vacancespleinair.fr
robots.txt

Robots Exclusion Standard data for vacancespleinair.fr

Resource Scan

Scan Details

Site Domain vacancespleinair.fr
Base Domain vacancespleinair.fr
Scan Status Ok
Last Scan2024-10-27T05:23:48+00:00
Next Scan 2024-11-03T05:23:48+00:00

Last Scan

Scanned2024-10-27T05:23:48+00:00
URL https://vacancespleinair.fr/robots.txt
Domain IPs 104.21.5.104, 172.67.133.74, 2606:4700:3032::ac43:854a, 2606:4700:3033::6815:568
Response IP 104.21.5.104
Found Yes
Hash a88b60d459e3af7fd4d812f54d197ce319c132341761ea7c4e7ff13005c10e5a
SimHash 709c1f32febb

Groups

*

Rule Path
Disallow /wp-admin*
Disallow /*?
Disallow /wp-login.php*
Disallow /wp-includes
Disallow */trackback
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Disallow /*?replytocom*
Disallow /wp-json
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads
Allow /wp-content/themes/
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

ahrefsbot
aspiegelbot
blexbot
barkrowler
dotbot
mj12bot
mauibot
nimbostratus-bot
petalbot
semrushbot
seznambot
sogou
serpstatbot
trendiction

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 180

textbulkerbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://vacancespleinair.fr/sitemap_index.xml
sitemap https://vacancespleinair.fr/sitemap-news.xml
sitemap https://vacancespleinair.fr/sitemap.xml

Warnings

  • 1 invalid line.