gaultmillau.fr
robots.txt

Robots Exclusion Standard data for gaultmillau.fr

Resource Scan

Scan Details

Site Domain gaultmillau.fr
Base Domain gaultmillau.fr
Scan Status Ok
Last Scan2024-06-12T03:04:13+00:00
Next Scan 2024-06-19T03:04:13+00:00

Last Scan

Scanned2024-06-12T03:04:13+00:00
URL http://gaultmillau.fr/robots.txt
Redirect https://fr.gaultmillau.com/robots.txt
Redirect Domain fr.gaultmillau.com
Redirect Base gaultmillau.com
Domain IPs 213.186.33.5
Redirect IPs 151.101.131.52, 151.101.195.52, 151.101.3.52, 151.101.67.52
Response IP 199.232.47.52
Found Yes
Hash 0762ca7a50202208311b78401428f2e585ca43e7643ec8d552592f2228f9c018
SimHash 083556924783

Groups

woorank

Rule Path
Disallow

facebookexternalhit

Rule Path
Disallow

googlebot

Rule Path
Disallow

semrushbot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

gigabot

Rule Path
Disallow

robozilla

Rule Path
Disallow

nutch

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

baiduspider

Rule Path
Disallow

naverbot

Rule Path
Disallow

yeti

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

bingbot/2.0

Rule Path
Disallow

yandexbot/3.0

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://fr.gaultmillau.com/sitemap/FR/index.xml