gaultmillau.com
robots.txt

Robots Exclusion Standard data for gaultmillau.com

Resource Scan

Scan Details

Site Domain gaultmillau.com
Base Domain gaultmillau.com
Scan Status Ok
Last Scan2024-05-31T11:05:23+00:00
Next Scan 2024-06-07T11:05:23+00:00

Last Scan

Scanned2024-05-31T11:05:23+00:00
URL https://gaultmillau.com/robots.txt
Redirect https://www.gaultmillau.com/robots.txt
Redirect Domain www.gaultmillau.com
Redirect Base gaultmillau.com
Domain IPs 54.76.137.79
Redirect IPs 151.101.131.52, 151.101.195.52, 151.101.3.52, 151.101.67.52
Response IP 199.232.47.52
Found Yes
Hash 00379e9d24ea85fa35017b63f868038267d4cb9433d8d51b3cbd2bee0763895b
SimHash 087556924783

Groups

woorank

Rule Path
Disallow

facebookexternalhit

Rule Path
Disallow

googlebot

Rule Path
Disallow

semrushbot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

gigabot

Rule Path
Disallow

robozilla

Rule Path
Disallow

nutch

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

baiduspider

Rule Path
Disallow

naverbot

Rule Path
Disallow

yeti

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

bingbot/2.0

Rule Path
Disallow

yandexbot/3.0

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gaultmillau.com/sitemap/WWW/index.xml