media.timeout.com
robots.txt

Robots Exclusion Standard data for media.timeout.com

Resource Scan

Scan Details

Site Domain media.timeout.com
Base Domain timeout.com
Scan Status Ok
Last Scan2024-05-22T11:11:47+00:00
Next Scan 2024-06-21T11:11:47+00:00

Last Scan

Scanned2024-05-22T11:11:47+00:00
URL https://media.timeout.com/robots.txt
Domain IPs 2600:9000:2003:5200:12:9a21:7900:93a1, 2600:9000:2003:5800:12:9a21:7900:93a1, 2600:9000:2003:7600:12:9a21:7900:93a1, 2600:9000:2003:7800:12:9a21:7900:93a1, 2600:9000:2003:a200:12:9a21:7900:93a1, 2600:9000:2003:a600:12:9a21:7900:93a1, 2600:9000:2003:b000:12:9a21:7900:93a1, 2600:9000:2003:f200:12:9a21:7900:93a1, 52.84.229.105, 52.84.229.128, 52.84.229.34, 52.84.229.62
Response IP 52.84.229.128
Found Yes
Hash bab3d6a81f2edffc02e0740acc70d7d37b86e0ce871ebce9512c907393b61044
SimHash 130654134292

Groups

fasterfox

Rule Path
Disallow /

nutch

Rule Path
Disallow /

spock

Rule Path
Disallow /

omniexplorer_bot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

geniebot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

*

Rule Path
Disallow /private/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap http://www.timeout.fr/sitemap_france.xml.gz
sitemap http://www.timeout.fr/sitemap_paris.xml.gz

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449

Warnings

  • 2 invalid lines.