toledoblade.com
robots.txt

Robots Exclusion Standard data for toledoblade.com

Resource Scan

Scan Details

Site Domain toledoblade.com
Base Domain toledoblade.com
Scan Status Ok
Last Scan2024-11-14T03:44:35+00:00
Next Scan 2024-11-21T03:44:35+00:00

Last Scan

Scanned2024-11-14T03:44:35+00:00
URL https://toledoblade.com/robots.txt
Redirect https://www.toledoblade.com/robots.txt
Redirect Domain www.toledoblade.com
Redirect Base toledoblade.com
Domain IPs 137.135.71.87
Redirect IPs 137.135.71.87
Response IP 137.135.71.87
Found Yes
Hash c04fab573484d45d9beee4e7bf1bbcf4f6a9514a3519cf2019a089ea2fbacc75
SimHash 12158bd125c3

Groups

msiecrawler

Rule Path
Disallow /

*

Rule Path
Disallow /ajaxquery/
Disallow /Our-Town-Business/
Disallow /Our-Town-Going-Out/
Disallow /Our-Town-Guest-Columns/
Disallow /Our-Town-Home/
Disallow /Our-Town-News/
Disallow /Our-Town-Police/
Disallow /Our-Town-Person-of-the-Week/
Disallow /Our-Town-Schools/
Disallow /Our-Town-Sports/
Disallow /Our-Town-User-Photos/
Disallow /Our-Town-User-Stories/
Disallow /Classifieds/
Disallow /pf3admin
Disallow /ugchandler/

facebookexternalhit/1.1

Rule Path
Disallow

Comments

  • Robots.txt
  • Be nice.

Warnings

  • `https` is not a known field.