illinoistimes.com
robots.txt
Robots Exclusion Standard data for illinoistimes.com
Resource Scan
Scan Details
Site Domain | illinoistimes.com |
Base Domain | illinoistimes.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-10-31T04:57:47+00:00 |
Next Scan | 2024-11-30T04:57:47+00:00 |
Last Successful Scan
Scanned | 2024-10-02T04:56:47+00:00 |
URL | https://www.illinoistimes.com/robots.txt |
Domain IPs | 104.21.42.234, 172.67.211.60, 2606:4700:3030::ac43:d33c, 2606:4700:3032::6815:2aea |
Response IP | 104.21.42.234 |
Found | Yes |
Hash | 86c1d16bf6604374786d272c9cf221c504ff35d95b45fe77b1ef7d4d9f0fa495 |
SimHash | a65df8446db0 |
Groups
*
Rule | Path |
---|---|
Disallow | /springfield/ArticleArchives |
Disallow | /springfield/CommentArchives |
Disallow | /springfield/EventSearch |
Disallow | /springfield/ImageArchives |
Disallow | /springfield/FilmSearch |
Disallow | /springfield/LocationSearch |
Disallow | /springfield/MemberSearch |
Disallow | /springfield/MovieTimes |
Disallow | /springfield/Search |
Disallow | /springfield/SlideshowArchives |
Disallow | /springfield/VideoArchives |
Other Records
Field | Value |
---|---|
sitemap | https://www.illinoistimes.com/springfield/Sitemap.xml |