newtimes.com
robots.txt
Robots Exclusion Standard data for newtimes.com
Resource Scan
Scan Details
Site Domain | newtimes.com |
Base Domain | newtimes.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-05-02T04:36:39+00:00 |
Next Scan | 2024-07-01T04:36:39+00:00 |
Last Successful Scan
Scanned | 2021-10-13T09:35:24+00:00 |
URL | http://newtimes.com/robots.txt |
Redirect | https://www.phoenixnewtimes.com/robots.txt |
Redirect Domain | www.phoenixnewtimes.com |
Redirect Base | phoenixnewtimes.com |
Found | Yes |
Hash | ca8cd3822e7d221cfe6c8944247b40d7d54ea18d8994cddd534bda419698036b |
SimHash | f45a1d40ddf7 |
Groups
*
Rule | Path |
---|---|
Disallow | /gyrobase/ |
Disallow | /phoenix/ArticleArchives |
Disallow | /phoenix/CommentArchives |
Disallow | /phoenix/EventSearch |
Disallow | /phoenix/ImageArchives |
Disallow | /phoenix/FilmSearch |
Disallow | /phoenix/LocationSearch |
Disallow | /phoenix/MemberSearch |
Disallow | /phoenix/MovieTimes |
Disallow | /phoenix/Search |
Disallow | /phoenix/SlideshowArchives |
Disallow | /phoenix/VideoArchives |
Other Records
Field | Value |
---|---|
sitemap | https://www.phoenixnewtimes.com/phoenix/Sitemap.xml |