pghcitypaper.com
robots.txt
Robots Exclusion Standard data for pghcitypaper.com
Resource Scan
Scan Details
Site Domain | pghcitypaper.com |
Base Domain | pghcitypaper.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-10-24T20:41:06+00:00 |
Next Scan | 2024-11-23T20:41:06+00:00 |
Last Successful Scan
Scanned | 2024-09-25T20:40:37+00:00 |
URL | https://pghcitypaper.com/robots.txt |
Redirect | https://www.pghcitypaper.com/robots.txt |
Redirect Domain | www.pghcitypaper.com |
Redirect Base | pghcitypaper.com |
Domain IPs | 104.21.17.75, 172.67.223.61, 2606:4700:3034::6815:114b, 2606:4700:3037::ac43:df3d |
Redirect IPs | 104.21.17.75, 172.67.223.61, 2606:4700:3034::6815:114b, 2606:4700:3037::ac43:df3d |
Response IP | 104.21.17.75 |
Found | Yes |
Hash | 21464ce73215e58406cf19d9a932ffcf50bbc4f36f65235d87072d832f0d9677 |
SimHash | d9649c054d17 |
Groups
*
Rule | Path |
---|---|
Disallow | /pittsburgh/ArticleArchives |
Disallow | /pittsburgh/CommentArchives |
Disallow | /pittsburgh/EventSearch |
Disallow | /pittsburgh/ImageArchives |
Disallow | /pittsburgh/FilmSearch |
Disallow | /pittsburgh/LocationSearch |
Disallow | /pittsburgh/MemberSearch |
Disallow | /pittsburgh/MovieTimes |
Disallow | /pittsburgh/Search |
Disallow | /pittsburgh/SlideshowArchives |
Disallow | /pittsburgh/VideoArchives |
Other Records
Field | Value |
---|---|
sitemap | https://www.pghcitypaper.com/pittsburgh/Sitemap.xml |