pghcitypaper.com
robots.txt

Robots Exclusion Standard data for pghcitypaper.com

Resource Scan

Scan Details

Site Domain pghcitypaper.com
Base Domain pghcitypaper.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-24T20:41:06+00:00
Next Scan 2024-11-23T20:41:06+00:00

Last Successful Scan

Scanned2024-09-25T20:40:37+00:00
URL https://pghcitypaper.com/robots.txt
Redirect https://www.pghcitypaper.com/robots.txt
Redirect Domain www.pghcitypaper.com
Redirect Base pghcitypaper.com
Domain IPs 104.21.17.75, 172.67.223.61, 2606:4700:3034::6815:114b, 2606:4700:3037::ac43:df3d
Redirect IPs 104.21.17.75, 172.67.223.61, 2606:4700:3034::6815:114b, 2606:4700:3037::ac43:df3d
Response IP 104.21.17.75
Found Yes
Hash 21464ce73215e58406cf19d9a932ffcf50bbc4f36f65235d87072d832f0d9677
SimHash d9649c054d17

Groups

*

Rule Path
Disallow /pittsburgh/ArticleArchives
Disallow /pittsburgh/CommentArchives
Disallow /pittsburgh/EventSearch
Disallow /pittsburgh/ImageArchives
Disallow /pittsburgh/FilmSearch
Disallow /pittsburgh/LocationSearch
Disallow /pittsburgh/MemberSearch
Disallow /pittsburgh/MovieTimes
Disallow /pittsburgh/Search
Disallow /pittsburgh/SlideshowArchives
Disallow /pittsburgh/VideoArchives

Other Records

Field Value
sitemap https://www.pghcitypaper.com/pittsburgh/Sitemap.xml