pghcitypaper.com
robots.txt

Robots Exclusion Standard data for pghcitypaper.com

Resource Scan

Scan Details

Site Domain pghcitypaper.com
Base Domain pghcitypaper.com
Scan Status Ok
Last Scan2024-05-15T01:07:16+00:00
Next Scan 2024-05-22T01:07:16+00:00

Last Scan

Scanned2024-05-15T01:07:16+00:00
URL https://pghcitypaper.com/robots.txt
Redirect https://www.pghcitypaper.com/robots.txt
Redirect Domain www.pghcitypaper.com
Redirect Base pghcitypaper.com
Domain IPs 104.21.17.75, 172.67.223.61, 2606:4700:3034::6815:114b, 2606:4700:3037::ac43:df3d
Redirect IPs 104.21.17.75, 172.67.223.61, 2606:4700:3034::6815:114b, 2606:4700:3037::ac43:df3d
Response IP 104.21.17.75
Found Yes
Hash 39f5df3cf71f3f9056110bf892b77ddd034e128432d81090f6eb66ee088fabc2
SimHash 896c16042e13

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /gyrobase/ArticleArchives
Disallow /gyrobase/EventSearch
Disallow /gyrobase/FilmSearch
Disallow /gyrobase/LocationSearch
Disallow /gyrobase/MovieTimes
Disallow /gyrobase/Search
Disallow /pittsburgh/ArticleArchives
Disallow /pittsburgh/EventSearch
Disallow /pittsburgh/FilmSearch
Disallow /pittsburgh/LocationSearch
Disallow /pittsburgh/MovieTimes
Disallow /pittsburgh/Search
Disallow /insertions/

Other Records

Field Value
sitemap https://www.pittsburghcitypaper.ws/pittsburgh/Sitemap.xml