posting.pghcitypaper.com
robots.txt
Robots Exclusion Standard data for posting.pghcitypaper.com
Resource Scan
Scan Details
Site Domain | posting.pghcitypaper.com |
Base Domain | pghcitypaper.com |
Scan Status | Ok |
Last Scan | 2024-05-26T21:52:39+00:00 |
Next Scan | 2024-06-25T21:52:39+00:00 |
Last Scan
Scanned | 2024-05-26T21:52:39+00:00 |
URL | https://posting.pghcitypaper.com/robots.txt |
Domain IPs | 209.104.5.141 |
Response IP | 209.104.5.141 |
Found | Yes |
Hash | 39f5df3cf71f3f9056110bf892b77ddd034e128432d81090f6eb66ee088fabc2 |
SimHash | 896c16042e13 |
Groups
*
Rule | Path |
---|---|
Disallow | /gyrobase/ArticleArchives |
Disallow | /gyrobase/EventSearch |
Disallow | /gyrobase/FilmSearch |
Disallow | /gyrobase/LocationSearch |
Disallow | /gyrobase/MovieTimes |
Disallow | /gyrobase/Search |
Disallow | /pittsburgh/ArticleArchives |
Disallow | /pittsburgh/EventSearch |
Disallow | /pittsburgh/FilmSearch |
Disallow | /pittsburgh/LocationSearch |
Disallow | /pittsburgh/MovieTimes |
Disallow | /pittsburgh/Search |
Disallow | /insertions/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.pittsburghcitypaper.ws/pittsburgh/Sitemap.xml |