post-gazette.com
robots.txt

Robots Exclusion Standard data for post-gazette.com

Resource Scan

Scan Details

Site Domain post-gazette.com
Base Domain post-gazette.com
Scan Status Ok
Last Scan2025-11-17T07:54:52+00:00
Next Scan 2025-11-24T07:54:52+00:00

Last Scan

Scanned2025-11-17T07:54:52+00:00
URL https://post-gazette.com/robots.txt
Domain IPs 72.240.104.198
Response IP 72.240.104.198
Found Yes
Hash b114d6505500dc83dd08efa5bcffcfefe324a0725d10725cd217254f9ee9dac9
SimHash 280c4c2489b3

Groups

*

Rule Path
Disallow /admin
Disallow /*.print
Disallow /zillowarticles

Other Records

Field Value
sitemap https://www.post-gazette.com/sitemaps/ssl/news-sitemap.xml
sitemap https://www.post-gazette.com/sitemaps/ssl/sitemap.xml