newsinteractive.post-gazette.com
robots.txt

Robots Exclusion Standard data for newsinteractive.post-gazette.com

Resource Scan

Scan Details

Site Domain newsinteractive.post-gazette.com
Base Domain post-gazette.com
Scan Status Ok
Last Scan2025-11-24T22:40:23+00:00
Next Scan 2025-12-24T22:40:23+00:00

Last Scan

Scanned2025-11-24T22:40:23+00:00
URL https://newsinteractive.post-gazette.com/robots.txt
Domain IPs 159.135.18.103
Response IP 159.135.18.103
Found Yes
Hash 32067f0e37df8cb5fb5e82d4c7c8d8ffe9ab47e367427218c4c835942e475c44
SimHash 6a094a0049b3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /nowall/

Other Records

Field Value
sitemap https://newsinteractive.post-gazette.com/sitemap.xml