www2.philly.com
robots.txt
Robots Exclusion Standard data for www2.philly.com
Resource Scan
Scan Details
Site Domain | www2.philly.com |
Base Domain | philly.com |
Scan Status | Ok |
Last Scan | 2024-09-18T22:06:06+00:00 |
Next Scan | 2024-09-25T22:06:06+00:00 |
Last Scan
Scanned | 2024-09-18T22:06:06+00:00 |
URL | https://www2.philly.com/robots.txt |
Redirect | https://www.inquirer.com/robots.txt |
Redirect Domain | www.inquirer.com |
Redirect Base | inquirer.com |
Domain IPs | 2600:1413:b000:13::b857:c18d, 2600:1413:b000:13::b857:c191, 72.247.127.208, 72.247.127.211 |
Redirect IPs | 173.222.148.48, 23.49.60.40, 2600:1413:b000:13::b857:c18d, 2600:1413:b000:13::b857:c191 |
Response IP | 23.52.171.146 |
Found | Yes |
Hash | 33a675ad6039b5993018dfbdd4b348b3a0315a0950c62b2733b4f2b26afd3790 |
SimHash | 89a5cf5c8b9b |
Groups
*
Rule | Path |
---|---|
Disallow | /light/ |
Disallow | /search |
Disallow | /wires/ |
Disallow | /zzz-systest/ |
Disallow | /zzz_systest/ |
Disallow | */zzz-systest/ |
Disallow | */zzz_systest/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.inquirer.com/sitemaps/48hour-news-sitemap-partner.xml |
sitemap | https://www.inquirer.com/arc/outboundfeeds/sitemap-index-2/?outputType=xml |