www2.philly.com
robots.txt
Robots Exclusion Standard data for www2.philly.com
Resource Scan
Scan Details
Site Domain | www2.philly.com |
Base Domain | philly.com |
Scan Status | Ok |
Last Scan | 2024-11-13T23:10:02+00:00 |
Next Scan | 2024-11-20T23:10:02+00:00 |
Last Scan
Scanned | 2024-11-13T23:10:02+00:00 |
URL | https://www2.philly.com/robots.txt |
Redirect | https://www.inquirer.com/robots.txt |
Redirect Domain | www.inquirer.com |
Redirect Base | inquirer.com |
Domain IPs | 173.222.148.48, 23.49.60.43, 2600:1413:b000:13::b857:c18d, 2600:1413:b000:13::b857:c192 |
Redirect IPs | 173.222.148.48, 23.49.60.43, 2600:1413:b000:13::b857:c18d, 2600:1413:b000:13::b857:c192 |
Response IP | 23.45.207.177 |
Found | Yes |
Hash | 341a8154d1ba4a97997dff36c7e9c5d567cb9441efe8e68c3138b7dd70542a63 |
SimHash | 89158a5ceacb |
Groups
*
Rule | Path |
---|---|
Disallow | /light/ |
Disallow | /search |
Disallow | /sports/betting/ |
Disallow | /wires/ |
Disallow | /zzz-systest/ |
Disallow | /zzz_systest/ |
Disallow | */zzz-systest/ |
Disallow | */zzz_systest/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.inquirer.com/sitemaps/48hour-news-sitemap-partner.xml |
sitemap | https://www.inquirer.com/arc/outboundfeeds/sitemap-index-2/?outputType=xml |
sitemap | https://www.inquirer.com/arc/outboundfeeds/sitemap-restaurant-page/ |