philly.com
robots.txt
Robots Exclusion Standard data for philly.com
Resource Scan
Scan Details
Site Domain | philly.com |
Base Domain | philly.com |
Scan Status | Ok |
Last Scan | 2025-03-31T05:26:06+00:00 |
Next Scan | 2025-04-07T05:26:06+00:00 |
Last Scan
Scanned | 2025-03-31T05:26:06+00:00 |
URL | https://philly.com/robots.txt |
Redirect | https://www.inquirer.com:443/robots.txt |
Redirect Domain | www.inquirer.com |
Redirect Base | inquirer.com |
Domain IPs | 35.71.148.62, 52.223.20.214 |
Redirect IPs | 23.54.155.68, 23.54.155.69, 2600:1413:b000:13::b857:c18d, 2600:1413:b000:13::b857:c192 |
Response IP | 23.52.171.146 |
Found | Yes |
Hash | 341a8154d1ba4a97997dff36c7e9c5d567cb9441efe8e68c3138b7dd70542a63 |
SimHash | 89158a5ceacb |
Groups
*
Rule | Path |
---|---|
Disallow | /light/ |
Disallow | /search |
Disallow | /sports/betting/ |
Disallow | /wires/ |
Disallow | /zzz-systest/ |
Disallow | /zzz_systest/ |
Disallow | */zzz-systest/ |
Disallow | */zzz_systest/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.inquirer.com/sitemaps/48hour-news-sitemap-partner.xml |
sitemap | https://www.inquirer.com/arc/outboundfeeds/sitemap-index-2/?outputType=xml |
sitemap | https://www.inquirer.com/arc/outboundfeeds/sitemap-restaurant-page/ |