phillymag.com
robots.txt
Robots Exclusion Standard data for phillymag.com
Resource Scan
Scan Details
Site Domain | phillymag.com |
Base Domain | phillymag.com |
Scan Status | Ok |
Last Scan | 2024-05-22T02:29:25+00:00 |
Next Scan | 2024-05-29T02:29:25+00:00 |
Last Scan
Scanned | 2024-05-22T02:29:25+00:00 |
URL | https://phillymag.com/robots.txt |
Redirect | https://www.phillymag.com/robots.txt |
Redirect Domain | www.phillymag.com |
Redirect Base | phillymag.com |
Domain IPs | 71.19.234.34 |
Redirect IPs | 13.33.30.108, 13.33.30.124, 13.33.30.33, 13.33.30.96 |
Response IP | 13.33.30.108 |
Found | Yes |
Hash | f394285b660cb832e1983e5ced32d5227edefd8310f3cd9944f59e993bd2970a |
SimHash | cca6c90567b1 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /dentists/?geodir_search=* |
Disallow | /find-a-doctor/?geodir_search=* |
Disallow | /property-listings/?geodir_search=* |
Disallow | /senior-living/?geodir_search=* |
Disallow | /restaurant-finder/?geodir_search=* |
Disallow | /weddings/?geodir_search=* |
Disallow | /blaize/ |
Disallow | /restaurant-finder/ |
Disallow | /find-a-doctor/search/ |
Disallow | /dentists/search/ |
Disallow | /real-estate-agents/search/ |
Disallow | /weddings/search/ |
Disallow | /senior-living/search/ |
Disallow | /home-design/search/ |
Disallow | /private-schools/search/ |
*
Rule | Path |
---|---|
Disallow | /scrapertrap/ |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.phillymag.com/sitemap_index.xml |