phillymag.com
robots.txt

Robots Exclusion Standard data for phillymag.com

Resource Scan

Scan Details

Site Domain phillymag.com
Base Domain phillymag.com
Scan Status Ok
Last Scan2024-05-22T02:29:25+00:00
Next Scan 2024-05-29T02:29:25+00:00

Last Scan

Scanned2024-05-22T02:29:25+00:00
URL https://phillymag.com/robots.txt
Redirect https://www.phillymag.com/robots.txt
Redirect Domain www.phillymag.com
Redirect Base phillymag.com
Domain IPs 71.19.234.34
Redirect IPs 13.33.30.108, 13.33.30.124, 13.33.30.33, 13.33.30.96
Response IP 13.33.30.108
Found Yes
Hash f394285b660cb832e1983e5ced32d5227edefd8310f3cd9944f59e993bd2970a
SimHash cca6c90567b1

Groups

*

Rule Path
Disallow /search/
Disallow /dentists/?geodir_search=*
Disallow /find-a-doctor/?geodir_search=*
Disallow /property-listings/?geodir_search=*
Disallow /senior-living/?geodir_search=*
Disallow /restaurant-finder/?geodir_search=*
Disallow /weddings/?geodir_search=*
Disallow /blaize/
Disallow /restaurant-finder/
Disallow /find-a-doctor/search/
Disallow /dentists/search/
Disallow /real-estate-agents/search/
Disallow /weddings/search/
Disallow /senior-living/search/
Disallow /home-design/search/
Disallow /private-schools/search/

*

Rule Path
Disallow /scrapertrap/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.phillymag.com/sitemap_index.xml