cdn10.bostonmagazine.com
robots.txt

Robots Exclusion Standard data for cdn10.bostonmagazine.com

Resource Scan

Scan Details

Site Domain cdn10.bostonmagazine.com
Base Domain bostonmagazine.com
Scan Status Ok
Last Scan2024-09-23T12:10:01+00:00
Next Scan 2024-10-23T12:10:01+00:00

Last Scan

Scanned2024-09-23T12:10:01+00:00
URL https://cdn10.bostonmagazine.com/robots.txt
Domain IPs 138.199.46.68, 2400:52e0:1500::868:1
Response IP 138.199.46.68
Found Yes
Hash c2b2fbce2d0576a20d4d15e05e12397520ea4f20cc9e2163c646c1b7f56cf2e7
SimHash cc87c90167a3

Groups

*

Rule Path
Disallow /search/
Disallow /dentists/?geodir_search=*
Disallow /find-a-doctor/?geodir_search=*
Disallow /private-schools/?geodir_search=*
Disallow /wedding/?geodir_search=*
Disallow /blaize/
Disallow /find-a-doctor/search/
Disallow /dentists/search/
Disallow /real-estate-agents/search/
Disallow /weddings/search/
Disallow /senior-living/search/
Disallow /home-design/search/
Disallow /lawyers/search/
Disallow /private-schools/search/

*

Rule Path
Disallow /scrapertrap/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.bostonmagazine.com/sitemap_index.xml