cdn10.bostonmagazine.com
robots.txt
Robots Exclusion Standard data for cdn10.bostonmagazine.com
Resource Scan
Scan Details
Site Domain | cdn10.bostonmagazine.com |
Base Domain | bostonmagazine.com |
Scan Status | Ok |
Last Scan | 2024-09-23T12:10:01+00:00 |
Next Scan | 2024-10-23T12:10:01+00:00 |
Last Scan
Scanned | 2024-09-23T12:10:01+00:00 |
URL | https://cdn10.bostonmagazine.com/robots.txt |
Domain IPs | 138.199.46.68, 2400:52e0:1500::868:1 |
Response IP | 138.199.46.68 |
Found | Yes |
Hash | c2b2fbce2d0576a20d4d15e05e12397520ea4f20cc9e2163c646c1b7f56cf2e7 |
SimHash | cc87c90167a3 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /dentists/?geodir_search=* |
Disallow | /find-a-doctor/?geodir_search=* |
Disallow | /private-schools/?geodir_search=* |
Disallow | /wedding/?geodir_search=* |
Disallow | /blaize/ |
Disallow | /find-a-doctor/search/ |
Disallow | /dentists/search/ |
Disallow | /real-estate-agents/search/ |
Disallow | /weddings/search/ |
Disallow | /senior-living/search/ |
Disallow | /home-design/search/ |
Disallow | /lawyers/search/ |
Disallow | /private-schools/search/ |
*
Rule | Path |
---|---|
Disallow | /scrapertrap/ |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.bostonmagazine.com/sitemap_index.xml |