cdn1.bostonmagazine.com
robots.txt
Robots Exclusion Standard data for cdn1.bostonmagazine.com
Resource Scan
Scan Details
Site Domain | cdn1.bostonmagazine.com |
Base Domain | bostonmagazine.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-03-13T18:50:59+00:00 |
Next Scan | 2024-06-11T18:50:59+00:00 |
Last Successful Scan
Scanned | 2023-10-23T17:37:27+00:00 |
URL | http://cdn1.bostonmagazine.com/robots.txt |
Redirect | https://www.bostonmagazine.com/robots.txt |
Redirect Domain | www.bostonmagazine.com |
Redirect Base | bostonmagazine.com |
Domain IPs | 64.74.126.10, 64.74.126.11, 64.74.126.12, 64.74.126.13, 64.74.126.6, 64.74.126.7, 64.74.126.8, 64.74.126.9 |
Redirect IPs | 13.33.33.29, 13.33.33.35, 13.33.33.38, 13.33.33.74 |
Response IP | 13.33.33.35 |
Found | Yes |
Hash | 4ec2568496a86fc5d466a59582e67c328a7d62ccf69faf863a443e323492de3d |
SimHash | 6997dc416603 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /dentists/?geodir_search=* |
Disallow | /find-a-doctor/?geodir_search=* |
Disallow | /private-schools/?geodir_search=* |
Disallow | /wedding/?geodir_search=* |
*
Rule | Path |
---|---|
Disallow | /scrapertrap/ |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.bostonmagazine.com/sitemap_index.xml |