cdn1.bostonmagazine.com
robots.txt

Robots Exclusion Standard data for cdn1.bostonmagazine.com

Resource Scan

Scan Details

Site Domain cdn1.bostonmagazine.com
Base Domain bostonmagazine.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-03-13T18:50:59+00:00
Next Scan 2024-06-11T18:50:59+00:00

Last Successful Scan

Scanned2023-10-23T17:37:27+00:00
URL http://cdn1.bostonmagazine.com/robots.txt
Redirect https://www.bostonmagazine.com/robots.txt
Redirect Domain www.bostonmagazine.com
Redirect Base bostonmagazine.com
Domain IPs 64.74.126.10, 64.74.126.11, 64.74.126.12, 64.74.126.13, 64.74.126.6, 64.74.126.7, 64.74.126.8, 64.74.126.9
Redirect IPs 13.33.33.29, 13.33.33.35, 13.33.33.38, 13.33.33.74
Response IP 13.33.33.35
Found Yes
Hash 4ec2568496a86fc5d466a59582e67c328a7d62ccf69faf863a443e323492de3d
SimHash 6997dc416603

Groups

*

Rule Path
Disallow /search/
Disallow /dentists/?geodir_search=*
Disallow /find-a-doctor/?geodir_search=*
Disallow /private-schools/?geodir_search=*
Disallow /wedding/?geodir_search=*

*

Rule Path
Disallow /scrapertrap/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.bostonmagazine.com/sitemap_index.xml