smithsonianmag.com
robots.txt

Robots Exclusion Standard data for smithsonianmag.com

Resource Scan

Scan Details

Site Domain smithsonianmag.com
Base Domain smithsonianmag.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-06T21:14:20+00:00
Next Scan 2024-11-05T21:14:20+00:00

Last Successful Scan

Scanned2024-07-09T21:10:07+00:00
URL https://smithsonianmag.com/robots.txt
Redirect https://www.smithsonianmag.com:443/robots.txt
Redirect Domain www.smithsonianmag.com
Redirect Base smithsonianmag.com
Domain IPs 44.206.34.104, 44.209.195.112, 52.206.86.201
Redirect IPs 104.22.6.9, 104.22.7.9, 172.67.5.56, 2606:4700:10::6816:609, 2606:4700:10::6816:709, 2606:4700:10::ac43:538
Response IP 104.22.7.9
Found Yes
Hash 98a263584e94bcf34903680cb41f143dac4aa9cbcdd5130b41432a6537e6fb3d
SimHash 1917c0566d36

Groups

*

Rule Path
Disallow /accounts/*
Disallow /museumdays/accounts/
Disallow /accounts/login/
Disallow /search?
Disallow /rss?
Disallow /article/preview/*/
Disallow /blogs/blogpost/preview/*
Disallow /dashboard/
Disallow /acoustic/

Other Records

Field Value
sitemap https://www.smithsonianmag.com/sitemap.xml