smithsonianmag.com
robots.txt
Robots Exclusion Standard data for smithsonianmag.com
Resource Scan
Scan Details
Site Domain | smithsonianmag.com |
Base Domain | smithsonianmag.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-06T21:14:20+00:00 |
Next Scan | 2024-11-05T21:14:20+00:00 |
Last Successful Scan
Scanned | 2024-07-09T21:10:07+00:00 |
URL | https://smithsonianmag.com/robots.txt |
Redirect | https://www.smithsonianmag.com:443/robots.txt |
Redirect Domain | www.smithsonianmag.com |
Redirect Base | smithsonianmag.com |
Domain IPs | 44.206.34.104, 44.209.195.112, 52.206.86.201 |
Redirect IPs | 104.22.6.9, 104.22.7.9, 172.67.5.56, 2606:4700:10::6816:609, 2606:4700:10::6816:709, 2606:4700:10::ac43:538 |
Response IP | 104.22.7.9 |
Found | Yes |
Hash | 98a263584e94bcf34903680cb41f143dac4aa9cbcdd5130b41432a6537e6fb3d |
SimHash | 1917c0566d36 |
Groups
*
Rule | Path |
---|---|
Disallow | /accounts/* |
Disallow | /museumdays/accounts/ |
Disallow | /accounts/login/ |
Disallow | /search? |
Disallow | /rss? |
Disallow | /article/preview/*/ |
Disallow | /blogs/blogpost/preview/* |
Disallow | /dashboard/ |
Disallow | /acoustic/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.smithsonianmag.com/sitemap.xml |