theatlantic.com
robots.txt
Robots Exclusion Standard data for theatlantic.com
Resource Scan
Scan Details
Site Domain | theatlantic.com |
Base Domain | theatlantic.com |
Scan Status | Ok |
Last Scan | 2024-05-06T19:46:57+00:00 |
Next Scan | 2024-05-13T19:46:57+00:00 |
Last Scan
Scanned | 2024-05-06T19:46:57+00:00 |
URL | https://theatlantic.com/robots.txt |
Redirect | https://www.theatlantic.com/robots.txt |
Redirect Domain | www.theatlantic.com |
Redirect Base | theatlantic.com |
Domain IPs | 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133 |
Redirect IPs | 199.232.194.133, 199.232.198.133 |
Response IP | 151.101.198.133 |
Found | Yes |
Hash | 046ecdf1f682e406ba73d3badee61761b68ded779fbbd5a3e638d6c797a3726e |
SimHash | 7108f953e411 |
Groups
*
Rule | Path |
---|---|
Disallow | /4624/TheAtlanticOnline/* |
Disallow | /magazine/archive/2010/11/letters-to-the-editor/308258/ |
Disallow | /magazine/archive/2010/11/letters-to-the-editor/308258/* |
Disallow | /ab/* |
Disallow | /video/embed/ |
Disallow | /video/iframe/* |
Disallow | /search/?*q=* |
Allow | /magazine/archive/2001/02/bill-clinton-and-his-consequences/303383/$ |
Disallow | /magazine/archive/2001/02/bill-clinton-and-his-consequences/303383/* |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://www.theatlantic.com/sitemap.xml |
sitemap | https://www.theatlantic.com/sponsored/sitemap.xml |