newrepublic.com
robots.txt
Robots Exclusion Standard data for newrepublic.com
Resource Scan
Scan Details
Site Domain | newrepublic.com |
Base Domain | newrepublic.com |
Scan Status | Ok |
Last Scan | 2024-11-06T19:32:49+00:00 |
Next Scan | 2024-11-13T19:32:49+00:00 |
Last Scan
Scanned | 2024-11-06T19:32:49+00:00 |
URL | https://newrepublic.com/robots.txt |
Domain IPs | 199.232.192.233, 199.232.196.233 |
Response IP | 199.232.192.233 |
Found | Yes |
Hash | 081b6f3100a9661c732f47a77c6235147999f433ebd4e5fbe32db02b7499e0c5 |
SimHash | 280c1a60f531 |
Groups
*
Rule | Path |
---|---|
Disallow | /search?* |
Disallow | /maz/* |
Disallow | /*?*id= |
Disallow | /socket/* |
Other Records
Field | Value |
---|---|
sitemap | https://newrepublic.com/sitemap/index.xml |
sitemap | https://newrepublic.com/sitemap/issues.xml |
sitemap | https://newrepublic.com/sitemap/verticals.xml |
sitemap | https://newrepublic.com/sitemap/tag-index.xml |
sitemap | https://newrepublic.com/sitemap/pages.xml |
sitemap | https://newrepublic.com/sitemap/authors.xml |
sitemap | https://newrepublic.com/sitemap/news.xml |