newrepublic.com
robots.txt

Robots Exclusion Standard data for newrepublic.com

Resource Scan

Scan Details

Site Domain newrepublic.com
Base Domain newrepublic.com
Scan Status Ok
Last Scan2024-11-06T19:32:49+00:00
Next Scan 2024-11-13T19:32:49+00:00

Last Scan

Scanned2024-11-06T19:32:49+00:00
URL https://newrepublic.com/robots.txt
Domain IPs 199.232.192.233, 199.232.196.233
Response IP 199.232.192.233
Found Yes
Hash 081b6f3100a9661c732f47a77c6235147999f433ebd4e5fbe32db02b7499e0c5
SimHash 280c1a60f531

Groups

upday

Rule Path
Allow /

*

Rule Path
Disallow /search?*
Disallow /maz/*
Disallow /*?*id=
Disallow /socket/*

Other Records

Field Value
sitemap https://newrepublic.com/sitemap/index.xml
sitemap https://newrepublic.com/sitemap/issues.xml
sitemap https://newrepublic.com/sitemap/verticals.xml
sitemap https://newrepublic.com/sitemap/tag-index.xml
sitemap https://newrepublic.com/sitemap/pages.xml
sitemap https://newrepublic.com/sitemap/authors.xml
sitemap https://newrepublic.com/sitemap/news.xml