newstatesman.com
robots.txt

Robots Exclusion Standard data for newstatesman.com

Resource Scan

Scan Details

Site Domain newstatesman.com
Base Domain newstatesman.com
Scan Status Ok
Last Scan2024-04-28T06:04:05+00:00
Next Scan 2024-05-05T06:04:05+00:00

Last Scan

Scanned2024-04-28T06:04:05+00:00
URL https://newstatesman.com/robots.txt
Redirect https://www.newstatesman.com/robots.txt
Redirect Domain www.newstatesman.com
Redirect Base newstatesman.com
Domain IPs 23.185.0.2, 2620:12a:8000::2, 2620:12a:8001::2
Redirect IPs 23.185.0.2, 2620:12a:8000::2, 2620:12a:8001::2
Response IP 23.185.0.2
Found Yes
Hash b0f0cd73dc794921688f562be7cf5e8662c279f8b8fc877a0bc75459402c889e
SimHash 590dd160ceb3

Groups

*
ravencrawler
rogerbot
dotbot
semrushbot
semrushbot-sa
powermapper
swiftbot
twitterbot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /
Disallow /magazine/
Disallow /?s=*

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.newstatesman.com/sitemap.xml
sitemap https://www.newstatesman.com/news-sitemap.xml