zephr.newscientist.com
robots.txt

Robots Exclusion Standard data for zephr.newscientist.com

Resource Scan

Scan Details

Site Domain zephr.newscientist.com
Base Domain newscientist.com
Scan Status Ok
Last Scan2024-05-21T09:52:48+00:00
Next Scan 2024-05-28T09:52:48+00:00

Last Scan

Scanned2024-05-21T09:52:48+00:00
URL https://zephr.newscientist.com/robots.txt
Domain IPs 13.33.30.108, 13.33.30.124, 13.33.30.33, 13.33.30.96
Response IP 13.33.30.33
Found Yes
Hash 9091af20d74f253d9a3bb92f5b7d894e484af95fdff4e8398f14f4f085f80d04
SimHash 6e29ec2a86ab

Groups

piplbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /21632812681/
Disallow /activate-subscription/
Disallow /cgi-bin/
Disallow /commenting/
Disallow /data/design/
Disallow /error/
Disallow /feed/
Disallow /icons/
Disallow /login/
Disallow /logout/
Disallow /lost-password/
Disallow /my-account/
Disallow /registration/
Disallow /search/
Disallow /upfront/
Disallow /wp-admin/

*

Rule Path
Disallow /21632812681/
Disallow /activate-subscription/
Disallow /cgi-bin/
Disallow /commenting/
Disallow /data/design/
Disallow /error/
Disallow /feed/
Disallow /icons/
Disallow /login/
Disallow /logout/
Disallow /lost-password/
Disallow /my-account/
Disallow /registration/
Disallow /search/
Disallow /upfront/
Disallow /wp-admin/
Disallow /api/
Disallow /build/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.newscientist.com/sitemap.xml
sitemap https://www.newscientist.com/nsj/sitemapindex.xml