stateimpact.npr.org
robots.txt

Robots Exclusion Standard data for stateimpact.npr.org

Resource Scan

Scan Details

Site Domain stateimpact.npr.org
Base Domain npr.org
Scan Status Ok
Last Scan2024-09-18T21:46:34+00:00
Next Scan 2024-10-18T21:46:34+00:00

Last Scan

Scanned2024-09-18T21:46:34+00:00
URL https://stateimpact.npr.org/robots.txt
Domain IPs 50.17.55.210
Response IP 50.17.55.210
Found Yes
Hash c24f2056ea7ac1e15c2fb8e09f8003157890f566dde3a3caa05c7017f0ab86fe
SimHash 427459028262

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes

Other Records

Field Value
crawl-delay 10

baiduspider
baiduspider-video
baiduspider-image
turnitinbot
yandex
youdaobot
moget
ichiro
naverbot
yeti
sogou spider
blexbot

Rule Path
Disallow /

Warnings

  • 1 invalid line.