necu.org
robots.txt

Robots Exclusion Standard data for necu.org

Resource Scan

Scan Details

Site Domain necu.org
Base Domain necu.org
Scan Status Ok
Last Scan2025-12-25T08:00:49+00:00
Next Scan 2026-01-24T08:00:49+00:00

Last Scan

Scanned2025-12-25T08:00:49+00:00
URL https://necu.org/robots.txt
Domain IPs 2620:12a:8000::4, 2620:12a:8001::4, 67.227.159.183
Response IP 67.227.159.183
Found Yes
Hash 5388a93c4c4cd923ce885618e83532751e8e248a7535c4a987a60c5b56c32209
SimHash 25d25a82c783

Groups

teoma

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

psbot

Rule Path
Disallow /

*

Rule Path
Disallow /
Disallow /cgi-bin/
Disallow /fonts/

Other Records

Field Value
crawl-delay 20

Warnings

  • 1 invalid line.