cada1.org
robots.txt

Robots Exclusion Standard data for cada1.org

Resource Scan

Scan Details

Site Domain cada1.org
Base Domain cada1.org
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-12-31T18:03:35+00:00
Next Scan 2026-01-07T18:03:35+00:00

Last Successful Scan

Scanned2025-11-29T21:13:50+00:00
URL http://cada1.org/robots.txt
Redirect https://secure.cada1.org/robots.txt
Redirect Domain secure.cada1.org
Redirect Base cada1.org
Domain IPs 203.23.244.53
Redirect IPs 203.23.244.53
Response IP 203.23.244.53
Found Yes
Hash 5a289c4e6ec657d6f97e53c160f02516526ffd793220a81f26cc337801b0bf65
SimHash d854d00adb14

Groups

teleport

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

mercator-2.0

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

polybot

Rule Path
Disallow /

pjspider

Rule Path
Disallow /

wfarc

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

guidebot/5.3

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

orbbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /i4a/click

*

Rule Path
Disallow /i4a/manage-preferences

*

Rule Path
Disallow /_ai/

*

Rule Path
Disallow /i4a/utilities/

Comments

  • go away