cfala.org
robots.txt

Robots Exclusion Standard data for cfala.org

Resource Scan

Scan Details

Site Domain cfala.org
Base Domain cfala.org
Scan Status Ok
Last Scan2025-12-11T01:17:47+00:00
Next Scan 2026-01-10T01:17:47+00:00

Last Scan

Scanned2025-12-11T01:17:47+00:00
URL https://cfala.org/robots.txt
Domain IPs 203.23.244.112
Response IP 203.23.244.112
Found Yes
Hash 1321e09d5f1adc65b3264868c9e19a940b4acca76171f96f0596738c1d28ce4e
SimHash dadcc00adb1c

Groups

teleport

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

mercator-2.0

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

polybot

Rule Path
Disallow /

pjspider

Rule Path
Disallow /

wfarc

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

guidebot/5.3

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /i4a/memberDirectory/

proximic

Rule Path
Disallow /i4a/ams/staff/

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

orbbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /i4a/click

*

Rule Path
Disallow /i4a/manage-preferences

*

Rule Path
Disallow /_ai/

*

Rule Path
Disallow /i4a/utilities/

Comments

  • go away