cisca.org
robots.txt

Robots Exclusion Standard data for cisca.org

Resource Scan

Scan Details

Site Domain cisca.org
Base Domain cisca.org
Scan Status Ok
Last Scan2025-05-23T04:32:40+00:00
Next Scan 2025-06-22T04:32:40+00:00

Last Scan

Scanned2025-05-23T04:32:40+00:00
URL https://cisca.org/robots.txt
Domain IPs 203.23.244.64
Response IP 203.23.244.64
Found Yes
Hash fa8acdbf6fc55838d34414e2a11bf2e4fcab1064b37868edd9622d866f8d9f49
SimHash d8dcc442db14

Groups

teleport

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

mercator-2.0

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

polybot

Rule Path
Disallow /

pjspider

Rule Path
Disallow /

wfarc

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

guidebot/5.3

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /i4a/memberDirectory/

proximic

Rule Path
Disallow /i4a/ams/staff/

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

orbbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /i4a/click

*

Rule Path
Disallow /i4a/manage-preferences

*

Rule Path
Disallow /i4a/etrack

Comments

  • go away