ccas.net
robots.txt

Robots Exclusion Standard data for ccas.net

Resource Scan

Scan Details

Site Domain ccas.net
Base Domain ccas.net
Scan Status Ok
Last Scan2025-05-20T23:23:50+00:00
Next Scan 2025-06-19T23:23:50+00:00

Last Scan

Scanned2025-05-20T23:23:50+00:00
URL https://ccas.net/robots.txt
Domain IPs 203.23.244.79
Response IP 203.23.244.79
Found Yes
Hash 1eb316b220b504076e8d86718d2e78b8796fbaa1a003478497cdda99ddf1ade4
SimHash daecc482db3c

Groups

teleport

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

mercator-2.0

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

polybot

Rule Path
Disallow /

pjspider

Rule Path
Disallow /

wfarc

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

guidebot/5.3

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /i4a/memberDirectory/

proximic

Rule Path
Disallow /i4a/ams/staff/

*

Rule Path
Disallow /custom/

piplbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

orbbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /i4a/click

*

Rule Path
Disallow /i4a/manage-preferences

Comments

  • go away