cawebdir.com
robots.txt

Robots Exclusion Standard data for cawebdir.com

Resource Scan

Scan Details

Site Domain cawebdir.com
Base Domain cawebdir.com
Scan Status Ok
Last Scan2025-06-28T22:12:16+00:00
Next Scan 2025-07-05T22:12:16+00:00

Last Scan

Scanned2025-06-28T22:12:16+00:00
URL https://cawebdir.com/robots.txt
Domain IPs 104.21.31.18, 172.67.174.165, 2606:4700:3031::ac43:aea5, 2606:4700:3037::6815:1f12
Response IP 104.21.31.18
Found Yes
Hash 2f25d38f80da014c3d9a8639c7e9f546fee62bb193066cb8327c371419211ed0
SimHash a55a5bc2c4b5

Groups

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 8

baiduspider

Rule Path
Disallow *.asp

semrushbot

Rule Path
Disallow /

yandex spider

Rule Path
Disallow *.pdf

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

Comments

  • Allow all
  • Server-Abuser List