cda-hd.be
robots.txt

Robots Exclusion Standard data for cda-hd.be

Resource Scan

Scan Details

Site Domain cda-hd.be
Base Domain cda-hd.be
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-09T09:52:04+00:00
Next Scan 2024-10-16T09:52:04+00:00

Last Successful Scan

Scanned2024-04-08T11:45:08+00:00
URL https://cda-hd.be/robots.txt
Redirect https://www.cda-hd.be/robots.txt
Redirect Domain www.cda-hd.be
Redirect Base cda-hd.be
Domain IPs 104.21.44.102, 172.67.198.183, 2606:4700:3031::6815:2c66, 2606:4700:3035::ac43:c6b7
Redirect IPs 104.21.44.102, 172.67.198.183, 2606:4700:3031::6815:2c66, 2606:4700:3035::ac43:c6b7
Response IP 104.21.44.102
Found Yes
Hash 360e8a5d1916f5a3562917abc92c7cf0c2adc2bc0da9ac54b0d8d5eb91925d8e
SimHash 427cd0f0ee01

Groups

amazonbot

Rule Path
Disallow /search?*

dotbot

Rule Path
Disallow /search?*

Other Records

Field Value
crawl-delay 10

blexbot

Rule Path
Disallow /search?*

Other Records

Field Value
crawl-delay 10

seekportbot

Rule Path
Disallow /search?*

Other Records

Field Value
crawl-delay 10

ahrefsbot

Rule Path
Disallow /search?*

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /search?*

Other Records

Field Value
crawl-delay 10

semrushbot

Rule Path
Disallow /search?*

Other Records

Field Value
crawl-delay 10

barkrowler

Rule Path
Disallow /search?*

Other Records

Field Value
crawl-delay 10

gptbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /