cdict.net
robots.txt

Robots Exclusion Standard data for cdict.net

Resource Scan

Scan Details

Site Domain cdict.net
Base Domain cdict.net
Scan Status Ok
Last Scan2024-11-02T01:02:34+00:00
Next Scan 2024-11-09T01:02:34+00:00

Last Scan

Scanned2024-11-02T01:02:34+00:00
URL https://cdict.net/robots.txt
Domain IPs 104.21.10.52, 172.67.131.60, 2606:4700:3033::6815:a34, 2606:4700:3033::ac43:833c
Response IP 104.21.10.52
Found Yes
Hash 23f78ab66069846599da7c37f5e707ecad4efe090aa095d3b749e70d637328a2
SimHash 204dd490c493

Groups

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

alexabot

Rule Path
Disallow

slurp

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

msnbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

scooter

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

baiduspider

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

sogou

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /q/
Disallow /*.wav$
Disallow /*?

Other Records

Field Value
sitemap http://cdict.net/sitemap.xml