cdict.giga.net.tw
robots.txt

Robots Exclusion Standard data for cdict.giga.net.tw

Resource Scan

Scan Details

Site Domain cdict.giga.net.tw
Base Domain giga.net.tw
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-10T07:18:52+00:00
Next Scan 2024-10-08T07:18:52+00:00

Last Successful Scan

Scanned2023-02-24T06:33:31+00:00
URL http://cdict.giga.net.tw/robots.txt
Redirect https://cdict.net/robots.txt
Redirect Domain cdict.net
Redirect Base cdict.net
Domain IPs 139.162.66.30
Redirect IPs 139.162.66.30
Response IP 139.162.66.30
Found Yes
Hash 23f78ab66069846599da7c37f5e707ecad4efe090aa095d3b749e70d637328a2
SimHash 204dd490c493

Groups

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

alexabot

Rule Path
Disallow

slurp

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

msnbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

scooter

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

baiduspider

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

sogou

Rule Path
Disallow

Other Records

Field Value
crawl-delay 30

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /q/
Disallow /*.wav$
Disallow /*?

Other Records

Field Value
sitemap http://cdict.net/sitemap.xml