gucn.cn
robots.txt

Robots Exclusion Standard data for gucn.cn

Resource Scan

Scan Details

Site Domain gucn.cn
Base Domain gucn.cn
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-07T07:56:02+00:00
Next Scan 2025-02-05T07:56:02+00:00

Last Successful Scan

Scanned2023-09-22T05:00:22+00:00
URL http://gucn.cn/robots.txt
Domain IPs 114.113.224.67
Response IP 114.113.224.67
Found Yes
Hash 9dd483fe4cc4251d4cd3454d7712b0fb7fe537d340b0064dabbc80f621af523a
SimHash 185edd524a52

Groups

semrushbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

linguee

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /controller/
Disallow /member/
Disallow /data/
Disallow /admin/

Other Records

Field Value
crawl-delay 18000

baiduspider

Rule Path
Disallow /controller/
Disallow /member/
Disallow /data/
Disallow /admin/

sosospider

Rule Path
Disallow /controller/
Disallow /member/
Disallow /data/
Disallow /admin/

sogou spider

Rule Path
Disallow /controller/
Disallow /member/
Disallow /data/
Disallow /admin/

Warnings

  • 5 invalid lines.