cgddz.cc
robots.txt

Robots Exclusion Standard data for cgddz.cc

Resource Scan

Scan Details

Site Domain cgddz.cc
Base Domain cgddz.cc
Scan Status Ok
Last Scan2025-08-29T03:39:58+00:00
Next Scan 2025-09-28T03:39:58+00:00

Last Scan

Scanned2025-08-29T03:39:58+00:00
URL https://cgddz.cc/robots.txt
Redirect https://51aw.com/robots.txt
Redirect Domain 51aw.com
Redirect Base 51aw.com
Domain IPs 154.207.77.84, 156.255.123.84
Redirect IPs 104.21.61.39, 172.67.205.202, 2606:4700:3033::ac43:cdca, 2606:4700:3037::6815:3d27
Response IP 104.21.61.39
Found Yes
Hash eaa629243b5f540f2a621839ee1fc8ec94651a8e0f44675cc163d34fe9e52ca8
SimHash 6615d013ff71

Groups

*

Rule Path
Disallow /admin*
Disallow /danmaku/*
Disallow /feed/*
Disallow /archives/*/comment-page-*
Disallow /comment-page-*?replyTo=*
Disallow /*.html/comment-page-*
Disallow /upload*/xiao/*
Disallow /*?replyTo=*
Disallow /*?ysclid=*
Disallow /*?continueFlag=*
Disallow /cmt/
Disallow */respond-post-*/
Disallow /*?*
Allow /
Allow /archives/
Allow /tag/
Allow /category/
Allow /author/

googlebot

Rule Path
Disallow /feed/

bingbot

Rule Path
Disallow /feed/

yandex

Rule Path
Allow /feed/
Allow /feed/rss/
Allow /feed/atom/

Other Records

Field Value
sitemap https://51aw.com/sitemap.xml

Warnings

  • `clean-param` is not a known field.