news.cnyes.com
robots.txt

Robots Exclusion Standard data for news.cnyes.com

Resource Scan

Scan Details

Site Domain news.cnyes.com
Base Domain cnyes.com
Scan Status Ok
Last Scan2024-09-29T03:16:10+00:00
Next Scan 2024-10-29T03:16:10+00:00

Last Scan

Scanned2024-09-29T03:16:10+00:00
URL https://news.cnyes.com/robots.txt
Domain IPs 96.17.96.29, 96.17.96.31
Response IP 23.32.29.106
Found Yes
Hash 99f32c6b683f0470bf6d5bd97f21a95bbf8de5815468696e807b64fb2d536b77
SimHash 4d1bc81217b0

Groups

*

Rule Path
Allow /
Disallow /*/*/*?desktop=true
Disallow /sonews*/*.shtm

googlebot

Rule Path
Allow /
Disallow /*/*/*?desktop=true
Disallow /sonews*/*.shtml

bingbot

Rule Path
Allow /
Disallow /*.axd$
Disallow /webmis*
Disallow /WebmisBuy/
Disallow /offwebmis/
Disallow /inc/
Disallow /upFile/
Disallow /Ajax.aspx
Disallow /error.aspx
Disallow /*.xls$
Disallow /*.csv$
Disallow /*.doc$
Disallow /test/
Disallow /news/Ajax.aspx
Disallow /news/Ajax2.aspx
Disallow /news/headline.html
Disallow /news/search.html
Disallow /Content/
Disallow /content/
Disallow /*/*/*?desktop=true
Disallow /sonews*/*.shtml

Other Records

Field Value
sitemap https://news.cnyes.com/cnyes-sitemap/desktop-news.xml.gz
sitemap https://news.cnyes.com/cnyes-sitemap/tag.xml.gz
sitemap https://news.cnyes.com/cnyes-sitemap/topic.xml.gz
sitemap https://news.cnyes.com/rss/v1/news/category/all