download.cnet.com
robots.txt

Robots Exclusion Standard data for download.cnet.com

Resource Scan

Scan Details

Site Domain download.cnet.com
Base Domain cnet.com
Scan Status Ok
Last Scan2024-05-03T13:39:32+00:00
Next Scan 2024-05-10T13:39:32+00:00

Last Scan

Scanned2024-05-03T13:39:32+00:00
URL https://download.cnet.com/robots.txt
Domain IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91
Response IP 199.232.45.91
Found Yes
Hash 02a39c070cfe086af0e3c2a5f068d2fe737279ccff5fb85bf151ba0f54907022
SimHash 7994d0306aa3

Groups

*

Rule Path
Disallow */3055-*
Disallow */3001-*
Disallow *?sort=*
Disallow /download-launch?*

brightbot 1.0
claudebot
juzi.bot
gptbot

Rule Path
Disallow /

onetszukaj
sputnikbot
nigma.ru
webalta
webalta crawler
http://mail.ru
mail.ru_bot
rambler
holmes
startsiden
seznambot
yeti
sosospider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://download.cnet.com/sitemap-category.xml
sitemap https://download.cnet.com/sitemap-index-platforms-alternatives.xml
sitemap https://download.cnet.com/sitemap-index-platforms-app.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449

Warnings

  • 1 invalid line.