cosmobotb.com
robots.txt

Robots Exclusion Standard data for cosmobotb.com

Resource Scan

Scan Details

Site Domain cosmobotb.com
Base Domain cosmobotb.com
Scan Status Ok
Last Scan2025-11-06T19:45:57+00:00
Next Scan 2025-11-13T19:45:57+00:00

Last Scan

Scanned2025-11-06T19:45:57+00:00
URL https://cosmobotb.com/robots.txt
Redirect https://www.cosmopolitan.com.hk/robots.txt
Redirect Domain www.cosmopolitan.com.hk
Redirect Base cosmopolitan.com.hk
Domain IPs 104.21.43.54, 172.67.220.192, 2606:4700:3031::ac43:dcc0, 2606:4700:3032::6815:2b36
Redirect IPs 170.33.12.207
Response IP 170.33.12.207
Found Yes
Hash f4ea64dfc523b8f6aff57d06d03710f5bf13e0f1197d2672cafdc913edbff0cb
SimHash 739e8b5037f0

Groups

googlebot

Rule Path
Allow /tag/
Disallow /index.php
Disallow /action/
Disallow /mcrs/

*

Rule Path
Disallow /index.php
Disallow /action/
Disallow /tag/
Disallow /mcrs/

Other Records

Field Value
crawl-delay 10

mediapartners-google

Rule Path
Disallow

proximic

Rule Path
Disallow

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

deepseekbot

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cosmopolitan.com.hk/sitemap.xml

Comments

  • OpenAI
  • Common Crawl
  • Anthropic
  • Anthropic AI
  • Google
  • Apple
  • Amazon
  • Meta
  • ByteDance
  • Cohere
  • DeepSeek
  • Huawei
  • Allen Institute
  • Diffbot
  • Omgili
  • Webzio-Extended

Warnings

  • 13 invalid lines.