darnews.com
robots.txt

Robots Exclusion Standard data for darnews.com

Resource Scan

Scan Details

Site Domain darnews.com
Base Domain darnews.com
Scan Status Ok
Last Scan2025-03-29T13:46:26+00:00
Next Scan 2025-04-05T13:46:26+00:00

Last Scan

Scanned2025-03-29T13:46:26+00:00
URL https://darnews.com/robots.txt
Redirect https://www.darnews.com/robots.txt
Redirect Domain www.darnews.com
Redirect Base darnews.com
Domain IPs 76.76.21.21
Redirect IPs 54.192.18.107, 54.192.18.11, 54.192.18.125, 54.192.18.19
Response IP 108.156.144.72
Found Yes
Hash 51edce68c472183bec74b6d73563d9f7ebcb78249c8f18e2ae92b6868337e8e6
SimHash 59109940e470

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.darnews.com/sitemap.xml

Comments

  • Block specific AI crawlers
  • Allow all other crawlers
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.