herworld.com
robots.txt

Robots Exclusion Standard data for herworld.com

Resource Scan

Scan Details

Site Domain herworld.com
Base Domain herworld.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-27T04:21:45+00:00
Next Scan 2024-10-26T04:21:45+00:00

Last Successful Scan

Scanned2024-06-29T03:48:53+00:00
URL https://herworld.com/robots.txt
Redirect https://www.herworld.com/robots.txt
Redirect Domain www.herworld.com
Redirect Base herworld.com
Domain IPs 108.156.133.115, 108.156.133.66, 108.156.133.8, 108.156.133.90
Redirect IPs 13.33.88.122, 13.33.88.21, 13.33.88.68, 13.33.88.9
Response IP 13.33.88.9
Found Yes
Hash bc9683d7386bd3fc2e7ff51353d25013b6477a27e4800a1f2e3206975755c910
SimHash d0589900e933

Groups

*

Rule Path
Disallow /*/feed/$
Disallow /advanced-galleries/*
Disallow /search/*
Disallow /topics/

Other Records

Field Value
crawl-delay 10

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.herworld.com/_plat/api/sitemap.xml

Comments

  • For new training only
  • Not for training, only for user requests
  • Marker for disabling Bard and Vertex AI
  • Multi-purpose, commercial uses; including LLMs