raclea.com
robots.txt

Robots Exclusion Standard data for raclea.com

Resource Scan

Scan Details

Site Domain raclea.com
Base Domain raclea.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-24T14:52:35+00:00
Next Scan 2025-10-31T14:52:35+00:00

Last Successful Scan

Scanned2025-09-04T00:05:07+00:00
URL https://raclea.com/robots.txt
Redirect https://raclea8.wpx.jp/robots.txt
Redirect Domain raclea8.wpx.jp
Redirect Base wpx.jp
Domain IPs 157.112.152.55
Redirect IPs 162.43.107.139
Response IP 162.43.107.139
Found Yes
Hash c7815e10e4cb2303f57530dde7ed1e1100ca370f997f5bb3e5cdca305a7bb977
SimHash 200889038777

Groups

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

ccbot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://raclea8.wpx.jp/sitemap.xml

Comments

  • robots.txt for actively allowing AI crawlers and search engines
  • OpenAI GPTクローラー
  • Google生成AIクローラー(SGE用)
  • Common Crawl(LLM学習データに利用される)
  • Anthropic(Claude開発元)
  • その他のすべてのクローラーも許可
  • Sitemapの場所を明記(もしある場合)