/.well-known/

Log In Sign Up

raclea.com
robots.txt

Robots Exclusion Standard data for raclea.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	raclea.com
Base Domain	raclea.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-10-24T14:52:35+00:00
Next Scan	2025-10-31T14:52:35+00:00

Last Successful Scan

Scanned	2025-09-04T00:05:07+00:00
URL	https://raclea.com/robots.txt
Redirect	https://raclea8.wpx.jp/robots.txt
Redirect Domain	raclea8.wpx.jp
Redirect Base	wpx.jp
Domain IPs	157.112.152.55
Redirect IPs	162.43.107.139
Response IP	162.43.107.139
Found	Yes
Hash	c7815e10e4cb2303f57530dde7ed1e1100ca370f997f5bb3e5cdca305a7bb977
SimHash	200889038777

Groups

gptbot

Rule

Path

Allow

/

google-extended

Rule

Path

Allow

/

ccbot

Rule

Path

Allow

/

anthropic-ai

Rule

Path

Allow

/

*

Rule

Path

Allow

/

Back to top

Other Records

Field

Value

sitemap

https://raclea8.wpx.jp/sitemap.xml

Back to top

Comments

robots.txt for actively allowing AI crawlers and search engines
OpenAI GPTクローラー
Google生成AIクローラー（SGE用）
Common Crawl（LLM学習データに利用される）
Anthropic（Claude開発元）
その他のすべてのクローラーも許可
Sitemapの場所を明記（もしある場合）

Back to top