kuxumarin.hatenablog.com
robots.txt

Robots Exclusion Standard data for kuxumarin.hatenablog.com

Resource Scan

Scan Details

Site Domain kuxumarin.hatenablog.com
Base Domain hatenablog.com
Scan Status Ok
Last Scan2025-06-25T02:52:43+00:00
Next Scan 2025-07-25T02:52:43+00:00

Last Scan

Scanned2025-06-25T02:52:43+00:00
URL https://kuxumarin.hatenablog.com/robots.txt
Domain IPs 35.75.255.9, 54.199.90.60
Response IP 35.75.255.9
Found Yes
Hash f91d347808cbaf6f836204f3a71b6a4e97d13484efea5c9e35c8395d34da6c93
SimHash 291c4945c0d3

Groups

*

Rule Path
Disallow /api/
Disallow /draft/
Disallow /preview

mediapartners-google

Rule Path
Disallow /draft/
Disallow /preview

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://kuxumarin.hatenablog.com/sitemap_index.xml