gs.hatenadiary.jp
robots.txt

Robots Exclusion Standard data for gs.hatenadiary.jp

Resource Scan

Scan Details

Site Domain gs.hatenadiary.jp
Base Domain hatenadiary.jp
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-10T22:13:20+00:00
Next Scan 2025-11-09T22:13:20+00:00

Last Successful Scan

Scanned2025-06-19T16:37:42+00:00
URL https://gs.hatenadiary.jp/robots.txt
Domain IPs 35.75.255.9, 54.199.90.60
Response IP 35.75.255.9
Found Yes
Hash 573b39dff44ab648726e80eb9cb5adc7f6fac22b2cae78049667de159ee8ff39
SimHash 211e4945c0c3

Groups

*

Rule Path
Disallow /api/
Disallow /draft/
Disallow /preview

mediapartners-google

Rule Path
Disallow /draft/
Disallow /preview

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gs.hatenadiary.jp/sitemap_index.xml