gudemi.hatenadiary.jp
robots.txt

Robots Exclusion Standard data for gudemi.hatenadiary.jp

Resource Scan

Scan Details

Site Domain gudemi.hatenadiary.jp
Base Domain hatenadiary.jp
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-05T06:14:37+00:00
Next Scan 2025-11-04T06:14:37+00:00

Last Successful Scan

Scanned2025-06-14T18:09:44+00:00
URL https://gudemi.hatenadiary.jp/robots.txt
Domain IPs 35.75.255.9, 54.199.90.60
Response IP 35.75.255.9
Found Yes
Hash 57be9700ce3f1f1403b07c69f574fcc16a9eb5f8d48d6530d77ef700b9518449
SimHash 091c4940e003

Groups

*

Rule Path
Disallow /api/
Disallow /draft/
Disallow /preview

mediapartners-google

Rule Path
Disallow /draft/
Disallow /preview

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gudemi.hatenadiary.jp/sitemap_index.xml