fuhca.hateblo.jp
robots.txt

Robots Exclusion Standard data for fuhca.hateblo.jp

Archived Snapshots

Resource Scan

Scan Details

Site Domain	fuhca.hateblo.jp
Base Domain	hateblo.jp
Scan Status	Ok
Last Scan	2025-10-13T15:13:44+00:00
Next Scan	2025-11-12T15:13:44+00:00

Last Scan

Scanned	2025-10-13T15:13:44+00:00
URL	https://fuhca.hateblo.jp/robots.txt
Domain IPs	13.33.45.10, 13.33.45.102, 13.33.45.12, 13.33.45.57
Response IP	13.33.45.12
Found	Yes
Hash	14c16f8b9fd98ec1159ebad743887925a19d075f3896f21ba94ff3aacd01f440
SimHash	691e4a40c093

Groups

*

Rule	Path
Disallow	/api/
Disallow	/draft/
Disallow	/preview
Disallow	/iframe/blog_bookmarks_count

Rule

Path

Disallow

/api/

Disallow

/draft/

Disallow

/preview

Disallow

/iframe/blog_bookmarks_count

mediapartners-google

Rule	Path
Disallow	/draft/
Disallow	/preview

Rule

Path

Disallow

/draft/

Disallow

/preview

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexity-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

oai-searchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://fuhca.hateblo.jp/sitemap_index.xml

Field

Value

sitemap

https://fuhca.hateblo.jp/sitemap_index.xml

fuhca.hateblo.jprobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

mediapartners-google

gptbot

google-extended

applebot-extended

anthropic-ai

claudebot

cohere-ai

perplexitybot

perplexity-ai

chatgpt-user

oai-searchbot

ccbot

meta-externalagent

Other Records

fuhca.hateblo.jp
robots.txt