/.well-known/

Log In Sign Up

hrhtv.me
robots.txt

Robots Exclusion Standard data for hrhtv.me

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hrhtv.me
Base Domain	hrhtv.me
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a server error.
Last Scan	2025-08-26T00:52:35+00:00
Next Scan	2025-09-25T00:52:35+00:00

Last Successful Scan

Scanned	2025-07-21T00:25:39+00:00
URL	https://hrhtv.me/robots.txt
Domain IPs	2600:9000:271a:1a00:18:2b09:b400:93a1, 2600:9000:271a:4200:18:2b09:b400:93a1, 2600:9000:271a:7600:18:2b09:b400:93a1, 2600:9000:271a:8c00:18:2b09:b400:93a1, 2600:9000:271a:a400:18:2b09:b400:93a1, 2600:9000:271a:ac00:18:2b09:b400:93a1, 2600:9000:271a:e00:18:2b09:b400:93a1, 2600:9000:271a:fc00:18:2b09:b400:93a1, 3.165.75.41, 3.165.75.43, 3.165.75.92, 3.165.75.95
Response IP	3.165.75.43
Found	Yes
Hash	6f6209c6a3467d59e809226bbcb61304605ba4b729418dc7cbb96f6a80560eb8
SimHash	702ba902ccf6

Groups

*

Rule

Path

Disallow

/search

anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
cohere-ai
diffbot
facebookbot
gptbot
imagesiftbot
meta-externalagent
meta-externalfetcher
omgilibot
perplexitybot
timpibot

Rule

Path

Disallow

/maps*

Back to top

Comments

Explicit rules for common LLM bots that might not attribute source data.

Back to top