hrhtv.me
robots.txt

Robots Exclusion Standard data for hrhtv.me

Resource Scan

Scan Details

Site Domain hrhtv.me
Base Domain hrhtv.me
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-08-26T00:52:35+00:00
Next Scan 2025-09-25T00:52:35+00:00

Last Successful Scan

Scanned2025-07-21T00:25:39+00:00
URL https://hrhtv.me/robots.txt
Domain IPs 2600:9000:271a:1a00:18:2b09:b400:93a1, 2600:9000:271a:4200:18:2b09:b400:93a1, 2600:9000:271a:7600:18:2b09:b400:93a1, 2600:9000:271a:8c00:18:2b09:b400:93a1, 2600:9000:271a:a400:18:2b09:b400:93a1, 2600:9000:271a:ac00:18:2b09:b400:93a1, 2600:9000:271a:e00:18:2b09:b400:93a1, 2600:9000:271a:fc00:18:2b09:b400:93a1, 3.165.75.41, 3.165.75.43, 3.165.75.92, 3.165.75.95
Response IP 3.165.75.43
Found Yes
Hash 6f6209c6a3467d59e809226bbcb61304605ba4b729418dc7cbb96f6a80560eb8
SimHash 702ba902ccf6

Groups

*

Rule Path
Disallow /search

anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
cohere-ai
diffbot
facebookbot
gptbot
imagesiftbot
meta-externalagent
meta-externalfetcher
omgilibot
perplexitybot
timpibot

Rule Path
Disallow /maps*

Comments

  • Explicit rules for common LLM bots that might not attribute source data.