he.distance.to
robots.txt
Robots Exclusion Standard data for he.distance.to
Resource Scan
Scan Details
| Site Domain | he.distance.to |
| Base Domain | distance.to |
| Scan Status | Ok |
| Last Scan | 2026-02-19T05:51:46+00:00 |
| Next Scan | 2026-03-21T05:51:46+00:00 |
Last Scan
| Scanned | 2026-02-19T05:51:46+00:00 |
| URL | https://he.distance.to/robots.txt |
| Domain IPs | 172.66.41.14, 172.66.42.242, 2606:4700:3108::ac42:290e, 2606:4700:3108::ac42:2af2 |
| Response IP | 172.66.41.14 |
| Found | Yes |
| Hash | 29e0af581ebea2983fb03a62ef27e8a38c1f2235920ffe4654b57e9879da6d11 |
| SimHash | 46354953c5d4 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
crawlspace
diffbot
duckassistbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
perplexitybot
petalbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot
| Rule | Path |
|---|---|
| Disallow | / |
Warnings
- `content-signal` is not a known field.
Comments