h-net.social
robots.txt
Robots Exclusion Standard data for h-net.social
Resource Scan
Scan Details
Site Domain | h-net.social |
Base Domain | h-net.social |
Scan Status | Ok |
Last Scan | 2024-06-28T14:34:03+00:00 |
Next Scan | 2024-06-29T14:34:03+00:00 |
Last Scan
Scanned | 2024-06-28T14:34:03+00:00 |
URL | https://h-net.social/robots.txt |
Domain IPs | 35.9.18.75 |
Response IP | 35.9.18.75 |
Found | Yes |
Hash | 8df099da114875ea9ad1549de70e82df45ab76ba11ea9a1f2a3518bb4410c16b |
SimHash | d34451307f21 |
Groups
accserver
admantx
adsbot
affiliatelabz
ahrefsbot
aihitbot
alittle client
alphabot
amazonbot
anthropic
aspiegelbot
auto spider
awariorssbot
awariosmartbot
babya discoverer
baidu
barkrowler
biginfolabs
blexbot
botnet
brands-bot
brandwatch
buck
builtwith
bytespider
ccbot
censysinspect
chatgpt
checkmarknetwork
chrome privacy preserving prefetch proxy
claudebot
claude-web
cliqzbot
cohere
cosmos
crawlagent
crawlson
css certificate spider
c\xd0\xbemp\xd0\xb0tible
dark_nexus_qbot
dataforseobot
daum
diffbot
digext
digitalshadowsbot
discordbot
domaincrawler
domains project
dotbot
dts agent
ebidag
expanse
eyemonit
facebookbot
facebookexternalhit
facebot
fatboykimcombot
finbot
funkwhale
garlikcrawler
gdnplus.com
gluten free crawler
go 1.1 package http
gokurou
google-extended
gptbot
grapeshot
gtnachatvru003
http banner detection
headlesschrome
infotigerbot
internet-structure-research-project-bot
internetmeasurement
ioncrawl
ips-agent
iviacrawler
jooblebot
kstandbot
libfetch
libwww-perl
lightspeedsystemscrawler
linkedinbot
ltx71
lwp::simple
magpie-crawler
mail.ru_bot
masscan
mazbot
mechanize
mediatoolkitbot
megaindex.ru
miniflux
mj12bot
mnogosearch
mojeek
monsidobot
moreover
mozilla/0
msiecrawler
msnbot
mybot
neevabot
netcraft
netsystemsresearch
nimbostratus-bot
nlpproject.info research
nutch
offline explorer
omgili
onalyticabot
p3p validator
peer39_crawler
pandalytics
panscient
paperlibot
perplexitybot
pdrlabs.net
petalbot
pinterest
proximic
proxychecker
researchscan
rootshell
safednsbot
scrapy
screaming frog seo spider
search.marginalia.nu
searchatlas.com seo crawler
seekportbot
semrushbot
seobilitybot
serendeputybot
seznambot
slurp
sogou web spider
tchelebi
trendiction
trendsmapresolver
twitterbot
turnitinbot
ubermetrics
virusdie crawler
vuhuvbot
wbsearchbot
webcapture
webzip
wf search
wget
whoareyoubot
xenu link sleuth
xforce-security
xmpp tiscali communicator
xovibot
yak
yisouspider
youbot
zoombot
zyborg
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /media_proxy/ |
Disallow | /interact/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Warnings
- 3 invalid lines.