h-net.social
robots.txt

Robots Exclusion Standard data for h-net.social

Resource Scan

Scan Details

Site Domain h-net.social
Base Domain h-net.social
Scan Status Ok
Last Scan2024-06-28T14:34:03+00:00
Next Scan 2024-06-29T14:34:03+00:00

Last Scan

Scanned2024-06-28T14:34:03+00:00
URL https://h-net.social/robots.txt
Domain IPs 35.9.18.75
Response IP 35.9.18.75
Found Yes
Hash 8df099da114875ea9ad1549de70e82df45ab76ba11ea9a1f2a3518bb4410c16b
SimHash d34451307f21

Groups

accserver
admantx
adsbot
affiliatelabz
ahrefsbot
aihitbot
alittle client
alphabot
amazonbot
anthropic
aspiegelbot
auto spider
awariorssbot
awariosmartbot
babya discoverer
baidu
barkrowler
biginfolabs
blexbot
botnet
brands-bot
brandwatch
buck
builtwith
bytespider
ccbot
censysinspect
chatgpt
checkmarknetwork
chrome privacy preserving prefetch proxy
claudebot
claude-web
cliqzbot
cohere
cosmos
crawlagent
crawlson
css certificate spider
c\xd0\xbemp\xd0\xb0tible
dark_nexus_qbot
dataforseobot
daum
diffbot
digext
digitalshadowsbot
discordbot
domaincrawler
domains project
dotbot
dts agent
ebidag
expanse
eyemonit
facebookbot
facebookexternalhit
facebot
fatboykimcombot
finbot
funkwhale
garlikcrawler
gdnplus.com
gluten free crawler
go 1.1 package http
gokurou
google-extended
gptbot
grapeshot
gtnachatvru003
http banner detection
headlesschrome
infotigerbot
internet-structure-research-project-bot
internetmeasurement
ioncrawl
ips-agent
iviacrawler
jooblebot
kstandbot
libfetch
libwww-perl
lightspeedsystemscrawler
linkedinbot
ltx71
lwp::simple
magpie-crawler
mail.ru_bot
masscan
mazbot
mechanize
mediatoolkitbot
megaindex.ru
miniflux
mj12bot
mnogosearch
mojeek
monsidobot
moreover
mozilla/0
msiecrawler
msnbot
mybot
neevabot
netcraft
netsystemsresearch
nimbostratus-bot
nlpproject.info research
nutch
offline explorer
omgili
onalyticabot
p3p validator
peer39_crawler
pandalytics
panscient
paperlibot
perplexitybot
pdrlabs.net
petalbot
pinterest
proximic
proxychecker
researchscan
rootshell
safednsbot
scrapy
screaming frog seo spider
search.marginalia.nu
searchatlas.com seo crawler
seekportbot
semrushbot
seobilitybot
serendeputybot
seznambot
slurp
sogou web spider
tchelebi
trendiction
trendsmapresolver
twitterbot
turnitinbot
ubermetrics
virusdie crawler
vuhuvbot
wbsearchbot
webcapture
webzip
wf search
wget
whoareyoubot
xenu link sleuth
xforce-security
xmpp tiscali communicator
xovibot
yak
yisouspider
youbot
zoombot
zyborg

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/

Other Records

Field Value
crawl-delay 10

Warnings

  • 3 invalid lines.