houjindata.com
robots.txt

Robots Exclusion Standard data for houjindata.com

Resource Scan

Scan Details

Site Domain houjindata.com
Base Domain houjindata.com
Scan Status Ok
Last Scan2025-12-09T13:26:18+00:00
Next Scan 2025-12-23T13:26:18+00:00

Last Scan

Scanned2025-12-09T13:26:18+00:00
URL https://houjindata.com/robots.txt
Domain IPs 104.21.40.136, 172.67.152.4, 2606:4700:3031::6815:2888, 2606:4700:3035::ac43:9804
Response IP 104.21.40.136
Found Yes
Hash deca402e8dba9fd4e5c8cefb2a99d4beed3cf819712502851afcae6c6723644e
SimHash d1160450fb8e

Groups

seekportbot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

zgrab

Rule Path
Disallow /

feeddemon

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

ioncrawl

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

checkmarknetwork

Rule Path
Disallow /

intelx.io_bot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

linespider

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

censysinspect

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

asktbfxtv

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

crawldaddy

Rule Path
Disallow /

coolpadwebkit

Rule Path
Disallow /

java

Rule Path
Disallow /

feedly

Rule Path
Disallow /

universalfeedparser

Rule Path
Disallow /

apachebench

Rule Path
Disallow /

swiftbot

Rule Path
Disallow /

zmeu

Rule Path
Disallow /

obot

Rule Path
Disallow /

jaunty

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

yyspider

Rule Path
Disallow /

digext

Rule Path
Disallow /

httpclient

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

httrack

Rule Path
Disallow /

apache-httpclient

Rule Path
Disallow /

harvest

Rule Path
Disallow /

audit

Rule Path
Disallow /

dirbuster

Rule Path
Disallow /

pangolin

Rule Path
Disallow /

nmap

Rule Path
Disallow /

sqln

Rule Path
Disallow /

hydra

Rule Path
Disallow /

parser

Rule Path
Disallow /

libwww

Rule Path
Disallow /

bbbike

Rule Path
Disallow /

sqlmap

Rule Path
Disallow /

w3af

Rule Path
Disallow /

owasp

Rule Path
Disallow /

nikto

Rule Path
Disallow /

fimap

Rule Path
Disallow /

havij

Rule Path
Disallow /

babykrokodil

Rule Path
Disallow /

netsparker

Rule Path
Disallow /

httperf

Rule Path
Disallow /

spbot

Rule Path
Disallow /

dnyzbot

Rule Path
Disallow /

researchscan

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

scooter

Rule Path
Disallow /

lycos_spider

Rule Path
Disallow /

fast-webcrawler

Rule Path
Disallow /

feedburner

Rule Path
Disallow /

internetmeasurement

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

amazon

Rule Path
Disallow /

applebot

Rule Path
Disallow /

awariobot

Rule Path
Disallow /

baispider

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

censysinspect

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dnyzbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

facebook

Rule Path
Disallow /

fiddler

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

googleother

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

member

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

net clr

Rule Path
Disallow /

okhttp

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

python-requests

Rule Path
Disallow /

researchscan

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

serankingbacklinksbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

shell

Rule Path
Disallow /

spbot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

*

Rule Path
Disallow /*.js$
Disallow /*.css$
Disallow /wp-content/
Disallow /wp-includes/
Disallow /go/