houseofnames.com
robots.txt

Robots Exclusion Standard data for houseofnames.com

Resource Scan

Scan Details

Site Domain houseofnames.com
Base Domain houseofnames.com
Scan Status Ok
Last Scan2024-05-19T07:20:31+00:00
Next Scan 2024-06-18T07:20:31+00:00

Last Scan

Scanned2024-05-19T07:20:31+00:00
URL https://houseofnames.com/robots.txt
Domain IPs 130.211.27.46
Response IP 130.211.27.46
Found Yes
Hash cc66b39e04885633cc3f95fe2b4f2902e94035517a2885e83ac081109ac8e77f
SimHash 5f1e47f38f3d

Groups

*

Rule Path
Disallow /secure/honcheckout.asp
Disallow /cookie_detect.asp
Disallow /cookiesDisabled.html
Disallow /specialpriceprod.asp
Disallow /namesearch.asp
Disallow /multinamesearch.asp
Disallow /honsearchresults.asp
Disallow /nameresults.asp
Disallow /multinameresults.asp
Disallow /qx?
Disallow /flash/
Disallow /i/t/
Disallow /filenotfound.asp
Disallow /errorfiles/
Disallow /images/pixel_trans.gif?asp=fc
Disallow /images/pixel_trans.gif
Disallow /mk/
Disallow /xq/asp/item
Disallow /checkout.asp

yandex

No rules defined. All paths allowed.

Other Records

Field Value Comment
crawl-delay 4 specifies a 4 second timeout

ia_archiver

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

fetch api request

Rule Path
Disallow /

psbot

Rule Path
Disallow /

custom bot/robot

Product Comment
custom bot/robot 20
Rule Path
Disallow /

w3crobot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

asterias crawler

Rule Path
Disallow /

ms frontpage

Rule Path
Disallow /

iaea

Rule Path
Disallow /

sohu-search

Rule Path
Disallow /

szukacz (www.szukacz.pl)

Rule Path
Disallow /

sherlock

Rule Path
Disallow /

almaden.ibm.com/cs/crawler

Rule Path
Disallow /

libwww-perl

Rule Path
Disallow /

microsoft url control

Rule Path
Disallow /

appie

Rule Path
Disallow /

arachmo

Rule Path
Disallow /

mac finder

Rule Path
Disallow /

dts agent

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

missigua locator

Rule Path
Disallow /

larbin

Rule Path
Disallow /

jakarta commons-httpclient

Rule Path
Disallow /

nutchcvs (nutch.org)

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

findlinks

Rule Path
Disallow /

asterias

Rule Path
Disallow /

faxobot

Rule Path
Disallow /

netcraft web server survey

Rule Path
Disallow /

plantynet_webrobot

Rule Path
Disallow /

spam bot

Rule Path
Disallow /

advanced email extractor

Rule Path
Disallow /

zeus

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

iltrovatore-setaccio

Rule Path
Disallow /

stupid email harvester

Rule Path
Disallow /

webgo is

Rule Path
Disallow /

worqmada

Rule Path
Disallow /

picgrabber

Rule Path
Disallow /

docomo

Rule Path
Disallow /

openfind data gatherer

Rule Path
Disallow /

test

Rule Path
Disallow /

omniweb

Rule Path
Disallow /

e-collector

Rule Path
Disallow /

emailsyphon

Rule Path
Disallow /

webmon

Rule Path
Disallow /

grub

Rule Path
Disallow /

ia_archiver/1.6

Rule Path
Disallow /

microsoft-webdav

Rule Path
Disallow /

frontpage

Rule Path
Disallow /

spiderman

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

microsoft url control - 6.00.8862

Rule Path
Disallow /

microsoft data access internet publishing provider protocol discovery

Rule Path
Disallow /

microsoft data access

Rule Path
Disallow /

microsoft-webdav-miniredir/5.1.2600

Rule Path
Disallow /

webalta crawler/2.0

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.houseofnames.com/sitemap_index.xml