douglasallen.co.uk
robots.txt
Robots Exclusion Standard data for douglasallen.co.uk
Resource Scan
Scan Details
Site Domain | douglasallen.co.uk |
Base Domain | douglasallen.co.uk |
Scan Status | Ok |
Last Scan | 2024-10-08T10:59:32+00:00 |
Next Scan | 2024-11-07T10:59:32+00:00 |
Last Scan
Scanned | 2024-10-08T10:59:32+00:00 |
URL | https://douglasallen.co.uk/robots.txt |
Redirect | https://www.douglasallen.co.uk/robots.txt |
Redirect Domain | www.douglasallen.co.uk |
Redirect Base | douglasallen.co.uk |
Domain IPs | 75.2.60.5 |
Redirect IPs | 13.228.199.255, 2406:da18:b3d:e201::64, 2406:da18:b3d:e202::64, 52.74.166.77 |
Response IP | 52.74.166.77 |
Found | Yes |
Hash | 01a871ddeb11f36c9dc9815a27fa0868ed50241b23944917e4e2b6684ab5823a |
SimHash | 339472f3c9b0 |
Groups
*
Rule | Path |
---|---|
Disallow | /email/ |
Disallow | /customer-complaint/ |
Disallow | /14days/ |
Disallow | /referral-fees/ |
Disallow | /feature-your-dog/ |
Disallow | /about-cubitt-and-west-estate-agents/video-pre-view-1/ |
Disallow | /terms-and-conditions/ |
Disallow | /privacy-policy/ |
Disallow | /cookie-policy/ |
Disallow | /buy-property/video-pre-view/ |
Disallow | /status.txt |
ahrefsbot
ahrefssiteaudit
adbeat_bot
alexibot
aqua_products
archive.org_bot
archive
asterias
b2w/0.1
backdoorbot/1.0
becomebot
blekkobot
blexbot
blowfish/1.0
bookmark search tool
botalot
builtbottough
bullseye/1.0
bunnyslippers
ccbot
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
chroot
copernic
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
dittospyder
dotbot
dumbot
emailwolf
enterprise_search
enterprise_search/1.0
erocrawler
es
exabot
extractorpro
fairad client
flaming attackbot
foobot
gaisbot
getright/4.2
gigabot
grub
grub-client
go-http-client
harvest/1.5
hatena antenna
hloader
http://www.searchengineworld.com bot
http://www.webmasterworld.com bot
httplib
humanlinks
ia_archiver
ia_archiver/1.6
infonavirobot
iron33/1.0.2
jamesbot
jennybot
jetbot
jetbot/1.0
jorgee
kenjin spider
keyword density/0.9
larbin
lexibot
libweb/clshttp
linkextractorpro
linkpadbot
linkscan/8.1a unix
linkwalker
lnspiderguy
looksmart
lwp-trivial
lwp-trivial/1.34
mata hari
megalodon
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
moget
moget/2.1
mozilla
mozilla
mozilla/3
mozilla/4
mozilla/4.0 (compatible; bullseye; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 2000)
mozilla/4.0 (compatible; msie 4.0; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 98)
mozilla/4.0 (compatible; msie 4.0; windows nt)
mozilla/4.0 (compatible; msie 4.0; windows xp)
mozilla/5
msiecrawler
naver
nerdybot
netants
netmechanic
nicerspro
nutch
offline explorer
openbot
openfind
openfind data gathere
oracle ultra search
perman
prowebwalker
psbot
python-urllib
queryn metasearch
radiation retriever 1.1
repomonkey
repomonkey bait & tackle/v1.01
rma
rogerbot
scooter
screaming frog seo spider
searchpreview
semrushbot
semrushbot
semrushbot-sa
seokicks-robot
sitesnagger
sootle
spankbot
spanner
spbot
stanford
stanford comp sci
stanford compclub
stanford compsciclub
stanford spiderboys
surveybot_ignoreip
suzuran
szukacz/1.4
szukacz/1.4
teleport
teleportpro
telesoft
teoma
the intraformant
thenomad
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
typhoeus
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webenhancer
webmasterworld extractor
webmasterworldforumbot
websauger
website quester
webster pro
webstripper
webvac
webzip
webzip/4.0
wget/1.5.3
wget/1.6
www-collector-e
xenu's
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.douglasallen.co.uk/sitemap.xml |
Warnings
- `host` is not a known field.