pdc-big.co.uk
robots.txt
Robots Exclusion Standard data for pdc-big.co.uk
Resource Scan
Scan Details
Site Domain | pdc-big.co.uk |
Base Domain | pdc-big.co.uk |
Scan Status | Ok |
Last Scan | 2025-04-03T15:09:33+00:00 |
Next Scan | 2025-05-03T15:09:33+00:00 |
Last Scan
Scanned | 2025-04-03T15:09:33+00:00 |
URL | https://www.pdc-big.co.uk/robots.txt |
Domain IPs | 40.89.190.184 |
Response IP | 40.89.190.184 |
Found | Yes |
Hash | ec90e86a244caebb938b238bce78c7260cb09d0af79e9e3ec13d87d0cf6f58a7 |
SimHash | c371a355829b |
Groups
*
Rule | Path |
---|---|
Disallow | /index.php/ |
Disallow | /catalog/category/view/ |
Disallow | /catalog/product/gallery/ |
Disallow | /catalog/product/view/ |
Disallow | /catalog/product_compare/ |
Disallow | /catalogs/bulk/view/ |
Disallow | /checkout/ |
Disallow | /pdc_checkout/ |
Disallow | /customer/ |
Disallow | /dto/ |
Disallow | /newsletter/ |
Disallow | /nle/ |
Disallow | /poll/ |
Disallow | /review/ |
Disallow | /sales/ |
Disallow | /sendfriend/ |
Disallow | /tag/ |
Disallow | /var/ |
Disallow | /wishlist/ |
Disallow | /*?dir= |
Disallow | /*?limit= |
Disallow | /*?mode= |
Disallow | /*?order= |
Disallow | /*?SID= |
admantx
ahrefsbot
ahrefsbot
archive-org.com
baiduspider
betabot
blackwidow
blexbot
ccbot
chinaclaw
cmscrawler
cognitiveseo
contextad\ bot
crystalsemantics
custo
disco
domainoptima
domainsigma
dotbot
dotbot
download\ demon
easouspider
ecatch
eirgrabber
emailsiphon
emailwolf
exabot
exabot
express\ webpictures
extractorpro
eyenetie
flashget
fr-crawler
genieo
getright
getweb!
gigabot
go!zilla
go-ahead-got-it
golden-praga
grabnet
grafula
grapeshotcrawler
hmview
httpclient
httrack
ia_archiver
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
james\ bot
jetcar
joc\ web\ spider
larbin
leechftp
leikibot
libcurl
linkdexbot
lipperhey
livelap
lssrocket
magpie
mass\ downloader
meanpathbot
megaindex
memorybot
midown\ tool
mister\ pix
mj12bot
mj12bot
navroad
nearsite
nerdybot
net\ vampire
netants
netseer
netspider
netzip
nutch
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
petalbot
pleasebot
proximic
realdownload
reget
riddler
rogerbot
ru_bot
safesearch
searchmetricsbot
semalt
semrushbot
seokicks
seznambot
showyoubot
sistrix
sitesnagger
slurp
smartdownload
smtbot
sogou
spbot
spiderbot
stackoverflow
superbot
superhttp
surfbot
takeout
teleport\ pro
tineye
turnitinbot
twitter
twittmemebot
umbot
voideye
voilabot
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wotbox
wwwoffle
xaldon\ webspider
xovibot
yandex
zeus
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.pdc-big.co.uk/sitemap_en_gb.xml |
Warnings
- 2 invalid lines.
Comments