pdc-big.co.uk
robots.txt

Robots Exclusion Standard data for pdc-big.co.uk

Resource Scan

Scan Details

Site Domain pdc-big.co.uk
Base Domain pdc-big.co.uk
Scan Status Ok
Last Scan2025-04-03T15:09:33+00:00
Next Scan 2025-05-03T15:09:33+00:00

Last Scan

Scanned2025-04-03T15:09:33+00:00
URL https://www.pdc-big.co.uk/robots.txt
Domain IPs 40.89.190.184
Response IP 40.89.190.184
Found Yes
Hash ec90e86a244caebb938b238bce78c7260cb09d0af79e9e3ec13d87d0cf6f58a7
SimHash c371a355829b

Groups

*

Rule Path
Disallow /index.php/
Disallow /catalog/category/view/
Disallow /catalog/product/gallery/
Disallow /catalog/product/view/
Disallow /catalog/product_compare/
Disallow /catalogs/bulk/view/
Disallow /checkout/
Disallow /pdc_checkout/
Disallow /customer/
Disallow /dto/
Disallow /newsletter/
Disallow /nle/
Disallow /poll/
Disallow /review/
Disallow /sales/
Disallow /sendfriend/
Disallow /tag/
Disallow /var/
Disallow /wishlist/
Disallow /*?dir=
Disallow /*?limit=
Disallow /*?mode=
Disallow /*?order=
Disallow /*?SID=

admantx
ahrefsbot
ahrefsbot
archive-org.com
baiduspider
betabot
blackwidow
blexbot
ccbot
chinaclaw
cmscrawler
cognitiveseo
contextad\ bot
crystalsemantics
custo
disco
domainoptima
domainsigma
dotbot
dotbot
download\ demon
easouspider
ecatch
eirgrabber
emailsiphon
emailwolf
exabot
exabot
express\ webpictures
extractorpro
eyenetie
flashget
fr-crawler
genieo
getright
getweb!
gigabot
go!zilla
go-ahead-got-it
golden-praga
grabnet
grafula
grapeshotcrawler
hmview
httpclient
httrack
ia_archiver
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
james\ bot
jetcar
joc\ web\ spider
larbin
leechftp
leikibot
libcurl
linkdexbot
lipperhey
livelap
lssrocket
magpie
mass\ downloader
meanpathbot
megaindex
memorybot
midown\ tool
mister\ pix
mj12bot
mj12bot
navroad
nearsite
nerdybot
net\ vampire
netants
netseer
netspider
netzip
nutch
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
petalbot
pleasebot
proximic
realdownload
reget
riddler
rogerbot
ru_bot
safesearch
searchmetricsbot
semalt
semrushbot
seokicks
seznambot
showyoubot
sistrix
sitesnagger
slurp
smartdownload
smtbot
sogou
spbot
spiderbot
stackoverflow
superbot
superhttp
surfbot
takeout
teleport\ pro
tineye
turnitinbot
twitter
twittmemebot
umbot
voideye
voilabot
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wotbox
wwwoffle
xaldon\ webspider
xovibot
yandex
zeus

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pdc-big.co.uk/sitemap_en_gb.xml

Comments

  • Filters
  • SID
  • Bots

Warnings

  • 2 invalid lines.