distrelec.be
robots.txt

Robots Exclusion Standard data for distrelec.be

Resource Scan

Scan Details

Site Domain distrelec.be
Base Domain distrelec.be
Scan Status Ok
Last Scan2024-05-26T04:30:41+00:00
Next Scan 2024-06-25T04:30:41+00:00

Last Scan

Scanned2024-05-26T04:30:41+00:00
URL https://distrelec.be/robots.txt
Redirect https://www.distrelec.be/robots.txt
Redirect Domain www.distrelec.be
Redirect Base distrelec.be
Domain IPs 52.58.122.104
Redirect IPs 2600:9000:2753:3a00:f:ffaf:24c0:93a1, 2600:9000:2753:4000:f:ffaf:24c0:93a1, 2600:9000:2753:5000:f:ffaf:24c0:93a1, 2600:9000:2753:6000:f:ffaf:24c0:93a1, 2600:9000:2753:7e00:f:ffaf:24c0:93a1, 2600:9000:2753:9000:f:ffaf:24c0:93a1, 2600:9000:2753:ba00:f:ffaf:24c0:93a1, 2600:9000:2753:ee00:f:ffaf:24c0:93a1, 54.192.18.109, 54.192.18.12, 54.192.18.87, 54.192.18.88
Response IP 108.157.254.124
Found Yes
Hash 49ce8511484ff31237517e7b8b6b495d7aa11b605388b4f223bab7048493c4da
SimHash 79ddc830ac9b

Groups

*

Rule Path
Disallow *?q=*&filter
Disallow *%26filterURL%3D*
Disallow */search
Disallow */cart
Disallow */checkout
Disallow */my-account
Disallow */login
Disallow *//app/etc/local.xml
Disallow */medias/sys_master/*.gz
Disallow */availability*
Disallow */bom-tool-upload*
Disallow */bom-tool*
Disallow *undefined*
Disallow */special-shops/*
Disallow */shop-in-shop/*
Disallow */Web/Downloads/*
Disallow */shopping/*
Disallow /compliance-document/*
Disallow */rs-welcome
Disallow */rs-registration

cazoodlebot
gigabot
admantx
wget
semvisubot
baiduspider
baiduspider-image
blexbot
yandexbot
yandex
sogou
exabot
spbot
fetchbot
betabot
linkpadbot
mail.ru_bot
seznambot
dotbot
yeti
smtbot
findxbot
genio
bubing
proximic
coccoc
grapeshotcrawler
savetheworldheritage.org
chatgpt-user
gptbot
claudebot
brightbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.distrelec.be/sitemap_index.xml

Comments

  • Block Bad-Bots or Useless-Bot