lampenwelt.de
robots.txt

Robots Exclusion Standard data for lampenwelt.de

Resource Scan

Scan Details

Site Domain lampenwelt.de
Base Domain lampenwelt.de
Scan Status Ok
Last Scan2024-09-15T03:19:35+00:00
Next Scan 2024-10-15T03:19:35+00:00

Last Scan

Scanned2024-09-15T03:19:35+00:00
URL https://lampenwelt.de/robots.txt
Redirect https://www.lampenwelt.de/robots.txt
Redirect Domain www.lampenwelt.de
Redirect Base lampenwelt.de
Domain IPs 34.107.167.247
Redirect IPs 34.107.167.247
Response IP 34.107.167.247
Found Yes
Hash dbbca2274580e807656307669f29acec6d3387c9f02e58701ee919704aaffc62
SimHash 4106f068cbf3

Groups

*
adidxbot

Product Comment
adidxbot explicit mentioning of bing ads bot. It does not adhere to the agent wildcard.
Rule Path Comment
Disallow /checkout/ -
Disallow /customer/ -
Disallow /wishlist/ -
Disallow /catalogsearch/ -
Disallow /app/ -
Disallow /lib/ -
Disallow /*.php$ -
Disallow /*SID%3D -
Disallow /index.php/ -
Disallow /banner/ajax -
Disallow /lw_related/ajax -
Disallow /documents/ -
Disallow */article/*nocache%3D -
Disallow /bloom/widget/* -
Disallow /*?*&* block irrelevant filter combinations
Disallow /*?*~* block irrelevant filter value combinations
Disallow /*?*~* block tilde
Disallow /*?*&*&*p=* block irrelevant filter combinations with pagination
Disallow /*?*~*&p=* block irrelevant filter value combinations with pagination
Disallow /*?*~*&p=* block tilde
Allow /*?*&p=* allow relevant filter with pagination
Disallow /*?*___* -
Disallow /*?*category=* -
Disallow /*?*light_bulb=* -
Disallow /*?SESSIONNAME=* -
Allow /*utm_* -
Allow /*lw_view%3Dnocontent* -
Allow /*display%3Dproducts* -
Allow /*lw_om_view%3Drecotop* -
Allow /*block_sku%3D* -

barkrowler
buck
velenpublicwebcrawler
imagesiftbot

Rule Path
Disallow *

gptbot
applebot

Rule Path
Disallow /*?*p=*

adsbot-google-mobile

Rule Path
Disallow /bloom/widget/*

adsbot-google

Rule Path
Disallow /bloom/widget/*

mediapartners-google

Rule Path
Disallow /bloom/widget/*

googlebot-image

Rule Path
Disallow /bloom/widget/*

googlebot-video

Rule Path
Disallow /bloom/widget/*

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.lampenwelt.de/media/sitemap/sitemap_de_de.xml

Comments

  • Disallow not needed crawlers
  • Disallow pagination crawls where not needed
  • Disallow widget crawls