delonghi.com
robots.txt

Robots Exclusion Standard data for delonghi.com

Resource Scan

Scan Details

Site Domain delonghi.com
Base Domain delonghi.com
Scan Status Ok
Last Scan2024-10-18T20:50:10+00:00
Next Scan 2024-11-17T20:50:10+00:00

Last Scan

Scanned2024-10-18T20:50:10+00:00
URL https://delonghi.com/robots.txt
Redirect https://www.delonghi.com/robots.txt
Redirect Domain www.delonghi.com
Redirect Base delonghi.com
Domain IPs 35.190.208.151
Redirect IPs 125.252.219.182
Response IP 125.252.219.182
Found Yes
Hash 4bed35c4b70a1535865da2abbc5e9b10738d1b17047b3591470349243d533ebf
SimHash bc5957b6adc1

Groups

*

Rule Path
Disallow /*cart
Disallow /*checkout
Disallow /*my-account
Disallow /he-il
Disallow /kk-kz
Disallow /en-th
Disallow /en-pk
Disallow /en-in
Disallow /ru-ru/search-results?q=
Disallow /ru-ru/customer-support/faqs/-----?q=
Disallow /en-sys
Disallow /en-ar
Disallow /ru-ua

trendictionbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /
Allow /ru-ru

yandexbot

Rule Path
Disallow /
Allow /ru-ru

googlebot

Rule Path
Disallow /*_openstat
Disallow /*from%3Dadwords
Disallow /*?from=
Disallow /*%26sort%3D
Disallow /?refpage=
Disallow /ru-ru/search-results?q=
Disallow /ru-ru/customer-support/faqs/-----?q=
Disallow /*cart
Disallow /*checkout
Disallow /*my-account

sogou news spider

Rule Path
Disallow /
Allow /zh-cn

baiduspider

Rule Path
Disallow /
Allow /zh-cn

zumbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

orangebot

Rule Path
Disallow /

jobdiggerspider

Rule Path
Disallow /

yeti

Rule Path
Disallow /
Allow /ko-kr

cazoodlebot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.delonghi.com/cs-cz/sitemap.xml
sitemap https://www.delonghi.com/da-dk/sitemap.xml
sitemap https://www.delonghi.com/de-at/sitemap.xml
sitemap https://www.delonghi.com/de-ch/sitemap.xml
sitemap https://www.delonghi.com/de-de/sitemap.xml
sitemap https://www.delonghi.com/el-gr/sitemap.xml
sitemap https://www.delonghi.com/en-ae/sitemap.xml
sitemap https://www.delonghi.com/en-au/sitemap.xml
sitemap https://www.delonghi.com/en-ca/sitemap.xml
sitemap https://www.delonghi.com/en-gb/sitemap.xml
sitemap https://www.delonghi.com/en-nz/sitemap.xml
sitemap https://www.delonghi.com/en-us/sitemap.xml
sitemap https://www.delonghi.com/es-es/sitemap.xml
sitemap https://www.delonghi.com/fi-fi/sitemap.xml
sitemap https://www.delonghi.com/fr-be/sitemap.xml
sitemap https://www.delonghi.com/fr-ca/sitemap.xml
sitemap https://www.delonghi.com/fr-ch/sitemap.xml
sitemap https://www.delonghi.com/fr-fr/sitemap.xml
sitemap https://www.delonghi.com/hr-hr/sitemap.xml
sitemap https://www.delonghi.com/hu-hu/sitemap.xml
sitemap https://www.delonghi.com/it-it/sitemap.xml
sitemap https://www.delonghi.com/ja-jp/sitemap.xml
sitemap https://www.delonghi.com/ko-kr/sitemap.xml
sitemap https://www.delonghi.com/lt-lt/sitemap.xml
sitemap https://www.delonghi.com/nb-no/sitemap.xml
sitemap https://www.delonghi.com/nl-be/sitemap.xml
sitemap https://www.delonghi.com/nl-nl/sitemap.xml
sitemap https://www.delonghi.com/pl-pl/sitemap.xml
sitemap https://www.delonghi.com/pt-pt/sitemap.xml
sitemap https://www.delonghi.com/ro-ro/sitemap.xml
sitemap https://www.delonghi.com/sl-si/sitemap.xml
sitemap https://www.delonghi.com/sk-sk/sitemap.xml
sitemap https://www.delonghi.com/sv-se/sitemap.xml
sitemap https://www.delonghi.com/zh-cn/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of SAP pages
  • Block access to specific non existing local folders
  • Block CazoodleBot as it does not present correct accept content headers
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Allow spiders to discover all the local sitemaps

Warnings

  • 4 invalid lines.