duepstore.it
robots.txt

Robots Exclusion Standard data for duepstore.it

Resource Scan

Scan Details

Site Domain duepstore.it
Base Domain duepstore.it
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-11-13T11:09:46+00:00
Next Scan 2024-11-20T11:09:46+00:00

Last Successful Scan

Scanned2024-10-13T10:50:24+00:00
URL https://duepstore.it/robots.txt
Domain IPs 217.64.201.138
Response IP 217.64.201.138
Found Yes
Hash dd11f32fc7ec67502a56498123cb9772e5a8e7bc4d3d3ef56bb39f9dafafa749
SimHash 0165325240b1

Groups

semrushbot
semrushbot-sa
mail.ru
dotbot
blexbot
blexbot/1.0
istellabot
istellabot/1.01.18
istellabot/1.01.18 +http://www.tiscali.it/
istellabot/1.10.2 +http://www.tiscali.it/
mozilla/5.0 (compatible; istellabot/1.01.18 +http://www.tiscali.it/)
turnitinbot
mj12bot
smtbot
smtbot/1.0
alphabot
alphaseobot
alphaseobot-sa
seekbot
seekport crawler
linguee bot

Rule Path
Disallow /

*

Rule Path
Disallow /wishlist/index/add/
Disallow /catalog/product_compare/add/
Disallow /newsletter/subscriber/new/

*

Rule Path
Disallow /CVS
Disallow /*.svn$
Disallow /*.idea$
Disallow /*.sql$
Disallow /*.tgz$

*

Rule Path
Disallow /admin/
Disallow /app/
Disallow /cgi-bin/
Disallow /downloader/
Disallow /errors/
Disallow /includes/
Disallow /lib/
Disallow /magento/
Disallow /pkginfo/
Disallow /shell/
Disallow /var/
Disallow /php_barcode/

*

Rule Path
Disallow /api.php
Disallow /cron.php
Disallow /cron.sh
Disallow /error_log
Disallow /get.php
Disallow /install.php
Disallow /LICENSE.html
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /STATUS.txt
Disallow /README.txt
Disallow /RELEASE_NOTES.txt
Disallow /cleanup.php

*

Rule Path
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/

*

Rule Path
Disallow /public/
Disallow /statistiche/
Disallow /support/
Disallow /tracking/
Disallow /mysetting/
Disallow /mapcategory/
Disallow /ebay-track/
Disallow

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

adsbot-google-mobile

Rule Path
Disallow

adsbot-google-mobile-apps

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap http://www.duepstore.it/sitemap.xml

Comments

  • Block bot crawlet
  • Blocca crawled sulla lista dei desisderi (wishlist) e compare (product_compare)
  • Do not crawl development files and folders: CVS, svn directories and dump files
  • Do not crawl common Magento technical folders
  • Do not crawl common Magento files
  • Do not crawl checkout and user account pages
  • Personal script

Warnings

  • `user agent` is not a known field.