farmasave.it
robots.txt

Robots Exclusion Standard data for farmasave.it

Resource Scan

Scan Details

Site Domain farmasave.it
Base Domain farmasave.it
Scan Status Ok
Last Scan2024-10-20T11:17:51+00:00
Next Scan 2024-11-19T11:17:51+00:00

Last Scan

Scanned2024-10-20T11:17:51+00:00
URL https://farmasave.it/robots.txt
Redirect https://www.farmasave.it/robots.txt
Redirect Domain www.farmasave.it
Redirect Base farmasave.it
Domain IPs 75.2.81.85, 99.83.215.215
Redirect IPs 83.229.32.44
Response IP 83.229.32.44
Found Yes
Hash 8c0f6b03d4d861d714d9564c82b092cabd9f3362ea45552d8f3c6f68791d4f00
SimHash ae94bd4347f1

Groups

*

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow

bingbot

Rule Path
Disallow

bingbot

Rule Path
Disallow

msnbot

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

yahoo! slurp

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

*

Rule Path
Disallow /app/

*

Rule Path
Disallow /feed/

*
*

Rule Path Comment
Disallow /test-rapido-covid-19 -
Disallow /404/ -
Disallow /app/ -
Disallow /cgi-bin/ -
Disallow /downloader/ -
Disallow /admin/ -
Disallow /errors/ -
Disallow /includes/ -
Disallow /lib/ -
Disallow /magento/ -
Disallow /media/captcha/ -
Disallow /media/customer/ -
Disallow /media/dhl/ -
Disallow /media/downloadable/ -
Disallow /media/import/ -
Disallow /media/pdf/ -
Disallow /media/sales/ -
Disallow /media/tmp/ -
Disallow /media/wysiwyg/ -
Disallow /media/xmlconnect/ -
Disallow /pkginfo/ -
Disallow /report/ -
Disallow /scripts/ -
Disallow /shell/ -
Disallow /stats/ -
Disallow /var/ -
Disallow /fatture/ -
Disallow /index.php/ -
Disallow /catalog/product_compare/ -
Disallow /catalog/category/view/ -
Disallow /catalog/product/view/ -
Disallow /catalog/product/gallery/ -
Disallow /catalogsearch/ -
Disallow /checkout/ -
Disallow /control/ -
Disallow /contacts/ -
Disallow /customer/ -
Disallow /customize/ -
Disallow /newsletter/ -
Disallow /poll/ -
Disallow /review/ -
Disallow /sendfriend/ -
Disallow /tag/ -
Disallow /wishlist/ -
Disallow /api.php -
Disallow /cron.php -
Disallow /cron.sh -
Disallow /error_log -
Disallow /install.php -
Disallow /LICENSE.html -
Disallow /LICENSE.txt -
Disallow /LICENSE_AFL.txt -
Disallow /STATUS.txt -
Disallow /get.php Magento 1.5+
Disallow /README.txt -
Disallow /RELEASE_NOTES.txt -
Disallow /cleanup.php -
Disallow /apc.php -
Disallow /memcache.php -
Disallow /phpinfo.php -
Disallow /*.php$ -
Disallow /*?SID= -
Disallow /rss* -
Disallow /*PHPSESSID -

Other Records

Field Value
sitemap https://www.farmasave.it/sitemap/sitemap_farma_sum.xml

Comments

  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more informationsk abocut the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html
  • Prevent blocking URL parameters with robots.txt
  • Use Google Webmaster Tools > Crawl > Url parameters instead
  • Crawlers Setup
  • Directories
  • Disallow: /js/
  • Disallow: /media/
  • Disallow: /media/catalog/
  • Disallow: /media/css/
  • Disallow: /media/css_secure/
  • Disallow: /media/js/
  • Disallow: /skin/
  • Paths (clean URLs)
  • Files
  • Do not index the general technical directories and files on a server
  • Paths (no clean URLs)
  • Disallow: /*.js$
  • Disallow: /*.css$

Warnings

  • 1 invalid line.