solopiante.it
robots.txt

Robots Exclusion Standard data for solopiante.it

Resource Scan

Scan Details

Site Domain solopiante.it
Base Domain solopiante.it
Scan Status Ok
Last Scan2025-10-28T03:35:38+00:00
Next Scan 2025-11-27T03:35:38+00:00

Last Scan

Scanned2025-10-28T03:35:38+00:00
URL https://solopiante.it/robots.txt
Domain IPs 104.26.12.142, 104.26.13.142, 172.67.68.71, 2606:4700:20::681a:c8e, 2606:4700:20::681a:d8e, 2606:4700:20::ac43:4447
Response IP 172.67.68.71
Found Yes
Hash e50e1f8e855d35d67f8d74d33fe26c409fabce572770d69084728fe7aa16a4bf
SimHash ea5e98298434

Groups

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

google-extended

Rule Path
Allow /

claude-web

Rule Path
Allow /

facebookbot

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

bingbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

youbot

Rule Path
Allow /

ccbot

Rule Path
Allow /

ai2bot

Rule Path
Allow /

imagesiftbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

*

Rule Path
Allow /*.css
Allow /*.js
Allow /*.png
Allow /*.jpg
Allow /*.jpeg
Allow /*.webp
Allow */modules/*.css
Allow */modules/*.js
Allow */modules/*.png
Allow */modules/*.jpg
Allow */modules/*.xml
Allow /contattaci
Disallow /negozi*
Disallow /*?orderby=
Disallow /*?orderway=
Disallow /*?tag=
Disallow /*?id_currency=
Disallow /*?search_query=
Disallow /*?back=
Disallow /*?n=
Disallow /*%26orderby%3D
Disallow /*%26orderway%3D
Disallow /*%26tag%3D
Disallow /*%26id_currency%3D
Disallow /*%26search_query%3D
Disallow /*%26back%3D
Disallow /*%26n%3D
Disallow /*controller%3Daddresses
Disallow /*controller%3Daddress
Disallow /*controller%3Dauthentication
Disallow /*controller%3Dcart
Disallow /*controller%3Ddiscount
Disallow /*controller%3Dfooter
Disallow /*controller%3Dget-file
Disallow /*controller%3Dheader
Disallow /*controller%3Dhistory
Disallow /*controller%3Didentity
Disallow /*controller%3Dimages.inc
Disallow /*controller%3Dinit
Disallow /*controller%3Dmy-account
Disallow /*controller%3Dorder
Disallow /*controller%3Dorder-slip
Disallow /*controller%3Dorder-detail
Disallow /*controller%3Dorder-follow
Disallow /*controller%3Dorder-return
Disallow /*controller%3Dorder-confirmation
Disallow /*controller%3Dpagination
Disallow /*controller%3Dpassword
Disallow /*controller%3Dpdf-invoice
Disallow /*controller%3Dpdf-order-return
Disallow /*controller%3Dpdf-order-slip
Disallow /*controller%3Dproduct-sort
Disallow /*controller%3Dsearch
Disallow /*controller%3Dstatistics
Disallow /*controller%3Dattachment
Disallow /*controller%3Dguest-tracking
Disallow */classes/
Disallow */config/
Disallow */controllers/
Disallow */css/
Disallow */download/
Disallow */js/
Disallow */localization/
Disallow */log/
Disallow */mails/
Disallow */override/
Disallow */pdf/
Disallow */src/
Disallow */tools/
Disallow */translations/
Disallow */upload/
Disallow */vendor/
Disallow */web/
Disallow */webservice/
Disallow /password-recovery
Disallow /address
Disallow /addresses
Disallow /login
Disallow /cart
Disallow /discount
Disallow /order-history
Disallow /identity
Disallow /my-account
Disallow /order-follow
Disallow /credit-slip
Disallow /order
Disallow /search
Disallow /guest-tracking
Disallow /order-confirmation
Allow /*.css
Allow /*.js

Comments

  • robots.txt automatically generated by PrestaShop e-commerce open-source solution
  • http://www.prestashop.com - http://www.prestashop.com/forums
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • Bot OpenAI (ChatGPT, GPT-4)
  • Bot Google (Bard/Gemini)
  • Bot Anthropic (Claude)
  • Bot Meta (AI di Facebook/Instagram)
  • Bot Microsoft (Copilot/Bing AI)
  • Bot Perplexity
  • Bot You.com
  • Bot comune per crawling AI
  • Altri bot AI emergenti
  • Googlebot tradizionale (importante per AI di Google)
  • Allow Directives
  • Private pages
  • Directories
  • Disallow: */cache/
  • Files