chiaracriniti.com
robots.txt

Robots Exclusion Standard data for chiaracriniti.com

Resource Scan

Scan Details

Site Domain chiaracriniti.com
Base Domain chiaracriniti.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2025-11-06T02:45:33+00:00
Next Scan 2026-02-04T02:45:33+00:00

Last Successful Scan

Scanned2024-03-23T00:33:44+00:00
URL https://chiaracriniti.com/robots.txt
Domain IPs 93.95.216.100
Response IP 93.95.216.100
Found Yes
Hash f1a6b5e3f6e69a71c53516c4bce70e2414429d23dab6814239500999eb80907e
SimHash e21f0d1b49fc

Groups

*

Rule Path
Disallow /administrator/
Disallow /api/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • https://www.robotstxt.org/orig.html