insite.guru
robots.txt

Robots Exclusion Standard data for insite.guru

Resource Scan

Scan Details

Site Domain insite.guru
Base Domain insite.guru
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-12T17:37:57+00:00
Next Scan 2025-12-11T17:37:57+00:00

Last Successful Scan

Scanned2025-05-16T16:35:57+00:00
URL https://insite.guru/robots.txt
Domain IPs 104.193.142.73
Response IP 104.193.142.73
Found Yes
Hash f1a6b5e3f6e69a71c53516c4bce70e2414429d23dab6814239500999eb80907e
SimHash e21f0d1b49fc

Groups

*

Rule Path
Disallow /administrator/
Disallow /api/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • https://www.robotstxt.org/orig.html