golfsitges.com
robots.txt

Robots Exclusion Standard data for golfsitges.com

Resource Scan

Scan Details

Site Domain golfsitges.com
Base Domain golfsitges.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2026-01-26T20:43:54+00:00
Next Scan 2026-03-27T20:43:54+00:00

Last Successful Scan

Scanned2025-11-04T16:30:15+00:00
URL https://golfsitges.com/robots.txt
Domain IPs 213.158.86.103
Response IP 213.158.86.103
Found Yes
Hash f1a6b5e3f6e69a71c53516c4bce70e2414429d23dab6814239500999eb80907e
SimHash e21f0d1b49fc

Groups

*

Rule Path
Disallow /administrator/
Disallow /api/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • https://www.robotstxt.org/orig.html