michelefreeman.org
robots.txt

Robots Exclusion Standard data for michelefreeman.org

Resource Scan

Scan Details

Site Domain michelefreeman.org
Base Domain michelefreeman.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-11-06T11:13:17+00:00
Next Scan 2026-02-04T11:13:17+00:00

Last Successful Scan

Scanned2024-03-23T11:29:32+00:00
URL https://michelefreeman.org/robots.txt
Domain IPs 35.213.141.17
Response IP 35.213.141.17
Found Yes
Hash 1ccf627c353e775b75435613c161bf3e3670e245f0e5cb143154bfa6d159170e
SimHash e21c1d1b43f4

Groups

*

Rule Path
Allow /*.js*
Allow /*.css*
Allow /*.png*
Allow /*.jpg*
Allow /*.gif*
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /tmp/

Other Records

Field Value
crawl-delay 10

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml