carpenterhq.com
robots.txt

Robots Exclusion Standard data for carpenterhq.com

Resource Scan

Scan Details

Site Domain carpenterhq.com
Base Domain carpenterhq.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2024-08-16T18:05:03+00:00
Next Scan 2024-11-14T18:05:03+00:00

Last Successful Scan

Scanned2024-04-19T17:10:46+00:00
URL https://carpenterhq.com/robots.txt
Domain IPs 216.218.206.54
Response IP 216.218.206.54
Found Yes
Hash d974f42f070bb96d4601c706c54f883b73a2e8334117a270682608e58c6f8cc4
SimHash a21d1d1803f4

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml