actuelconstruction.com
robots.txt

Robots Exclusion Standard data for actuelconstruction.com

Resource Scan

Scan Details

Site Domain actuelconstruction.com
Base Domain actuelconstruction.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-04-15T08:31:51+00:00
Next Scan 2024-07-14T08:31:51+00:00

Last Successful Scan

Scanned2023-06-21T08:30:07+00:00
URL https://actuelconstruction.com/robots.txt
Domain IPs 91.216.107.208
Response IP 91.216.107.208
Found Yes
Hash d974f42f070bb96d4601c706c54f883b73a2e8334117a270682608e58c6f8cc4
SimHash a21d1d1803f4

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml