cloudcis.com
robots.txt

Robots Exclusion Standard data for cloudcis.com

Resource Scan

Scan Details

Site Domain cloudcis.com
Base Domain cloudcis.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-15T10:25:44+00:00
Next Scan 2025-11-14T10:25:44+00:00

Last Successful Scan

Scanned2025-06-23T19:26:57+00:00
URL http://cloudcis.com/robots.txt
Domain IPs 2a01:488:42:1000:50ed:844a:ffb0:d14f, 87.230.39.173
Response IP 87.230.39.173
Found Yes
Hash b246e97d24f7e63775eb519c2baafb72bc22ac2767a29f242f1c0b99cc243f10
SimHash a21d1d1803f4

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml