sumy.life
robots.txt

Robots Exclusion Standard data for sumy.life

Resource Scan

Scan Details

Site Domain sumy.life
Base Domain sumy.life
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-03-12T06:37:16+00:00
Next Scan 2024-06-10T06:37:16+00:00

Last Successful Scan

Scanned2023-11-14T05:26:53+00:00
URL https://sumy.life/robots.txt
Domain IPs 104.26.14.188, 104.26.15.188, 172.67.74.27, 2606:4700:20::681a:ebc, 2606:4700:20::681a:fbc, 2606:4700:20::ac43:4a1b
Response IP 104.26.14.188
Found Yes
Hash 97c87f1e050ea0ead076373282f50734e90f71734b825931f9a4739ca0d5ace8
SimHash a21d1d1803f4

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml