parokiboro.org
robots.txt

Robots Exclusion Standard data for parokiboro.org

Resource Scan

Scan Details

Site Domain parokiboro.org
Base Domain parokiboro.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-04-18T12:12:47+00:00
Next Scan 2024-06-17T12:12:47+00:00

Last Successful Scan

Scanned2024-02-12T12:10:58+00:00
URL https://parokiboro.org/robots.txt
Domain IPs 103.253.212.248, 2001:df0:27b:2::c0c7
Response IP 103.253.212.248
Found Yes
Hash 22aa0adeed7730e6a83520a4013a3500ccb280a2f55c57b5a59183ffd05d611b
SimHash a31f155943e5

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml