grsu.by
robots.txt

Robots Exclusion Standard data for grsu.by

Resource Scan

Scan Details

Site Domain grsu.by
Base Domain grsu.by
Scan Status Ok
Last Scan2024-09-28T04:06:27+00:00
Next Scan 2024-10-28T04:06:27+00:00

Last Scan

Scanned2024-09-28T04:06:27+00:00
URL https://grsu.by/robots.txt
Redirect https://www.grsu.by/robots.txt
Redirect Domain www.grsu.by
Redirect Base grsu.by
Domain IPs 195.50.7.205
Redirect IPs 195.50.7.205
Response IP 195.50.7.205
Found Yes
Hash fb385ce0ef8dbcdce09912bb18a1f7a99aa7dc351debff3f885dbca1361b39df
SimHash e10e151f8be4

Groups

*

Rule Path
Disallow /administrator
Disallow /bin
Disallow /cache
Disallow /cli
Disallow /components
Disallow /includes
Disallow /installation
Disallow /language
Disallow /layouts
Disallow /libraries
Disallow /logs
Disallow /modules
Disallow /plugins
Disallow /pma
Disallow /tmp
Disallow /publications.php
Allow /cache/widgetkit
Allow /components/com_k2/css
Allow /plugins/system
Allow /modules/mod_djimageslider
Allow /components/com_k2/images/system/blank.gif

Other Records

Field Value
sitemap https://www.grsu.by/sitemap.xml

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml

Warnings

  • `host` is not a known field.