marshruti.eu
robots.txt

Robots Exclusion Standard data for marshruti.eu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	marshruti.eu
Base Domain	marshruti.eu
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-10-17T00:04:41+00:00
Next Scan	2025-10-31T00:04:41+00:00

Last Successful Scan

Scanned	2025-10-02T00:03:15+00:00
URL	https://marshruti.eu/robots.txt
Domain IPs	79.98.104.27
Response IP	79.98.104.27
Found	Yes
Hash	7f30e9fb9558ab904cec4a2e0bff641096876d8234ca117da0fea6b8df24671b
SimHash	c01e1551cfe4

Groups

googlebot

Rule	Path
Allow	*.css
Allow	*.js
Allow	*.jpg
Allow	*.png
Allow	*.gif

Rule

Path

Allow

*.css

Allow

*.js

Allow

*.jpg

Allow

*.png

Allow

*.gif

*

Rule	Path
Allow	/.js
Allow	/.css
Allow	/.png
Allow	/.jpg
Allow	/.gif
Disallow	/administrator/
Disallow	/cli/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/libraries/
Disallow	/logs/
Disallow	/tmp/

Rule

Path

Allow

/*.js*

Allow

/*.css*

Allow

/*.png*

Allow

/*.jpg*

Allow

/*.gif*

Disallow

/administrator/

Disallow

/cli/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/libraries/

Disallow

/logs/

Disallow

/tmp/

Back to top

Other Records

Field	Value
sitemap	https://marshruti.eu/index.php?option=com_jmap&view=sitemap&format=xml

Field

Value

sitemap

https://marshruti.eu/index.php?option=com_jmap&view=sitemap&format=xml

Back to top

Comments

If the Joomla site is installed within a folder such as at
e.g. www.example.com/joomla/ the robots.txt file MUST be
moved to the site root at e.g. www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to the disallowed
path, e.g. the Disallow rule for the /administrator/ folder
MUST be changed to read Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Googlebot
global

Back to top

marshruti.eurobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

googlebot

*

Other Records

Comments

marshruti.eu
robots.txt