mohe.gov.my
robots.txt

Robots Exclusion Standard data for mohe.gov.my

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mohe.gov.my
Base Domain	mohe.gov.my
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-04-03T17:06:33+00:00
Next Scan	2025-07-02T17:06:33+00:00

Last Successful Scan

Scanned	2024-11-12T17:05:10+00:00
URL	https://www.mohe.gov.my/robots.txt
Domain IPs	104.21.69.210, 172.67.213.91, 2606:4700:3034::6815:45d2, 2606:4700:3036::ac43:d55b
Response IP	104.21.69.210
Found	Yes
Hash	5d47a72861702b182db2b2b74a0189c1b16cba65d5bf67c2bd542fb4f9a695dd
SimHash	f21c151b43f4

Groups

*

Rule	Path
Allow	/.js
Allow	/.css
Allow	/.png
Allow	/.jpg
Allow	/.gif
Disallow	/administrator/
Disallow	/bin/
Disallow	/cache/
Disallow	/cli/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/layouts/
Disallow	/libraries/
Disallow	/logs/
Disallow	/tmp/

Rule

Path

Allow

/*.js*

Allow

/*.css*

Allow

/*.png*

Allow

/*.jpg*

Allow

/*.gif*

Disallow

/administrator/

Disallow

/bin/

Disallow

/cache/

Disallow

/cli/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/layouts/

Disallow

/libraries/

Disallow

/logs/

Disallow

/tmp/

Back to top

Comments

If the Joomla site is installed within a folder
eg www.example.com/joomla/ then the robots.txt file
MUST be moved to the site root
eg www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to all of the
paths.
eg the Disallow rule for the /administrator/ folder MUST
be changed to read
Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://tool.motoricerca.info/robots-checker.phtml

Back to top

mohe.gov.myrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Comments

mohe.gov.my
robots.txt