global-standard.org
robots.txt

Robots Exclusion Standard data for global-standard.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	global-standard.org
Base Domain	global-standard.org
Scan Status	Ok
Last Scan	2025-12-03T15:48:36+00:00
Next Scan	2026-01-02T15:48:36+00:00

Last Scan

Scanned	2025-12-03T15:48:36+00:00
URL	https://global-standard.org/robots.txt
Domain IPs	45.144.187.21
Response IP	45.144.187.21
Found	Yes
Hash	7fc02c41c3062834e9fec9e7232000f58561a4b362f26eb141ed2c12e6b6cc59
SimHash	e01e1559c3f5

Groups

*

Rule	Path
Allow	/.js
Allow	/.css
Allow	/.png
Allow	/.jpg
Allow	/.gif
Disallow	/administrator/
Disallow	/cache/
Disallow	/cli/

Rule

Path

Allow

/*.js*

Allow

/*.css*

Allow

/*.png*

Allow

/*.jpg*

Allow

/*.gif*

Disallow

/administrator/

Disallow

/cache/

Disallow

/cli/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

chatgpt-user

Rule	Path
Disallow	/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/libraries/
Disallow	/logs/
Disallow	/tmp/

Rule

Path

Disallow

/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/libraries/

Disallow

/logs/

Disallow

/tmp/

Back to top

Other Records

Field	Value
sitemap	https://global-standard.org/index.php?option=com_jmap&view=sitemap&format=xml&lang=en

Field

Value

sitemap

https://global-standard.org/index.php?option=com_jmap&view=sitemap&format=xml&lang=en

Back to top

Comments

If the Joomla site is installed within a folder such as at
e.g. www.example.com/joomla/ the robots.txt file MUST be
moved to the site root at e.g. www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to the disallowed
path, e.g. the Disallow rule for the /administrator/ folder
MUST be changed to read Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
JSitemap entries

Back to top

global-standard.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

chatgpt-user

Other Records

Comments

global-standard.org
robots.txt