michelefreeman.org
robots.txt

Robots Exclusion Standard data for michelefreeman.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	michelefreeman.org
Base Domain	michelefreeman.org
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-11-06T11:13:17+00:00
Next Scan	2026-02-04T11:13:17+00:00

Last Successful Scan

Scanned	2024-03-23T11:29:32+00:00
URL	https://michelefreeman.org/robots.txt
Domain IPs	35.213.141.17
Response IP	35.213.141.17
Found	Yes
Hash	1ccf627c353e775b75435613c161bf3e3670e245f0e5cb143154bfa6d159170e
SimHash	e21c1d1b43f4

Groups

*

Rule	Path
Allow	/.js
Allow	/.css
Allow	/.png
Allow	/.jpg
Allow	/.gif
Disallow	/administrator/
Disallow	/bin/
Disallow	/cache/
Disallow	/cli/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/layouts/
Disallow	/libraries/
Disallow	/logs/
Disallow	/tmp/

Rule

Path

Allow

/*.js*

Allow

/*.css*

Allow

/*.png*

Allow

/*.jpg*

Allow

/*.gif*

Disallow

/administrator/

Disallow

/bin/

Disallow

/cache/

Disallow

/cli/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/layouts/

Disallow

/libraries/

Disallow

/logs/

Disallow

/tmp/

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

Comments

If the Joomla site is installed within a folder
eg www.example.com/joomla/ then the robots.txt file
MUST be moved to the site root
eg www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to all of the
paths.
eg the Disallow rule for the /administrator/ folder MUST
be changed to read
Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://tool.motoricerca.info/robots-checker.phtml

Back to top

michelefreeman.orgrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

Comments

michelefreeman.org
robots.txt