cirp.net
robots.txt

Robots Exclusion Standard data for cirp.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cirp.net
Base Domain	cirp.net
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-07-25T23:41:34+00:00
Next Scan	2025-08-24T23:41:34+00:00

Last Successful Scan

Scanned	2025-06-03T23:12:25+00:00
URL	https://cirp.net/robots.txt
Domain IPs	2001:41d0:b00:6200::3, 91.134.190.172
Response IP	91.134.190.172
Found	Yes
Hash	0012621b9ed74483d551f86a7dec27c2fa4c34f70a107480d28f74303dc3b6dc
SimHash	a31d1d180bf4

Groups

*

Rule	Path
Disallow	/administrator/
Disallow	/bin/
Disallow	/cache/
Disallow	/cli/
Disallow	/components/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/layouts/
Disallow	/libraries/
Disallow	/logs/
Disallow	/modules/
Disallow	/plugins/
Disallow	/tmp/

Rule

Path

Disallow

/administrator/

Disallow

/bin/

Disallow

/cache/

Disallow

/cli/

Disallow

/components/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/layouts/

Disallow

/libraries/

Disallow

/logs/

Disallow

/modules/

Disallow

/plugins/

Disallow

/tmp/

Back to top

Comments

If the Joomla site is installed within a folder
eg www.example.com/joomla/ then the robots.txt file
MUST be moved to the site root
eg www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to all of the
paths.
eg the Disallow rule for the /administrator/ folder MUST
be changed to read
Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://tool.motoricerca.info/robots-checker.phtml

Back to top

Warnings

`disollow` is not a known field.

Back to top

cirp.netrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Comments

Warnings

cirp.net
robots.txt