catch-newz.com
robots.txt

Robots Exclusion Standard data for catch-newz.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	catch-newz.com
Base Domain	catch-newz.com
Scan Status	Ok
Last Scan	2024-09-25T10:54:10+00:00
Next Scan	2024-10-02T10:54:10+00:00

Last Scan

Scanned	2024-09-25T10:54:10+00:00
URL	https://catch-newz.com/robots.txt
Domain IPs	146.88.233.252
Response IP	146.88.233.252
Found	Yes
Hash	aef6be027d924db4f974757ee35b6c02abc74dd34c969daac867926b69b9abfb
SimHash	e21e155943f5

Groups

*

Rule	Path
Allow	/.js
Allow	/.css
Allow	/.png
Allow	/.jpg
Allow	/.gif
Disallow	/administrator/
Disallow	/bin/
Disallow	/cache/
Disallow	/cli/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/layouts/
Disallow	/libraries/
Disallow	/logs/
Disallow	/tmp/

Rule

Path

Allow

/*.js*

Allow

/*.css*

Allow

/*.png*

Allow

/*.jpg*

Allow

/*.gif*

Disallow

/administrator/

Disallow

/bin/

Disallow

/cache/

Disallow

/cli/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/layouts/

Disallow

/libraries/

Disallow

/logs/

Disallow

/tmp/

Back to top

Other Records

Field	Value
sitemap	https://www.catch-newz.com/index.php?option=com_jmap&view=sitemap&format=xml

Field

Value

sitemap

https://www.catch-newz.com/index.php?option=com_jmap&view=sitemap&format=xml

Back to top

Comments

If the Joomla site is installed within a folder such as at
e.g. www.example.com/joomla/ the robots.txt file MUST be
moved to the site root at e.g. www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to the disallowed
path, e.g. the Disallow rule for the /administrator/ folder
MUST be changed to read Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://tool.motoricerca.info/robots-checker.phtml

Back to top

catch-newz.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

catch-newz.com
robots.txt