cs.rutgers.edu
robots.txt

Robots Exclusion Standard data for cs.rutgers.edu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cs.rutgers.edu
Base Domain	rutgers.edu
Scan Status	Ok
Last Scan	2024-05-25T07:41:19+00:00
Next Scan	2024-06-24T07:41:19+00:00

Last Scan

Scanned	2024-05-25T07:41:19+00:00
URL	https://cs.rutgers.edu/robots.txt
Redirect	https://www.cs.rutgers.edu/robots.txt
Redirect Domain	www.cs.rutgers.edu
Redirect Base	rutgers.edu
Domain IPs	128.6.48.178, 2620:0:d60:addd::b2
Redirect IPs	128.6.48.178, 2620:0:d60:addd::b2
Response IP	128.6.48.178
Found	Yes
Hash	2ea455c708ddef4d40f26facb95fb878fb55b19c355c3f0c85f5beb9f7097aa8
SimHash	e21f0d5bc9fc

Groups

*

Rule	Path
Disallow	/administrator/
Disallow	/api/
Disallow	/bin/
Disallow	/cache/
Disallow	/cli/
Disallow	/components/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/layouts/
Disallow	/libraries/
Disallow	/logs/
Disallow	/modules/
Disallow	/plugins/
Disallow	/tmp/

Rule

Path

Disallow

/administrator/

Disallow

/api/

Disallow

/bin/

Disallow

/cache/

Disallow

/cli/

Disallow

/components/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/layouts/

Disallow

/libraries/

Disallow

/logs/

Disallow

/modules/

Disallow

/plugins/

Disallow

/tmp/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Comments

If the Joomla site is installed within a folder
eg www.example.com/joomla/ then the robots.txt file
MUST be moved to the site root
eg www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to all of the
paths.
eg the Disallow rule for the /administrator/ folder MUST
be changed to read
Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
https://www.robotstxt.org/orig.html

Back to top

cs.rutgers.edurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

bytespider

Comments

cs.rutgers.edu
robots.txt