polaris.jackson.sparcc.org
robots.txt

Robots Exclusion Standard data for polaris.jackson.sparcc.org

Resource Scan

Scanned	2025-08-08T06:16:15+00:00
URL	https://polaris.jackson.sparcc.org/robots.txt
Domain IPs	3.218.203.100, 34.206.169.120, 54.84.154.202
Response IP	34.206.169.120
Found	Yes
Hash	4aae486cceccf06e2dc2b0309542ddb5b3da6bbb4dee73d724385871abe38f44
SimHash	ac12994ac565

Rule	Path
Disallow	/
Allow	/group/25228991/blog

Rule

Path

Disallow

/

Allow

/group/25228991/blog

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

$Id: robots.txt,v 1.9.2.1 2008/12/10 20:12:19 goba Exp $
robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
See commit 32573af8f1721f855a91dae507c601ae0e255f0b to see what this looks like prior to the App Cutover
Directories

Back to top