rappahannock.com
robots.txt

Robots Exclusion Standard data for rappahannock.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	rappahannock.com
Base Domain	rappahannock.com
Scan Status	Ok
Last Scan	2024-08-31T07:27:44+00:00
Next Scan	2024-09-30T07:27:44+00:00

Last Scan

Scanned	2024-08-31T07:27:44+00:00
URL	https://rappahannock.com/robots.txt
Redirect	https://www.rappahannock.com/robots.txt
Redirect Domain	www.rappahannock.com
Redirect Base	rappahannock.com
Domain IPs	167.114.119.161
Redirect IPs	167.114.119.161
Response IP	167.114.119.161
Found	Yes
Hash	8618a253073c31a3aa4b0513a8fe06d87b678c73eb56b2f994f3b68daab84eba
SimHash	a8965d1ac764

Groups

*

Rule	Path
Disallow	/apps/
Disallow	/conf/
Disallow	/lib/
Disallow	/tests/
Disallow	/LICENSE
Disallow	/README.md
Disallow	/elefant
Disallow	/nginx.conf
Disallow	/phpunit.xml.dist
Disallow	/composer.json
Disallow	/admin/

Rule

Path

Disallow

/apps/

Disallow

/conf/

Disallow

/lib/

Disallow

/tests/

Disallow

/LICENSE

Disallow

/README.md

Disallow

/elefant

Disallow

/nginx.conf

Disallow

/phpunit.xml.dist

Disallow

/composer.json

Disallow

/admin/

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html

Back to top

rappahannock.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

rappahannock.com
robots.txt