smithharper.org
robots.txt

Robots Exclusion Standard data for smithharper.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	smithharper.org
Base Domain	smithharper.org
Scan Status	Ok
Last Scan	2025-11-25T07:24:52+00:00
Next Scan	2025-12-25T07:24:52+00:00

Last Scan

Scanned	2025-11-25T07:24:52+00:00
URL	https://smithharper.org/robots.txt
Domain IPs	104.21.18.166, 172.67.182.192, 2606:4700:3032::6815:12a6, 2606:4700:3035::ac43:b6c0
Response IP	104.21.18.166
Found	Yes
Hash	f585dee808db5d7ca6e79aab3f6d82510d131049ae5e2f9ce46f9c2aff66b151
SimHash	ab3cc47863c8

Groups

*

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	120

Field

Value

crawl-delay

120

*

Rule	Path
Disallow	/administrator/
Disallow	/cache/
Disallow	/components/
Disallow	/images/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/libraries/
Disallow	/media/
Disallow	/modules/
Disallow	/plugins/
Disallow	/templates/
Disallow	/tmp/
Disallow	/xmlrpc/

Rule

Path

Disallow

/administrator/

Disallow

/cache/

Disallow

/components/

Disallow

/images/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/libraries/

Disallow

/media/

Disallow

/modules/

Disallow

/plugins/

Disallow

/templates/

Disallow

/tmp/

Disallow

/xmlrpc/

Back to top

smithharper.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

*

smithharper.org
robots.txt