smithharper.org
robots.txt
Robots Exclusion Standard data for smithharper.org
Resource Scan
Scan Details
| Site Domain | smithharper.org |
| Base Domain | smithharper.org |
| Scan Status | Ok |
| Last Scan | 2025-11-25T07:24:52+00:00 |
| Next Scan | 2025-12-25T07:24:52+00:00 |
Last Scan
| Scanned | 2025-11-25T07:24:52+00:00 |
| URL | https://smithharper.org/robots.txt |
| Domain IPs | 104.21.18.166, 172.67.182.192, 2606:4700:3032::6815:12a6, 2606:4700:3035::ac43:b6c0 |
| Response IP | 104.21.18.166 |
| Found | Yes |
| Hash | f585dee808db5d7ca6e79aab3f6d82510d131049ae5e2f9ce46f9c2aff66b151 |
| SimHash | ab3cc47863c8 |
Groups
*
No rules defined. All paths allowed.
Other Records
| Field | Value |
|---|---|
| crawl-delay | 120 |
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /cache/ |
| Disallow | /components/ |
| Disallow | /images/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /libraries/ |
| Disallow | /media/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /templates/ |
| Disallow | /tmp/ |
| Disallow | /xmlrpc/ |