uswebsites.com
robots.txt

Robots Exclusion Standard data for uswebsites.com

Resource Scan

Scan Details

Site Domain uswebsites.com
Base Domain uswebsites.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-09T20:47:15+00:00
Next Scan 2025-11-07T20:47:15+00:00

Last Successful Scan

Scanned2025-03-20T20:45:02+00:00
URL https://uswebsites.com/robots.txt
Domain IPs 40.160.24.203
Response IP 40.160.24.203
Found Yes
Hash 76fb4307a3898102e9db4bf0b54d06bcefa54e24ad7e2b1210baf4805bd51945
SimHash ab3cc56c62ec

Groups

mediapartners-google*
*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /images/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/
Disallow /xmlrpc/
Disallow /cgi-bin/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /conf/
Disallow /counter/
Disallow /editpage/
Disallow /stats/
Disallow /etc/
Disallow /submit/digest/
Disallow /submit/forum/
Disallow /hitcounts/