thenewx.org
robots.txt
Robots Exclusion Standard data for thenewx.org
Resource Scan
Scan Details
Site Domain | thenewx.org |
Base Domain | thenewx.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-19T18:46:27+00:00 |
Next Scan | 2024-09-26T18:46:27+00:00 |
Last Successful Scan
Scanned | 2024-07-17T14:36:01+00:00 |
URL | https://thenewx.org/robots.txt |
Domain IPs | 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91 |
Response IP | 151.101.65.91 |
Found | Yes |
Hash | 4bf9d5dc6711a1e255407c0e0aa42ebbbea1f9338b06e67958a946ec334317d1 |
SimHash | 845d5e42a601 |
Groups
*
Rule | Path |
---|---|
Disallow | /account/ |
Disallow | /goto/ |
Disallow | /login/ |
Disallow | /search/ |
Disallow | /members/ |
Disallow | /admin.php |
Disallow | /business/directory |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://thenewx.org/sitemap.xml |