newcelica.org
robots.txt
Robots Exclusion Standard data for newcelica.org
Resource Scan
Scan Details
Site Domain | newcelica.org |
Base Domain | newcelica.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-02T16:48:34+00:00 |
Next Scan | 2024-11-09T16:48:34+00:00 |
Last Successful Scan
Scanned | 2024-07-19T04:04:26+00:00 |
URL | https://newcelica.org/robots.txt |
Domain IPs | 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91 |
Response IP | 151.101.193.91 |
Found | Yes |
Hash | d8cf692943df0076d2545c1d0b6fa7bb9b7defccbdb7e14264693b8d9c0f9fd7 |
SimHash | 84195842a603 |
Groups
*
Rule | Path |
---|---|
Disallow | /account/ |
Disallow | /goto/ |
Disallow | /login/ |
Disallow | /search/ |
Disallow | /members/ |
Disallow | /admin.php |
Disallow | /business/directory |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://newcelica.org/sitemap.xml |