thecombineforum.com
robots.txt
Robots Exclusion Standard data for thecombineforum.com
Resource Scan
Scan Details
Site Domain | thecombineforum.com |
Base Domain | thecombineforum.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-15T06:34:35+00:00 |
Next Scan | 2024-09-22T06:34:35+00:00 |
Last Successful Scan
Scanned | 2024-07-13T06:31:39+00:00 |
URL | https://thecombineforum.com/robots.txt |
Domain IPs | 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91 |
Response IP | 151.101.65.91 |
Found | Yes |
Hash | 9259040de20f509f6e94ba83726fd20d23ff9ee1aa0d6b89bb502f209f188c8a |
SimHash | c479d842a201 |
Groups
*
Rule | Path |
---|---|
Disallow | /account/ |
Disallow | /goto/ |
Disallow | /login/ |
Disallow | /search/ |
Disallow | /members/ |
Disallow | /admin.php |
Disallow | /business/directory |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://thecombineforum.com/sitemap.xml |