lccc.edu
robots.txt
Robots Exclusion Standard data for lccc.edu
Resource Scan
Scan Details
Site Domain | lccc.edu |
Base Domain | lccc.edu |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a server error. |
Last Scan | 2024-10-22T09:26:30+00:00 |
Next Scan | 2025-01-20T09:26:30+00:00 |
Last Successful Scan
Scanned | 2023-06-08T03:46:20+00:00 |
URL | https://www.lccc.edu/robots.txt |
Domain IPs | 67.22.129.60 |
Response IP | 67.22.129.60 |
Found | Yes |
Hash | 79bae4eb075d9a3aabf638619470151d5fca9a40690967f0b263f0cc45c4b492 |
SimHash | 61407502e9d2 |
Groups
*
Rule | Path |
---|---|
Disallow | /Forms* |
Disallow | /Landing-Pages* |
Disallow | /Advertisments* |
Disallow | /ThankYou* |
Disallow | /Unpublished* |
Disallow | /Intranet* |