santaclarita.com
robots.txt
Robots Exclusion Standard data for santaclarita.com
Resource Scan
Scan Details
Site Domain | santaclarita.com |
Base Domain | santaclarita.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-06-02T14:27:06+00:00 |
Next Scan | 2024-08-01T14:27:06+00:00 |
Last Successful Scan
Scanned | 2024-04-04T12:33:53+00:00 |
URL | https://santaclarita.com/robots.txt |
Domain IPs | 104.26.6.218, 104.26.7.218, 172.67.68.250, 2606:4700:20::681a:6da, 2606:4700:20::681a:7da, 2606:4700:20::ac43:44fa |
Response IP | 104.26.7.218 |
Found | Yes |
Hash | 5b962cabeca39721327631e21ded6f398209b832322ae6f08995accbc65ee9d6 |
SimHash | 6c04d5104215 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /message_board/listings/ |
Disallow | /message_board/ajax/check_for_thread_replies.php |
Disallow | /classifieds/flag.php |
Disallow | /beat/beat.php |
Disallow | /message_board/search.php |
Disallow | /restaurants/search.php |