santaclarita.com
robots.txt

Robots Exclusion Standard data for santaclarita.com

Resource Scan

Scan Details

Site Domain santaclarita.com
Base Domain santaclarita.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-06-02T14:27:06+00:00
Next Scan 2024-08-01T14:27:06+00:00

Last Successful Scan

Scanned2024-04-04T12:33:53+00:00
URL https://santaclarita.com/robots.txt
Domain IPs 104.26.6.218, 104.26.7.218, 172.67.68.250, 2606:4700:20::681a:6da, 2606:4700:20::681a:7da, 2606:4700:20::ac43:44fa
Response IP 104.26.7.218
Found Yes
Hash 5b962cabeca39721327631e21ded6f398209b832322ae6f08995accbc65ee9d6
SimHash 6c04d5104215

Groups

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

microsoft

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /admin/
Disallow /message_board/listings/
Disallow /message_board/ajax/check_for_thread_replies.php
Disallow /classifieds/flag.php
Disallow /beat/beat.php
Disallow /message_board/search.php
Disallow /restaurants/search.php