newcaney.com
robots.txt

Robots Exclusion Standard data for newcaney.com

Resource Scan

Scan Details

Site Domain newcaney.com
Base Domain newcaney.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-09T17:47:58+00:00
Next Scan 2024-11-07T17:47:58+00:00

Last Successful Scan

Scanned2023-09-10T21:38:31+00:00
URL https://newcaney.com/robots.txt
Domain IPs 172.66.40.141, 172.66.43.115, 2606:4700:3108::ac42:288d, 2606:4700:3108::ac42:2b73
Response IP 172.66.43.115
Found Yes
Hash 5b962cabeca39721327631e21ded6f398209b832322ae6f08995accbc65ee9d6
SimHash 6c04d5104215

Groups

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

microsoft

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /admin/
Disallow /message_board/listings/
Disallow /message_board/ajax/check_for_thread_replies.php
Disallow /classifieds/flag.php
Disallow /beat/beat.php
Disallow /message_board/search.php
Disallow /restaurants/search.php