bellesahouse.com
robots.txt

Robots Exclusion Standard data for bellesahouse.com

Resource Scan

Scan Details

Site Domain bellesahouse.com
Base Domain bellesahouse.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-12T17:16:21+00:00
Next Scan 2024-12-11T17:16:21+00:00

Last Successful Scan

Scanned2023-04-25T04:23:25+00:00
URL https://bellesahouse.com/robots.txt
Domain IPs 104.26.8.140, 104.26.9.140, 172.67.73.172, 2606:4700:20::681a:88c, 2606:4700:20::681a:98c, 2606:4700:20::ac43:49ac
Response IP 172.67.73.172
Found Yes
Hash 5caf298b7b22694c745bc39a0eed4c740d3742c34de76973249252f97942de99
SimHash 27089c504933

Groups

*
msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

yandex

Rule Path
Disallow /*?sortby=*
Disallow /*%26sortby%3D*
Disallow /*?letter=*
Disallow /*%26letter%3D*
Disallow /*%26page%3D*
Allow /models?sortby=*

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.bellesahouse.com/sitemaps.xml