plougastelsportsnature.com
robots.txt

Robots Exclusion Standard data for plougastelsportsnature.com

Resource Scan

Scan Details

Site Domain plougastelsportsnature.com
Base Domain plougastelsportsnature.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-10T21:47:05+00:00
Next Scan 2024-11-08T21:47:05+00:00

Last Successful Scan

Scanned2023-10-16T21:10:35+00:00
URL https://plougastelsportsnature.com/robots.txt
Domain IPs 185.128.239.52
Response IP 185.128.239.52
Found Yes
Hash 0951b3e351bf52c4355c3bbd792064ba55955647fea5fa9fec7e16724e2844d8
SimHash 6a004455c733

Groups

*

Rule Path
Allow /
Disallow /contact
Disallow /mail/subscribe
Disallow /mail/valid-*

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

spbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://plougastelsportsnature.com/sitemap-news.xml
sitemap https://plougastelsportsnature.com/sitemap.xml