cguittard.com
robots.txt

Robots Exclusion Standard data for cguittard.com

Resource Scan

Scan Details

Site Domain cguittard.com
Base Domain cguittard.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-25T21:28:18+00:00
Next Scan 2025-01-23T21:28:18+00:00

Last Successful Scan

Scanned2024-06-28T21:25:54+00:00
URL https://cguittard.com/robots.txt
Domain IPs 185.128.239.52
Response IP 185.128.239.52
Found Yes
Hash 0f82d5fdd3305c3387c8ee2d0d03c93757576b61009b20d378976b878be455f2
SimHash 6a0cd053c733

Groups

*

Rule Path
Allow /
Disallow /contact
Disallow /mail/subscribe
Disallow /mail/valid-*

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

spbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://cguittard.com/sitemap-news.xml
sitemap https://cguittard.com/sitemap.xml