bourret.ca
robots.txt

Robots Exclusion Standard data for bourret.ca

Resource Scan

Scan Details

Site Domain bourret.ca
Base Domain bourret.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-13T09:57:26+00:00
Next Scan 2025-10-20T09:57:26+00:00

Last Successful Scan

Scanned2025-09-12T08:42:13+00:00
URL https://bourret.ca/robots.txt
Domain IPs 51.222.43.14
Response IP 51.222.43.14
Found Yes
Hash d08eb4634380e245f43578073ddeb3e82a24bc8015794f6cf61b7107f6212b6e
SimHash 711491c68670

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-activate.php
Disallow /wp-app.php
Disallow /wp-blog-header.php
Disallow /wp-comments-post.php
Disallow /wp-config-sample.php
Disallow /wp-config.php
Disallow /wp-cron.php
Disallow /wp-links-opml.php
Disallow /wp-load.php
Disallow /wp-login.php
Disallow /wp-mail.php
Disallow /wp-pass.php
Disallow /wp-register.php
Disallow /wp-settings.php
Disallow /wp-signup.php
Disallow /wp-trackback.php
Disallow /xmlrpc.php

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.bourret.ca/sitemap.xml

Comments

  • Allow OpenAI GPTBot
  • Allow Google generative content crawler