well.ca
robots.txt

Robots Exclusion Standard data for well.ca

Resource Scan

Scan Details

Site Domain well.ca
Base Domain well.ca
Scan Status Ok
Last Scan2024-09-18T17:33:56+00:00
Next Scan 2024-09-25T17:33:56+00:00

Last Scan

Scanned2024-09-18T17:33:56+00:00
URL https://well.ca/robots.txt
Domain IPs 3.225.41.227, 3.230.27.133
Response IP 3.230.27.133
Found Yes
Hash 839be9b41e120b98333b176f70cf732ba1dbb24bb08ff2abf3f8af3c0ac1fbca
SimHash 0bb0c9089e8b

Groups

psbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

neevabot

Rule Path
Disallow /fr/searchresult.html
Disallow /*?*main_page=advanced_search_result*
Disallow /searchresult.html

yandexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

linespider

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /fr/cgi-bin/
Disallow /ajax_index.php
Disallow /fr/ajax_index.php
Disallow /tell_a_friend/
Disallow /fr/tell_a_friend/
Disallow /cookie_usage.html
Disallow /fr/cookie_usage.html
Disallow /account.html
Disallow /fr/account.html
Disallow /shopping_cart.html
Disallow /fr/shopping_cart.html
Disallow /login.html
Disallow /fr/login.html
Disallow /terms.html
Disallow /fr/terms.html
Disallow /searchresult.html
Disallow /fr/searchresult.html
Disallow /badrobots.php
Disallow /fr/badrobots.php
Disallow /fr/compte.html
Disallow /fr/mon_panier.html
Disallow /fr/connexion.html
Disallow /fr/conditions_generales.html

Other Records

Field Value
sitemap https://well.ca/sitemap_wellca_index.xml