theralist.ca
robots.txt

Robots Exclusion Standard data for theralist.ca

Resource Scan

Scan Details

Site Domain theralist.ca
Base Domain theralist.ca
Scan Status Ok
Last Scan2025-06-17T07:21:38+00:00
Next Scan 2025-07-17T07:21:38+00:00

Last Scan

Scanned2025-06-17T07:21:38+00:00
URL https://theralist.ca/robots.txt
Domain IPs 104.26.4.36, 104.26.5.36, 172.67.74.115, 2606:4700:20::681a:424, 2606:4700:20::681a:524, 2606:4700:20::ac43:4a73
Response IP 104.26.5.36
Found Yes
Hash 6669b3e8a873112e00f6da12a76f535b6b20f65bc2d3e886fff5cf12b8beeff5
SimHash 495b542dc9fb

Groups

semrushbot
twitterbot
facebookexternalhit
facebookcatalog
meta-externalagent
dataforseobot
serpstatbot
velenpublicwebcrawler

Rule Path
Disallow /

*

Rule Path
Disallow /events
Disallow /cdn-cgi
Disallow /therapists/*?s=
Disallow /therapists/*?l=
Disallow /therapists/*?a=
Disallow /therapists/*?t=
Disallow /therapists/*?la=
Disallow /therapists/*?g=
Disallow /therapists/*?r=
Disallow /therapists/*?ap=
Disallow /therapists/*?*&s=
Disallow /therapists/*?*&l=
Disallow /therapists/*?*&a=
Disallow /therapists/*?*&t=
Disallow /therapists/*?*&la=
Disallow /therapists/*?*&g=
Disallow /therapists/*?*&r=
Disallow /therapists/*?*&ap=

Other Records

Field Value
sitemap https://theralist.ca/sitemap.xml