learninghouse.ca
robots.txt

Robots Exclusion Standard data for learninghouse.ca

Resource Scan

Scan Details

Site Domain learninghouse.ca
Base Domain learninghouse.ca
Scan Status Ok
Last Scan2025-09-01T00:13:33+00:00
Next Scan 2025-10-01T00:13:33+00:00

Last Scan

Scanned2025-09-01T00:13:33+00:00
URL https://learninghouse.ca/robots.txt
Domain IPs 104.26.2.184, 104.26.3.184, 172.67.75.16, 2606:4700:20::681a:2b8, 2606:4700:20::681a:3b8, 2606:4700:20::ac43:4b10
Response IP 104.26.2.184
Found Yes
Hash c94753f742c98612d01f71e905fc733e1695e034bc748fb6ab6ab5f66269e22b
SimHash 501c942b7b92

Groups

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

wget

Rule Path
Disallow /

curl

Rule Path
Disallow /

*

Rule Path
Disallow