piwik.canlii.org
robots.txt

Robots Exclusion Standard data for piwik.canlii.org

Resource Scan

Scan Details

Site Domain piwik.canlii.org
Base Domain canlii.org
Scan Status Ok
Last Scan2024-09-19T17:24:08+00:00
Next Scan 2024-10-19T17:24:08+00:00

Last Scan

Scanned2024-09-19T17:24:08+00:00
URL https://piwik.canlii.org/robots.txt
Domain IPs 52.60.114.91
Response IP 52.60.114.91
Found Yes
Hash 2a3003f6d24a2d984ec7d0e9510e9363477e6128a457fe74889530931007da81
SimHash ca14d321d4f5

Groups

googlebot
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
google favicon
googlebot-news
googlebot-image
googlebot-video
mediapartners-google
apis-google
duplexweb-google
bingbot
slurp
duckduckbot
baiduspider
ahrefsbot
rogerbot
yandexbot
dotbot
twitterbot
bingpreview
linkedinbot
yandexbot
facebot
facebookexternalhit
msnbot
msnbot-media

Rule Path
Disallow /
Allow /matomo.php
Allow /piwik.php
Allow /matomo.js
Allow /piwik.js
Allow /js/