piwik.web.cern.ch
robots.txt

Robots Exclusion Standard data for piwik.web.cern.ch

Resource Scan

Scan Details

Site Domain piwik.web.cern.ch
Base Domain cern.ch
Scan Status Ok
Last Scan2024-10-20T07:56:58+00:00
Next Scan 2024-11-19T07:56:58+00:00

Last Scan

Scanned2024-10-20T07:56:58+00:00
URL https://piwik.web.cern.ch/robots.txt
Domain IPs 137.138.6.31, 2001:1458:201:8b::100:1c8
Response IP 137.138.6.31
Found Yes
Hash 2a3003f6d24a2d984ec7d0e9510e9363477e6128a457fe74889530931007da81
SimHash ca14d321d4f5

Groups

googlebot
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
google favicon
googlebot-news
googlebot-image
googlebot-video
mediapartners-google
apis-google
duplexweb-google
bingbot
slurp
duckduckbot
baiduspider
ahrefsbot
rogerbot
yandexbot
dotbot
twitterbot
bingpreview
linkedinbot
yandexbot
facebot
facebookexternalhit
msnbot
msnbot-media

Rule Path
Disallow /
Allow /matomo.php
Allow /piwik.php
Allow /matomo.js
Allow /piwik.js
Allow /js/