piwik.web.cern.ch
robots.txt

Robots Exclusion Standard data for piwik.web.cern.ch

Resource Scan

Scan Details

Site Domain piwik.web.cern.ch
Base Domain cern.ch
Scan Status Ok
Last Scan2024-05-22T22:23:11+00:00
Next Scan 2024-06-21T22:23:11+00:00

Last Scan

Scanned2024-05-22T22:23:11+00:00
URL https://piwik.web.cern.ch/robots.txt
Domain IPs 188.184.79.178, 2001:1458:d00:6f::100:2ff
Response IP 188.184.79.178
Found Yes
Hash c6b9faa03e9785941f15ed14ecd6bdfe0a71f2dfceb1f6a378b58e2e826be2dd
SimHash 0a5d17a96b26

Groups

*

Rule Path
Allow /authoring/
Allow /cnellist/
Allow /cms-tracker/
Allow /club-badminton-bst2016/
Allow /CLICr/
Allow /davidc/
Allow /droussea/
Allow /dd4hep/
Allow /edreyer/
Allow /edreyer
Allow /flair
Allow /gfasanel/
Allow /geant4/
Allow /grid-deployment/
Allow /glite/
Allow /gsalam/
Allow /hrussell/
Allow /ibarrien/
Allow /lhcbqa/
Allow /lhcbproject/Publications/
Allow /luxqed/
Allow /mad/
Allow /mdudek/
Allow /mig/
Allow /mcfayden/
Allow /pmarino/
Allow /pvankov/
Allow /project-allpix-squared/
Allow /radnext-network/
Allow /rtomas/
Allow /swedish/
Allow /spectrum/
Allow /shoh/
Allow /srettie/
Allow /siodmok/
Allow /stapley/
Allow /sviel/
Allow /theofil/
Allow /track-it-documentation/
Allow /tvami/
Allow /hedberg/
Disallow /yachting/BBoard
Allow /yachting/
Disallow /

fast enterprise crawler 6 used by cern (project-search@cern.ch)

Rule Path
Disallow
Disallow /proj-fgc/

fast enterprise crawler for sharepoint used by cern (project-search@cern.ch)

Rule Path
Disallow
Disallow /proj-fgc/

cern search crawler

Rule Path
Allow /
Disallow
Disallow /proj-fgc/

Comments

  • Robots.txt file for all AFS and EOS Web servers
  • Same file for all servers