www.europarl.europa.eu
robots.txt

Robots Exclusion Standard data for www.europarl.europa.eu

Resource Scan

Scan Details

Site Domain www.europarl.europa.eu
Base Domain europa.eu
Scan Status Ok
Last Scan2024-05-19T19:26:00+00:00
Next Scan 2024-06-18T19:26:00+00:00

Last Scan

Scanned2024-05-19T19:26:00+00:00
URL https://www.europarl.europa.eu/robots.txt
Domain IPs 99.84.238.112, 99.84.238.134, 99.84.238.167, 99.84.238.201
Response IP 18.165.171.100
Found Yes
Hash a94da91599a477f9b36b47e50b2666a82b29de9da7c8498d503dfb1c76e6721b
SimHash 6914f9350eb5

Groups

*

Rule Path
Disallow /calendar/
Disallow /debats/
Disallow /pv1/
Disallow /pv2/
Disallow /searchdeb/
Disallow /guidemep_info_2009/
Disallow /votre-europarl/
Disallow /comparl/
Disallow /parliament/public/traineeship/secured/
Disallow /parliament/public/transltraineeship/secured
Disallow /activities/committees/studies/
Disallow /activities/committees/studiesCom/
Disallow /meps/*/pdf*
Disallow /meps/*/xml*
Disallow /committees/*/studiesdownload.html
Disallow /RegistreWeb/search/simple.htm
Disallow /RegistreWeb/search/typedoc.htm
Disallow /RegistreWeb/search/advanced.htm
Disallow /common/mail/
Disallow /meps/*/eptvAjax
Disallow /thinktank/*/pdf/search.html
Disallow /ecard/
Disallow /newsgate/
Disallow /100books/*/downloadPDF
Disallow /plenary/*/vod.html
Disallow /delegations/*/*/members/pdf.html
Disallow /pdf/traineeships/

Comments

  • USD 119677 + USD I-88842
  • DDU, REVAMPING : URL is now allowed Disallow: /committees/
  • DDU, pb sur recherche des études sans paramètres
  • KAD, 08/11/2018 - ne pas indexer les générations de listes de MEP [ERPL-8599]
  • DDU, 15/04/2013 - ne pas indexer les fichiers d'études (pb timeout) [SDISPT-756]
  • DDU, 03/02/2015 - pb registre REGI-1726
  • DDU, 04/02/2015 - pb registre REGI-1726
  • ERPL-4988
  • ERPL-8599
  • Discard thinktank PDF generators
  • R-1534197
  • 100Books RWD
  • List of VOD - ERPL-7502
  • Discard MEP lists
  • traineeships should not be indexed anymore

Warnings

  • 6 invalid lines.