cervantes.es
robots.txt

Robots Exclusion Standard data for cervantes.es

Resource Scan

Scan Details

Site Domain cervantes.es
Base Domain cervantes.es
Scan Status Ok
Last Scan5/15/2025, 1:31:19 PM
Next Scan 6/14/2025, 1:31:19 PM

Last Scan

Scanned5/15/2025, 1:31:19 PM
URL https://cervantes.es/robots.txt
Redirect https://www.cervantes.es/robots.txt
Redirect Domain www.cervantes.es
Redirect Base cervantes.es
Domain IPs 193.146.5.10
Redirect IPs 193.146.5.10
Response IP 193.146.5.10
Found Yes
Hash 968db3d0d15cb6eae4d8d32ca938f53abbce9a425171b20de219a8f309301d39
SimHash 4561fe34c501

Groups

googlebot
bingbot
slurp
yandex
askjeeves
baiduspider
ia_archiver

Rule Path
Disallow /Vtemp/
Disallow /seg_nivel/
Disallow /boletin/
Disallow /lengua_y_ensenanza/espacio_lenguas_ibericas/
Disallow /sobre_instituto_cervantes/espacios_profesionales/
Disallow /lengua_y_ensenanza/comprofes/inscripcion/
Disallow /default_pruebas.htm
Disallow /cultura_espanola/informacion_noindex.htm
Disallow /bibliotecas_documentacion_espanol/default-noindex.htm
Disallow /france/cours_espagnol_paris_bordeaux/
Disallow /france/cours_espagnol_paris_lyon/
Disallow /lengua_y_ensenanza/csf/
Disallow /memoria_ic_web_2011-2012/

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow /