paginasamarillas.es
robots.txt

Robots Exclusion Standard data for paginasamarillas.es

Resource Scan

Scan Details

Site Domain paginasamarillas.es
Base Domain paginasamarillas.es
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-07T07:58:28+00:00
Next Scan 2025-01-05T07:58:28+00:00

Last Successful Scan

Scanned2022-09-07T22:48:04+00:00
URL https://paginasamarillas.es/robots.txt
Redirect https://www.paginasamarillas.es/robots.txt
Redirect Domain www.paginasamarillas.es
Redirect Base paginasamarillas.es
Response IP 45.60.36.114
Found Yes
Hash 4cf48de43c8e9588b49092bbc7fe51ba748489d043cd73fa7c95e4c8f666ca7a
SimHash c8519ac6aa10

Groups

doc

Rule Path
Disallow /

fetch

Rule Path
Disallow /

*

Rule Path
Disallow /_Incapsula_Resource
Disallow /altaig_dinamico.asp
Disallow /marcopaol.html
Disallow /functions/
Disallow /click.asp
Disallow /ajax/
Disallow /commonsAjax/
Disallow /f/bipAjax/mailNotification
Disallow /fichas/valoracion.action
Disallow /*jsessionid
Disallow /srvpags/
Disallow /fichas/usuario
Disallow /numeroDescargas
Disallow /poisservice/
Disallow /wp-admin/
Disallow /articulos/*/feed/
Disallow /contratacion/
Disallow /paol-presentation-fich-webapp/
Disallow */?itm_source=
Disallow */%26id_busq%3D
Disallow */%26site%3D
Disallow */?pext=
Disallow */?ub=

Comments

  • robots.txt