paginasamarillas.es
robots.txt
Robots Exclusion Standard data for paginasamarillas.es
Resource Scan
Scan Details
Site Domain | paginasamarillas.es |
Base Domain | paginasamarillas.es |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-10-07T07:58:28+00:00 |
Next Scan | 2025-01-05T07:58:28+00:00 |
Last Successful Scan
Scanned | 2022-09-07T22:48:04+00:00 |
URL | https://paginasamarillas.es/robots.txt |
Redirect | https://www.paginasamarillas.es/robots.txt |
Redirect Domain | www.paginasamarillas.es |
Redirect Base | paginasamarillas.es |
Response IP | 45.60.36.114 |
Found | Yes |
Hash | 4cf48de43c8e9588b49092bbc7fe51ba748489d043cd73fa7c95e4c8f666ca7a |
SimHash | c8519ac6aa10 |
Groups
*
Rule | Path |
---|---|
Disallow | /_Incapsula_Resource |
Disallow | /altaig_dinamico.asp |
Disallow | /marcopaol.html |
Disallow | /functions/ |
Disallow | /click.asp |
Disallow | /ajax/ |
Disallow | /commonsAjax/ |
Disallow | /f/bipAjax/mailNotification |
Disallow | /fichas/valoracion.action |
Disallow | /*jsessionid |
Disallow | /srvpags/ |
Disallow | /fichas/usuario |
Disallow | /numeroDescargas |
Disallow | /poisservice/ |
Disallow | /wp-admin/ |
Disallow | /articulos/*/feed/ |
Disallow | /contratacion/ |
Disallow | /paol-presentation-fich-webapp/ |
Disallow | */?itm_source= |
Disallow | */%26id_busq%3D |
Disallow | */%26site%3D |
Disallow | */?pext= |
Disallow | */?ub= |
Comments