protocolo.org
robots.txt

Robots Exclusion Standard data for protocolo.org

Resource Scan

Scan Details

Site Domain protocolo.org
Base Domain protocolo.org
Scan Status Ok
Last Scan2024-11-14T03:10:14+00:00
Next Scan 2024-11-21T03:10:14+00:00

Last Scan

Scanned2024-11-14T03:10:14+00:00
URL https://protocolo.org/robots.txt
Domain IPs 82.223.22.103
Response IP 82.223.22.103
Found Yes
Hash 078a439103c4ee00b814838b88a5f7077d10d52947c5848925ee9b100a897b33
SimHash 6b57dc7bc792

Groups

*

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /portal/
Disallow /compras/
Disallow /clientes/
Disallow /usuarios/
Disallow /expecial_navidad/
Disallow /portal-protocolo-etiqueta/
Disallow /portal-protocolo-y-etiqueta/
Disallow /traspasados_2505/
Allow /portal/cg-rss.pl
Allow /portal/cg_rss.pl
Allow /extra/sitemap.xml.gz
Allow /extra/bk-sitemap.xml.gz
Allow /extra/extfiles/
Allow /extra/desimg/
Allow /extra/script/
Allow /extra/estilo/
Allow /portal/cgPixel.pl
Allow /portal/cgMasVistos.pl
Allow /portal/cgRelMenuArt.pl

Other Records

Field Value
sitemap https://www.protocolo.org/sitemap.xml