lapieza.io
robots.txt

Robots Exclusion Standard data for lapieza.io

Resource Scan

Scan Details

Site Domain lapieza.io
Base Domain lapieza.io
Scan Status Ok
Last Scan2025-11-03T16:45:56+00:00
Next Scan 2025-11-17T16:45:56+00:00

Last Scan

Scanned2025-11-03T16:45:56+00:00
URL https://lapieza.io/robots.txt
Domain IPs 104.18.8.214, 104.18.9.214, 2606:4700::6812:8d6, 2606:4700::6812:9d6
Response IP 104.18.8.214
Found Yes
Hash 1c3398ec8f4a9649699a3191a534af8cadf8e918ec6d39dd03a9639657aacaf5
SimHash f5555280cdf5

Groups

*

Rule Path
Disallow /aplicar/*
Disallow /apply/*
Disallow /profile
Disallow /perfil
Disallow /login
Disallow /empresas/*
Disallow /companies/*
Disallow /oferta/*
Disallow /offer/*
Disallow /forgot-pass
Disallow /settings
Disallow /404
Disallow /deleted-account
Disallow /benefits
Disallow /beneficios
Disallow /es/*
Disallow /en/*
Disallow /pt/*
Allow /
Allow /vacantes
Allow /contacto
Allow /aviso-de-privacidad
Allow /terminos-y-condiciones
Allow /vacante/*
Allow /vacancy/*
Allow /faqs

Other Records

Field Value
sitemap https://lapieza.io/sitemap.xml