crepi.org
robots.txt

Robots Exclusion Standard data for crepi.org

Resource Scan

Scan Details

Site Domain crepi.org
Base Domain crepi.org
Scan Status Ok
Last Scan2025-11-15T10:35:47+00:00
Next Scan 2025-11-29T10:35:47+00:00

Last Scan

Scanned2025-11-15T10:35:47+00:00
URL https://crepi.org/robots.txt
Redirect https://www.crepi.org/robots.txt
Redirect Domain www.crepi.org
Redirect Base crepi.org
Domain IPs 91.236.153.95
Redirect IPs 91.236.153.95
Response IP 91.236.153.95
Found Yes
Hash 780ccb0eb659e6c2221a9b4f4b3ad715186ed36065ba2f3bedf1000844c8b5b2
SimHash 52091a9087d2

Groups

*

Rule Path
Disallow /les-entreprises-engagees.html?
Disallow /les-entreprises-engagees/*.html?
Disallow /profils-candidats.html?
Disallow /profils-candidats/*.html?
Disallow /annonces-emploi.html?
Disallow /annonces-emploi/*.html?

Other Records

Field Value
sitemap https://www.crepi.org/sitemap.xml

Comments

  • Search on companies page and company detail page
  • Search on applicants page and applicant detail page
  • Search on job offers page and job offer detail page
  • Sitemap