lhh.com
robots.txt

Robots Exclusion Standard data for lhh.com

Resource Scan

Scan Details

Site Domain lhh.com
Base Domain lhh.com
Scan Status Ok
Last Scan2026-01-12T11:36:22+00:00
Next Scan 2026-02-11T11:36:22+00:00

Last Scan

Scanned2026-01-12T11:36:22+00:00
URL https://www.lhh.com/robots.txt
Domain IPs 13.107.213.59, 13.107.246.59, 2620:1ec:46::59, 2620:1ec:bdf::59
Response IP 13.107.213.59
Found Yes
Hash 1280213cd2f6613575f06addf9448752f7b94f3d6794971027b190bc9c241bc6
SimHash a996da61c689

Groups

*

Rule Path
Disallow /fr/fr/job/r/fr/detail-offre/*
Disallow /fr/fr/detail-offre/*
Disallow /fr/fr/offre-emploi/?*
Disallow /detail-offre/*
Disallow /offre-emploi/?*
Disallow /cgi-bin/
Disallow /static/
Disallow /sitecore*/content/
Disallow /upload/
Disallow /xsl/
Disallow /media/spain/landings/pdfs/*
Disallow */figma*
Disallow */site-settings/*
Disallow *?k=*
Disallow *pageNum%3D*

facebot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.png$
Allow /*.gif$
Allow /*.svg$

Other Records

Field Value
sitemap https://www.lhh.com/sitemap.xml
sitemap https://www.lhh.com/sitemap-index.xml

Warnings

  • 1 invalid line.