careerlink.com
robots.txt

Robots Exclusion Standard data for careerlink.com

Resource Scan

Scan Details

Site Domain careerlink.com
Base Domain careerlink.com
Scan Status Ok
Last Scan2025-12-07T00:21:44+00:00
Next Scan 2026-01-06T00:21:44+00:00

Last Scan

Scanned2025-12-07T00:21:44+00:00
URL https://careerlink.com/robots.txt
Domain IPs 104.26.4.50, 104.26.5.50, 172.67.68.226, 2606:4700:20::681a:432, 2606:4700:20::681a:532, 2606:4700:20::ac43:44e2
Response IP 104.26.5.50
Found Yes
Hash bdfeb40c91e8d6615a8a33a4f5be617574cdae18b8dd633c2fc37f75562eeb95
SimHash 1d0f5c418240

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /master/
Disallow /pend/
Disallow /regent/
Disallow /reports/
Disallow /resume/
Disallow /emp/
Disallow /staff/
Disallow /profile/
Disallow /public_profile/
Disallow /jobapp/
Disallow /search?*

Other Records

Field Value
crawl-delay 120

trovitbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://careerlink.com/sitemap_https.xml
sitemap https://careerlink.com/sitemap_jobs.xml
sitemap https://secure.careerlink.com/sitemap_jobs.html

Comments

  • robots.txt for all careerlink.com domains
  • $Id: robots.txt,v 1.45 2010/07/26 14:24:00 chad Exp $
  • Disallow: /images/
  • Disallow: /pix/
  • Disallow: /js/
  • Disallow: /css/