careermine.com
robots.txt

Robots Exclusion Standard data for careermine.com

Resource Scan

Scan Details

Site Domain careermine.com
Base Domain careermine.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-05-31T04:26:26+00:00
Next Scan 2024-08-29T04:26:26+00:00

Last Successful Scan

Scanned2022-10-07T02:02:32+00:00
URL https://www.careermine.com/robots.txt
Response IP 54.192.111.2, 54.192.111.115, 54.192.111.56, 54.192.111.47
Found Yes
Hash 241644f881bfe76bfc50939a324a249c9f2a05370241e085d6ea39bb083086f2
SimHash 280027dd4614

Groups

*

Rule Path
Disallow /session-img/
Disallow /invalid-request/
Disallow /document/
Disallow /analytics/
Disallow */searchjobs/*
Disallow */jobsrss/*
Disallow /jobsrss/*
Disallow */jbequicksignup/*
Disallow */emailjob/*
Disallow /your-jobs/*
Disallow /external-redirect-registration*
Disallow */emailjob/*
Disallow */previewjob/*

Other Records

Field Value
sitemap https://www.careermine.com/sitemapindex.xml

Comments

  • Robot exclusion file
  • The following pages require registration and login