careermine.com
robots.txt
Robots Exclusion Standard data for careermine.com
Resource Scan
Scan Details
Site Domain | careermine.com |
Base Domain | careermine.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-05-31T04:26:26+00:00 |
Next Scan | 2024-08-29T04:26:26+00:00 |
Last Successful Scan
Scanned | 2022-10-07T02:02:32+00:00 |
URL | https://www.careermine.com/robots.txt |
Response IP | 54.192.111.2, 54.192.111.115, 54.192.111.56, 54.192.111.47 |
Found | Yes |
Hash | 241644f881bfe76bfc50939a324a249c9f2a05370241e085d6ea39bb083086f2 |
SimHash | 280027dd4614 |
Groups
*
Rule | Path |
---|---|
Disallow | /session-img/ |
Disallow | /invalid-request/ |
Disallow | /document/ |
Disallow | /analytics/ |
Disallow | */searchjobs/* |
Disallow | */jobsrss/* |
Disallow | /jobsrss/* |
Disallow | */jbequicksignup/* |
Disallow | */emailjob/* |
Disallow | /your-jobs/* |
Disallow | /external-redirect-registration* |
Disallow | */emailjob/* |
Disallow | */previewjob/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.careermine.com/sitemapindex.xml |
Comments