sitecorejobs.net
robots.txt
Robots Exclusion Standard data for sitecorejobs.net
Resource Scan
Scan Details
Site Domain | sitecorejobs.net |
Base Domain | sitecorejobs.net |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-09-13T01:18:44+00:00 |
Next Scan | 2024-12-12T01:18:44+00:00 |
Last Successful Scan
Scanned | 2022-10-25T10:40:12+00:00 |
URL | https://www.sitecorejobs.net/robots.txt |
Response IP | 104.17.85.204, 104.17.86.204, 104.17.89.204, 104.17.87.204, 104.17.88.204 |
Found | Yes |
Hash | d67c8391cbd88f6fb01b5365663d05100a13002d26761c8da0080ab648c049c5 |
SimHash | a79dcc076771 |
Groups
*
Rule | Path |
---|---|
Disallow | /jobs/*/tracker |
Disallow | /jobs/*/preview |
Disallow | /jobs/*/applicants |
Disallow | /jobs/*/manage |
Disallow | /messages/* |
Disallow | /applicants/new |
Disallow | /backfills/latest_jobs |
Disallow | /auth/* |
Disallow | /clk/* |
Disallow | /employers/* |
Disallow | /c/* |
Disallow | /s/* |
Disallow | /e/* |
Disallow | /g/* |
Disallow | /n/* |
Disallow | /Salaries/* |
Disallow | /*?*page= |
Disallow | /*?*lat= |
Disallow | /*?*long= |
Disallow | /*?*sort= |
Comments