jobs.washingtonpost.com
robots.txt
Robots Exclusion Standard data for jobs.washingtonpost.com
Resource Scan
Scan Details
Site Domain | jobs.washingtonpost.com |
Base Domain | washingtonpost.com |
Scan Status | Ok |
Last Scan | 3/3/2025, 1:48:06 PM |
Next Scan | 4/2/2025, 1:48:06 PM |
Last Scan
Scanned | 3/3/2025, 1:48:06 PM |
URL | https://jobs.washingtonpost.com/robots.txt |
Domain IPs | 3.164.85.3, 3.164.85.36, 3.164.85.52, 3.164.85.76 |
Response IP | 18.165.140.63 |
Found | Yes |
Hash | 23646a9d4600730818fb16f6c058999f3005457e972c5ccaa136b53362e03087 |
SimHash | 08803f54ce10 |
Groups
*
Rule | Path |
---|---|
Disallow | /session-img/ |
Disallow | /invalid-request/ |
Disallow | /document/ |
Disallow | /analytics/ |
Disallow | */searchjobs/* |
Disallow | */jobsrss/* |
Disallow | /jobsrss/* |
Disallow | */jbequicksignup/* |
Disallow | */emailjob/* |
Disallow | /your-jobs* |
Disallow | /external-redirect-registration/* |
Disallow | */previewjob/* |
Disallow | */previewjob/* |
Other Records
Field | Value |
---|---|
sitemap | https://jobs.washingtonpost.com/sitemapindex.xml |
Comments