jobs.washingtonpost.com
robots.txt

Robots Exclusion Standard data for jobs.washingtonpost.com

Resource Scan

Scan Details

Site Domain jobs.washingtonpost.com
Base Domain washingtonpost.com
Scan Status Ok
Last Scan3/3/2025, 1:48:06 PM
Next Scan 4/2/2025, 1:48:06 PM

Last Scan

Scanned3/3/2025, 1:48:06 PM
URL https://jobs.washingtonpost.com/robots.txt
Domain IPs 3.164.85.3, 3.164.85.36, 3.164.85.52, 3.164.85.76
Response IP 18.165.140.63
Found Yes
Hash 23646a9d4600730818fb16f6c058999f3005457e972c5ccaa136b53362e03087
SimHash 08803f54ce10

Groups

*

Rule Path
Disallow /session-img/
Disallow /invalid-request/
Disallow /document/
Disallow /analytics/
Disallow */searchjobs/*
Disallow */jobsrss/*
Disallow /jobsrss/*
Disallow */jbequicksignup/*
Disallow */emailjob/*
Disallow /your-jobs*
Disallow /external-redirect-registration/*
Disallow */previewjob/*
Disallow */previewjob/*

Other Records

Field Value
sitemap https://jobs.washingtonpost.com/sitemapindex.xml

Comments

  • Robot exclusion file
  • The following pages require registration and login