jobs.farmersguardian.com
robots.txt

Robots Exclusion Standard data for jobs.farmersguardian.com

Resource Scan

Scan Details

Site Domain jobs.farmersguardian.com
Base Domain farmersguardian.com
Scan Status Ok
Last Scan2024-05-12T07:15:14+00:00
Next Scan 2024-06-11T07:15:14+00:00

Last Scan

Scanned2024-05-12T07:15:14+00:00
URL https://jobs.farmersguardian.com/robots.txt
Domain IPs 18.239.199.3, 18.239.199.32, 18.239.199.40, 18.239.199.79
Response IP 18.165.171.127
Found Yes
Hash 8c85324d382f04ef8993404b9f52b050c6f008925dab39feee4c6173fec874d9
SimHash 08007f544e95

Groups

*

Rule Path
Disallow /session-img/
Disallow /invalid-request/
Disallow /document/
Disallow /analytics/
Disallow */searchjobs/*
Disallow */jobsrss/*
Disallow /jobsrss/*
Disallow */jbequicksignup/*
Disallow */emailjob/*
Disallow /your-jobs*
Disallow */previewjob/*

Other Records

Field Value
sitemap https://jobs.farmersguardian.com/sitemapindex.xml

Comments

  • Robot exclusion file
  • The following pages require registration and login