topschooljobs.org
robots.txt

Robots Exclusion Standard data for topschooljobs.org

Resource Scan

Scan Details

Site Domain topschooljobs.org
Base Domain topschooljobs.org
Scan Status Ok
Last Scan2024-05-21T10:58:13+00:00
Next Scan 2024-06-20T10:58:13+00:00

Last Scan

Scanned2024-05-21T10:58:13+00:00
URL https://www.topschooljobs.org/robots.txt
Domain IPs 18.155.202.111, 18.155.202.20, 18.155.202.41, 18.155.202.96
Response IP 108.157.52.124
Found Yes
Hash cfd4e869b0c279eb11a6208b6594e039c4948cb7d5f193260567bf0011d91e1f
SimHash 0a803d54ce14

Groups

*

Rule Path
Disallow /session-img/
Disallow /invalid-request/
Disallow /document/
Disallow /analytics/
Disallow /apply-profile/
Disallow */searchjobs/*
Disallow */jobsrss/*
Disallow /jobsrss/*
Disallow */jbequicksignup/*
Disallow */emailjob/*
Disallow /your-jobs*
Disallow /external-redirect-registration/*
Disallow */previewjob/*

rogerbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.topschooljobs.org/sitemapindex.xml

Comments

  • Robot exclusion file
  • The following pages require registration and login