applynow.com.au
robots.txt

Robots Exclusion Standard data for applynow.com.au

Resource Scan

Scan Details

Site Domain applynow.com.au
Base Domain applynow.com.au
Scan Status Ok
Last Scan2024-05-02T20:17:30+00:00
Next Scan 2024-06-01T20:17:30+00:00

Last Scan

Scanned2024-05-02T20:17:30+00:00
URL https://www.applynow.com.au/robots.txt
Domain IPs 104.17.247.70, 104.17.248.70, 104.17.249.70, 104.17.250.70, 104.17.251.70, 2606:4700::6811:f746, 2606:4700::6811:f846, 2606:4700::6811:f946, 2606:4700::6811:fa46, 2606:4700::6811:fb46
Response IP 104.17.250.70
Found Yes
Hash 4d80df24a57f16c6334558ca59f992565cf299d318787bd6eb870fe05a209225
SimHash a39dcc876761

Groups

*

Rule Path
Disallow /jobs/*/tracker
Disallow /jobs/*/preview
Disallow /jobs/*/applicants
Disallow /jobs/*/manage
Disallow /messages/*
Disallow /applicants/new
Disallow /backfills/latest_jobs
Disallow /auth/*
Disallow /clk/*
Disallow /employers/*
Disallow /c/*
Disallow /s/*
Disallow /e/*
Disallow /g/*
Disallow /n/*
Disallow /Salaries/*
Disallow /*?*lat=
Disallow /*?*long=
Disallow /*?*sort=

yandeximages

Rule Path
Disallow /

yandex

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

pcore-http

Rule Path
Disallow /

bubing

Rule Path
Disallow /

companybook-crawler

Rule Path
Disallow /

wotbox/2.01

Rule Path
Disallow /

ccbot/2.0

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ebibot

Rule Path
Disallow /

pcore-http/v0.24.5

Rule Path
Disallow /

testitest1

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

istellabot/t.1

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • ZR integration blocks
  • block search query params