jobtome.com
robots.txt

Robots Exclusion Standard data for jobtome.com

Resource Scan

Scan Details

Site Domain jobtome.com
Base Domain jobtome.com
Scan Status Ok
Last Scan2024-10-30T22:48:29+00:00
Next Scan 2024-11-06T22:48:29+00:00

Last Scan

Scanned2024-10-30T22:48:29+00:00
URL https://jobtome.com/robots.txt
Domain IPs 104.26.14.161, 104.26.15.161, 172.67.69.217
Response IP 104.26.15.161
Found Yes
Hash a3786a985237bd68894f534e73858c12e8f07279641e964159671ed17278a730
SimHash 6306d2c5e315

Groups

*

Rule Path
Disallow /search/getjob
Disallow /i/*
Disallow /cd/*
Disallow /private/search
Disallow *fo%3D1
Disallow /feed-wel/
Disallow /ads/
Disallow *%26orgn%3D113
Disallow /api/*
Disallow /before-redirect
Disallow /messenger-redirect
Disallow /suggested-job
Disallow /contract-consent

mediapartners-google
adidxbot

Rule Path
Allow *

grapeshot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

baiduspider
yisouspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

seekportbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

dataforseobot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

imagesiftbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

Comments

  • &&&&&
  • &&&&&
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449

Warnings

  • 4 invalid lines.