ca.jobtome.com
robots.txt

Robots Exclusion Standard data for ca.jobtome.com

Resource Scan

Scan Details

Site Domain ca.jobtome.com
Base Domain jobtome.com
Scan Status Ok
Last Scan2024-05-13T16:26:39+00:00
Next Scan 2024-06-12T16:26:39+00:00

Last Scan

Scanned2024-05-13T16:26:39+00:00
URL https://ca.jobtome.com/robots.txt
Domain IPs 104.26.14.161, 104.26.15.161, 172.67.69.217
Response IP 172.67.69.217
Found Yes
Hash 05b8045b6be2aaf7a00cfa81c67f6aa53931932836b3f89e44d46b5e38de6434
SimHash 4706d3d56211

Groups

*

Rule Path
Disallow /search/getjob
Disallow /i/*
Disallow /cd/*
Disallow /private/search
Disallow *fo%3D1
Disallow /feed-wel/
Disallow /ads/
Disallow *%26orgn%3D113
Disallow /api/*
Disallow /before-redirect

mediapartners-google
adidxbot

Rule Path
Allow *

grapeshot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

baiduspider
yisouspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

Comments

  • &&&&&
  • &&&&&
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • @@@
  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449

Warnings

  • 4 invalid lines.