rekrute.com
robots.txt

Robots Exclusion Standard data for rekrute.com

Resource Scan

Scan Details

Site Domain rekrute.com
Base Domain rekrute.com
Scan Status Ok
Last Scan2024-05-30T18:58:13+00:00
Next Scan 2024-06-29T18:58:13+00:00

Last Scan

Scanned2024-05-30T18:58:13+00:00
URL https://www.rekrute.com/robots.txt
Domain IPs 178.33.249.143
Response IP 178.33.249.143
Found Yes
Hash bd1c52b1285b419d9d24e7cd07f39f8977feb55e830149283df9a2160de11c40
SimHash 4354519be8a9

Groups

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Disallow /user/photo.jpg*

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot/7~bl

Rule Path
Disallow /

dotbot/1.2

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandexbot/3.0

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

ezooms robot

Rule Path
Disallow /

perl lwp

Rule Path
Disallow /

wiseguys robot

Rule Path
Disallow /

turnitin robot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

turnitin bot

Rule Path
Disallow /

facebookexternalhit bot

Rule Path
Disallow /

turnitinbot/3.0 (http://www.turnitin.com/robot/crawlerinfo.html)

Rule Path
Disallow /

turnitinbot/3.0

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

pimonster

Rule Path
Disallow /

pimonster

Rule Path
Disallow /

eccp/1.0 (search@eniro.com)

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image
mozilla/5.0 (compatible; baiduspider/2.0; +http://www.baidu.com/search/spider.html)
mozilla/5.0 (compatible; baiduspider/3.0; +http://www.baidu.com/search/spider.html)
mozilla/5.0 (compatible; baiduspider/4.0; +http://www.baidu.com/search/spider.html)
mozilla/5.0 (compatible; baiduspider/5.0; +http://www.baidu.com/search/spider.html)
baiduspider/2.0
baiduspider/3.0
baiduspider/4.0
baiduspider/5.0

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

gsa-crawler (enterprise; t4-knhh62cdkc2w3; gsa_manage@nikon-sys.co.jp)

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

mail.ru_bot/2.0

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

mail.ru_bot/2.0; +http://go.mail.ru/help/robots

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mj12bot/v1.4.3

Rule Path
Disallow /

mj12bot/v1.4.8

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

nutch

Rule Path
Disallow /

spock

Rule Path
Disallow /

omniexplorer_bot

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

geniebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot/1.2

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

sbider/nutch

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

magent

Rule Path
Disallow /

speedy spider

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

huasai

Rule Path
Disallow /

datacha0s

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

atomic_email_hunter

Rule Path
Disallow /

mp3bot

Rule Path
Disallow /

winhttp

Rule Path
Disallow /

betabot

Rule Path
Disallow /

core-project

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

java

Rule Path
Disallow /

libwww-perl

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

bingbot/2.0

Rule Path
Disallow /

applebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

amazonbot/0.1

Rule Path
Disallow /

adsbot/3.1

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

msnbot/2.0b

Rule Path
Disallow /

dataforseobot/1.0

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

jooblebot/2.0

Rule Path
Disallow /

gptbot/1.0

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

awariobot/1.0

Rule Path
Disallow /

serpstatbot/2.1

Rule Path
Disallow /

twitterbot/1.0

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

jobbot/2.0

Rule Path
Disallow /

barkrowler/0.9

Rule Path
Disallow /

applebot/0.1

Rule Path
Disallow /

discordbot/2.0

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

facebookexternalhit/1.1

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropicbot

Rule Path
Disallow /

claude

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

mail.ru_bot/2.0

Rule Path
Disallow /

blexbot/1.0

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.rekrute.com/sitemap.xml

Comments

  • Allowed crawlers
  • Block BlexBot
  • Block AhrefsBot
  • Block SemrushBot
  • Block SemrushBot/7~bl
  • Block DotBot/1.2
  • Block Yandex
  • Block SEOkicks
  • Block SISTRIX
  • Block Uptime robot
  • Block Ezooms Robot
  • Block Perl LWP
  • Block WiseGuys Robot
  • Block Turnitin Robot
  • User-agent: facebookexternalhit
  • Disallow: /
  • Block Heritrix
  • Block pricepi
  • Block Searchmetrics Bot
  • User-agent: SearchmetricsBot
  • Disallow: /
  • Block Eniro
  • Block Baidu
  • Block SoGou
  • Block Youdao
  • Block Nikon JP Crawler
  • Block MegaIndex.ru
  • Crawlers that are kind enough to obey, but which we'd rather not have
  • unless they're feeding search engines.
  • Some bots are known to be trouble, particularly those designed to copy
  • entire sites or download them for offline viewing. Please obey robots.txt.
  • Block MegaIndex.ru
  • Allow: /$
  • Allow: /foire-aux-questions.html
  • Allow: /recruiter_area.html
  • Allow: /contactez-nous.html
  • Allow: /home-recruteur.html
  • Block DotBot
  • Block AwarioBot
  • Block serpstatbot
  • Block Twitterbot
  • Block Mediatoolkitbot
  • Block JobBot
  • Block Barkrowler
  • Block Applebot
  • Block Discordbot
  • Block UptimeRobot
  • Block facebookexternalhit
  • Block claudebot

Warnings

  • 4 invalid lines.