praktikumsanzeigen.de
robots.txt

Robots Exclusion Standard data for praktikumsanzeigen.de

Resource Scan

Scan Details

Site Domain praktikumsanzeigen.de
Base Domain praktikumsanzeigen.de
Scan Status Ok
Last Scan2024-09-26T01:57:09+00:00
Next Scan 2024-10-03T01:57:09+00:00

Last Scan

Scanned2024-09-26T01:57:09+00:00
URL https://praktikumsanzeigen.de/robots.txt
Redirect https://www.praktikumsanzeigen.de/robots.txt
Redirect Domain www.praktikumsanzeigen.de
Redirect Base praktikumsanzeigen.de
Domain IPs 136.243.84.231, 2a01:4f8:191:8105::20
Redirect IPs 136.243.84.231, 2a01:4f8:191:8105::20
Response IP 136.243.84.231
Found Yes
Hash 6b680b2a9751e9a9d02f75221d5d480d57adadd57d261e01990e07db70efc5ca
SimHash 6374f6a34371

Groups

*

Rule Path
Disallow /go/*
Disallow /impressum.html
Disallow /datenschutz.html
Disallow /agb.html
Disallow /agb_ideenkraftwerk_gmbh.pdf

bingbot

Rule Path
Disallow /go/*
Disallow /impressum.html
Disallow /datenschutz.html
Disallow /agb.html
Disallow /agb_ideenkraftwerk_gmbh.pdf

Other Records

Field Value
crawl-delay 2

iccrawler – icjobs

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow /stellenanzeigen-inserieren/firmenlogos/
Disallow /stellenanzeigen-inserieren/firmenlogos/*
Disallow /images/
Disallow /images/*
Disallow /img/
Disallow /img/*

msnbot-media

Rule Path
Disallow /stellenanzeigen-inserieren/firmenlogos/
Disallow /stellenanzeigen-inserieren/firmenlogos/*
Disallow /images/
Disallow /images/*
Disallow /img/
Disallow /img/*

psbot

Rule Path
Disallow /stellenanzeigen-inserieren/firmenlogos/
Disallow /stellenanzeigen-inserieren/firmenlogos/*
Disallow /images/
Disallow /images/*
Disallow /img/
Disallow /img/*

magpie-crawler

Rule Path
Disallow /stellenanzeigen-inserieren/firmenlogos/
Disallow /stellenanzeigen-inserieren/firmenlogos/*
Disallow /images/
Disallow /images/*
Disallow /img/
Disallow /img/*

yandeximages

Rule Path
Disallow /stellenanzeigen-inserieren/firmenlogos/
Disallow /stellenanzeigen-inserieren/firmenlogos/*
Disallow /images/
Disallow /images/*
Disallow /img/
Disallow /img/*

yandexmedia

Rule Path
Disallow /stellenanzeigen-inserieren/firmenlogos/
Disallow /stellenanzeigen-inserieren/firmenlogos/*
Disallow /images/
Disallow /images/*
Disallow /img/
Disallow /img/*

yahoo-mmcrawler

Rule Path
Disallow /stellenanzeigen-inserieren/firmenlogos/
Disallow /stellenanzeigen-inserieren/firmenlogos/*
Disallow /images/
Disallow /images/*
Disallow /img/
Disallow /img/*

msnbot

Rule Path
Disallow /go/*
Disallow /impressum.html
Disallow /datenschutz.html
Disallow /agb.html
Disallow /agb_ideenkraftwerk_gmbh.pdf

Other Records

Field Value
crawl-delay 2

ahrefsbot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mtrobot

Rule Path
Disallow /

lcc

Rule Path
Disallow /