talentplus.de
robots.txt

Robots Exclusion Standard data for talentplus.de

Resource Scan

Scan Details

Site Domain talentplus.de
Base Domain talentplus.de
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-05-31T18:20:14+00:00
Next Scan 2024-08-29T18:20:14+00:00

Last Successful Scan

Scanned2023-10-11T18:27:00+00:00
URL https://talentplus.de/robots.txt
Redirect https://www.talentplus.de/robots.txt
Redirect Domain www.talentplus.de
Redirect Base talentplus.de
Domain IPs 195.14.224.181
Redirect IPs 195.14.224.181
Response IP 195.14.224.181
Found Yes
Hash 104292cbb7bb8264cff5837e81c3cfe8c5be51bf3560ceeb9f73513e16aab985
SimHash a10e3fb0b802

Groups

ahrefsbot

Rule Path
Disallow /

mozilla/5.0 (compatible; ahrefsbot/5.0; +http://ahrefs.com/robot/)

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

amazonbot/0.1

Rule Path
Disallow /

mozilla/5.0 (macintosh; intel mac os x 10_10_1) applewebkit/600.2.5 (khtml, like gecko) version/8.0.2 safari/600.2.5 (amazonbot/0.1; +https://developer.amazon.com/support/amazonbot)

Rule Path
Disallow /

mozilla/5.0 (compatible; semrushbot/0.99~bl; +http://www.semrush.com/bot.html)

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

mozilla/5.0+(compatible;+baiduspider/2.0;++http://www.baidu.com/search/spider.html)

Rule Path
Disallow /

baiduspider-image+(+http://www.baidu.com/search/spider.htm)

Rule Path
Disallow /

iisbot/1.0 (+http://www.iis.net/iisbot.html)

Rule Path
Disallow /

iisbot/1.0 (+http://rvs.informatik.uni-leipzig.de/bot.php)

Rule Path
Disallow /

wotbox/2.01 (+http://www.wotbox.com/bot/)

Rule Path
Disallow /

*

Rule Path
Disallow /bilder/
Disallow /externe-downloads/
Disallow /externe-links/
Disallow /opencms/index.html
Disallow /shared/.content/externe-downloads/
Disallow /shared/.content/externe-links/
Disallow /shared/images/

Other Records

Field Value
sitemap https://www.talentplus.de/sitemap.xml