tdh.de
robots.txt

Robots Exclusion Standard data for tdh.de

Resource Scan

Scan Details

Site Domain tdh.de
Base Domain tdh.de
Scan Status Ok
Last Scan2024-09-15T02:13:34+00:00
Next Scan 2024-10-15T02:13:34+00:00

Last Scan

Scanned2024-09-15T02:13:34+00:00
URL https://tdh.de/robots.txt
Redirect https://www.tdh.de/robots.txt
Redirect Domain www.tdh.de
Redirect Base tdh.de
Domain IPs 188.94.250.192
Redirect IPs 188.94.250.192
Response IP 188.94.250.192
Found Yes
Hash 3cf0272b3dcf11a035ac9e84eeceed16dd84ca754dd84d4bab218eb0c8b5ae0d
SimHash 382c0a66b990

Groups

*

Rule Path
Disallow /typo3/
Disallow /typo3_src/

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

wi job roboter spider version 3

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tdh.de/sitemap.xml

Comments

  • misbehaving bots (turning POST into GET, sending forms etc)
  • gready, useless bots (wasteing bandwidth)