thieme.com
robots.txt

Robots Exclusion Standard data for thieme.com

Resource Scan

Scan Details

Site Domain thieme.com
Base Domain thieme.com
Scan Status Ok
Last Scan2024-05-13T04:26:01+00:00
Next Scan 2024-06-12T04:26:01+00:00

Last Scan

Scanned2024-05-13T04:26:01+00:00
URL https://www.thieme.com/robots.txt
Domain IPs 2600:9000:2003:5200:19:6e59:8100:93a1, 2600:9000:2003:6000:19:6e59:8100:93a1, 2600:9000:2003:6600:19:6e59:8100:93a1, 2600:9000:2003:6c00:19:6e59:8100:93a1, 2600:9000:2003:a00:19:6e59:8100:93a1, 2600:9000:2003:a400:19:6e59:8100:93a1, 2600:9000:2003:ce00:19:6e59:8100:93a1, 2600:9000:2003:e800:19:6e59:8100:93a1, 52.84.229.116, 52.84.229.127, 52.84.229.20, 52.84.229.3
Response IP 52.84.229.116
Found Yes
Hash 68563dc438b50308448ed20efd436cb89e410aa4bc02e26d8df2d1ba46362151
SimHash 20514f026af1

Groups

*

Rule Path
Disallow /scrivito/

*

Rule Path
Disallow /de-de/testpage-content-all
Disallow /*?

searchmetricsbot
mj12bot
bleriot
qwantify
jobkicks
trendkite-akashic-crawler

Rule Path
Disallow /

Comments

  • german: