houghton.de
robots.txt

Robots Exclusion Standard data for houghton.de

Resource Scan

Scan Details

Site Domain houghton.de
Base Domain houghton.de
Scan Status Ok
Last Scan2025-12-03T23:11:23+00:00
Next Scan 2026-01-02T23:11:23+00:00

Last Scan

Scanned2025-12-03T23:11:23+00:00
URL http://houghton.de/robots.txt
Redirect http://www.houghtonglobal.com//robots.txt
Redirect Domain www.houghtonglobal.com
Redirect Base houghtonglobal.com
Domain IPs 2001:8d8:100f:f000::200, 217.160.0.80
Redirect IPs 104.21.27.217, 172.67.169.197, 2606:4700:3035::ac43:a9c5, 2606:4700:3037::6815:1bd9
Response IP 104.21.27.217
Found Yes
Hash f746e6bb107cdcef4d19a5903b1491098eedfcedd21f68bcdbc12311206c8195
SimHash 28187013c193

Groups

*

Rule Path
Disallow /fcmedianet.js
Disallow /__media__/js/templates.js
Disallow /cmedianet
Disallow /cmdynet
Disallow /mediamainlog.php

googlebot

Rule Path
Disallow

slurp

Rule Path
Disallow

msnbot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

*

Rule Path
Disallow /