insel-mauritius.de
robots.txt

Robots Exclusion Standard data for insel-mauritius.de

Resource Scan

Scan Details

Site Domain insel-mauritius.de
Base Domain insel-mauritius.de
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-09-16T14:49:53+00:00
Next Scan 2025-12-15T14:49:53+00:00

Last Successful Scan

Scanned2025-05-20T04:02:46+00:00
URL https://insel-mauritius.de/robots.txt
Domain IPs 178.254.0.102
Response IP 178.254.0.102
Found Yes
Hash 0948b020eb7fabfc08257d872519dcd3ab7f1814edb32490f162e74ab98a2404
SimHash a91f18968733

Groups

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.insel-mauritius.de/sitemap_index.xml

Comments

  • Webseite: https://www.insel-mauritius.de
  • ===================================
  • ===================================
  • Schließe folgende Spider komplett aus:
  • ===================================