instandhaltung.de
robots.txt

Robots Exclusion Standard data for instandhaltung.de

Resource Scan

Scan Details

Site Domain instandhaltung.de
Base Domain instandhaltung.de
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-09T12:07:25+00:00
Next Scan 2024-10-08T12:07:25+00:00

Last Successful Scan

Scanned2024-06-11T11:41:29+00:00
URL https://instandhaltung.de/robots.txt
Redirect https://www.instandhaltung.de:443/robots.txt
Redirect Domain www.instandhaltung.de
Redirect Base instandhaltung.de
Domain IPs 143.204.98.121, 143.204.98.14, 143.204.98.53, 143.204.98.6
Redirect IPs 13.33.30.129, 13.33.30.33, 13.33.30.47, 13.33.30.55
Response IP 13.33.30.33
Found Yes
Hash a2e0fed2b80bb0bbb7b8c3e643043915e5cbd0b7ed9512b42371925205235431
SimHash 796c0a342dbb

Groups

claudebot
searchmetricsbot

Rule Path
Disallow *

*

Rule Path
Disallow /check/
Disallow /contao/
Disallow /system/
Disallow /templates/
Disallow /vendor/
Disallow /share/index.php
Disallow /build.xml
Disallow /composer.json
Disallow /composer.lock
Disallow /README.md
Disallow /serp.html
Disallow /serp/serp-heftarchiv.html
Disallow /files/content/noindex/
Disallow /files/content/_heftarchiv/
Disallow /reader-preview/*
Disallow /oeffentliches-profil/letzte-kommentare/
Disallow /*?page*&page*
Disallow /*?page_i*
Disallow /api/
Disallow /login-ish.html
Disallow /login-ish/logout.html
Disallow /*?p=*
Disallow /*?cat=*
Disallow /*?s=*
Disallow /player/*
Disallow /*error.php*
Disallow /reader.html
Disallow /*?sc_*
Disallow /*%26sc_*
Disallow /*?emi=*
Disallow /*%26emi%3D*

Other Records

Field Value
sitemap https://www.instandhaltung.de/sitemap-index.xml

Comments

  • robots-ish.txt