canhack.de
robots.txt

Robots Exclusion Standard data for canhack.de

Resource Scan

Scan Details

Site Domain canhack.de
Base Domain canhack.de
Scan Status Ok
Last Scan2024-06-15T07:05:16+00:00
Next Scan 2024-06-22T07:05:16+00:00

Last Scan

Scanned2024-06-15T07:05:16+00:00
URL https://canhack.de/robots.txt
Redirect https://www.canhack.de/robots.txt
Redirect Domain www.canhack.de
Redirect Base canhack.de
Domain IPs 178.209.53.204, 2a02:418:6601::1
Redirect IPs 178.209.53.204, 2a02:418:6601::1
Response IP 178.209.53.204
Found Yes
Hash 8d762ab42af4a1a3e36a1a71c195298b324870153ad1ab0c1ae41885e79184e6
SimHash 320c9c82c8d1

Groups

*

Rule Path
Disallow /admin/
Disallow /attach_mod/
Disallow /db/
Disallow /files/
Disallow /includes/
Disallow /language/
Disallow /common.php
Disallow /config.php
Disallow /favorites.php
Disallow /groupcp.php
Disallow /karma.php
Disallow /karma_history.php
Disallow /memberlist.php
Disallow /modcp.php
Disallow /paywall.php
Disallow /posting.php
Disallow /privmsg.php
Disallow /progress.php
Disallow /redirect.php
Disallow /search.php
Disallow /uacp.php
Disallow /viewonline.php
Disallow /login.php
Disallow /*?printertopic=
Disallow /*%26share%3D
Disallow /*%26watch%3D

sistrix

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

crazywebcrawler-spider

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sbsearch

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.canhack.de/sitemap_content.xml
sitemap https://www.canhack.de/sitemap_forums.xml