canhack.de
robots.txt

Robots Exclusion Standard data for canhack.de

Resource Scan

Scan Details

Site Domain canhack.de
Base Domain canhack.de
Scan Status Ok
Last Scan2024-09-22T07:10:54+00:00
Next Scan 2024-09-29T07:10:54+00:00

Last Scan

Scanned2024-09-22T07:10:54+00:00
URL https://canhack.de/robots.txt
Redirect https://www.canhack.de/robots.txt
Redirect Domain www.canhack.de
Redirect Base canhack.de
Domain IPs 178.209.53.204, 2a02:418:6601::1
Redirect IPs 178.209.53.204, 2a02:418:6601::1
Response IP 178.209.53.204
Found Yes
Hash bbb4b271f3b204551424da06f1f8a6a4418c4d8226b17f08f4842342b6f8756f
SimHash 720c9c02ccd1

Groups

*

Rule Path
Disallow /admin/
Disallow /attach_mod/
Disallow /db/
Disallow /files/
Disallow /includes/
Disallow /language/
Disallow /common.php
Disallow /config.php
Disallow /favorites.php
Disallow /groupcp.php
Disallow /karma.php
Disallow /karma_history.php
Disallow /memberlist.php
Disallow /modcp.php
Disallow /paywall.php
Disallow /posting.php
Disallow /privmsg.php
Disallow /progress.php
Disallow /redirect.php
Disallow /search.php
Disallow /uacp.php
Disallow /viewonline.php
Disallow /login.php
Disallow /*?printertopic=
Disallow /*%26share%3D
Disallow /*%26watch%3D

sistrix

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

crazywebcrawler-spider

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sbsearch

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.canhack.de/sitemap_content.xml
sitemap https://www.canhack.de/sitemap_forums.xml