hardcore.lt
robots.txt

Robots Exclusion Standard data for hardcore.lt

Resource Scan

Scan Details

Site Domain hardcore.lt
Base Domain hardcore.lt
Scan Status Ok
Last Scan2024-10-29T13:01:35+00:00
Next Scan 2024-11-28T13:01:35+00:00

Last Scan

Scanned2024-10-29T13:01:35+00:00
URL https://hardcore.lt/robots.txt
Domain IPs 195.181.245.217
Response IP 195.181.245.217
Found Yes
Hash e212515b9cd125904037b8b46a90f6f25ea4cf87981e4bf6975f746539893b29
SimHash f762595ad9e3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /c4p_log/
Disallow /cdn-cgi/
Disallow /wp-admin/
Disallow /?s=*
Disallow /t%3D*
Disallow /*/?share=*
Disallow /*?relatedposts=*
Disallow /?p=*
Disallow /go/
Disallow /download/
Disallow /?cat=*
Disallow /?author=*
Disallow /archives/
Disallow /recommended/
Allow /wp-admin/admin-ajax.php
Allow /wp-content/s/css/
Allow /wp-content/s/js/

teleport

Rule Path
Disallow /

teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
webcopier
offline explorer pro
httrack website copier
offline commander
leech
websnake
blackwidow
http weazel
sentibot

Rule Path
Disallow /calendar/action~posterboard/
Disallow /calendar/action~agenda/
Disallow /calendar/action~oneday/
Disallow /calendar/action~month/
Disallow /calendar/action~week/
Disallow /calendar/action~stream/
Disallow /calendar/action~undefined/
Disallow /calendar/action~http%3A/
Disallow /calendar/action~default/
Disallow /calendar/action~poster/
Disallow /calendar/action~*/
Disallow /*controller%3Dai1ec_exporter_controller*
Disallow /*/action~*/