hacker10.com
robots.txt

Robots Exclusion Standard data for hacker10.com

Resource Scan

Scan Details

Site Domain hacker10.com
Base Domain hacker10.com
Scan Status Ok
Last Scan2025-09-01T08:58:28+00:00
Next Scan 2025-09-08T08:58:28+00:00

Last Scan

Scanned2025-09-01T08:58:28+00:00
URL https://hacker10.com/robots.txt
Domain IPs 104.36.62.101
Response IP 104.36.62.101
Found Yes
Hash 1616c9c69d5d737d73f863344ebe75c0ab4b5790a1571bb8b7fc18305e2df3d8
SimHash 134b76c44c73

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-content/upgrade/
Disallow /wp-includes/
Allow /wp-content/uploads/

wget

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.hacker10.com/sitemap.xml.gz

Comments

  • Sitemap link (CHANGE for every site!!)
  • All crawling bots
  • Disallow Wordpress admin directories crawling
  • Allow Wordpress image crawling
  • Disallow wget
  • Disallow automatic downloaders