hacker10.com
robots.txt

Robots Exclusion Standard data for hacker10.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hacker10.com
Base Domain	hacker10.com
Scan Status	Ok
Last Scan	2025-09-01T08:58:28+00:00
Next Scan	2025-09-08T08:58:28+00:00

Last Scan

Scanned	2025-09-01T08:58:28+00:00
URL	https://hacker10.com/robots.txt
Domain IPs	104.36.62.101
Response IP	104.36.62.101
Found	Yes
Hash	1616c9c69d5d737d73f863344ebe75c0ab4b5790a1571bb8b7fc18305e2df3d8
SimHash	134b76c44c73

Groups

*

Rule	Path
Disallow	/cgi-bin/
Disallow	/wp-admin/
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/wp-content/upgrade/
Disallow	/wp-includes/
Allow	/wp-content/uploads/

Rule

Path

Disallow

/cgi-bin/

Disallow

/wp-admin/

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/wp-content/upgrade/

Disallow

/wp-includes/

Allow

/wp-content/uploads/

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

webbandit

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

web downloader

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer pro

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack website copier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline commander

Rule	Path
Disallow	/

Rule

Path

Disallow

leech

Rule	Path
Disallow	/

Rule

Path

Disallow

websnake

Rule	Path
Disallow	/

Rule

Path

Disallow

blackwidow

Rule	Path
Disallow	/

Rule

Path

Disallow

http weazel

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.hacker10.com/sitemap.xml.gz

Field

Value

sitemap

https://www.hacker10.com/sitemap.xml.gz

Comments

Sitemap link (CHANGE for every site!!)
All crawling bots
Disallow Wordpress admin directories crawling
Allow Wordpress image crawling
Disallow wget
Disallow automatic downloaders

hacker10.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

wget

teleport

teleportpro

emailcollector

emailsiphon

webbandit

webzip

webreaper

webstripper

web downloader

webcopier

offline explorer pro

httrack website copier

offline commander

leech

websnake

blackwidow

http weazel

Other Records

Comments

hacker10.com
robots.txt