cwejman.net
robots.txt

Robots Exclusion Standard data for cwejman.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cwejman.net
Base Domain	cwejman.net
Scan Status	Ok
Last Scan	2024-10-30T01:52:19+00:00
Next Scan	2024-11-29T01:52:19+00:00

Last Scan

Scanned	2024-10-30T01:52:19+00:00
URL	https://cwejman.net/robots.txt
Redirect	https://www.nimr.org/robots.txt
Redirect Domain	www.nimr.org
Redirect Base	nimr.org
Domain IPs	104.21.81.158, 172.67.190.2, 2606:4700:3033::ac43:be02, 2606:4700:3037::6815:519e
Redirect IPs	104.18.30.10, 104.18.31.10, 2606:4700::6812:1e0a, 2606:4700::6812:1f0a
Response IP	104.18.31.10
Found	Yes
Hash	e1e6140409444a8f31efc9e0bff08d0fe62bddfca59ea26d8c2bcd73bb2c9c83
SimHash	ba4b78c26911

Groups

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

webbandit

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

web downloader

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer pro

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack website copier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline commander

Rule	Path
Disallow	/

Rule

Path

Disallow

leech

Rule	Path
Disallow	/

Rule

Path

Disallow

websnake

Rule	Path
Disallow	/

Rule

Path

Disallow

blackwidow

Rule	Path
Disallow	/

Rule

Path

Disallow

http weazel

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-includes/

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-includes/

cwejman.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

teleport

teleportpro

emailcollector

emailsiphon

webbandit

webzip

webreaper

webstripper

web downloader

ahrefsbot

semrushbot

mj12bot

webcopier

offline explorer pro

offline explorer

httrack website copier

offline commander

leech

websnake

blackwidow

http weazel

*

cwejman.net
robots.txt