freetheocean.com
robots.txt

Robots Exclusion Standard data for freetheocean.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	freetheocean.com
Base Domain	freetheocean.com
Scan Status	Ok
Last Scan	2024-11-12T18:05:34+00:00
Next Scan	2024-11-19T18:05:34+00:00

Last Scan

Scanned	2024-11-12T18:05:34+00:00
URL	https://freetheocean.com/robots.txt
Redirect	https://www.freetheocean.com/robots.txt
Redirect Domain	www.freetheocean.com
Redirect Base	freetheocean.com
Domain IPs	23.185.0.3, 2620:12a:8000::3, 2620:12a:8001::3
Redirect IPs	23.185.0.3, 2620:12a:8000::3, 2620:12a:8001::3
Response IP	23.185.0.3
Found	Yes
Hash	39e9970af95f6f182d06a04a9df303bd8c41279f72c4563f929703030b7e8a29
SimHash	2276dd40c68a

Groups

*

Rule	Path
Disallow	/cgi-bin/
Disallow	/tmp/
Disallow	/junk/

Rule

Path

Disallow

/cgi-bin/

Disallow

/tmp/

Disallow

/junk/

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

aspiegelbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

ltx71

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

linkfluence

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bubing

Rule	Path
Disallow	/

Rule

Path

Disallow

coccoc

Rule	Path
Disallow	/

Rule

Path

Disallow

exabot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshotcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou

Rule	Path
Disallow	/

Rule

Path

Disallow

seekport

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

twengabot

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

zumbot

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

curl

Rule	Path
Disallow	/

Rule

Path

Disallow

*

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

Comments

Block Bad Bots
Block all user agents trying to access the site too frequently

freetheocean.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ahrefsbot

mj12bot

semrushbot

dotbot

blexbot

aspiegelbot

yandexbot

megaindex

spbot

seokicks-robot

ltx71

sistrix

linkfluence

petalbot

bubing

coccoc

exabot

grapeshotcrawler

proximic

sogou

seekport

baiduspider

twengabot

yeti

zumbot

wget

httrack

wget

curl

*

Other Records

Comments

freetheocean.com
robots.txt