fromocean.com
robots.txt

Robots Exclusion Standard data for fromocean.com

Resource Scan

Scan Details

Site Domain fromocean.com
Base Domain fromocean.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-06-15T17:41:17+00:00
Next Scan 2025-09-13T17:41:17+00:00

Last Successful Scan

Scanned2025-02-09T17:39:37+00:00
URL https://fromocean.com/robots.txt
Domain IPs 104.21.10.35, 172.67.189.228, 2606:4700:3035::6815:a23, 2606:4700:3035::ac43:bde4
Response IP 104.21.10.35
Found Yes
Hash a59f054a908e62b336af8db3901fef43329f21065612aafc799117ef8766ec7b
SimHash f87ad872ee33

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /*?add_to_wishlist=*
Disallow /*?add-to-cart=*
Disallow /cheap/?add-to-cart=*
Allow /wp-admin/admin-ajax.php

turnitinbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

bingbot/2.0

Rule Path
Disallow /*?add_to_wishlist=*
Disallow /*?add-to-cart=*
Disallow /*add-to-cart%3D*

*

Rule Path
Disallow /*add-to-cart%3D*

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

spbot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://fromocean.com/sitemap_index.xml

Warnings

  • 2 invalid lines.