Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Hey everyone - Gus from Coiled (and formerly dku) here. We recently started discussions with @DataikuCarlyT and the Dataiku team about how DSS users might use Dask. Dask is used across end-to-end ML workloads as well more general scientific computing applications.
Based on my experience at Dataiku and Coiled, I wanted to offer the following recommendations to DSS Python users interested in Dask:
import coiled
cluster = coiled.Cluster(n_workers=10, name="dask_from_dku")
from dask.distributed import Client
client = Client(cluster)
print('Dashboard:', client.dashboard_link)
I think there is some exploration required to read/write with the dataiku dataset API. If completed, this work might be best packaged as a DSS plugin.
I look forward to discussions with the community