Posts tagged dask

Dask DataFrame is Fast Now

May 30, 2024

This work was engineered and supported by Coiled and NVIDIA. Thanks to Patrick Hoefler and Rick Zamora, in particular. Original version of this post appears on docs.coiled.io

Read more ...

High Level Query Optimization in Dask

Aug 25, 2023

This work was engineered and supported by Coiled and NVIDIA. Thanks to Patrick Hoefler and Rick Zamora, in particular. Original version of this post appears on blog.coiled.io

Read more ...

Upstream testing in Dask

Apr 18, 2023

Original version of this post appears on blog.coiled.io

Read more ...

Shuffling large data at constant memory in Dask

Mar 15, 2023

This work was engineered and supported by Coiled. In particular, thanks to Florian Jetter, Gabe Joseph, Hendrik Makait, and Matt Rocklin. Original version of this post appears on blog.coiled.io

Read more ...

Managing dask workloads with Flyte

Feb 13, 2023

It is now possible to manage dask workloads using Flyte 🎉!

Read more ...

Measuring Dask memory usage with dask-memusage

Mar 11, 2021

Using too much computing resources can get expensive when you’re scaling up in the cloud.

Read more ...

Comparing Dask-ML and Ray Tune's Model Selection Algorithms

Aug 06, 2020

Hyperparameter optimization is the process of deducing model parameters that can’t be learned from data. This process is often time- and resource-consuming, especially in the context of deep learning. A good description of this process can be found at “Tuning the hyper-parameters of an estimator,” and the issues that arise are concisely summarized in Dask-ML’s documentation of “Hyper Parameter Searches.”

Read more ...

DataFrame Groupby Aggregations

Oct 08, 2019

Document headings start at H2, not H1 [myst.header]

Read more ...

Dask on HPC

Jun 12, 2019

We analyze large datasets on HPC systems with Dask, a parallel computing library that integrates well with the existing Python software ecosystem, and works comfortably with native HPC hardware.

Read more ...

Composing Dask Array with Numba Stencils

Apr 09, 2019

In this post we explore four array computing technologies, and how they work together to achieve powerful results.

Read more ...

cuML and Dask hyperparameter optimization

Mar 27, 2019

Document headings start at H3, not H1 [myst.header]

Read more ...

Extension Arrays in Dask DataFrame

Jan 22, 2019

This work is supported by Anaconda Inc

Read more ...

Dask Version 1.0

Nov 29, 2018

We are pleased to announce the release of Dask version 1.0.0!

Read more ...

Refactor Documentation

Sep 27, 2018

This work is supported by Anaconda Inc

Read more ...

Dask Development Log

Sep 17, 2018

This work is supported by Anaconda Inc

Read more ...

Dask Release 0.19.0

Sep 05, 2018

This work is supported by Anaconda Inc.

Read more ...

High level performance of Pandas, Dask, Spark, and Arrow

Aug 28, 2018

This work is supported by Anaconda Inc

Read more ...

Building SAGA optimization for Dask arrays

Aug 07, 2018

This work is supported by ETH Zurich, Anaconda Inc, and the Berkeley Institute for Data Science

Read more ...

Dask Development Log

Aug 02, 2018

This work is supported by Anaconda Inc

Read more ...

Pickle isn't slow, it's a protocol

Jul 23, 2018

This work is supported by Anaconda Inc

Read more ...

Dask Development Log, Scipy 2018

Jul 17, 2018

This work is supported by Anaconda Inc

Read more ...

Who uses Dask?

Jul 16, 2018

This work is supported by Anaconda Inc

Read more ...

Dask Development Log

Jul 08, 2018

This work is supported by Anaconda Inc

Read more ...

Dask Scaling Limits

Jun 26, 2018

This work is supported by Anaconda Inc.

Read more ...

Dask Release 0.18.0

Jun 14, 2018

This work is supported by Anaconda Inc.

Read more ...

Beyond Numpy Arrays in Python

May 27, 2018

Document headings start at H2, not H1 [myst.header]

Read more ...

Dask Release 0.17.2

Mar 21, 2018

This work is supported by Anaconda Inc. and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Dask Release 0.17.0

Feb 12, 2018

This work is supported by Anaconda Inc. and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Pangeo: JupyterHub, Dask, and XArray on the Cloud

Jan 22, 2018

This work is supported by Anaconda Inc, the NSF EarthCube program, and UC Berkeley BIDS

Read more ...

Dask Development Log

Dec 06, 2017

This work is supported by Anaconda Inc and the Data Driven Discovery Initiative from the Moore Foundation

Read more ...

Dask Release 0.16.0

Nov 21, 2017

This work is supported by Anaconda Inc. and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Optimizing Data Structure Access in Python

Nov 03, 2017

This work is supported by Anaconda Inc and the Data Driven Discovery Initiative from the Moore Foundation

Read more ...

Streaming Dataframes

Oct 16, 2017

This work is supported by Anaconda Inc and the Data Driven Discovery Initiative from the Moore Foundation

Read more ...

Notes on Kafka in Python

Oct 10, 2017

Document headings start at H2, not H1 [myst.header]

Read more ...

Dask Release 0.15.3

Sep 24, 2017

This work is supported by Anaconda Inc. and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Fast GeoSpatial Analysis in Python

Sep 21, 2017

This work is supported by Anaconda Inc., the Data Driven Discovery Initiative from the Moore Foundation, and NASA SBIR NNX16CG43P

Read more ...

Dask on HPC - Initial Work

Sep 18, 2017

This work is supported by Anaconda Inc. and the NSF EarthCube program.

Read more ...

Dask Release 0.15.2

Aug 30, 2017

This work is supported by Anaconda Inc. and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Dask Benchmarks

Jul 03, 2017

This work is supported by Continuum Analytics and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Use Apache Parquet

Jun 28, 2017

This work is supported by Continuum Analytics and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Dask Release 0.15.0

Jun 15, 2017

This work is supported by Continuum Analytics and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Dask Release 0.14.3

May 08, 2017

This work is supported by Continuum Analytics and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Dask Release 0.14.1

Mar 23, 2017

This work is supported by Continuum Analytics, the XDATA Program, and the Data Driven Discovery Initiative from the Moore Foundation.

Read more ...

Dask Distributed Release 1.13.0

Sep 12, 2016

I’m pleased to announce a release of Dask’s distributed scheduler, dask.distributed, version 1.13.0.

Read more ...

Dask for Institutions

Aug 16, 2016

Read more ...

Dask and Scikit-Learn -- Model Parallelism

Jul 12, 2016

This post was written by Jim Crist. The original post lives at http://jcrist.github.io/dask-sklearn-part-1.html (with better styling)

Read more ...

Ad Hoc Distributed Random Forests

Apr 20, 2016

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Fast Message Serialization

Apr 14, 2016

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Distributed Dask Arrays

Feb 26, 2016

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Pandas on HDFS with Dask Dataframes

Feb 22, 2016

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Introducing Dask distributed

Feb 17, 2016

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Dask is one year old

Dec 21, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Distributed Prototype

Oct 09, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Caching

Aug 03, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Custom Parallel Workflows

Jul 23, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Write Complex Parallel Algorithms

Jun 26, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Distributed Scheduling

Jun 23, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

State of Dask

May 19, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core DataFrames

Mar 11, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core ND-Arrays -- Dask + Toolz = Bag

Feb 17, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core ND-Arrays -- Slicing and Stacking

Feb 13, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core ND-Arrays -- Spilling to Disk

Jan 16, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core ND-Arrays -- Benchmark MatMul

Jan 14, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core ND-Arrays -- Multi-core Scheduling

Jan 06, 2015

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core ND-Arrays -- Frontend

Dec 30, 2014

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...

Towards Out-of-core ND-Arrays

Dec 27, 2014

This work is supported by Continuum Analytics and the XDATA Program as part of the Blaze Project

Read more ...