<?xml version='1.0' encoding='UTF-8'?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
  <id>https://blog.dask.org</id>
  <title>Dask Working Notes - Posts by Mike McCarty (Capital One Center for Machine Learning) and Matthew Rocklin (Coiled Computing)</title>
  <updated>2026-03-05T15:05:20.239569+00:00</updated>
  <link href="https://blog.dask.org"/>
  <link href="https://blog.dask.org/blog/author/mike-mccarty-capital-one-center-for-machine-learning-and-matthew-rocklin-coiled-computing/atom.xml" rel="self"/>
  <generator uri="https://ablog.readthedocs.io/" version="0.11.12">ABlog</generator>
  <entry>
    <id>https://blog.dask.org/2020/04/28/dask-summit/</id>
    <title>Dask Summit</title>
    <updated>2020-04-28T00:00:00+00:00</updated>
    <author>
      <name>Mike McCarty (Capital One Center for Machine Learning) and Matthew Rocklin (Coiled Computing)</name>
    </author>
    <content type="html">&lt;p&gt;In late February members of the Dask community gathered together in Washington, DC.
This was a mix of open source project maintainers
and active users from a broad range of institutions.
This post shares a summary of what happened at this workshop,
including slides, images, and lessons learned.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Note: this event happened just before the widespread effects of the COVID-19
outbreak in the US and Europe. We were glad to see each other, but wouldn’t recommend doing this today.&lt;/em&gt;&lt;/p&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 18)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;section id="who-came"&gt;

&lt;p&gt;This was an invite-only event of fifty people, with a cap of three people per
organization. We intentionally invited an even mix of half people who
self-identified as open source maintainers, and half people who identified as
institutional users. We had attendees from academia, small startups, tech
companies, government institutions, and large enterprise. It was surprising
how much we all had in common.
We had attendees from the following companies:&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Anaconda&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Berkeley Institute for Datascience&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Blue Yonder&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Brookhaven National Lab&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Capital One&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Chan Zuckerberg Initiative&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Coiled Computing&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Columbia University&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;D. E. Shaw &amp;amp; Co.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Flatiron Health&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Howard Hughes Medial Institute, Janelia Research Campus&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Inria&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Kitware&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Lawrence Berkeley National Lab&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Los Alamos National Laboratory&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;MetroStar Systems&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Microsoft&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;NIMH&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;NVIDIA&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;National Center for Atmospheric Research (NCAR)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;National Energy Research Scientific Computing (NERSC) Center&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Prefect&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Quansight&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Related Sciences&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Saturn Cloud&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Smithsonian Institution&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;SymphonyRM&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;The HDF Group&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;USGS&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Ursa Labs&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 59)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;section id="objectives"&gt;
&lt;h1&gt;Objectives&lt;/h1&gt;
&lt;p&gt;The Dask community comes from a broad range of backgrounds.
It’s an odd bunch, all solving very different problems,
but all with a surprisingly common set of needs.
We’ve all known each other on GitHub for several years,
and have a long shared history, but many of us had never met in person.&lt;/p&gt;
&lt;p&gt;In hindsight, this workshop served two main purposes:&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;&lt;p&gt;It helped us to see that we were all struggling with the same problems
and so helped to form direction and motivate future work&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;It helped us to create social bonds and collaborations that help us manage
the day to day challenges of building and maintaining community software
across organizations&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 75)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;section id="structure"&gt;
&lt;h1&gt;Structure&lt;/h1&gt;
&lt;p&gt;We met for three days.&lt;/p&gt;
&lt;p&gt;On days 1-2 we started with quick talks from the attendees and followed with
afternoon working sessions.&lt;/p&gt;
&lt;p&gt;Talks were short around 10-15 minutes
(having only experts in the room meant that we could easily skip the introductory material)
and always had the same structure:&lt;/p&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p&gt;A brief description of the domain that they’re in and why it’s important&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Example: We look at seismic readings from thousand of measurement devices around
the world to understand and predict catastrophic earthquakes&lt;/em&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;How they use Dask to solve this problem&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Example: this means that we need to cross-correlate thousands of very
long timeseries. We use Xarray on AWS with some custom operations.&lt;/em&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;What is wrong with Dask, and what they would like to see improved&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Example: It turns out that our axes labels can grow larger than what
Xarray was designed for. Also, the task graph size for Dask can become a
limitation&lt;/em&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;These talks were structured into six sections:&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;&lt;p&gt;Workflow and pipelines&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Deployment&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Imaging&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;General data analysis&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Performance and tooling&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Xarray&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;We didn’t capture video, but we do have slides from each of the talks below.&lt;/p&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 113)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;section id="workflow-and-pipelines"&gt;
&lt;h1&gt;1: Workflow and Pipelines&lt;/h1&gt;
&lt;section id="blue-yonder"&gt;
&lt;h2&gt;Blue Yonder&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: ETL Pipelines for Machine Learning&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Florian Jetter&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Also attending:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Nefta Kanilmaz&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Lucas Rademaker&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vSk2zAnSmzpbz5BgK70mpPmeQeV4h1IkCQh-EU8KXrZFJQGHmlMTuHvln3CmOQVTg/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="360" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="prefect"&gt;
&lt;h2&gt;Prefect&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Prefect + Dask: Parallel / Distributed Workflows&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Chris White, CTO&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="//www.slideshare.net/slideshow/embed_code/key/4wiUwkDHmdzVTW" width="595" height="485" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" style="border:1px solid #CCC; border-width:1px; margin-bottom:5px; max-width: 100%;" allowfullscreen&gt; &lt;/iframe&gt; &lt;div style="margin-bottom:5px"&gt; &lt;strong&gt; &lt;a href="//www.slideshare.net/ChrisWhite249/dask-prefect" title="Dask + Prefect" target="_blank"&gt;Dask + Prefect&lt;/a&gt; &lt;/strong&gt; from &lt;strong&gt;&lt;a href="https://www.slideshare.net/ChrisWhite249" target="_blank"&gt;Chris White&lt;/a&gt;&lt;/strong&gt; &lt;/div&gt;
&lt;/section&gt;
&lt;section id="symphonyrm"&gt;
&lt;h2&gt;SymphonyRM&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Dask and Prefect for Data Science in Healthcare&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenter: Joe Schmid, CTO&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vSCDbXrXtrL9vmA0hQ1NNk5DY0-3Azpcf9FbYgjoLuKV79vf_nm7wdUZl1NsL5DZqRmlUTP--u9HM56/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 139)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;/section&gt;
&lt;section id="deployment"&gt;
&lt;h1&gt;2: Deployment&lt;/h1&gt;
&lt;section id="quansight"&gt;
&lt;h2&gt;Quansight&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Building Cloud-based Data Science Platforms with Dask&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Dharhas Pothina&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Also attending: - James Bourbeau - Dhavide Aruliah&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vSZ1fSrkWvzMPlx-f0qk7w2xj_uDp5q-Tg11S9UlynoohZV0VYjdFduDUrAdhptSYfpzFu9Wask1WSN/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="479" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="nvidia-and-microsoft-azure"&gt;
&lt;h2&gt;NVIDIA and Microsoft/Azure&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Native Cloud Deployment with Dask-Cloudprovider&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Jacob Tomlinson, Tom Drabas, and Code Peterson&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vT-B1c0r8MWMF8wvW4lNly-qmOCqhFqKdhshXnVql6UVkYQ-aGprY3Du0VH0PJBccOmM84ncw0lDV77/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="inria"&gt;
&lt;h2&gt;Inria&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: HPC Deployments with Dask-Jobqueue&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Loïc Esteve&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://lesteve.github.io/talks/2020-dask-jobqueue-dask-workshop/slides.html" frameborder="0" width="1000" height="800"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="anaconda"&gt;
&lt;h2&gt;Anaconda&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Title: Dask Gateway&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Jim Crist&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Also attending: - Tom Augspurger - Eric Dill - Jonathan Helmus&lt;/p&gt;
&lt;style&gt;
    iframe {
        overflow:hidden;
    }
&lt;/style&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="http://jcrist.github.io/talks/dask_summit_2020/slides.html" frameborder="1" width="600" height="355" scrolling="no"&gt;&lt;/iframe&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 176)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;/section&gt;
&lt;section id="imaging"&gt;
&lt;h1&gt;3: Imaging&lt;/h1&gt;
&lt;section id="kitware"&gt;
&lt;h2&gt;Kitware&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Scientific Image Analysis and Visualization with ITK&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Matt McCormick&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vRz2SV2G-1LEXXCF0n9vugF13s7ABpLDT-yH3WtxQEOjt2FVHE7apl3nQhqkOiLeY9kSzM_Mrs6fJOk/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="id1"&gt;
&lt;h2&gt;Kitware&lt;/h2&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: INFO/1 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 185); &lt;em&gt;&lt;a href="#id1"&gt;backlink&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Duplicate implicit target name: “kitware”.&lt;/p&gt;
&lt;/aside&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Image processing with X-rays and electrons&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Marcus Hanwell&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vRT--l76IcSPlIP_N6ClUtm2ECZaxkvIGrBNyyoFmJNQu6kS6CilWoleIMCur2FQ7ZpEkkCsw7UXnRd/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="national-institutes-of-mental-health"&gt;
&lt;h2&gt;National Institutes of Mental Health&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Brain imaging&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: John Lee&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vTH1X0cSjozmCDvSQ8CtcxPPYejkLROC_b92W6uwznG5litWq_MwKJzUMnAQi0Prw/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="janelia-howard-hughes-medical-institute"&gt;
&lt;h2&gt;Janelia / Howard Hughes Medical Institute&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Spark, Dask, and FlyEM HPC&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Stuart Berg&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vSnZ-JgHAoAOUirqmLcI3GaKyC4oVo3vThZZ4oyx8vZjJ66An09JIhbcoy6k7ufTw/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="479" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 206)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;/section&gt;
&lt;section id="general-data-analysis"&gt;
&lt;h1&gt;4: General Data Analysis&lt;/h1&gt;
&lt;section id="brookhaven-national-labs"&gt;
&lt;h2&gt;Brookhaven National Labs&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Dask at DOE Light Sources&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Dan Allan&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vRd8PVHjW7Umjo1rUjR7XWDT95CcEoE_3jH-ceDHsN_lMv_4M2qnlFiFvtMl9SX0Eb1EFQTGkzUWCDy/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="d-e-shaw-group"&gt;
&lt;h2&gt;D.E. Shaw Group&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Dask at D.E. Shaw&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Akihiro Matsukawa&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/section&gt;
&lt;section id="id2"&gt;
&lt;h2&gt;Anaconda&lt;/h2&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: INFO/1 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 220); &lt;em&gt;&lt;a href="#id2"&gt;backlink&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Duplicate implicit target name: “anaconda”.&lt;/p&gt;
&lt;/aside&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Dask Dataframes and Dask-ML summary&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Tom Augspurger&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vTs6nNsMkV92Uj4QUns1VB8pKlKSsRgUAGwvcbTOPqMazSAhxtawVNgb04YmHVFmb0z8-no-cdS8mE8/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 227)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;/section&gt;
&lt;section id="performance-and-tooling"&gt;
&lt;h1&gt;5: Performance and Tooling&lt;/h1&gt;
&lt;section id="berkeley-institute-for-data-science"&gt;
&lt;h2&gt;Berkeley Institute for Data Science&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Numpy APIs&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Sebastian Berg&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/section&gt;
&lt;section id="ursa-labs"&gt;
&lt;h2&gt;Ursa Labs&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Arrow&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Joris Van den Bossche&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vQY3ubjCFkMcU_b8p2xmuXN8VVR1BxxSWZDe5Vy-ftnH2CstZILvTo2pRBv5R_VDk85rNjVoWew2AJl/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="nvidia"&gt;
&lt;h2&gt;NVIDIA&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: RAPIDS&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Keith Kraus&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Also attending: - Mike Beaumont - Richard Zamora&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vQiNrupzQlSqsu95AAHqIhU1V_iVUav_0WlIp4dXdSE6Izze1BL8mkFbIzg7p8CndEi9bjWaC2OVlyu/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="id3"&gt;
&lt;h2&gt;NVIDIA&lt;/h2&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: INFO/1 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 249); &lt;em&gt;&lt;a href="#id3"&gt;backlink&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Duplicate implicit target name: “nvidia”.&lt;/p&gt;
&lt;/aside&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: UCX&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Ben Zaitlen&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vRU-vsXsnXgeLKdmtWZkZVV_-mOojsNesCbQKJgmWkwSjxj5ZdwkmS6X4tOt3HpFrIOfNROSlV_8l84/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 256)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;/section&gt;
&lt;section id="xarray"&gt;
&lt;h1&gt;6: Xarray&lt;/h1&gt;
&lt;section id="usgs-and-ncar"&gt;
&lt;h2&gt;USGS and NCAR&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Dask in Pangeo&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Rich Signell and Anderson Banihirwe&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vStqGiQy6pJDYhRgF-BZylQussINK5BGlhnidOVCUECo_ebYqRH9cSY4e-2z7BfFFvTfvkqq_M1jXBX/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="366" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;/section&gt;
&lt;section id="lbnl"&gt;
&lt;h2&gt;LBNL&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Accelerating Experimental Science with Dask&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Matt Henderson&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a class="reference external" href="https://drive.google.com/file/d/1DVVzYmhkDhO2xs0tmxpPCkxx5c4o63bO/view"&gt;Slides&lt;/a&gt; - Fill too large to preview&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/section&gt;
&lt;section id="lanl"&gt;
&lt;h2&gt;LANL&lt;/h2&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Title: Seismic Analysis&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Presenters: Jonathan MacCarthy&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;iframe src="https://docs.google.com/presentation/d/e/2PACX-1vSWAgKLxt1tBZxXjQfIRQNFPvAMFYZ-z0hkMy7euPnOHwO9pomH_gM8cKUTKXA68w/embed?start=false&amp;loop=false&amp;delayms=3000" frameborder="0" width="600" height="404" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"&gt;&lt;/iframe&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 278)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;/section&gt;
&lt;section id="unstructured-time"&gt;
&lt;h1&gt;Unstructured Time&lt;/h1&gt;
&lt;p&gt;Having rapid fire talks in the morning, followed by unstructured time in the
afternoon was a productive combination. Below you’ll see pictures from
geo-scientists and quants talking about the same challenges, and library
maintainers from Pandas/Arrow/RAPDIS/Dask all working together on joint
solutions.&lt;/p&gt;
&lt;p&gt;&lt;img src="https://pbs.twimg.com/media/ERykEc9XUAEFq-L?format=jpg&amp;name=large"
     width="40%"&gt;
&lt;img src="https://pbs.twimg.com/media/ERzEcEeXkAU35sg?format=jpg&amp;name=large"
    width="40%"&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src="https://pbs.twimg.com/media/ERyz7B5X0AIrDkn?format=jpg&amp;name=large"
    width="40%"&gt;
&lt;img src="https://pbs.twimg.com/media/ERzXhHnWAAE_zDA?format=jpg&amp;name=large"
    width="40%"&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src="https://pbs.twimg.com/media/ERz3GDgXsAcE6Id?format=jpg&amp;name=large"
    width="40%"&gt;
&lt;img src="https://pbs.twimg.com/media/ERz4ur2WkAAGJwm?format=jpg&amp;name=large"
    width="40%"&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src="https://pbs.twimg.com/media/ER0sZceUYAAF5fW?format=jpg&amp;name=large"
    width="40%"&gt;
&lt;img src="https://pbs.twimg.com/media/ER0yY2rX0AEFfXi?format=jpg&amp;name=large"
    width="40%"&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src="https://pbs.twimg.com/media/ERyz98YWAAAmJbE?format=jpg&amp;name=large"
    width="40%"&gt;
&lt;img src="https://pbs.twimg.com/media/ERz5S2dWoAEhFHc?format=jpg&amp;name=large"
    width="40%"&gt;&lt;/p&gt;
&lt;p&gt;This unstructured time is a productive combination that we would recommend to
other technically diverse groups in the future. Engagement and productivity was
really high throughout the workshop.&lt;/p&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 315)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;section id="final-thoughts"&gt;
&lt;h1&gt;Final Thoughts&lt;/h1&gt;
&lt;p&gt;Dask’s strength comes from this broad community of stakeholders.&lt;/p&gt;
&lt;p&gt;An early technical focus on simplicity and pragmatism allowed the project to be
quickly adopted within many different domains. As a result, the practitioners
within these domains are largely the ones driving the project forward today.
This Community Driven Development brings an incredible diversity of technical
and cultural challenges and experience that force the project to quickly evolve
in a way that is constrained towards pragmatism.&lt;/p&gt;
&lt;p&gt;There is still plenty of work to do.
Short term this workshop brought up many technical challenges that are shared
by all (simpler deployments, scheduling under task constraints, active memory
management). Longer term we need to welcome more people into this community,
both by increasing the diversity of domains, and the diversity of individuals
(the vast majority of attendees were white men in their thirties from the US
and western Europe).&lt;/p&gt;
&lt;p&gt;We’re in a good position to effect this change.
Dask’s recent growth has captured the attention of many different institutions.
Now is a critical time to be intentional about the projects growth to make sure
that the project and community continue to reflect a broad and ethical set of
principles.&lt;/p&gt;
&lt;aside class="system-message"&gt;
&lt;p class="system-message-title"&gt;System Message: WARNING/2 (&lt;span class="docutils literal"&gt;/opt/build/repo/2020/04/28/dask-summit.md&lt;/span&gt;, line 340)&lt;/p&gt;
&lt;p&gt;Document headings start at H2, not H1 [myst.header]&lt;/p&gt;
&lt;/aside&gt;
&lt;/section&gt;
&lt;section id="acknowledgements"&gt;
&lt;h1&gt;Acknowledgements&lt;/h1&gt;
&lt;section id="sponsors"&gt;
&lt;h2&gt;Sponsors&lt;/h2&gt;
&lt;p&gt;Without the support of our sponsors, this workshop would not have been possible.
Thanks to Anaconda, Capital One and NVIDIA for their support and generous
donations toward this event.&lt;/p&gt;
&lt;/section&gt;
&lt;section id="organizers"&gt;
&lt;h2&gt;Organizers&lt;/h2&gt;
&lt;p&gt;Thank you very much to the organizers who took time from their busy schedules
and worked so hard to put together this event.&lt;/p&gt;
&lt;ul class="simple"&gt;
&lt;li&gt;&lt;p&gt;Brittany Treadway (Capital One)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Keith Kraus (NVIDIA)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Matthew Rocklin (Coiled Computing)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Mike Beaumont (NVIDIA)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Mike McCarty (Capital One)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Neia Woodson (Capital One)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Jake Schmitt (Capital One)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Jim Crist (Anaconda)&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/section&gt;
&lt;/section&gt;
</content>
    <link href="https://blog.dask.org/2020/04/28/dask-summit/"/>
    <summary>In late February members of the Dask community gathered together in Washington, DC.
This was a mix of open source project maintainers
and active users from a broad range of institutions.
This post shares a summary of what happened at this workshop,
including slides, images, and lessons learned.</summary>
    <published>2020-04-28T00:00:00+00:00</published>
  </entry>
</feed>
