Original post

Joe Doliner joined the show to talk about managing data lakes with Pachyderm, data containers, provenance, and other interesting projects and news.

Discuss on Changelog News

Join Changelog++ to support our work, get closer to the metal, and make the ads disappear!


  • Linode – Our cloud server of choice. Get one of the fastest, most efficient SSD cloud servers for only $5/mo. Use the code changelog2017 to get 4 months free!

  • Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform.

  • Toptal – Scale your team and hire from the top 3% of developers and designers with Toptal. Email adam@changelog.com for a personal introduction.

  • Backtrace – Reduce your time to resolution. Go beyond stacktraces and logs. Get to the root cause quickly with deep application introspection at your fingertips.


Notes and Links


Let’s build a modern Hadoop

Putting the science back in data science

Martin Fowler – DataLake

Wikipedia: Data Lake

Provenance: the Missing Feature for Rigorous Data Science. Now in Pachyderm 1.1

xkcd: Who were you DenverCoder9? What did you see?!

Pachyderm Users Slack Channel

Interesting Go Projects and News

GitLab.com Database Incident – 2017/01/31

Changelog Spotlight #8: Conversational Development and Controversy with Sid Sijbrandij

Wuzz (visual cURL)

Ozzo Validation

dep 101 – I Can Haz Downtime?

The State of Go – February 2017

Free Software Friday!

Each week on the show we give a shout out to an open source project or community that’s made an impact in our day to day developer lives.