Domino Data Science Blog

Data Science Trends, Tools, and Best Practices

Posts tagged with:   Reproducibility

Data Scientist? Programmer? Are They Mutually Exclusive?

This Domino Data Science Field Note blog post provides highlights of Hadley Wickham’s ACM Chicago talk, “You Can’t Do Data Science in a GUI” more

,   ,   ,   

The Machine Learning Reproducibility Crisis

Pete Warden is the Technical Lead on the TensorFlow Mobile Embedded Team at Google doing Deep Learning. He is formerly the CTO more

,   ,   

Managing Data Science as a Capability

Nick Elprin, CEO at Domino, presented a 3-hour training workshop, “Managing Data Science in the Enterprise”, that provided practical insights and interactive more

,   ,   

0.05 is an Arbitrary Cut Off: “Turning Fails into Wins”

Grace Tang, Data Scientist at Uber, presented insights, common pitfalls, and “best practices to ensure all experiments are useful” in her Strata more

,   ,   ,   ,   ,   

Reproducible Machine Learning with Jupyter and Quilt

In this guest blog post, Aneesh Karve, Co-founder and CTO of Quilt, demonstrates how Quilt works in conjunction with Domino's Reproducibility Engine more

,   ,   ,   ,   ,   

Reproducible dashboards and other great things to do with Jupyter

Mac Rogers, Research Engineer at Domino, presented best practices for creating Jupyter dashboards at a recent Domino Data Science PopUp. Session Summary more

,   ,   ,   

Principles of Collaboration in Data Science

Data science is no longer a specialization of a single person or small group. It is now a key source of competitive advantage, more

,   ,   ,   ,   ,   

Achieving Reproducibility with Conda and Domino Environments

Managing “environments” (i.e., the set of packages, configuration, etc.) is a critical capability of any Data Science Platform. Not only does environment more

,   ,   ,   ,   ,   ,   

Domino raises $10.5M in funding for collaborative, reproducible data science

Today we’re announcing that we have raised $10.5 million in a funding round led by Sequoia Capital. For us, fundraising is simply more

,   ,   

Reproducible Research in Computational Sciences

This guest post was written by Arnu Pretorius, a Masters student in Mathematical Statistics at the MIH Media Lab, Stellenbosch University. Arnu's more

,   ,   ,   ,   

Providing Digital Provenance: from Modeling through Production

At last week's useR! R User conference, I spoke on digital provenance, the importance of reproducible research, and how Domino has solved more

,   ,   ,   ,   

Building a High-Throughput Data Science Machine

Insights on process and culture from The Climate Corporation’s Erik Andrejko This post was originally published on the O'Reilly Radar blog. Scaling more