Providing Digital Provenance: from Modeling through Production

At last week's useR! R User conference, I spoke on digital provenance, the importance of reproducible research, and how Domino has solved many of the challenges...

Announcing Enhanced Apache Spark Support

Domino now offers data scientists a simple, yet incredibly powerful way to conduct quantitative work using Apache Spark. Apache Spark has captured the hearts and minds...

Ugly Little Bits of the Data Science Process

This morning there was a great conversation on Twitter, kicked off by Hadley Wickham, about one of the ugly little bits of the data science process....

The R Data I/O Shootout

We pit newcomer R data I/O package, feather, against popular packages data.table, readr, and the venerable saveRDS/writeRDS functions from base R. While feather fared well, it...

Building a High-Throughput Data Science Machine

Insights on process and culture from The Climate Corporation’s Erik Andrejko This post was originally published on the O'Reilly Radar blog. Scaling is hard. Scaling data...

Genomic Ranges: An Introduction to Working with Genomic Data

To view the code that generated this blog post, check out Jack Fu's project here on Domino Data Lab. This is a guest blog post by...