Latest

Intel’s Python Distribution is Smoking Fast, and Now it’s in Domino

Domino just finished benchmarking Intel’s Python Distribution, and it is fast, very fast. Intel’s Python distribution is available for use in Domino. Intel’s Python Distribution People...

Reproducible Machine Learning with Jupyter and Quilt

In this guest blog post, Aneesh Karve, Co-founder and CTO of Quilt, demonstrates how Quilt works in conjunction with Domino's Reproducibility Engine to make Jupyter notebooks...

Reproducible Dashboards and Other Great Things to do with Jupyter

Mac Rogers, Research Engineer at Domino, presented best practices for creating Jupyter dashboards at a recent Domino Data Science Pop-Up. Session Summary In this Data Science...

Taking the Course: Practical Deep Learning for Coders

In this blog post, Lisa Green, Head of Domino for Good, describes the content, value, and experience of taking Lesson 1 of the Practical Deep Learning...

Multicore Data Science with R and Python

This article is an excerpt from the full video on Multicore Data Science in R and Python. Watch the full video to learn how to leverage...

Fitting Gaussian Process Models in Python

[mathjax] Written by Chris Fonnesbeck, Assistant Professor of Biostatistics, Vanderbilt University Medical Center. You can view, fork, and play with this project on the Domino data...

Using Monte Carlo Simulations in R to Test Methodological Advances in Social Policy Research

This is a guest post written by Kristin Porter, Senior Research Associate at MDRC. MDRC is a nonprofit, nonpartisan education and social policy research organization dedicated...

Python 3.6 with Domino in Minutes

For Pythonistas like me, the holidays started a little early with today's release of Python 3.6. In case you haven't heard, Python 3.6 has a number...

Python for SAS Users: The pandas Data Analysis Library

Ths post is a chapter from Randy Betancourt's Python for SAS Users quick start guide. Randy wrote this guide to familiarize SAS users with Python and...

R vs. Python for Data Science

While the elections are over, some debates continue. R and Python are both popular programming languages for data scientists. Each has its advantages for performing data...

A Quick Benchmark of Hashtable Implementations in R

UPDATE: I am humbled and thankful to have had so much feedback on this post! It started out as a quick and dirty benchmark but I...

A Summary of Using k-NN in Production

This week, Domino’s Chief Data Scientist, Eduardo Ariño de la Rubia, presented a webinar: An Introduction to Using k-NN in Production. If you missed the live...

nteract: An open-source, desktop-based, interactive computing application

Today, along with our partners at Plot.ly, we're announcing a significant investment in the open-source nteract project. nteract is a desktop-based, interactive computing application. At Domino,...