Latest

On Ingesting Kate Crawford’s “The Trouble with Bias”

Kate Crawford discussed bias at a recent SF-based City Arts and Lectures talk and a recording of the discussion will be broadcast, May 6th, on KQED and...

Data Science is more than Machine Learning 

This Domino Data Science Field Note provides highlights and video clips from Addhyan Pandey’s Domino Data Pop-Up talk, “Leveraging Data Science in the Automotive Industry”. Addhyan...

Data Scientist? Programmer? Are They Mutually Exclusive?

This Domino Data Science Field Note blog post provides highlights of Hadley Wickham’s ACM Chicago talk, “You Can’t Do Data Science in a GUI”. In his talk,...

The Machine Learning Reproducibility Crisis

Pete Warden is the Technical Lead on the TensorFlow Mobile Embedded Team at Google doing Deep Learning. He is formerly the CTO of Jetpac, which was...

0.05 is an Arbitrary Cut Off: “Turning Fails into Wins”

Grace Tang, Data Scientist at Uber, presented insights, common pitfalls, and “best practices to ensure all experiments are useful” in her Strata Singapore session, “Turning Fails...

Data Science Use Cases

In this post, Don Miner covers how to identify, evaluate, prioritize, and pick which data science problems to work on next. Don is a cofounder of...

Building a Domino Web App with Dash

Randi R. Ludwig, Data Scientist at Dell EMC and an organizer of Women in Data Science ATX, covers how to build a Domino web app with...

Racial Bias in Policing: An Analysis of Illinois Traffic Stop Data

Mollie Pettit, Data Scientist and D3.js Data Visualization Instructor with Metis, walks data scientists through analysis of Illinois police traffic stop data, presenting a story narrative...

Data Quality Analytics

Scott Murdoch, PhD, Director of Data Science at HealthJoy, presents how data scientists can use distribution and modeling techniques to understand the pitfalls in their data...

Intel’s Python Distribution is Smoking Fast, and Now it’s in Domino

Domino just finished benchmarking Intel’s Python Distribution, and it is fast, very fast. Intel’s Python distribution is available for use in Domino. Intel’s Python Distribution People...

Reproducible Machine Learning with Jupyter and Quilt

In this guest blog post, Aneesh Karve, Co-founder and CTO of Quilt, demonstrates how Quilt works in conjunction with Domino's Reproducibility Engine to make Jupyter notebooks...

Summertime Analytics: Predicting E. Coli and West Nile Virus

Gene Leynes (Senior Data Scientist) and Nick Lucius (Advanced Analytics) from the City of Chicago discussed two predictive analytics projects that forecasted potential risk involved with...

Using Bayesian Methods to Clean Up Human Labels

Derrick Higgins, AmFam Data Science & Analytics, discusses how Bayesian methods can be applied to improve the quality of annotated training sets. Session Summary Derrick Higgins,...