Latest

Machine Learning Projects: Challenges and Best Practices

Lukas Biewald is the founder of Weights & Biases. He was previously the founder of Figure Eight (formerly CrowdFlower). This blog post provides insights into why machine learning teams have challenges with managing machine learning projects....

Themes and Conferences per Pacoid, Episode 6

In Paco Nathan's latest column, he explores the role of curiosity in data science work as well as Rev 2, an upcoming summit for...

Reflections on the Data Science Platform Market

Reflections Before we get too far into 2019, I wanted to take a brief moment to reflect on some of the changes we’ve...

Data Science vs Engineering: Tension Points

This blog post provides highlights and a full written transcript from the panel, “Data Science Versus Engineering: Does It Really Have To Be...

On the Importance of Community-Led Open Source

Wes McKinney, Director of Ursa Labs and creator of pandas project, presented the keynote, "Advancing Data Science Through Open Source" at Rev. McKinney's...

Code

Creating Multi-language Pipelines with Apache Spark or Avoid Having to Rewrite spaCy into Java

In this guest post, Holden Karau, Apache Spark Committer, provides insights on how to create multi-language pipelines with Apache Spark and avoid rewriting...

Data Scientist? Programmer? Are They Mutually Exclusive?

This Domino Data Science Field Note blog post provides highlights of Hadley Wickham’s ACM Chicago talk, “You Can’t Do Data Science in a GUI”....

Featured

Growing Data Scientists Into Manager Roles

In this post, Ricky Chachra, Research Science Manager at Lyft, provides insight for companies looking to home-grow their promising individual contributors (ICs) into...

Featured

Model Management and the Era of the Model-Driven Business

Over the past few years, we’ve seen a new community of data science leaders emerge. Regardless of their industry, we have heard three...

Practical Techniques

Learn from the Reproducibility Crisis in Science

Key highlights from Clare Gollnick’s talk, “The limits of inference: what data scientists can learn from the reproducibility crisis in science”, are covered...

Building a Domino Web App with Dash

Randi R. Ludwig, Data Scientist at Dell EMC and an organizer of Women in Data Science ATX, covers how to build a Domino...

Leaders at Work

Collaboration Between Data Science and Data Engineering: True or False?

This blog post includes candid insights about addressing tension points that arise when people collaborate on developing and deploying models. Domino’s Head of...

Data Science Use Cases

In this post, Don Miner covers how to identify, evaluate, prioritize, and pick which data science problems to work on next. Don is...

Model Interpretability with TCAV (Testing with Concept Activation Vectors)

This Domino Data Science Field Note provides very distilled insights and excerpts from Been Kim’s recent MLConf 2018 talk and research about Testing with...

Trust in LIME: Yes, No, Maybe So? 

In this Domino Data Science Field Note, we briefly discuss an algorithm and framework for generating explanations, LIME (Local Interpretable Model-Agnostic Explanations), that...

Classify all the Things (with Multiple Labels)

Derrick Higgins of American Family Insurance presented a talk, “Classify all the Things (with multiple labels): The most common type of modeling task no...

Put Models at the Core of Business Processes

At Rev, Nick Elprin, Domino's CEO, continued to provide insights on managing data science based upon years of candid discussions with customers. He...

Domino 3.0: New Features and User Experiences to Help the World Run on Models

This blog post introduces new Domino 3.0 features. Akansh Murthy is a Technical Product Manager at Domino and previously worked as a software...

Docker, but for Data

Aneesh Karve, Co-founder and CTO of Quilt, visited the Domino MeetUp to discuss the evolution of data infrastructure. This blog post provides a...