Tag: Data Science

Analyzing Large P Small N Data – Examples from Microbiome

Guest Post by Bill Shannon, Co-Founder and Managing Partner of BioRankings Introduction High throughput screening technologies have been developed to measure all the molecules of interest...

Bringing ML to Agriculture: Transforming a Millennia-old Industry

Guest post by Jeff Melching from The Climate Corporation At The Climate Corporation, we aim to help farmers better understand their operations and make better decisions...

The curse of Dimensionality

Guest Post by Bill Shannon, Founder and Managing Partner of BioRankings Danger of Big Data Big data is the rage. This could be lots of rows...

Why models fail to deliver value and what you can do about it.

Building models requires a lot of time and effort. Data scientists can spend weeks just trying to find, capture and transform data into decent features for...

The importance of structure, coding style, and refactoring in notebooks

Notebooks are increasingly crucial in the data scientist's toolbox. Although considered relatively new, their history traces back to systems like Mathematica and MATLAB. This form of...

Domino Paves the Way for the Future of Enterprise Data Science with Latest Release

Today, we announced the latest release of Domino’s data science platform which represents a big step forward for enterprise data science teams. We’re introducing groundbreaking new features –...

Evaluating Ray: Distributed Python for Massive Scalability

Dean Wampler provides a distilled overview of Ray, an open source system for scaling Python systems from single machines to large clusters. If you are interested...

Evaluating Generative Adversarial Networks (GANs)

This article provides concise insights into GANs to help data scientists and researchers assess whether to investigate GANs further. If you are interested in a tutorial...

Announcement: Domino is fully Kubernetes native

Last week we announced that Domino is now fully Kubernetes native. This is great news for data science teams and IT organizations building modern DS platforms,...

Themes and Conferences per Pacoid, Episode 13

Paco Nathan's latest article covers data practices from the National Oceanic and Atmospheric Administration (NOAA) Environment Data Management (EDM) workshop as well as updates from the...

Deep Learning Illustrated: Building Natural Language Processing Models

Many thanks to Addison-Wesley Professional for providing the permissions to excerpt "Natural Language Processing" from the book, Deep Learning Illustrated by Krohn, Beyleveld, and Bassens. The...

Data Ethics: Contesting Truth and Rearranging Power

This Domino Data Science Field Note covers Chris Wiggins's recent data ethics seminar at Berkeley. The article focuses on 1) proposed frameworks for defining and designing...

Data Science, Past & Future

Paco Nathan presented, "Data Science, Past & Future", at Rev. This blog post provides a concise session summary, a video, and a written transcript. Session Summary...

Next page