Skip to content
    Latest

    Production Data Science: Delivering Models with R Markdown

    R Markdown is one of those indispensable tools in a data scientist’s toolbox that provides speed and flexibility with the last-mile problem of getting your work into...

    Getting Data with Beautiful Soup

    Data is all around us, from the spreadsheets we analyse on a daily basis, to the weather forecast we rely on every morning or the webpages we read....

    Accelerating model velocity through Snowflake Java UDF integration

    Over the next decade, the companies that will beat competitors will be “model-driven” businesses. These companies often undertake large data...

    The Curse of Dimensionality

    Guest Post by Bill Shannon, Founder and Managing Partner of BioRankings Danger of Big Data Big data is the rage. This could be lots of rows (samples)...

    Themes and Conferences per Pacoid, Episode 7

    Paco Nathan covers recent research on data infrastructure as well as adoption of machine learning and AI in the enterprise. Introduction Welcome back...

    Creating Multi-language Pipelines with Apache Spark or Avoid Having to Rewrite spaCy into Java

    In this guest post, Holden Karau, Apache Spark Committer, provides insights on how to create multi-language pipelines with Apache Spark and avoid...

    Docker, but for Data

    Aneesh Karve, Co-founder and CTO of Quilt, visited the Domino MeetUp to discuss the evolution of data infrastructure. This blog post provides a...

    New G3 Instances in AWS - Worth it for Machine Learning?

    We benchmarked AWS’s new G3 instances for deep learning tasks and found they significantly outperform the older P2 instances. The new G3 instances...

    Data Scientists are Analysts are Software Engineers

    In this Data Science Popup session, W. Whipple Neely, Director of Data Science at Electronic Arts, explains why data scientists have responsibilities...

    Building a Model is the Least Important Part of Your Job

    In this Data Science Popup session, Kimberly Shenk, Director of Data Science Solutions at Domino Data Lab, explains why building models is the least...