Domino Data Science Blog

Data Science Trends, Tools, and Best Practices

Posts tagged with:   R

Data Scientist? Programmer? Are They Mutually Exclusive?

This Domino Data Science Field Note blog post provides highlights of Hadley Wickham’s ACM Chicago talk, “You Can’t Do Data Science in a GUI”....read more

,   ,   ,   

Summertime Analytics: Predicting E. Coli and West Nile Virus

Gene Leynes (Senior Data Scientist) and Nick Lucius (Advanced Analytics) from the City of Chicago discussed two predictive analytics projects that forecasted potential...read more

,   

Model Deployment Powered by Kubernetes

In this article we explain how we’re using Kubernetes to enable data scientists to deploy predictive models as production-grade APIs. Background Domino lets...read more

,   ,   ,   ,   

Horizontal Scaling for Parallel Experimentation

The amount of time data scientists spend waiting for experiment results is the difference between making incremental improvements and making significant advances. With...read more

,   ,   ,   ,   ,   

Multicore Data Science with R and Python

This article is an excerpt from the full video on Multicore Data Science in R and Python. Watch the full video to learn...read more

,   ,   

Using Monte Carlo Simulations in R to Test Methodological Advances in Social Policy Research

This is a guest post written by Kristin Porter, Senior Research Associate at MDRC. MDRC is a nonprofit, nonpartisan education and social policy...read more

,   ,   ,   

R vs. Python for Data Science

While the elections are over, some debates continue. R and Python are both popular programming languages for data scientists. Each has its advantages...read more

,   

A Quick Benchmark of Hashtable Implementations in R

UPDATE: I am humbled and thankful to have had so much feedback on this post! It started out as a quick and dirty...read more

,   

High-performance Computing with Amazon’s X1 Instance – Part II

When you have at your disposal 128 cores and 2TB of RAM, it’s hard not to experiment and attempt to find ways to...read more

,   ,   ,   ,   ,   ,   ,   

A Summary of Using k-NN in Production

This week, Domino’s Chief Data Scientist, Eduardo Ariño de la Rubia, presented a webinar: An Introduction to Using k-NN in Production. If you...read more

,   ,   ,   ,   

Join Us: An Introduction to Using k-NN in Production

Join us next Wednesday, October 5 for a webinar hosted by our Chief Data Scientist covering best practices for using k-NN in production....read more

,   ,   ,   ,   

An Introduction to Model-Based Machine Learning

This guest post was written by Daniel Emaasit, a Ph.D Student of Transportation Engineering at the University of Nevada, Las Vegas. Daniel's research...read more

,   ,   ,   

Providing Digital Provenance: from Modeling through Production

At last week's useR! R User conference, I spoke on digital provenance, the importance of reproducible research, and how Domino has solved many...read more

,   ,   ,   ,   

Announcing Enhanced Apache Spark Support

Domino now offers data scientists a simple, yet incredibly powerful way to conduct quantitative work using Apache Spark. Apache Spark has captured the...read more

,   ,   ,   ,   ,   

Ugly Little Bits of the Data Science Process

This morning there was a great conversation on Twitter, kicked off by Hadley Wickham, about one of the ugly little bits of the...read more

,   ,   ,   

Building and Delivering Risk Models to Global Insurance Companies

We’re excited to share our latest customer case study, about how KatRisk, a leading catastrophe risk modeling firm, used Domino to deploy its...read more

,   ,   ,   ,   

Using R and Python for Common SAS Functions

SAS is the recognized incumbent in the analytics, statistics and data science tool space. As the software celebrates its 50th birthday this year,...read more

,   ,   

The R Data I/O Shootout

We pit newcomer R data I/O package, feather, against popular packages data.table, readr, and the venerable saveRDS/writeRDS functions from base R. While feather...read more

,   

Visualizing Machine Learning with Plotly and Domino

This post was contributed by Chelsea Douglas, a Software Engineer at Plotly. Want to play with the code from this post? Visit the...read more

,   ,   

Open Source Winning Against Proprietary Data Science Vendors

With the recent publication of Gartner’s Magic Quadrant for Advanced Analytics, we wanted to know how proprietary data science software vendors were faring...read more

,   ,   ,   ,   ,   

Genomic Ranges: An Introduction to Working with Genomic Data

To view the code that generated this blog post, check out Jack Fu's project here on Domino Data Lab. This is a guest...read more

,   

Visual Logic Authoring vs Code

At some point in their careers, almost every data scientist has written code to perform a series of steps, and thought, “It would...read more

,   ,   

R in Ecology

This is a guest post from Auriel Fournier, a PhD Candidate with the Arkansas Cooperative Fish and Wildlife Research Unit at the University...read more

,   

Applied Spatial Data Science with R

Introduction I recently started working on my Ph.D dissertation which utilizes a vast amount of different spatial data types. During the process, I...read more

,   

Predicting winners of the Rugby World Cup

This a guest post by Arnu Pretorius For the sake of brevity, not all the relevant data and code are displayed in this...read more

,   

Better interactive data science with Beaker and Rodeo

Domino has offered support for IPython/Jupyter for a while, but we recently added support for two newer, up-and-coming tools for interactive data science:...read more

,   

Optimizing Chicago’s Services with the Power of Analytics

In an effort to reduce the public’s exposure to food-borne illness, the City of Chicago partnered with Allstate’s Quantitative Research & Analytics department...read more

,   

Geographic visualization with R’s ggmap

Have you ever crunched some numbers on data that involved spatial locations? If the answer is no, then boy are you missing out!...read more

,   

Deep Learning with h2o.ai

This post provides a brief history lesson and overview of deep learning, coupled with a quick "how to" guide for dipping your toes...read more

,   ,   ,   

Comparing Python and R for Data Science

A guest post by Martijn Theuwissen from DataCamp Both Python and R are popular open source languages for performing data science tasks. As...read more

,   
Next page