Skip to content

    About Manojit Nandi

    Density-Based Clustering

    Original content by Manojit Nandi - Updated by Josh Poduska. Cluster Analysis is an important problem in data analysis. Data scientists use...

    Recommender Systems through Collaborative Filtering

    This is a technical deep dive into the collaborative filtering algorithm and how to use it in practice. From Amazon recommending products you may be...

    Sampling Based Methods for Class Imbalance in Datasets

    Imagine you are a medical professional who is training a classifier to detect whether an individual has an extremely rare disease. You train your...

    A/B Testing with Hierarchical Models in Python

    In this post, I discuss a method for A/B testing using Beta-Binomial Hierarchical models to correct for a common pitfall when testing multiple...

    Faster Deep Learning with GPUs and Theano

    Domino recently added support for GPU instances. To celebrate this release, I will show you how to: Configure the Python library Theano to use the...

    Social Network Analysis with NetworkX

    Many types of real-world problems involve dependencies between records in the data. For example, sociologist are eager to understand how people...