Tag: Python

Fitting Support Vector Machines via Quadratic Programming

In this blog post we take a deep dive into the internals of Support Vector Machines. We derive a Linear SVM classifier, explain its advantages, and...

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

In this article, we'll discuss the challenge organizations face around fraud detection, how machine learning can be used to identify and spot anomalies that the human...

Ray for Data Science: Distributed Python tasks at scale

Editors Note: This article was originally posted on Patterson Consulting's blog and can be found at http://www.pattersonconsultingtn.com/blog/blog_index.html and has been republished with permission. Why Do We...

Enterprise-class NLP with spaCy v3

spaCy is a python library that provides capabilities to conduct advanced natural language processing analysis and build models that can underpin document analysis, chatbot capabilities, and...

How to supercharge data exploration with Pandas Profiling

Producing insights from raw data is a time-consuming process. Predictive modeling efforts rely on dataset profiles, whether consisting of summary statistics or descriptive charts. Pandas Profiling,...

PyCaret 2.2: Efficient Pipelines for Model Development

Data science is an exciting field, but it can be intimidating to get started, especially for those new to coding.  Even for experienced developers and data...

Density-Based Clustering

Original content by Manojit Nandi - Updated by Josh Poduska. Cluster Analysis is an important problem in data analysis. Data scientists use clustering to identify malfunctioning...

Evaluating Ray: Distributed Python for Massive Scalability

Dean Wampler provides a distilled overview of Ray, an open source system for scaling Python systems from single machines to large clusters. If you are interested...

Data Drift Detection for Image Classifiers

This article covers how to detect data drift for models that ingest image data as their input in order to prevent their silent degradation in production....

Techniques for Collecting, Prepping, and Plotting Data: Predicting Social Media-Influence in the NBA

This article provides insight on the mindset, approach, and tools to consider when solving a real-world ML problem. It covers questions to consider as well as...

Understanding Causal Inference

This article covers causal relationships and includes a chapter excerpt from the book Machine Learning in Production: Developing and Optimizing Data Science Workflows and Applications by...

Exploring US Real Estate Values with Python

This post covers data exploration using machine learning and interactive plotting. If interested in running the examples, there is a complementary Domino project available. Introduction Models...

Natural Language in Python using spaCy: An Introduction

This article provides a brief introduction to natural language using spaCy and related libraries in Python. The complementary Domino project is also available. Introduction This article...

Next page