Skip to content
    Latest

    A Guide to Natural Language Processing for Text and Speech

    While humans have been using language since we arose, a complete understanding of language is a lifelong pursuit that often comes short, even for experts. To task computer...

    Data Science

    The Importance of Machine Learning Model Validation and How It Works

    Model valuation is a core component of developing machine learning or artificial intelligence (ML/AI). While it’s separate from training and...

    MATLAB for Data Science and Machine Learning

    The opportunities to solve problems with the use of data are greater than ever, and as different industries embrace them, the available data has been...

    Data Exploration with Pandas Profiler and D-Tale

    We all have heard how data is the new oil. I always say that if that is the case, we need to go through some refinement process before that raw oil...

    Explaining black-box models using attribute importance, PDPs, and LIME

    In this article we cover explainability for black-box models and show how to use different methods from the Skater framework to provide insights...
    Code

    Getting Started with Ray

    In this blog post we give a quick introduction to Ray. We talk about the architecture and execution model, and present some of Ray's core paradigms...

    Building a Named Entity Recognition model using a BiLSTM-CRF network

    In this blog post we present the Named Entity Recognition problem and show how a BiLSTM-CRF model can be fitted using a freely available annotated...
    Machine Learning

    A Detailed Guide To Transfer Learning and How It Works

    For data science teams working with inadequate data or too much data and not enough time or resources to process it, transfer learning can represent...

    Machine Learning Model Training: What It Is and Why It’s Important

    Training a machine learning (ML) model is a process in which a machine learning algorithm is fed with training data from which it can learn. ML...
    Practical Techniques

    Fundamentals of Signal Processing

    Basics of digital signal processing A signal is defined as any physical quantity that varies with time, space or any other independent...

    Accelerating model velocity through Snowflake Java UDF integration

    Over the next decade, the companies that will beat competitors will be “model-driven” businesses. These companies often undertake large data...
    Leaders at Work

    Defining clear metrics to drive model adoption and value creation

    One of the biggest ironies of enterprise data science is that although data science teams are masters at using probabilistic models and diagnostic...

    Evaluating Ray: Distributed Python for Massive Scalability

    Dean Wampler provides a distilled overview of Ray, an open source system for scaling Python systems from single machines to large clusters. If you...
    Model Management

    How To Deploy and Monitor Machine Learning Models

    Machine-learning (ML) deployment involves placing a working ML model into an environment where it can do the work it was designed to do. The process...

    The Role of Containers on MLOps and Model Production

    Container technology has changed the way data science gets done. The original container use case for data science focused on what I call,...

    Evaluating Generative Adversarial Networks (GANs)

    This article provides concise insights into GANs to help data scientists and researchers assess whether to investigate GANs further. If you are...
    Engineering

    Production Data Science: Delivering Models with R Markdown

    R Markdown is one of those indispensable tools in a data scientist’s toolbox that provides speed and flexibility with the last-mile problem of...

    Techniques for Collecting, Prepping, and Plotting Data: Predicting Social Media-Influence in the NBA

    This article provides insight on the mindset, approach, and tools to consider when solving a real-world ML problem. It covers questions to consider...