Category: machine-learning

Do You Really Need a Foundation Model?

Do You Really Need a Foundation Model? LLM or custom model: how should you choose the right solution? The post Do You Really Need a Foundation Model? appeared first on Towards Data Science. Vincent Vandenbussche Go to original source

July 16, 2025
How to Ensure Reliability in LLM Applications

How to Ensure Reliability in LLM Applications Learn how to make your LLM applications more robust The post How to Ensure Reliability in LLM Applications appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

July 16, 2025
From Equal Weights to Smart Weights: OTPO’s Approach to Better LLM Alignment

From Equal Weights to Smart Weights: OTPO’s Approach to Better LLM Alignment Using optimal transport to weight what matters most In LLM-generated responses The post From Equal Weights to Smart Weights: OTPO’s Approach to Better LLM Alignment appeared first on Towards Data Science. Sudheer Singh Go to original source

July 16, 2025
Topic Model Labelling with LLMs

Topic Model Labelling with LLMs Python tutorial for reproducible labeling of cutting-edge topic models with GPT4-o-mini. The post Topic Model Labelling with LLMs appeared first on Towards Data Science. Petr Koráb Go to original source

July 15, 2025
There and Back Again: An AI Career Journey

There and Back Again: An AI Career Journey A full circle moment 30 years in the making The post There and Back Again: An AI Career Journey appeared first on Towards Data Science. David Martin Go to original source

July 15, 2025
Are You Being Unfair to LLMs?

Are You Being Unfair to LLMs? They may deserve better. The post Are You Being Unfair to LLMs? appeared first on Towards Data Science. Julian Mendel Go to original source

July 12, 2025
Reducing Time to Value for Data Science Projects: Part 3

Reducing Time to Value for Data Science Projects: Part 3 Setting up a robust experimentation process The post Reducing Time to Value for Data Science Projects: Part 3 appeared first on Towards Data Science. Kristopher McGlinchey Go to original source

July 11, 2025
The Crucial Role of NUMA Awareness in High-Performance Deep Learning

The Crucial Role of NUMA Awareness in High-Performance Deep Learning PyTorch model performance analysis and optimization — Part 10 The post The Crucial Role of NUMA Awareness in High-Performance Deep Learning appeared first on Towards Data Science. Chaim Rand Go to original source

July 10, 2025
Recap of all types of LLM Agents

Recap of all types of LLM Agents Regular, ReAct, Chain-of-Thought, Reflexion, ToT, GoT, PoT The post Recap of all types of LLM Agents appeared first on Towards Data Science. Mauro Di Pietro Go to original source

July 10, 2025
How to Perform Effective Data Cleaning for Machine Learning

How to Perform Effective Data Cleaning for Machine Learning Learn how you can improve your machine learning models using effective data cleaning The post How to Perform Effective Data Cleaning for Machine Learning appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

July 10, 2025
Build Interactive Machine Learning Apps with Gradio

Build Interactive Machine Learning Apps with Gradio Create a fun text-to-speech demo in minutes The post Build Interactive Machine Learning Apps with Gradio appeared first on Towards Data Science. Ehssan Khan Go to original source

July 9, 2025
Microsoft’s Revolutionary Diagnostic Medical AI, Explained

Microsoft’s Revolutionary Diagnostic Medical AI, Explained Microsoft’s latest paper discusses a path to medical superintelligence. How close are we, really? The post Microsoft’s Revolutionary Diagnostic Medical AI, Explained appeared first on Towards Data Science. Ryan D’Cunha Go to original source

July 9, 2025
The Five-Second Fingerprint: Inside Shazam’s Instant Song ID

The Five-Second Fingerprint: Inside Shazam’s Instant Song ID How Shazam recognizes songs in seconds The post The Five-Second Fingerprint: Inside Shazam’s Instant Song ID appeared first on Towards Data Science. Ashton Gribble Go to original source

July 8, 2025
Your Personal Analytics Toolbox

Your Personal Analytics Toolbox Leveraging MCP for automating your daily routine The post Your Personal Analytics Toolbox appeared first on Towards Data Science. Mariya Mansurova Go to original source

July 8, 2025
Build Algorithm-Agnostic ML Pipelines in a Breeze

Build Algorithm-Agnostic ML Pipelines in a Breeze The framework is now an open-source Python package for streamlined ML workflows The post Build Algorithm-Agnostic ML Pipelines in a Breeze appeared first on Towards Data Science. Mena Wang Go to original source

July 8, 2025
My Honest Advice for Aspiring Machine Learning Engineers

My Honest Advice for Aspiring Machine Learning Engineers What it really takes to become a machine learning engineer The post My Honest Advice for Aspiring Machine Learning Engineers appeared first on Towards Data Science. Egor Howell Go to original source

July 5, 2025
Explainable Anomaly Detection with RuleFit: An Intuitive Guide

Explainable Anomaly Detection with RuleFit: An Intuitive Guide Creating interpretable rules to characterize the identified anomalies The post Explainable Anomaly Detection with RuleFit: An Intuitive Guide appeared first on Towards Data Science. Shuai Guo Go to original source

July 5, 2025
Taking ResNet to the Next Level

Taking ResNet to the Next Level Understanding how ResNeXt improves upon ResNet, with a comprehensive PyTorch implementation guide The post Taking ResNet to the Next Level appeared first on Towards Data Science. Muhammad Ardi Go to original source

July 3, 2025
STOP Building Useless ML Projects – What Actually Works

STOP Building Useless ML Projects – What Actually Works How to find machine learning projects that will get you hired. The post STOP Building Useless ML Projects – What Actually Works appeared first on Towards Data Science. Egor Howell Go to original source

July 2, 2025
Revisiting Benchmarking of Tabular Reinforcement Learning Methods

Revisiting Benchmarking of Tabular Reinforcement Learning Methods Introducing a modular framework and improving model performance. The post Revisiting Benchmarking of Tabular Reinforcement Learning Methods appeared first on Towards Data Science. Oliver S Go to original source

July 2, 2025
Prescriptive Modeling Makes Causal Bets – Whether You Know it or Not!

Prescriptive Modeling Makes Causal Bets – Whether You Know it or Not! An explanation of the causal assumption implicit in prescriptive modeling and how to satisfy it. The post Prescriptive Modeling Makes Causal Bets – Whether You Know it or Not! appeared first on Towards Data Science. Jarom Hulet Go to original source

July 1, 2025
Lessons Learned After 6.5 Years Of Machine Learning

Lessons Learned After 6.5 Years Of Machine Learning Deep work, trends, data, and research The post Lessons Learned After 6.5 Years Of Machine Learning appeared first on Towards Data Science. Pascal Janetzky Go to original source

July 1, 2025
Become a Better Data Scientist with These Prompt Engineering Tips and Tricks

Become a Better Data Scientist with These Prompt Engineering Tips and Tricks Part 1: prompt engineering for planning, cleaning, and EDA The post Become a Better Data Scientist with These Prompt Engineering Tips and Tricks appeared first on Towards Data Science. Sara Nobrega Go to original source

July 1, 2025
A Caching Strategy for Identifying Bottlenecks on the Data Input Pipeline

A Caching Strategy for Identifying Bottlenecks on the Data Input Pipeline PyTorch model performance analysis and optimization — Part 8 The post A Caching Strategy for Identifying Bottlenecks on the Data Input Pipeline appeared first on Towards Data Science. Chaim Rand Go to original source

June 27, 2025
Pipelining AI/ML Training Workloads with CUDA Streams

Pipelining AI/ML Training Workloads with CUDA Streams PyTorch Model Performance Analysis and Optimization — Part 9 The post Pipelining AI/ML Training Workloads with CUDA Streams appeared first on Towards Data Science. Chaim Rand Go to original source

June 27, 2025
Use OpenAI Whisper for Automated Transcriptions

Use OpenAI Whisper for Automated Transcriptions Streamline your computer interactions using OpenAI’s Whisper model The post Use OpenAI Whisper for Automated Transcriptions appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

June 26, 2025
Data Has No Moat!

Data Has No Moat! Only if you ignore data quality The post Data Has No Moat! appeared first on Towards Data Science. Fabiana Clemente Go to original source

June 25, 2025
Reinforcement Learning from Human Feedback, Explained Simply

Reinforcement Learning from Human Feedback, Explained Simply The one technique that made ChatGPT so smart The post Reinforcement Learning from Human Feedback, Explained Simply appeared first on Towards Data Science. Vyacheslav Efimov Go to original source

June 24, 2025
Programming, Not Prompting: A Hands-On Guide to DSPy

Programming, Not Prompting: A Hands-On Guide to DSPy A practical deep dive into declarative AI programming The post Programming, Not Prompting: A Hands-On Guide to DSPy appeared first on Towards Data Science. Mariya Mansurova Go to original source

June 24, 2025
Beyond Model Stacking: The Architecture Principles That Make Multimodal AI Systems Work

Beyond Model Stacking: The Architecture Principles That Make Multimodal AI Systems Work Transforming Independent Models into Collaborative Intelligence The post Beyond Model Stacking: The Architecture Principles That Make Multimodal AI Systems Work appeared first on Towards Data Science. Eric Chung Go to original source

June 20, 2025
LLM-as-a-Judge: A Practical Guide

LLM-as-a-Judge: A Practical Guide How to Scale LLM Evaluations Beyond Manual Review The post LLM-as-a-Judge: A Practical Guide appeared first on Towards Data Science. Shuai Guo Go to original source

June 20, 2025
What PyTorch Really Means by a Leaf Tensor and Its Grad

What PyTorch Really Means by a Leaf Tensor and Its Grad The secret life of leaves, gradients, and the mighty requires_grad flag The post What PyTorch Really Means by a Leaf Tensor and Its Grad appeared first on Towards Data Science. Maciej J. Mikulski Go to original source

June 20, 2025
Computer Vision’s Annotation Bottleneck Is Finally Breaking

Computer Vision’s Annotation Bottleneck Is Finally Breaking A Technical Deep Dive into Auto-Labeling The post Computer Vision’s Annotation Bottleneck Is Finally Breaking appeared first on Towards Data Science. TDS Brand Studio Go to original source

June 19, 2025
LLaVA on a Budget: Multimodal AI with Limited Resources

LLaVA on a Budget: Multimodal AI with Limited Resources Let’s get started with multimodality The post LLaVA on a Budget: Multimodal AI with Limited Resources appeared first on Towards Data Science. Marcello Politi Go to original source

June 18, 2025
Grad-CAM from Scratch with PyTorch Hooks

Grad-CAM from Scratch with PyTorch Hooks A hands-on look at an explainable AI (XAI) technique that helps reveal why a convolutional neural network (CNN) made a particular decision The post Grad-CAM from Scratch with PyTorch Hooks appeared first on Towards Data Science. Conor O’Sullivan Go to original source

June 17, 2025
Let’s Analyze OpenAI’s Claims About ChatGPT Energy Use

Let’s Analyze OpenAI’s Claims About ChatGPT Energy Use ChatGPT uses an average of 0.34 Wh per query, according to a blog post by Sam Altman. Does that figure hold up? The post Let’s Analyze OpenAI’s Claims About ChatGPT Energy Use appeared first on Towards Data Science. Kasper Groes Albin Ludvigsen Go to original source

June 17, 2025
Regularisation: A Deep Dive into Theory, Implementation, and Practical Insights

Regularisation: A Deep Dive into Theory, Implementation, and Practical Insights A detailed guide on controlling overfitting and increasing the stability of your models. The post Regularisation: A Deep Dive into Theory, Implementation, and Practical Insights appeared first on Towards Data Science. Sourav Mohile Go to original source

June 17, 2025
AI Is Not a Black Box (Relatively Speaking)

AI Is Not a Black Box (Relatively Speaking) Compared to the opacity around human intelligence, AI is more transparent in some very tangible ways. The post AI Is Not a Black Box (Relatively Speaking) appeared first on Towards Data Science. Piotr (Peter) Mardziel Go to original source

June 14, 2025
Connecting the Dots for Better Movie Recommendations

Connecting the Dots for Better Movie Recommendations Connecting the Dots for Better Movie Recommendations: Lightweight graph RAG on Rotten Tomatoes movie reviews The post Connecting the Dots for Better Movie Recommendations appeared first on Towards Data Science. Brian Godsey Go to original source

June 13, 2025
Design Smarter Prompts and Boost Your LLM Output: Real Tricks from an AI Engineer’s Toolbox

Design Smarter Prompts and Boost Your LLM Output: Real Tricks from an AI Engineer’s Toolbox Not just what you ask, but how you ask it. Practical techniques for prompt engineering that deliver The post Design Smarter Prompts and Boost Your LLM Output: Real Tricks from an AI Engineer’s Toolbox appeared first on Towards Data Science. Ugo Pradère…

June 13, 2025
Audio Spectrogram Transformers Beyond the Lab

Audio Spectrogram Transformers Beyond the Lab A recipe for building a portable soundscape monitoring app with AudioMoth, Raspberry Pi, and a decent dose of deep learning. The post Audio Spectrogram Transformers Beyond the Lab appeared first on Towards Data Science. Maciej Adamiak Go to original source

June 11, 2025
Automate Models Training: An MLOps Pipeline with Tekton and Buildpacks

Automate Models Training: An MLOps Pipeline with Tekton and Buildpacks A step-by-step guide to containerizing and orchestrating an ML training workflow without the Dockerfile headache, using a lightweight GPT-2 example. The post Automate Models Training: An MLOps Pipeline with Tekton and Buildpacks appeared first on Towards Data Science. Sylvain Kalache Go to original source

June 11, 2025
10,000x Faster Bayesian Inference: Multi-GPU SVI vs. Traditional MCMC

10,000x Faster Bayesian Inference: Multi-GPU SVI vs. Traditional MCMC Using GPU acceleration to speed up Bayesian Inference from months to minutes… The post 10,000x Faster Bayesian Inference: Multi-GPU SVI vs. Traditional MCMC appeared first on Towards Data Science. Derek Tran Go to original source

June 11, 2025
How to Transition From Data Analyst to Data Scientist

How to Transition From Data Analyst to Data Scientist Playbook on how data analysts can become data scientists The post How to Transition From Data Analyst to Data Scientist appeared first on Towards Data Science. Egor Howell Go to original source

June 10, 2025
Prescriptive Modeling Unpacked: A Complete Guide to Intervention With Bayesian Modeling.

Prescriptive Modeling Unpacked: A Complete Guide to Intervention With Bayesian Modeling. Learn how to move beyond prediction and actively make intervention through prescriptive modeling. This in-depth guide walks you through Bayesian approaches to system intervention, with practical examples in predictive maintenance. The post Prescriptive Modeling Unpacked: A Complete Guide to Intervention With Bayesian Modeling. appeared…

June 7, 2025
How I Automated My Machine Learning Workflow with Just 10 Lines of Python

How I Automated My Machine Learning Workflow with Just 10 Lines of Python Use LazyPredict and PyCaret to skip the grunt work and jump straight to performance. The post How I Automated My Machine Learning Workflow with Just 10 Lines of Python appeared first on Towards Data Science. Himanshu Sharma Go to original source

June 7, 2025
Data Drift Is Not the Actual Problem: Your Monitoring Strategy Is

Data Drift Is Not the Actual Problem: Your Monitoring Strategy Is Monitoring is easy; what to monitor is not. In the field of machine learning, data drift is just noise until you know what it means. The post Data Drift Is Not the Actual Problem: Your Monitoring Strategy Is appeared first on Towards Data Science.…

June 4, 2025
Pairwise Cross-Variance Classification

Pairwise Cross-Variance Classification Multi-class zero-shot embedding classification and error checking The post Pairwise Cross-Variance Classification appeared first on Towards Data Science. Doster Esh Go to original source

June 4, 2025
Landing your First Machine Learning Job: Startup vs Big Tech vs Academia

Landing your First Machine Learning Job: Startup vs Big Tech vs Academia A practical guide to landing your first Machine Learning job across startups, big tech, and academia. The post Landing your First Machine Learning Job: Startup vs Big Tech vs Academia appeared first on Towards Data Science. Piero Paialunga Go to original source

June 4, 2025
Decision Trees Natively Handle Categorical Data

Decision Trees Natively Handle Categorical Data But mean target encoding is their turbocharger The post Decision Trees Natively Handle Categorical Data appeared first on Towards Data Science. Vadim Arzamasov Go to original source

June 4, 2025
Evaluating LLMs for Inference, or Lessons from Teaching for Machine Learning

Evaluating LLMs for Inference, or Lessons from Teaching for Machine Learning It’s like grading papers, but your student is an LLM The post Evaluating LLMs for Inference, or Lessons from Teaching for Machine Learning appeared first on Towards Data Science. Stephanie Kirmer Go to original source

June 3, 2025
Your DNA Is a Machine Learning Model: It’s Already Out There

Your DNA Is a Machine Learning Model: It’s Already Out There Even if you never sequenced your genome, predictive systems already know a lot about it. Genomic inference has become a population-scale model, and you’re probably in it. The post Your DNA Is a Machine Learning Model: It’s Already Out There appeared first on Towards…

June 3, 2025
Hands-On Attention Mechanism for Time Series Classification, with Python

Hands-On Attention Mechanism for Time Series Classification, with Python This is how to use the attention mechanism in a time series classification framework The post Hands-On Attention Mechanism for Time Series Classification, with Python appeared first on Towards Data Science. Piero Paialunga Go to original source

May 31, 2025
LLM Optimization: LoRA and QLoRA

LLM Optimization: LoRA and QLoRA Scalable fine-tuning techniques for large language models The post LLM Optimization: LoRA and QLoRA appeared first on Towards Data Science. Vyacheslav Efimov Go to original source

May 31, 2025
GAIA: The LLM Agent Benchmark Everyone’s Talking About

GAIA: The LLM Agent Benchmark Everyone’s Talking About What practitioners need to know about this LLM agent benchmark The post GAIA: The LLM Agent Benchmark Everyone’s Talking About appeared first on Towards Data Science. Shuai Guo Go to original source

May 30, 2025
From Data to Stories: Code Agents for KPI Narratives

From Data to Stories: Code Agents for KPI Narratives HuggingFace’s smolagents framework in action The post From Data to Stories: Code Agents for KPI Narratives appeared first on Towards Data Science. Mariya Mansurova Go to original source

May 29, 2025
JAX: Is This Google’s NumPy killer?

JAX: Is This Google’s NumPy killer? Auto differentiation and JIT compilation make a compelling case. The post JAX: Is This Google’s NumPy killer? appeared first on Towards Data Science. Thomas Reid Go to original source

May 29, 2025
Detecting Malicious URLs Using LSTM and Google’s BERT Models

Detecting Malicious URLs Using LSTM and Google’s BERT Models A progressive approach to implementing AI-powered webpage detection applications into production The post Detecting Malicious URLs Using LSTM and Google’s BERT Models appeared first on Towards Data Science. Toluwase Babalola Go to original source

May 29, 2025
Tree of Thought Prompting: Teaching LLMs to Think Slowly

Tree of Thought Prompting: Teaching LLMs to Think Slowly Playing Minesweeper with Augmented Reasoning The post Tree of Thought Prompting: Teaching LLMs to Think Slowly appeared first on Towards Data Science. Shuyang Go to original source

May 29, 2025
Bayesian Optimization for Hyperparameter Tuning of Deep Learning Models

Bayesian Optimization for Hyperparameter Tuning of Deep Learning Models Explore how Bayesian Optimization outperforms Grid Search in efficiency and performance over binary classification tasks. The post Bayesian Optimization for Hyperparameter Tuning of Deep Learning Models appeared first on Towards Data Science. Kuriko Iwai Go to original source

May 28, 2025
Reinforcement Learning Made Simple: Build a Q-Learning Agent in Python

Reinforcement Learning Made Simple: Build a Q-Learning Agent in Python Inspired by AlphaGo’s Move 37 — learn how agents explore, exploit, and win The post Reinforcement Learning Made Simple: Build a Q-Learning Agent in Python appeared first on Towards Data Science. Sarah Schürch Go to original source

May 28, 2025
Why Regularization Isn’t Enough: A Better Way to Train Neural Networks with Two Objectives

Why Regularization Isn’t Enough: A Better Way to Train Neural Networks with Two Objectives Why splitting your objectives and your model might be the key to better performance and clearer trade-offs in deep learning. The post Why Regularization Isn’t Enough: A Better Way to Train Neural Networks with Two Objectives appeared first on Towards Data…

May 28, 2025
Code Agents: The Future of Agentic AI

Code Agents: The Future of Agentic AI HuggingFace smolagents framework in action The post Code Agents: The Future of Agentic AI appeared first on Towards Data Science. Mariya Mansurova Go to original source

May 27, 2025
The Best AI Books & Courses for Getting a Job

The Best AI Books & Courses for Getting a Job A comprehensive guide to the books and courses that helped me learn AI The post The Best AI Books & Courses for Getting a Job appeared first on Towards Data Science. Egor Howell Go to original source

May 27, 2025
Prototyping Gradient Descent in Machine Learning

Prototyping Gradient Descent in Machine Learning Mathematical theorem and credit transaction prediction using Stochastic / Batch GD The post Prototyping Gradient Descent in Machine Learning appeared first on Towards Data Science. Kuriko Iwai Go to original source

May 24, 2025
Estimating Product-Level Price Elasticities Using Hierarchical Bayesian

Estimating Product-Level Price Elasticities Using Hierarchical Bayesian Using one model to personalize ML results The post Estimating Product-Level Price Elasticities Using Hierarchical Bayesian appeared first on Towards Data Science. Derek Tran Go to original source

May 24, 2025
Top Machine Learning Jobs and How to Prepare For Them

Top Machine Learning Jobs and How to Prepare For Them These days, job titles like data scientist, machine learning engineer, and Ai Engineer are everywhere — and if you were anything like me, it can be hard to understand what each of them actually does if you are not working within the field. And then there are titles…

May 22, 2025
How to Set the Number of Trees in Random Forest

How to Set the Number of Trees in Random Forest Scientific publication T. M. Lange, M. Gültas, A. O. Schmitt & F. Heinrich (2025). optRF: Optimising random forest stability by determining the optimal number of trees. BMC bioinformatics, 26(1), 95. Follow this LINK to the original publication. Random Forest — A Powerful Tool for Anyone…

May 17, 2025
The Automation Trap: Why Low-Code AI Models Fail When You Scale

The Automation Trap: Why Low-Code AI Models Fail When You Scale In the beginning, building Machine Learning models was a skill only data scientists with knowledge of Python could master. However, low-code AI platforms have made things much easier now. Anyone can now directly make a model, link it to data, and publish it as…

May 17, 2025
Google’s AlphaEvolve Is Evolving New Algorithms — And It Could Be a Game Changer

Google’s AlphaEvolve Is Evolving New Algorithms — And It Could Be a Game Changer AlphaEvolve imagined as a genetic algorithm coupled to a large language model. Picture created by the author using various tools including Dall-E3 via ChatGPT. Large Language Models have undeniably revolutionized how many of us approach coding, but they’re often more like a super-powered…

May 16, 2025
Understanding Random Forest using Python (scikit-learn)

Understanding Random Forest using Python (scikit-learn) Decision trees are a popular supervised learning algorithm with benefits that include being able to be used for both regression and classification as well as being easy to interpret. However, decision trees aren’t the most performant algorithm and are prone to overfitting due to small variations in the training…

May 16, 2025
How to Learn the Math Needed for Machine Learning

How to Learn the Math Needed for Machine Learning Maths can be a scary topic for people. Many of you want to work in machine learning, but the maths skills needed may seem overwhelming. I am here to tell you that it’s nowhere as intimidating as you may think and to give you a roadmap, resources,…

May 16, 2025
How To Build a Benchmark for Your Models

How To Build a Benchmark for Your Models I’ve been working as a data science consultant for the past three years, and I’ve had the opportunity to work on multiple projects across various industries. Yet, I noticed one common denominator among most of the clients I worked with: They rarely have a clear idea of…

May 16, 2025
Strength in Numbers: Ensembling Models with Bagging and Boosting

Strength in Numbers: Ensembling Models with Bagging and Boosting Bagging and boosting are two powerful ensemble techniques in machine learning – they are must-knows for data scientists! After reading this article, you are going to have a solid understanding of how bagging and boosting work and when to use them. We’ll cover the following topics,…

May 15, 2025
Empowering LLMs to Think Deeper by Erasing Thoughts

Empowering LLMs to Think Deeper by Erasing Thoughts Introduction Recent large language models (LLMs) — such as OpenAI’s o1/o3, DeepSeek’s R1 and Anthropic’s Claude 3.7 — demonstrate that allowing the model to think deeper and longer at test time can significantly enhance model’s reasoning capability. The core approach underlying their deep thinking capability is called…

May 13, 2025
The Westworld Blunder

The Westworld Blunder We’re entering an interesting moment in AI development. AI systems are getting memory, reasoning chains, self-critiques, and long-context recall. These capabilities are exactly some of the things that I’ve previously written would be prerequisites for an AI system to be conscious. Just to be clear, I don’t believe today’s AI systems are self-aware, but…

May 13, 2025
Pause Your ML Pipelines for Human Review Using AWS Step Functions + Slack

Pause Your ML Pipelines for Human Review Using AWS Step Functions + Slack Have you ever wanted to pause an automated workflow to wait for a human decision? Maybe you need approval before provisioning cloud resources, promoting a machine learning model to production, or charging a customer’s credit card. In many data science and machine learning…

May 13, 2025
Clustering Eating Behaviors in Time: A Machine Learning Approach to Preventive Health

Clustering Eating Behaviors in Time: A Machine Learning Approach to Preventive Health It’s well known that what we eat matters — but what if when and how often we eat matters just as much? In the midst of ongoing scientific debate around the benefits of intermittent fasting, this question becomes even more intriguing. As someone passionate about machine learning and healthy living,…

May 9, 2025
Model Compression: Make Your Machine Learning Models Lighter and Faster

Model Compression: Make Your Machine Learning Models Lighter and Faster Introduction Whether you’re preparing for interviews or building Machine Learning systems at your job, model compression has become a must-have skill. In the era of LLMs, where models are getting larger and larger, the challenges around compressing these models to make them more efficient, smaller,…

May 9, 2025
The Shadow Side of AutoML: When No-Code Tools Hurt More Than Help

The Shadow Side of AutoML: When No-Code Tools Hurt More Than Help Automl has become the gateway drug to machine learning for many organizations. It promises exactly what teams under pressure want to hear: you bring the data, and we’ll handle the modeling. There are no pipelines to manage, no hyperparameters to tune, and no…

May 9, 2025
A Practical Guide to BERTopic for Transformer-Based Topic Modeling

A Practical Guide to BERTopic for Transformer-Based Topic Modeling Topic modeling has a wide range of use cases in the natural language processing (NLP) domain, such as document tagging, survey analysis, and content organization. It falls under the realm of unsupervised learning technique, making it a very cost-effective technique that reduces the resources required to…

May 8, 2025
From RGB to HSV — and Back Again

From RGB to HSV — and Back Again Introduction A fundamental concept in Computer Vision is understanding how images are stored and represented. On disk, image files are encoded in various ways, from lossy, compressed JPEG files to lossless PNG files. Once you load an image into a program and decode it from the respective…

May 8, 2025
Retrieval Augmented Classification: Improving Text Classification with External Knowledge

Retrieval Augmented Classification: Improving Text Classification with External Knowledge Text Classification stands as one of the most basic yet most important applications of natural language processing. It has a vital role in many real-world applications that go from filtering unwanted emails like spam, detecting product categories or classifying user intent in a chat-bot application. The…

May 7, 2025
Fine-Tuning vLLMs for Document Understanding

Fine-Tuning vLLMs for Document Understanding In this article, I discuss how you can fine-tune VLMs (visual large language models, often called vLLMs) like Qwen 2.5 VL 7B. I will introduce you to a dataset of handwritten digits, which the base version of Qwen 2.5 VL struggles with. We will then inspect the dataset, annotate it,…

May 6, 2025
Why I stopped Using Cursor and Reverted to VSCode

Why I stopped Using Cursor and Reverted to VSCode Introduction In December 2024, I wrote an article sharing my experience using VSCode (GitHub Copilot) and Cursor (Claude 3.5 Sonnet) from the perspective of a Data Scientist. Should you switch from VSCode to Cursor? I concluded the article by stating: After using Cursor for the past two…

May 3, 2025
How Would I Learn to Code with ChatGPT if I Had to Start Again

How Would I Learn to Code with ChatGPT if I Had to Start Again Coding has been a part of my life since I was 10. From modifying HTML & CSS for my Friendster profile during the simple internet days to exploring SQL injections for the thrill, building a three-legged robot for fun, and lately…

May 1, 2025
Why Are Convolutional Neural Networks Great For Images?

Why Are Convolutional Neural Networks Great For Images? The Universal Approximation Theorem states that a neural network with a single hidden layer and a nonlinear activation function can approximate any continuous function. Practical issues aside, such that the number of neurons in this hidden layer would grow enormously large, we do not need other network architectures. A simple…

May 1, 2025
Beyond Glorified Curve Fitting: Exploring the Probabilistic Foundations of Machine Learning

Beyond Glorified Curve Fitting: Exploring the Probabilistic Foundations of Machine Learning You see a math formula you don’t immediately understand. Your instinct? Stop reading. Don’t. That’s exactly what I told myself when I started reading Probabilistic Machine Learning – An Introduction by Kevin P. Murphy. And it was absolutely worth it. It changed how I…

May 1, 2025
Reinforcement Learning from One Example?

Reinforcement Learning from One Example? Prompt engineering alone won’t get us to production. Fine-tuning is expensive. And reinforcement learning? That’s been reserved for well-funded labs with massive datasets until now. New research from Microsoft and academic collaborators has overturned that assumption. Using Reinforcement Learning with Verifiable Rewards (RLVR) and just a single training example, researchers…

May 1, 2025
From FOMO to Opportunity: Analytical AI in the Era of LLM Agents

From FOMO to Opportunity: Analytical AI in the Era of LLM Agents Are you feeling “fear of missing out” (FOMO) when it comes to LLM agents? Well, that was the case for me for quite a while. In recent months, it feels like my online feeds have been completely bombarded by “LLM Agents”: every other…

April 30, 2025
If I Wanted to Become a Machine Learning Engineer, I’d Do This

If I Wanted to Become a Machine Learning Engineer, I’d Do This If I wanted to become a machine learning engineer again, this is the exact process I would follow. Let’s get into it! First become a data scientist or software engineer I’ve said it before, but a machine learning engineer is not exactly an entry-level position.…

April 29, 2025
How to Ensure Your AI Solution Does What You Expect iI to Do

How to Ensure Your AI Solution Does What You Expect iI to Do Generative AI (GenAI) is evolving fast — and it’s no longer just about fun chatbots or impressive image generation. 2025 is the year where the focus is on turning the AI hype into real value. Companies everywhere are looking into ways to…

April 29, 2025
LLM Evaluations: from Prototype to Production

LLM Evaluations: from Prototype to Production Evaluation is the cornerstone of any machine learning product. Investing in quality measurement delivers significant returns. Let’s explore the potential business benefits. As management consultant and writer Peter Drucker once said, “If you can’t measure it, you can’t improve it.” Building a robust evaluation system helps you identify areas…

April 26, 2025
Choose the Right One: Evaluating Topic Models for Business Intelligence

Choose the Right One: Evaluating Topic Models for Business Intelligence Topic models are used in businesses to classify brand-related text datasets (such as product and site reviews, surveys, and social media comments) and to track how customer satisfaction metrics change over time. There is a myriad of recent topic models one can choose from: the…

April 25, 2025
Predicting the NBA Champion with Machine Learning

Predicting the NBA Champion with Machine Learning Every NBA season, 30 teams compete for something only one will achieve: the legacy of a championship. From power rankings to trade deadline chaos and injuries, fans and analysts alike speculate endlessly about who will raise the Larry O’Brien Trophy. But what if we could go beyond the hot…

April 25, 2025
How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI’s simple-evals

How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI’s simple-evals The recent launch of the DeepSeek-R1 model sent ripples across the global AI community. It delivered breakthroughs on par with the reasoning models from Meta and OpenAI, achieving this in a fraction of the time and at a significantly lower cost. Beyond…

April 24, 2025
Exporting MLflow Experiments from Restricted HPC Systems

Exporting MLflow Experiments from Restricted HPC Systems Many High-Performance Computing (HPC) environments, especially in research and educational institutions, restrict communications to outbound TCP connections. Running a simple command-line ping or curl with the MLflow tracking URL on the HPC bash shell to check packet transfer can be successful. However, communication fails and times out while…

April 24, 2025
Explained: How Does L1 Regularization Perform Feature Selection?

Explained: How Does L1 Regularization Perform Feature Selection? Feature Selection is the process of selecting an optimal subset of features from a given set of features; an optimal feature subset is the one which maximizes the performance of the model on the given task. Feature selection can be a manual or rather explicit process when…

April 23, 2025
When Physics Meets Finance: Using AI to Solve Black-Scholes

When Physics Meets Finance: Using AI to Solve Black-Scholes DISCLAIMER: This is not financial advice. I’m a PhD in Aerospace Engineering with a strong focus on Machine Learning: I’m not a financial advisor. This article is intended solely to demonstrate the power of Physics-Informed Neural Networks (PINNs) in a financial context. When I was 16,…

April 18, 2025
Google’s New AI System Outperforms Physicians in Complex Diagnoses

Google’s New AI System Outperforms Physicians in Complex Diagnoses Imagine going to the doctor with a baffling set of symptoms. Getting the right diagnosis quickly is crucial, but sometimes even experienced physicians face challenges piecing together the puzzle. Sometimes it might not be something serious at all; others a deep investigation might be required. No…

April 18, 2025