Category: ai

Architecting GPUaaS for Enterprise AI On-Prem

Architecting GPUaaS for Enterprise AI On-Prem Multi-tenancy, scheduling, and cost modeling on Kubernetes The post Architecting GPUaaS for Enterprise AI On-Prem appeared first on Towards Data Science. Joe Sasson Go to original source

February 22, 2026
Can AI Solve Failures in Your Supply Chain?

Can AI Solve Failures in Your Supply Chain? When your warehouse and transportation teams blame each other for late deliveries, who’s right? We can ask an agent connected to the data settle the debate. The post Can AI Solve Failures in Your Supply Chain? appeared first on Towards Data Science. Samir Saci Go to original…

February 19, 2026
The Hidden Opportunity in AI Workflow Automation with n8n for Low-Tech Companies

The Hidden Opportunity in AI Workflow Automation with n8n for Low-Tech Companies How to use n8n with multimodal AI and optimisation tools to help companies with low data maturity accelerate their digital transformation. The post The Hidden Opportunity in AI Workflow Automation with n8n for Low-Tech Companies appeared first on Towards Data Science. Samir Saci…

January 19, 2026
How AI Can Become Your Personal Language Tutor

How AI Can Become Your Personal Language Tutor How I used n8n to build AI study partners for learning Mandarin: vocabulary, listening, and pronunciation correction. The post How AI Can Become Your Personal Language Tutor appeared first on Towards Data Science. Samir Saci Go to original source

January 13, 2026
Deploy Your AI Assistant to Monitor and Debug n8n Workflows Using Claude and MCP

Deploy Your AI Assistant to Monitor and Debug n8n Workflows Using Claude and MCP Use Claude AI to monitor, analyse, and troubleshoot your n8n automation workflows through natural conversation. The post Deploy Your AI Assistant to Monitor and Debug n8n Workflows Using Claude and MCP appeared first on Towards Data Science. Samir Saci Go to…

November 13, 2025
Water Cooler Small Talk, Ep. 9: What “Thinking” and “Reasoning” Really Mean in AI and LLMs

Water Cooler Small Talk, Ep. 9: What “Thinking” and “Reasoning” Really Mean in AI and LLMs Understanding how AI models “reason” and why it’s not what humans do when we think The post Water Cooler Small Talk, Ep. 9: What “Thinking” and “Reasoning” Really Mean in AI and LLMs appeared first on Towards Data Science.…

October 29, 2025
STAT+: FDA will convene Digital Health Advisory Committee in November

STAT+: FDA will convene Digital Health Advisory Committee in November The Food and Drug Administration will convene a meeting of external digital health advisors on Nov. 6, according to an email reviewed by STAT. The gathering comes as the agency wrestles with myriad issues related to regulating artificial intelligence and other technologies in health care.…

September 5, 2025
Designing Trustworthy ML Models: Alan & Aida Discover Monotonicity in Machine Learning

Designing Trustworthy ML Models: Alan & Aida Discover Monotonicity in Machine Learning Accuracy alone doesn’t guarantee trustworthiness. Monotonicity ensures predictions align with common sense and business rules. The post Designing Trustworthy ML Models: Alan & Aida Discover Monotonicity in Machine Learning appeared first on Towards Data Science. Mehdi Mohammadi Go to original source

August 22, 2025
Work Data Is the Next Frontier for GenAI

Work Data Is the Next Frontier for GenAI 9 reasons why work data is the single most valuable data source for LLM training, uniquely capable of propelling LLM performance to unprecedented heights. The post Work Data Is the Next Frontier for GenAI appeared first on Towards Data Science. Zsombor Varnagy-Toth Go to original source

July 10, 2025
Stop Building AI Platforms

Stop Building AI Platforms When small and medium companies achieve success in building Data and ML platforms, building AI platforms is now profoundly challenging The post Stop Building AI Platforms appeared first on Towards Data Science. Ming Gao Go to original source

June 14, 2025
How to Build an AI Journal with LlamaIndex

How to Build an AI Journal with LlamaIndex This post will share how to build an AI journal with the LlamaIndex. We will cover one essential function of this AI journal: asking for advice. We will start with the most basic implementation and iterate from there. We can see significant improvements for this function when…

May 17, 2025
Former FDA head Robert Califf says RFK Jr.’s vaccine rhetoric is ‘doing harm’ to Americans

Former FDA head Robert Califf says RFK Jr.’s vaccine rhetoric is ‘doing harm’ to Americans SAN FRANCISCO — Former Food and Drug Administration Commissioner Robert Califf said health secretary Robert F. Kennedy Jr.’s hesitance to endorse vaccines is harming the American public. “Fundamental vaccination saves millions of lives every year already, and he’s busy eroding public…

May 14, 2025
STAT+: Insitro, an AI biotech founded by Daphne Koller, lays off staff

STAT+: Insitro, an AI biotech founded by Daphne Koller, lays off staff Insitro, the closely watched AI biotech founded by the Stanford professor and entrepreneur Daphne Koller, said Thursday it was laying off 22% of its staff, or around 60 people. It’s part of a series of cuts that have hit well-backed startups over the…

May 8, 2025
Circuit Tracing: A Step Closer to Understanding Large Language Models

Circuit Tracing: A Step Closer to Understanding Large Language Models Context Over the years, Transformer-based large language models (LLMs) have made substantial progress across a wide range of tasks evolving from simple information retrieval systems to sophisticated agents capable of coding, writing, conducting research, and much more. But despite their capabilities, these models are still largely…

April 9, 2025
How I Became A Machine Learning Engineer (No CS Degree, No Bootcamp)

How I Became A Machine Learning Engineer (No CS Degree, No Bootcamp) Machine learning and AI are among the most popular topics nowadays, especially within the tech space. I am fortunate enough to work and develop with these technologies every day as a machine learning engineer! In this article, I will walk you through my…

February 15, 2025
I Tried Making my Own (Bad) LLM Benchmark to Cheat in Escape Rooms

I Tried Making my Own (Bad) LLM Benchmark to Cheat in Escape Rooms Recently, DeepSeek announced their latest model, R1, and article after article came out praising its performance relative to cost, and how the release of such open-source models could genuinely change the course of LLMs forever. That is really exciting! And also, too…

February 8, 2025
Neural Networks – Intuitively and Exhaustively Explained

Neural Networks – Intuitively and Exhaustively Explained An in-depth exploration of the most fundamental architecture in modern AI “The Thinking Part” by Daniel Warfield using MidJourney. All images by the author unless otherwise specified. Article originally made available on Intuitively and Exhaustively Explained. In this article we’ll form a thorough understanding of the neural network,…

February 4, 2025
The Cultural Backlash Against Generative AI

The Cultural Backlash Against Generative AI What’s making many people resent generative AI, and what impact does that have on the companies responsible? Photo by Joshua Hoehne on Unsplash The recent reveal of DeepSeek-R1, the large scale LLM developed by a Chinese company (also named DeepSeek), has been a very interesting event for those of us…

February 3, 2025
Rapid Data Visualization with Copilot and Plotly

Rapid Data Visualization with Copilot and Plotly Code visualizations quickly and efficiently with Copilot, Plotly, and Streamlit Continue reading on Towards Data Science » Alan Jones Go to original source

February 2, 2025
Fine-tuning Multimodal Embedding Models

Fine-tuning Multimodal Embedding Models Adapting CLIP to YouTube Data (with Python Code) This is the 4th article in a larger series on multimodal AI. In the previous post, we discussed multimodal RAG systems, which can retrieve and synthesize information from different data modalities (e.g. text, images, audio). There, we saw how we could implement such a…

February 1, 2025
Data Pruning MNIST: How I Hit 99% Accuracy Using Half the Data

Data Pruning MNIST: How I Hit 99% Accuracy Using Half the Data How much data does AI really need? TLDR: Data-centric AI can create more efficient and accurate models. I experimented with data pruning on MNIST¹ to classify handwritten digits. Best runs for “furthest-from-centroid” selection compared to full dataset. Image by author. What if I told you…

January 31, 2025
Great Books for AI Engineering

Great Books for AI Engineering 10 books with valuable insights about AI science and engineering Great books for AI Engineering — Plus ‘Brave New Words’ (Image is Author’s own work) A few years ago I recommended 21 books in Great Books for Data Science and Great Books for Data Science 2. Since then a lot has changed. While…

January 30, 2025
Understanding Emergent Capabilities in LLMs: Lessons from Biological Systems

Understanding Emergent Capabilities in LLMs: Lessons from Biological Systems How natural systems fundamental laws help explain AI’s unexpected abilities Continue reading on Towards Data Science » Javier Marin Go to original source

January 25, 2025
How to Utilize ModernBERT and Synthetic Data for Robust Text Classification

How to Utilize ModernBERT and Synthetic Data for Robust Text Classification Learn how to fine-tune ModernBERT and create augmentations of text samples Continue reading on Towards Data Science » Eivind Kjosbakken Go to original source

January 23, 2025
Large Language Models: A Short Introduction

Large Language Models: A Short Introduction And why you should care about LLMs Image by author. There’s an acronym you’ve probably heard non-stop for the past few years: LLM, which stands for Large Language Model. In this article we’re going to take a brief look at what LLMs are, why they’re an extremely exciting piece of technology, why…

January 22, 2025
Why Generative-AI Apps’ Quality Often Sucks and What to Do About It

Why Generative-AI Apps’ Quality Often Sucks and What to Do About It How to get from PoCs to tested high-quality applications in production Image licensed from elements.envato.com, edit by Marcel Müller, 2025 The generative AI hype has rolled through the business world in the past two years. This technology can make business process executions more efficient,…

January 21, 2025
How to Log Your Data with MLflow

How to Log Your Data with MLflow MLflow, MLOps, Data Science Mastering data logging in MLOps for your AI workflow Photo by Chris Liverani on Unsplash Preface Data is one of the most critical components of the machine learning process. In fact, the quality of the data used in training a model often determines the success or failure…

January 20, 2025
Machine Learning + openAI: solving a text classification problem

Machine Learning + openAI: solving a text classification problem How I migrated an old solution to a more elegant, robust and scalable solution using text classification from openAI Continue reading on Towards Data Science » Ricardo Ribas Go to original source

January 12, 2025
What to Do If the Logit Decision Boundary Fails?

What to Do If the Logit Decision Boundary Fails? Feature engineering for classification models using Bayesian Machine Learning Continue reading on Towards Data Science » Lukasz Gatarek Go to original source

January 10, 2025
AI Agents Hype, Explained — What You Really Need to Know to Get Started

AI Agents Hype, Explained — What You Really Need to Know to Get Started I’ll set the record straight — AI Agents are not new but advanced. Learn how they’ve evolved and where to get started. Continue reading on Towards Data Science » Marc Nehme Go to original source

January 7, 2025
The Cultural Impact of AI Generated Content: Part 2

The Cultural Impact of AI Generated Content: Part 2 What can we do about the increasingly sophisticated AI generated content in our lives? Photo by Meszárcsek Gergely on Unsplash In my prior column, I established how AI generated content is expanding online, and described scenarios to illustrate why it’s occurring. (Please read that before you go on…

January 4, 2025
GDD: Generative Driven Design

GDD: Generative Driven Design Reflective generative AI software components as a development paradigm Nowhere has the proliferation of generative AI tooling been more aggressive than in the world of software development. It began with GitHub Copilot’s supercharged autocomplete, then exploded into direct code-along integrated tools like Aider and Cursor that allow software engineers to dictate…

January 2, 2025
Top 12 Skills Data Scientists Need to Succeed in 2025

Top 12 Skills Data Scientists Need to Succeed in 2025 It’s (not) all about LLMs and AI tools Continue reading on Towards Data Science » Benjamin Bodner Go to original source

January 1, 2025
Probability Distributions: Poisson vs. Binomial Distribution

Probability Distributions: Poisson vs. Binomial Distribution Using Soccer to Understand the Difference Between Poisson & Binomial: Probability for Data Science Series (3) Continue reading on Towards Data Science » Sunghyun Ahn Go to original source

December 25, 2024
What Every Aspiring Machine Learning Engineer Must Know to Succeed

What Every Aspiring Machine Learning Engineer Must Know to Succeed Your Guide to Avoiding Critical Errors with Machine Learning in Production Continue reading on Towards Data Science » Claudia Ng Go to original source

December 23, 2024
Should you switch from VSCode to Cursor?

Should you switch from VSCode to Cursor? My experience using VSCode (GitHub Copilot) and Cursor (Claude 3.5 Sonnet) as a Data Scientist. Continue reading on Towards Data Science » Marc Matterson Go to original source

December 22, 2024
Top 3 Strategies to Search Your Data

Top 3 Strategies to Search Your Data Strategies from traditional index seek to AI based semantic search that every software engineer should know! Continue reading on Towards Data Science » Shawn Shi Go to original source

December 22, 2024
Introduction to TensorFlow’s Functional API

Introduction to TensorFlow’s Functional API Learn what the Functional API is, and how to build complex keras models using it Continue reading on Towards Data Science » Javier Martínez Ojeda Go to original source

December 19, 2024
The Invisible Bug That Broke My Automation: How OCR Changed The Game

The Invisible Bug That Broke My Automation: How OCR Changed The Game The evolution of AI in test automation: from locators to generative AI (Part 3) Continue reading on Towards Data Science » Abdelkader HASSINE Go to original source

December 18, 2024
Data Valuation — A Concise Overview

Data Valuation — A Concise Overview Understanding the Value of your Data: Challenges, Methods, and Applications ChatGPT and similar LLMs were trained on insane amounts of data. OpenAI and Co. scraped the internet, collecting books, articles, and social media posts to train their models. It’s easy to imagine that some of the texts (like scientific or news…

December 16, 2024
How I’d Learn AI in 2025 (If I Knew Nothing)

How I’d Learn AI in 2025 (If I Knew Nothing) A 5-step roadmap for today’s landscape Today, more people than ever are trying to learn AI. Although there are countless free learning resources online, navigating this rapidly evolving landscape can be overwhelming (especially as a beginner). In this article, I discuss how I’d approach learning…

December 14, 2024
Why “AI Can’t Reason” Is a Bias

Why “AI Can’t Reason” Is a Bias We humans are proud creatures Continue reading on Towards Data Science » Rafe Brena, Ph.D. Go to original source

December 13, 2024
How to Integrate AI and Data Science into Your Business Strategy

How to Integrate AI and Data Science into Your Business Strategy DATA SCIENCE CONSULTING Insider consulting guide to conducting a successful 2-day executive workshop Image by author using Canva “Our industry does not respect tradition — it only respects innovation.” — Satya Nadella, CEO Microsoft, Letter to employees in 2014 While not all industries are as competitive and cutthroat as the…

December 7, 2024
Multimodal RAG: Process Any File Type with AI

Multimodal RAG: Process Any File Type with AI A beginner-friendly guide with example (Python) code This is the third article in a larger series on multimodal AI. In the previous posts, we discussed multimodal LLMs and embedding models, respectively. In this article, we will combine these ideas to enable the development of multimodal RAG systems. I’ll…

December 6, 2024
How to Build a General-Purpose LLM Agent

How to Build a General-Purpose LLM Agent A Step-by-Step Guide High-level Overview of an LLM Agent. (Image by author) Why build a general-purpose agent? Because it’s an excellent tool to prototype your use cases and lays the groundwork for designing your own custom agentic architecture. Before we dive in, let’s quickly introduce LLM agents. Feel free…

December 5, 2024
What Teaching AI Taught me About Data Skills & People

What Teaching AI Taught me About Data Skills & People Three key lessons from my journey as a corporate AI educator Photo by Mikhail Nilov. As an AI Educator, my job was to equip corporate teams with the data & AI skills they needed to thrive. But looking back, I realized that I learned far more from…

December 5, 2024
Bird’s-Eye View of Linear Algebra: Left, Right Inverse => Injective, Surjective Maps

Bird’s-Eye View of Linear Algebra: Left, Right Inverse => Injective, Surjective Maps If matrix multiplication isn’t commutative, then why don’t we have left and right inverses? Continue reading on Towards Data Science » Rohit Pandey Go to original source

December 4, 2024
The Cultural Impact of AI Generated Content: Part 1

The Cultural Impact of AI Generated Content: Part 1 What happens when AI generated media becomes ubiquitous in our lives? How does this relate to what we’ve experienced before, and how does it change us? Photo by Annie Spratt on Unsplash This is the first part of a two part series I’m writing analyzing how people and…

December 4, 2024
Smaller is smarter

Smaller is smarter Concerns about the environmental impacts of Large Language Models (LLMs) are growing. Although detailed information about the actual costs of LLMs can be difficult to find, let’s attempt to gather some facts to understand the scale. Generated with ChatGPT-4o Since comprehensive data on ChatGPT-4 is not readily available, we can consider Llama 3.1…

December 2, 2024
How Did Open Food Facts Fix OCR-Extracted Ingredients Using Open-Source LLMs?

How Did Open Food Facts Fix OCR-Extracted Ingredients Using Open-Source LLMs? Delve into an end-to-end Machine Learning project to improve the quality of the Open Food Facts database Image generated with Flux1 Open Food Facts’ purpose is to create the largest open-source food database in the world. To this day, it has collected over 3 millions products…

November 30, 2024
Why Internal Company Chatbots Fail and How to Use Generative AI in Enterprise with Impact

Why Internal Company Chatbots Fail and How to Use Generative AI in Enterprise with Impact Start with the problem and not with the solution Background licensed from elements.envato.com, edit by Marcel Müller 2024 The most common disillusion that many organizations have is the following: They get excited about generative AI with ChatGPT or Microsoft Co-Pilot, read some…

November 30, 2024
How to Develop an Effective AI-Powered Legal Assistant

How to Develop an Effective AI-Powered Legal Assistant Create a machine-learning-based search into legal decisions Continue reading on Towards Data Science » Eivind Kjosbakken Go to original source

November 28, 2024