Category: genai

Claude Skills and Subagents: Escaping the Prompt Engineering Hamster Wheel

Claude Skills and Subagents: Escaping the Prompt Engineering Hamster Wheel How reusable, lazy-loaded instructions solve the context bloat problem in AI-assisted development. The post Claude Skills and Subagents: Escaping the Prompt Engineering Hamster Wheel appeared first on Towards Data Science. Ruben Broekx Go to original source

March 1, 2026
From Amnesia to Awareness: Giving Retrieval-Only Chatbots Memory

From Amnesia to Awareness: Giving Retrieval-Only Chatbots Memory Achieve natural multi-turn conversations without sacrificing content control. The post From Amnesia to Awareness: Giving Retrieval-Only Chatbots Memory appeared first on Towards Data Science. Nicole Ren Go to original source

September 19, 2025
Should We Use LLMs As If They Were Swiss Knives?

Should We Use LLMs As If They Were Swiss Knives? A logic game performance comparison between popular LLMs and a custom-made algorithm The post Should We Use LLMs As If They Were Swiss Knives? appeared first on Towards Data Science. Nicolas Garcia Aramouni Go to original source

September 5, 2025
From Pixels to Plots

From Pixels to Plots How I built an AI-powered prototype to turn images into insights The post From Pixels to Plots appeared first on Towards Data Science. Jens Winkelmann Go to original source

July 1, 2025
How to Ensure Your AI Solution Does What You Expect iI to Do

How to Ensure Your AI Solution Does What You Expect iI to Do Generative AI (GenAI) is evolving fast — and it’s no longer just about fun chatbots or impressive image generation. 2025 is the year where the focus is on turning the AI hype into real value. Companies everywhere are looking into ways to…

April 29, 2025
Deep Dive into KV-Caching In Mistral

Deep Dive into KV-Caching In Mistral Ever wondered why the time to first token in LLMs is high but subsequent tokens are super fast? In this post, I dive into the details of KV-Caching used in Mistral, a topic I initially found quite daunting. However, as I delved deeper, it became a fascinating subject, especially when…

January 15, 2025
Evaluation-Driven Development for agentic applications using PydanticAI

Evaluation-Driven Development for agentic applications using PydanticAI An open-source, model-agnostic agentic framework that supports dependency injection Ideally, you can evaluate agentic applications even as you are developing them, instead of evaluation being an afterthought. For this to work, though, you need to be able to mock both internal and external dependencies of the agent you…

December 22, 2024
The Anatomy of an Autonomous Agent

The Anatomy of an Autonomous Agent A blueprint for autonomous agents in an Agentic Mesh ecosystem. Continue reading on Towards Data Science » Eric Broda Go to original source

December 18, 2024
How I’d Learn AI in 2025 (If I Knew Nothing)

How I’d Learn AI in 2025 (If I Knew Nothing) A 5-step roadmap for today’s landscape Today, more people than ever are trying to learn AI. Although there are countless free learning resources online, navigating this rapidly evolving landscape can be overwhelming (especially as a beginner). In this article, I discuss how I’d approach learning…

December 14, 2024
The Name That Broke ChatGPT: Who is David Mayer?

The Name That Broke ChatGPT: Who is David Mayer? AI, privacy, human bias, prompting, the future of content, and how to hack a chatbot Continue reading on Towards Data Science » Cassie Kozyrkov Go to original source

December 4, 2024
Google Gemini Is Entering the Advent of Code Challenge

Google Gemini Is Entering the Advent of Code Challenge An open-source project to explore the capabilities and limitations of LLMs on coding challenges Image by author (created with Flux 1.1 Pro) What is this about? If 2024 taught us anything in the realm of Generative AI, then it is that coding is one of the most promising…

December 3, 2024
Optimizing Transformer Models for Variable-Length Input Sequences

Optimizing Transformer Models for Variable-Length Input Sequences How PyTorch NestedTensors, FlashAttention2, and xFormers can Boost Performance and Reduce AI Costs Photo by Tanja Zöllner on Unsplash As generative AI (genAI) models grow in both popularity and scale, so do the computational demands and costs associated with their training and deployment. Optimizing these models is crucial for enhancing…

November 27, 2024