Category: programming

  • How We Reduced LLM Costs by 90% with 5 Lines of Code

    How We Reduced LLM Costs by 90% with 5 Lines of Code When clean code hides inefficiencies: what we learned from fixing a few lines of code and saving 90% in LLM cost. The post How We Reduced LLM Costs by 90% with 5 Lines of Code appeared first on Towards Data Science. Uri Peled Go to…

  • Building a Modern Dashboard with Python and Tkinter

    Building a Modern Dashboard with Python and Tkinter Create polished GUIs and data dashboards with this versatile library The post Building a Modern Dashboard with Python and Tkinter appeared first on Towards Data Science. Thomas Reid Go to original source

  • Mastering NLP with spaCy – Part 3

    Mastering NLP with spaCy – Part 3 Rule-based matching for information extraction The post Mastering NLP with spaCy – Part 3 appeared first on Towards Data Science. Marcello Politi Go to original source

  • How to Create Powerful LLM Applications with Context Engineering

    How to Create Powerful LLM Applications with Context Engineering Improve your LLM by optimizing its context The post How to Create Powerful LLM Applications with Context Engineering appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

  • Reducing Time to Value for Data Science Projects: Part 4

    Reducing Time to Value for Data Science Projects: Part 4 Embrace your inner software developer The post Reducing Time to Value for Data Science Projects: Part 4 appeared first on Towards Data Science. Kristopher McGlinchey Go to original source

  • Agentic AI: On Evaluations

    Agentic AI: On Evaluations Metrics to track for RAG and agents, plus the frameworks that help The post Agentic AI: On Evaluations appeared first on Towards Data Science. Ida Silfverskiöld Go to original source

  • Introducing Server-Sent Events in Python

    Introducing Server-Sent Events in Python A simpler path to coding real-time web applications. The post Introducing Server-Sent Events in Python appeared first on Towards Data Science. Thomas Reid Go to original source

  • Does the Code Work or Not? 

    Does the Code Work or Not?  A common misconception about the working state of code in data, AI or software engineering fields. The post Does the Code Work or Not?  appeared first on Towards Data Science. Marina Tosic Go to original source

  • How to Benchmark LLMs – ARC AGI 3

    How to Benchmark LLMs – ARC AGI 3 Learn how to LLMs are benchmarked, and try out the newly released ARC AGI 3 The post How to Benchmark LLMs – ARC AGI 3 appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

  • The ONLY Data Science Roadmap You Need to Get a Job

    The ONLY Data Science Roadmap You Need to Get a Job Are you looking to become a data scientist and don’t know where to start? In this article, I want to provide you with a straightforward, no-nonsense learning roadmap that you can follow to break into the industry. By the end, you’ll finally have a clear…

  • Automated Testing: A Software Engineering Concept Data Scientists Must Know To Succeed

    Automated Testing: A Software Engineering Concept Data Scientists Must Know To Succeed Why you should read this article Most data scientists whip up a Jupyter Notebook, play around in some cells, and then maintain entire data processing and model training pipelines in the same notebook. The code is tested once when the notebook was first…

  • End-to-End AWS RDS Setup with Bastion Host Using Terraform

    End-to-End AWS RDS Setup with Bastion Host Using Terraform Learn how to automate secure AWS infrastructure using Terraform — including VPC, public/private subnets, a MySQL RDS database, and a Bastion host for secure access. The post End-to-End AWS RDS Setup with Bastion Host Using Terraform appeared first on Towards Data Science. Yagmur Gulec Go to…

  • NumPy API on a GPU?

    NumPy API on a GPU? It’s here already from Nvidia and it’s called cuNumeric. The post NumPy API on a GPU? appeared first on Towards Data Science. Thomas Reid Go to original source

  • Exploratory Data Analysis: Gamma Spectroscopy in Python (Part 2)

    Exploratory Data Analysis: Gamma Spectroscopy in Python (Part 2) Let’s observe the matter on the atomic level The post Exploratory Data Analysis: Gamma Spectroscopy in Python (Part 2) appeared first on Towards Data Science. Dmitrii Eliuseev Go to original source

  • Deploy a Streamlit App to AWS

    Deploy a Streamlit App to AWS Using the Elastic Beanstalk service The post Deploy a Streamlit App to AWS appeared first on Towards Data Science. Thomas Reid Go to original source

  • Run Your Python Code up to 80x Faster Using the Cython Library

    Run Your Python Code up to 80x Faster Using the Cython Library A four-step plan for C language speed where it matters most The post Run Your Python Code up to 80x Faster Using the Cython Library appeared first on Towards Data Science. Thomas Reid Go to original source

  • Software Engineering in the LLM Era

    Software Engineering in the LLM Era On growing new software engineers, even when it’s inefficient The post Software Engineering in the LLM Era appeared first on Towards Data Science. Stephanie Kirmer Go to original source

  • Interactive Data Exploration for Computer Vision Projects with Rerun

    Interactive Data Exploration for Computer Vision Projects with Rerun Analyse dynamic signals in a computer vision pipeline in Python using OpenCV and Rerun The post Interactive Data Exploration for Computer Vision Projects with Rerun appeared first on Towards Data Science. Florian Trautweiler Go to original source

  • STOP Building Useless ML Projects – What Actually Works

    STOP Building Useless ML Projects – What Actually Works How to find machine learning projects that will get you hired. The post STOP Building Useless ML Projects – What Actually Works appeared first on Towards Data Science. Egor Howell Go to original source

  • An Introduction to Remote Model Context Protocol Servers

    An Introduction to Remote Model Context Protocol Servers Writing, testing and using them. The post An Introduction to Remote Model Context Protocol Servers appeared first on Towards Data Science. Thomas Reid Go to original source

  • A Gentle Introduction to Backtracking

    A Gentle Introduction to Backtracking Conceptual overview and hands-on examples The post A Gentle Introduction to Backtracking appeared first on Towards Data Science. Chinmay Kakatkar Go to original source

  • A Developer’s Guide to Building Scalable AI: Workflows vs Agents

    A Developer’s Guide to Building Scalable AI: Workflows vs Agents A practical guide to choosing between AI agents and workflows for production systems, covering the hidden costs, architectural trade-offs, and decision framework that can save you thousands in deployment mistakes. Includes real-world examples and a scoring system to determine which approach fits your specific use…

  • Data Science: From School to Work, Part V

    Data Science: From School to Work, Part V How to profile your Python project The post Data Science: From School to Work, Part V appeared first on Towards Data Science. Vincent Margot Go to original source

  • Use OpenAI Whisper for Automated Transcriptions

    Use OpenAI Whisper for Automated Transcriptions Streamline your computer interactions using OpenAI’s Whisper model The post Use OpenAI Whisper for Automated Transcriptions appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

  • Building A Modern Dashboard with Python and Taipy

    Building A Modern Dashboard with Python and Taipy A guide to building a front-end data application. The post Building A Modern Dashboard with Python and Taipy appeared first on Towards Data Science. Thomas Reid Go to original source

  • Abstract Classes: A Software Engineering Concept Data Scientists Must Know To Succeed

    Abstract Classes: A Software Engineering Concept Data Scientists Must Know To Succeed Simple concepts that differentiate a professional from amateurs. The post Abstract Classes: A Software Engineering Concept Data Scientists Must Know To Succeed appeared first on Towards Data Science. Benjamin Lee Go to original source

  • LLaVA on a Budget: Multimodal AI with Limited Resources

    LLaVA on a Budget: Multimodal AI with Limited Resources Let’s get started with multimodality The post LLaVA on a Budget: Multimodal AI with Limited Resources appeared first on Towards Data Science. Marcello Politi Go to original source

  • User Authorisation in Streamlit With OIDC and Google

    User Authorisation in Streamlit With OIDC and Google Log in to a Streamlit app with a Google email account The post User Authorisation in Streamlit With OIDC and Google appeared first on Towards Data Science. Thomas Reid Go to original source

  • Mobile App Development with Python

    Mobile App Development with Python Build iOS & Android Apps with Kivy The post Mobile App Development with Python appeared first on Towards Data Science. Mauro Di Pietro Go to original source

  • How to Transition From Data Analyst to Data Scientist

    How to Transition From Data Analyst to Data Scientist Playbook on how data analysts can become data scientists The post How to Transition From Data Analyst to Data Scientist appeared first on Towards Data Science. Egor Howell Go to original source

  • How I Automated My Machine Learning Workflow with Just 10 Lines of Python

    How I Automated My Machine Learning Workflow with Just 10 Lines of Python Use LazyPredict and PyCaret to skip the grunt work and jump straight to performance. The post How I Automated My Machine Learning Workflow with Just 10 Lines of Python appeared first on Towards Data Science. Himanshu Sharma Go to original source

  • The Journey from Jupyter to Programmer: A Quick-Start Guide

    The Journey from Jupyter to Programmer: A Quick-Start Guide Explore the real benefits of ditching the notebook The post The Journey from Jupyter to Programmer: A Quick-Start Guide appeared first on Towards Data Science. Lucy Dickinson Go to original source

  • Building a Modern Dashboard with Python and Gradio

    Building a Modern Dashboard with Python and Gradio Data insights made simple The post Building a Modern Dashboard with Python and Gradio appeared first on Towards Data Science. Thomas Reid Go to original source

  • Reducing Time to Value for Data Science Projects: Part 2

    Reducing Time to Value for Data Science Projects: Part 2 Leveraging automation and parallelism to scale out experiments The post Reducing Time to Value for Data Science Projects: Part 2 appeared first on Towards Data Science. Kristopher McGlinchey Go to original source

  • Inside Google’s Agent2Agent (A2A) Protocol: Teaching AI Agents to Talk to Each Other

    Inside Google’s Agent2Agent (A2A) Protocol: Teaching AI Agents to Talk to Each Other Exploring how Google’s A2A enables plug-and-play communication between LLM-powered agents across frameworks The post Inside Google’s Agent2Agent (A2A) Protocol: Teaching AI Agents to Talk to Each Other appeared first on Towards Data Science. Hailey Quach Go to original source

  • JAX: Is This Google’s NumPy killer?

    JAX: Is This Google’s NumPy killer? Auto differentiation and JIT compilation make a compelling case. The post JAX: Is This Google’s NumPy killer? appeared first on Towards Data Science. Thomas Reid Go to original source

  • The Best AI Books & Courses for Getting a Job

    The Best AI Books & Courses for Getting a Job A comprehensive guide to the books and courses that helped me learn AI The post The Best AI Books & Courses for Getting a Job appeared first on Towards Data Science. Egor Howell Go to original source

  • Estimating Product-Level Price Elasticities Using Hierarchical Bayesian

    Estimating Product-Level Price Elasticities Using Hierarchical Bayesian Using one model to personalize ML results The post Estimating Product-Level Price Elasticities Using Hierarchical Bayesian appeared first on Towards Data Science. Derek Tran Go to original source

  • Do More with NumPy Array Type Hints: Annotate & Validate Shape & Dtype

    Do More with NumPy Array Type Hints: Annotate & Validate Shape & Dtype Improve static analysis and run-time validation with full generic specification The post Do More with NumPy Array Type Hints: Annotate & Validate Shape & Dtype appeared first on Towards Data Science. Christopher Ariza Go to original source

  • Inheritance: A Software Engineering Concept Data Scientists Must Know To Succeed

    Inheritance: A Software Engineering Concept Data Scientists Must Know To Succeed Coding concepts that distinguish an amateur from a professional data scientist The post Inheritance: A Software Engineering Concept Data Scientists Must Know To Succeed appeared first on Towards Data Science. Benjamin Lee Go to original source

  • Use PyTorch to Easily Access Your GPU

    Use PyTorch to Easily Access Your GPU Let’s say you are lucky enough to have access to a system with an Nvidia Graphical Processing Unit (Gpu). Did you know there is an absurdly easy method to use your GPU’s capabilities using a Python library intended and predominantly used for machine learning (ML) applications?  Don’t worry…

  • I Teach Data Viz with a Bag of Rocks

    I Teach Data Viz with a Bag of Rocks Last Thursday, my co-instructor and I showed up to the Data Visualization course we teach at the University of Washington with a bag of rocks. The bag consisted of a fairly diverse collection that I myself put together across a set of treks in various regions…

  • Google’s AlphaEvolve Is Evolving New Algorithms — And It Could Be a Game Changer

    Google’s AlphaEvolve Is Evolving New Algorithms — And It Could Be a Game Changer AlphaEvolve imagined as a genetic algorithm coupled to a large language model. Picture created by the author using various tools including Dall-E3 via ChatGPT. Large Language Models have undeniably revolutionized how many of us approach coding, but they’re often more like a super-powered…

  • Get Started with Rust: Installation and Your First CLI Tool – A Beginner’s Guide

    Get Started with Rust: Installation and Your First CLI Tool – A Beginner’s Guide Rust has become a popular programming language in recent years as it combines security and high performance and can be used in many applications. It combines the positive characteristics of C and C++ with the modern syntax and simplicity of other…

  • Running Python Programs in Your Browser

    Running Python Programs in Your Browser In recent years, WebAssembly (often abbreviated as WASM) has emerged as an interesting technology that extends web browsers’ capabilities far beyond the traditional realms of HTML, CSS, and JavaScript.  As a Python developer, one particularly exciting application is the ability to run Python code directly in the browser. In this…

  • Log Link vs Log Transformation in R — The Difference that Misleads Your Entire Data Analysis

    Log Link vs Log Transformation in R — The Difference that Misleads Your Entire Data Analysis Although normal distributions are the most commonly used, a lot of real-world data unfortunately is not normal. When faced with extremely skewed data, it’s tempting for us to utilize log transformations to normalize the distribution and stabilize the variance. I…

  • Why I stopped Using Cursor and Reverted to VSCode

    Why I stopped Using Cursor and Reverted to VSCode Introduction In December 2024, I wrote an article sharing my experience using VSCode (GitHub Copilot) and Cursor (Claude 3.5 Sonnet) from the perspective of a Data Scientist. Should you switch from VSCode to Cursor? I concluded the article by stating: After using Cursor for the past two…

  • Agentic AI 101: Starting Your Journey Building AI Agents

    Agentic AI 101: Starting Your Journey Building AI Agents Introduction The Artificial Intelligence industry is moving fast. It is impressive and many times overwhelming. I have been studying, learning, and building my foundations in this area of Data Science because I believe that the future of Data Science is strongly correlated with the development of…

  • Rust for Python Developers: Why You Should Take a Look at the Rust Programming Language

    Rust for Python Developers: Why You Should Take a Look at the Rust Programming Language The programming language Rust is now appearing in many feeds as it offers a performant and secure way to write programs and places great emphasis on performance. If you come from the Python world of Pandas, Jupyter or Flask, you might think that…

  • How Would I Learn to Code with ChatGPT if I Had to Start Again

    How Would I Learn to Code with ChatGPT if I Had to Start Again Coding has been a part of my life since I was 10. From modifying HTML & CSS for my Friendster profile during the simple internet days to exploring SQL injections for the thrill, building a three-legged robot for fun, and lately…

  • How to Level Up Your Technical Skills in This AI Era

    How to Level Up Your Technical Skills in This AI Era AI-assisted coding is here to stay. Tools like Cursor, V0, and Lovable have dramatically lowered the barrier to entry — building dashboards, pipelines, or entire apps can now be done in a fraction of the time. I use these tools daily, and they’ve definitely made me…

  • NumExpr: The “Faster than Numpy” Library Most Data Scientists Have Never Used

    NumExpr: The “Faster than Numpy” Library Most Data Scientists Have Never Used Browsing GitHub the other day, I came across a library I’d never heard of before. It was called NumExpr. I was immediately interested because of some claims made about the library. In particular, it stated that for some complex numerical calculations, it was…

  • Exporting MLflow Experiments from Restricted HPC Systems

    Exporting MLflow Experiments from Restricted HPC Systems Many High-Performance Computing (HPC) environments, especially in research and educational institutions, restrict communications to outbound TCP connections. Running a simple command-line ping or curl with the MLflow tracking URL on the HPC bash shell to check packet transfer can be successful. However, communication fails and times out while…

  • Data Science: From School to Work, Part IV

    Data Science: From School to Work, Part IV Introduction Let’s start with a simple example that will appeal to most of us. If you want to check if the blinkers of your car are working properly, you sit in the car, turn on the ignition and test a turn signal to see if the front…

  • Beyond the Code: Unconventional Lessons from Empathetic Interviewing

    Beyond the Code: Unconventional Lessons from Empathetic Interviewing Recently, I’ve been interviewing Computer Science students applying for data science and engineering internships with a 4-day turnaround from CV vetting to final decisions. With a small local office of 10 and no in-house HR, hiring managers handle the entire process. This article reflects on the lessons…

  • Deb8flow: Orchestrating Autonomous AI Debates with LangGraph and GPT-4o

    Deb8flow: Orchestrating Autonomous AI Debates with LangGraph and GPT-4o Introduction I’ve always been fascinated by debates—the strategic framing, the sharp retorts, and the carefully timed comebacks. Debates aren’t just entertaining; they’re structured battles of ideas, driven by logic and evidence. Recently, I started wondering: could we replicate that dynamic using AI agents—having them debate each…

  • How to Optimize your Python Program for Slowness

    How to Optimize your Python Program for Slowness Also available: A Rust version of this article. Everyone talks about making Python programs faster [1, 2, 3], but what if we pursue the opposite goal? Let’s explore how to make them slower — absurdly slower. Along the way, we’ll examine the nature of computation, the role of memory,…

  • How I Would Learn To Code (If I Could Start Over)

    How I Would Learn To Code (If I Could Start Over) According to various sources, the average salary for Coding jobs is ~£47.5k in the UK, which is ~35% higher than the median salary of about £35k. So, coding is a very valuable skill that will earn you more money, not to mention it’s really fun.…

  • PyScript vs. JavaScript: A Battle of Web Titans

    PyScript vs. JavaScript: A Battle of Web Titans We’re delving into frontend web development today, and you might be thinking: what does this have to do with Data Science? Why is Towards Data Science publishing a post related to web dev? Well, because data science isn’t only about building powerful models, engaging in advanced analytics,…

  • 4 Levels of GitHub Actions: A Guide to Data Workflow Automation

    4 Levels of GitHub Actions: A Guide to Data Workflow Automation Automation has become an indispensable element for ensuring operational efficiency and reliability in modern software development. GitHub Actions, an integrated Continuous Integration and Continuous Deployment (CI/CD) tool within GitHub, has established its position in the software development industry by providing a comprehensive platform for…

  • Agentic AI: Single vs Multi-Agent Systems

    Agentic AI: Single vs Multi-Agent Systems We’ve seen this shift the last few years from building rigid programming systems to natural language-driven workflows, all made possible with more advanced large language models. One of the interesting areas into these Agentic Ai systems is the difference between building a single versus multi-agent workflow, or perhaps the…

  • Data Science: From School to Work, Part III

    Data Science: From School to Work, Part III Introduction Writing code is about solving problems, but not every problem is predictable. In the real world, your software will encounter unexpected situations: missing files, invalid user inputs, network timeouts, or even hardware failures. This is why handling errors isn’t just a nice-to-have; it’s a critical part…

  • A Clear Intro to MCP (Model Context Protocol) with Code Examples

    A Clear Intro to MCP (Model Context Protocol) with Code Examples As the race to move AI agents from prototype to production heats up, the need for a standardized way for agents to call tools across different providers is pressing. This transition to a standardized approach to agent tool calling is similar to what we…

  • What Germany Currently Is Up To, Debt-Wise

    What Germany Currently Is Up To, Debt-Wise €1,600 per second. That’s how much interest Germany has to pay for its debts. In total, the German state has debts ranging into the trillions — more than a thousand billion Euros. And the government is planning to make even more, up to one trillion additional debt is…

  • Nine Pico PIO Wats with Rust (Part 2)

    Nine Pico PIO Wats with Rust (Part 2) This is Part 2 of an exploration into the unexpected quirks of programming the Raspberry Pi Pico PIO with Micropython. If you missed Part 1, we uncovered four Wats that challenge assumptions about register count, instruction slots, the behavior of pull noblock, and smart yet cheap hardware.…

  • How to Develop Complex DAX Expressions

    How to Develop Complex DAX Expressions At some point or another, any Power BI developer must write complex Dax expressions to analyze data. But nobody tells you how to do it. What’s the process for doing it? What is the best way to do it, and how supportive can a development process be? These are the questions…

  • Comprehensive Guide to Dependency Management in Python

    Comprehensive Guide to Dependency Management in Python Introduction When learning Python, many beginners focus solely on the language and its libraries while completely ignoring virtual environments. As a result, managing Python projects can become a mess: dependencies installed for different projects may have conflicting versions, leading to compatibility issues. Even when I studied Python, nobody…

  • Practical SQL Puzzles That Will Level Up Your Skill

    Practical SQL Puzzles That Will Level Up Your Skill There are some Sql patterns that, once you know them, you start seeing them everywhere. The solutions to the puzzles that I will show you today are actually very simple SQL queries, but understanding the concept behind them will surely unlock new solutions to the queries…

  • Data Science: From School to Work, Part II

    Data Science: From School to Work, Part II In my previous article, I highlighted the importance of effective project management in Python development. Now, let’s shift our focus to the code itself and explore how to write clean, maintainable code — an essential practice in professional and collaborative environments.  Readability & Maintainability: Well-structured code is easier to…

  • Nine Rules for SIMD Acceleration of Your Rust Code (Part 1)

    Nine Rules for SIMD Acceleration of Your Rust Code (Part 1) Thanks to Ben Lichtman (B3NNY) at the Seattle Rust Meetup for pointing me in the right direction on SIMD. SIMD (Single Instruction, Multiple Data) operations have been a feature of Intel/AMD and ARM CPUs since the early 2000s. These operations enable you to, for example,…

  • Is Python Set to Surpass Its Competitors?

    Is Python Set to Surpass Its Competitors? A soufflé is a baked egg dish that originated in France in the 18th century. The process of making an elegant and delicious French soufflé is complex, and in the past, it was typically only prepared by professional French pastry chefs. However, with pre-made soufflé mixes now widely…

  • Efficient Data Handling in Python with Arrow

    Efficient Data Handling in Python with Arrow 1. Introduction We’re all used to work with CSVs, JSON files… With the traditional libraries and for large datasets, these can be extremely slow to read, write and operate on, leading to performance bottlenecks (been there). It’s precisely with big amounts of data that being efficient handling the…

  • Breaking the Bottleneck: GPU-Optimised Video Processing for Deep Learning

    Breaking the Bottleneck: GPU-Optimised Video Processing for Deep Learning Deep Learning (DL) applications often require processing video data for tasks such as object detection, classification, and segmentation. However, conventional video processing pipelines are typically inefficient for deep learning inference, leading to performance bottlenecks. In this post will leverage PyTorch and FFmpeg with NVIDIA hardware acceleration…

  • Reinforcement Learning with PDEs

    Reinforcement Learning with PDEs Previously we discussed applying reinforcement learning to Ordinary Differential Equations (ODEs) by integrating ODEs within gymnasium. ODEs are a powerful tool that can describe a wide range of systems but are limited to a single variable. Partial Differential Equations (PDEs) are differential equations involving derivatives of multiple variables that can cover…

  • How to Use an LLM-Powered Boilerplate for Building Your Own Node.js API

    How to Use an LLM-Powered Boilerplate for Building Your Own Node.js API For a long time, one of the common ways to start new Node.js projects was using boilerplate templates. These templates help developers reuse familiar code structures and implement standard features, such as access to cloud file storage. With the latest developments in LLM,…

  • Don’t Let Conda Eat Your Hard Drive

    Don’t Let Conda Eat Your Hard Drive If you’re an Anaconda user, you know that conda environments help you manage package dependencies, avoid compatibility conflicts, and share your projects with others. Unfortunately, they can also take over your computer’s hard drive. I write lots of computer tutorials and to keep them organized, each has a dedicated folder…

  • AI Agents from Zero to Hero – Part 1

    AI Agents from Zero to Hero – Part 1 Intro AI Agents are autonomous programs that perform tasks, make decisions, and communicate with others. Normally, they use a set of tools to help complete tasks. In GenAI applications, these Agents process sequential reasoning and can use external tools (like web searches or database queries) when…

  • Why Data Scientists Should Care about Containers — and Stand Out with This Knowledge

    Why Data Scientists Should Care about Containers — and Stand Out with This Knowledge “I train models, analyze data and create dashboards — why should I care about Containers?” Many people who are new to the world of data science ask themselves this question. But imagine you have trained a model that runs perfectly on…

  • Advanced Time Intelligence in DAX with Performance in Mind

    Advanced Time Intelligence in DAX with Performance in Mind We all know the usual Time Intelligence function based on years, quarters, months, and days. But sometimes, we need to perform more exotic timer intelligence calculations. But we should not forget to consider performance while programming the measures.  Introduction  There are many Dax functions in Power BI…

  • Multimodal Search Engine Agents Powered by BLIP-2 and Gemini

    Multimodal Search Engine Agents Powered by BLIP-2 and Gemini This post was co-authored with Rafael Guedes. Introduction Traditional models can only process a single type of data, such as text, images, or tabular data. Multimodality is a trending concept in the AI research community, referring to a model’s ability to learn from multiple types of…

  • Zero Human Code: What I Learned from Forcing AI to Build (and Fix) Its Own Code for 27 Straight Days

    Zero Human Code: What I Learned from Forcing AI to Build (and Fix) Its Own Code for 27 Straight Days 27 days, 1,700+ commits, 99,9% AI generated code The narrative around AI development tools has become increasingly detached from reality. YouTube is filled with claims of building complex applications in hours using AI assistants. The…

  • Learning How to Play Atari Games Through Deep Neural Networks

    Learning How to Play Atari Games Through Deep Neural Networks In July 1959, Arthur Samuel developed one of the first agents to play the game of checkers. What constitutes an agent that plays checkers can be best described in Samuel’s own words, “…a computer [that] can be programmed so that it will learn to play…

  • Publish Interactive Data Visualizations for Free with Python and Marimo

    Publish Interactive Data Visualizations for Free with Python and Marimo Working in Data Science, it can be hard to share insights from complex datasets using only static figures. All the facets that describe the shape and meaning of interesting data are not always captured in a handful of pre-generated figures. While we have powerful technologies…

  • Manage Environment Variables with Pydantic

    Manage Environment Variables with Pydantic Introduction Developers work on applications that are supposed to be deployed on some server in order to allow anyone to use those. Typically in the machine where these apps live, developers set up environment variables that allow the app to run. These variables can be API keys of external services,…

  • 4-Dimensional Data Visualization: Time in Bubble Charts

    4-Dimensional Data Visualization: Time in Bubble Charts Bubble Charts elegantly compress large amounts of information into a single visualization, with bubble size adding a third dimension. However, comparing “before” and “after” states is often crucial. To address this, we propose adding a transition between these states, creating an intuitive user experience. Since we couldn’t find…

  • Introduction to Minimum Cost Flow Optimization in Python

    Introduction to Minimum Cost Flow Optimization in Python Minimum cost flow optimization minimizes the cost of moving flow through a network of nodes and edges. Nodes include sources (supply) and sinks (demand), with different costs and capacity limits. The aim is to find the least costly way to move volume from sources to sinks while…

  • How to Find Seasonality Patterns in Time Series

    How to Find Seasonality Patterns in Time Series Using Fourier Transforms to detect seasonal components In my professional life as a data scientist, I have encountered time series multiple times. Most of my knowledge comes from my academic experience, specifically my courses in Econometrics (I have a degree in Economics), where we studied statistical properties…

  • Are Data Scientists at Risk in 2025?

    Are Data Scientists at Risk in 2025? The impact of AI on data science jobs. Continue reading on Towards Data Science » Natassha Selvaraj Go to original source

  • 2-Bit VPTQ: 6.5x Smaller LLMs While Preserving 95% Accuracy

    2-Bit VPTQ: 6.5x Smaller LLMs While Preserving 95% Accuracy Very accurate 2-bit quantization for running 70B LLMs on a 24 GB GPU Continue reading on Towards Data Science » Benjamin Marie Go to original source

  • Optimising Budgets With Marketing Mix Models In Python

    Optimising Budgets With Marketing Mix Models In Python Part 3 of a hands-on guide to help you master MMM in pymc Photo by Towfiqu barbhuiya on Unsplash What is this series about? Welcome to part 3 of my series on marketing mix modelling (MMM), a hands-on guide to help you master MMM. Throughout this series, we’ll cover key…

  • A Derivation and Application of Restricted Boltzmann Machines (2024 Nobel Prize)

    A Derivation and Application of Restricted Boltzmann Machines (2024 Nobel Prize) Investigating Geoffrey Hinton’s Nobel Prize-winning work and building it from scratch using PyTorch One recipient of the 2024 Nobel Prize in Physics was Geoffrey Hinton for his contributions in the field of AI and machine learning. A lot of people know he worked on neural…

  • Satellite Image Classification with Deep Learning — Complete Project

    Satellite Image Classification with Deep Learning — Complete Project A Comprehensive Guide Using PyTorch and CNNs Continue reading on Towards Data Science » Leo Anello Go to original source

  • Water Cooler Small Talk: Benford’s Law

    Water Cooler Small Talk: Benford’s Law A look into the strange first digit distribution of naturally occurring datasets Continue reading on Towards Data Science » Maria Mouschoutzi, PhD Go to original source

  • llama.cpp: Writing A Simple C++ Inference Program for GGUF LLM Models

    llama.cpp: Writing A Simple C++ Inference Program for GGUF LLM Models Exploring llama.cpp internals and a basic chat program flow Photo by Mathew Schwartz on Unsplash llama.cpp has revolutionized the space of LLM inference by the means of wide adoption and simplicity. It has enabled enterprises and individual developers to deploy LLMs on devices ranging from SBCs…

  • 3 Powerful Examples of the Python Re Library

    3 Powerful Examples of the Python Re Library Explore the power of regex and save time in data analysis Continue reading on Towards Data Science » Suraj Gurav Go to original source

  • Advanced SQL Techniques for Unstructured Data Handling

    Advanced SQL Techniques for Unstructured Data Handling Everything you need to know to get started with text mining Continue reading on Towards Data Science » Jiayan Yin Go to original source

  • How to Securely Connect Microsoft Fabric to Azure Databricks SQL API

    How to Securely Connect Microsoft Fabric to Azure Databricks SQL API Integration architecture focusing on security and access control Connecting Compute — image by Alexandre Debiève on Unsplash 1. Introduction Microsoft Fabric and Azure Databricks are both powerhouses in the data analytics field. These platforms can be used end-to-end in a medallion architecture, from data ingestion to creating data…

  • Predicting a Ball Trajectory

    Predicting a Ball Trajectory Polynomial Fit in Python with NumPy Continue reading on Towards Data Science » Florian Trautweiler Go to original source

  • How to Tell Among Two Regression Models with Statistical Significance

    How to Tell Among Two Regression Models with Statistical Significance Diving into the F-test for nested models with algorithms, examples and code Continue reading on Towards Data Science » LucianoSphere (Luciano Abriata, PhD) Go to original source

  • Master Bots Before Starting with AI Agents: Simple Steps to Create a Mastodon Bot with Python

    Master Bots Before Starting with AI Agents: Simple Steps to Create a Mastodon Bot with Python I recently published a post on Mastodon that was shared by six other accounts within two minutes. Curious, I visited the profiles and… Continue reading on Towards Data Science » Sarah Lea Go to original source