Category: aimldsaimlds

How Agent Handoffs Work in Multi-Agent Systems

How Agent Handoffs Work in Multi-Agent Systems Understanding how LLM agents transfer control to each other in a multi-agent system with LangGraph The post How Agent Handoffs Work in Multi-Agent Systems appeared first on Towards Data Science. Kenneth Leung Go to original source

December 12, 2025
Online Inference of Constrained Optimization: Primal-Dual Optimality and Sequential Quadratic Programming

Online Inference of Constrained Optimization: Primal-Dual Optimality and Sequential Quadratic Programming arXiv:2512.08948v1 Announce Type: new Abstract: We study online statistical inference for the solutions of stochastic optimization problems with equality and inequality constraints. Such problems are prevalent in statistics and machine learning, encompassing constrained $M$-estimation, physics-informed models, safe reinforcement learning, and algorithmic fairness. We develop…

December 11, 2025
WTNN: Weibull-Tailored Neural Networks for survival analysis

WTNN: Weibull-Tailored Neural Networks for survival analysis arXiv:2512.09163v1 Announce Type: new Abstract: The Weibull distribution is a commonly adopted choice for modeling the survival of systems subject to maintenance over time. When only proxy indicators and censored observations are available, it becomes necessary to express the distribution’s parameters as functions of time-dependent covariates. Deep neural…

December 11, 2025
Robust and Sparse Estimation of Unbounded Density Ratio under Heavy Contamination

Robust and Sparse Estimation of Unbounded Density Ratio under Heavy Contamination arXiv:2512.09266v1 Announce Type: new Abstract: We examine the non-asymptotic properties of robust density ratio estimation (DRE) in contaminated settings. Weighted DRE is the most promising among existing methods, exhibiting doubly strong robustness from an asymptotic perspective. This study demonstrates that Weighted DRE achieves sparse…

December 11, 2025
Impact of Positional Encoding: Clean and Adversarial Rademacher Complexity for Transformers under In-Context Regression

Impact of Positional Encoding: Clean and Adversarial Rademacher Complexity for Transformers under In-Context Regression arXiv:2512.09275v1 Announce Type: new Abstract: Positional encoding (PE) is a core architectural component of Transformers, yet its impact on the Transformer’s generalization and robustness remains unclear. In this work, we provide the first generalization analysis for a single-layer Transformer under in-context…

December 11, 2025
Estimation of Stochastic Optimal Transport Maps

Estimation of Stochastic Optimal Transport Maps arXiv:2512.09499v1 Announce Type: new Abstract: The optimal transport (OT) map is a geometry-driven transformation between high-dimensional probability distributions which underpins a wide range of tasks in statistics, applied probability, and machine learning. However, existing statistical theory for OT map estimation is quite restricted, hinging on Brenier’s theorem (quadratic cost,…

December 11, 2025
The Machine Learning “Advent Calendar” Day 10: DBSCAN in Excel

The Machine Learning “Advent Calendar” Day 10: DBSCAN in Excel DBSCAN shows how far we can go with a very simple idea: count how many neighbors live close to each point. It finds clusters and marks anomalies without any probabilistic model, and it works beautifully in Excel. But because it relies on one fixed radius,…

December 11, 2025
How to Maximize Agentic Memory for Continual Learning

How to Maximize Agentic Memory for Continual Learning Learn how to become an effective engineer with continual learning LLMs The post How to Maximize Agentic Memory for Continual Learning appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

December 11, 2025
Don’t Build an ML Portfolio Without These Projects

Don’t Build an ML Portfolio Without These Projects What recruiters are looking for in machine learning portfolios The post Don’t Build an ML Portfolio Without These Projects appeared first on Towards Data Science. Egor Howell Go to original source

December 11, 2025
Optimizing PyTorch Model Inference on AWS Graviton

Optimizing PyTorch Model Inference on AWS Graviton Tips for accelerating AI/ML on CPU — Part 2 The post Optimizing PyTorch Model Inference on AWS Graviton appeared first on Towards Data Science. Chaim Rand Go to original source

December 11, 2025
Functional Random Forest with Adaptive Cost-Sensitive Splitting for Imbalanced Functional Data Classification

Functional Random Forest with Adaptive Cost-Sensitive Splitting for Imbalanced Functional Data Classification arXiv:2512.07888v1 Announce Type: new Abstract: Classification of functional data where observations are curves or trajectories poses unique challenges, particularly under severe class imbalance. Traditional Random Forest algorithms, while robust for tabular data, often fail to capture the intrinsic structure of functional observations and…

December 10, 2025
Provable Diffusion Posterior Sampling for Bayesian Inversion

Provable Diffusion Posterior Sampling for Bayesian Inversion arXiv:2512.08022v1 Announce Type: new Abstract: This paper proposes a novel diffusion-based posterior sampling method within a plug-and-play (PnP) framework. Our approach constructs a probability transport from an easy-to-sample terminal distribution to the target posterior, using a warm-start strategy to initialize the particles. To approximate the posterior score, we…

December 10, 2025
Worst-case generation via minimax optimization in Wasserstein space

Worst-case generation via minimax optimization in Wasserstein space arXiv:2512.08176v1 Announce Type: new Abstract: Worst-case generation plays a critical role in evaluating robustness and stress-testing systems under distribution shifts, in applications ranging from machine learning models to power grids and medical prediction systems. We develop a generative modeling framework for worst-case generation for a pre-specified risk,…

December 10, 2025
Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis

Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis arXiv:2512.08601v1 Announce Type: new Abstract: Since the 1990s, considerable empirical work has been carried out to train statistical models, such as neural networks (NNs), as learned heuristics for combinatorial optimization (CO) problems. When successful, such an approach eliminates the need for experts…

December 10, 2025
Bayesian Optimization for Function-Valued Responses under Min-Max Criteria

Bayesian Optimization for Function-Valued Responses under Min-Max Criteria arXiv:2512.07868v1 Announce Type: cross Abstract: Bayesian optimization is widely used for optimizing expensive black box functions, but most existing approaches focus on scalar responses. In many scientific and engineering settings the response is functional, varying smoothly over an index such as time or wavelength, which makes classical…

December 10, 2025
The Machine Learning “Advent Calendar” Day 9: LOF in Excel

The Machine Learning “Advent Calendar” Day 9: LOF in Excel In this article, we explore LOF through three simple steps: distances and neighbors, reachability distances, and the final LOF score. Using tiny datasets, we see how two anomalies can look obvious to us but completely different to different algorithms. This reveals the key idea of…

December 10, 2025
Personal, Agentic Assistants: A Practical Blueprint for a Secure, Multi-User, Self-Hosted Chatbot

Personal, Agentic Assistants: A Practical Blueprint for a Secure, Multi-User, Self-Hosted Chatbot Build a self-hosted, end-to-end platform that gives each user a personal, agentic chatbot that can autonomously vector-search through files that the user explicitly allows it to access. The post Personal, Agentic Assistants: A Practical Blueprint for a Secure, Multi-User, Self-Hosted Chatbot appeared first…

December 10, 2025
How to Develop AI-Powered Solutions, Accelerated by AI

How to Develop AI-Powered Solutions, Accelerated by AI From idea to impact : using AI as your accelerating copilot The post How to Develop AI-Powered Solutions, Accelerated by AI appeared first on Towards Data Science. Anna Via Go to original source

December 10, 2025
GraphRAG in Practice: How to Build Cost-Efficient, High-Recall Retrieval Systems

GraphRAG in Practice: How to Build Cost-Efficient, High-Recall Retrieval Systems Smarter retrieval strategies that outperform dense graphs — with hybrid pipelines and lower cost The post GraphRAG in Practice: How to Build Cost-Efficient, High-Recall Retrieval Systems appeared first on Towards Data Science. Partha Sarkar Go to original source

December 10, 2025
A Realistic Roadmap to Start an AI Career in 2026

A Realistic Roadmap to Start an AI Career in 2026 How to learn AI in 2026 through real, usable projects The post A Realistic Roadmap to Start an AI Career in 2026 appeared first on Towards Data Science. Sabrine Bendimerad Go to original source

December 10, 2025
Contextual Strongly Convex Simulation Optimization: Optimize then Predict with Inexact Solutions

Contextual Strongly Convex Simulation Optimization: Optimize then Predict with Inexact Solutions arXiv:2512.06270v1 Announce Type: new Abstract: In this work, we study contextual strongly convex simulation optimization and adopt an “optimize then predict” (OTP) approach for real-time decision making. In the offline stage, simulation optimization is conducted across a set of covariates to approximate the optimal-solution…

December 9, 2025
Modeling Spatio-temporal Extremes via Conditional Variational Autoencoders

Modeling Spatio-temporal Extremes via Conditional Variational Autoencoders arXiv:2512.06348v1 Announce Type: new Abstract: Extreme weather events are widely studied in fields such as agriculture, ecology, and meteorology. The spatio-temporal co-occurrence of extreme events can strengthen or weaken under changing climate conditions. In this paper, we propose a novel approach to model spatio-temporal extremes by integrating climate…

December 9, 2025
Canonical Tail Dependence for Soft Extremal Clustering of Multichannel Brain Signals

Canonical Tail Dependence for Soft Extremal Clustering of Multichannel Brain Signals arXiv:2512.06435v1 Announce Type: new Abstract: We develop a novel characterization of extremal dependence between two cortical regions of the brain when its signals display extremely large amplitudes. We show that connectivity in the tails of the distribution reveals unique features of extreme events (e.g.,…

December 9, 2025
ADAM Optimization with Adaptive Batch Selection

ADAM Optimization with Adaptive Batch Selection arXiv:2512.06795v1 Announce Type: new Abstract: Adam is a widely used optimizer in neural network training due to its adaptive learning rate. However, because different data samples influence model updates to varying degrees, treating them equally can lead to inefficient convergence. To address this, a prior work proposed adapting the…

December 9, 2025
Latent Nonlinear Denoising Score Matching for Enhanced Learning of Structured Distributions

Latent Nonlinear Denoising Score Matching for Enhanced Learning of Structured Distributions arXiv:2512.06615v1 Announce Type: new Abstract: We present latent nonlinear denoising score matching (LNDSM), a novel training objective for score-based generative models that integrates nonlinear forward dynamics with the VAE-based latent SGM framework. This combination is achieved by reformulating the cross-entropy term using the approximate…

December 9, 2025
Bridging the Silence: How LEO Satellites and Edge AI Will Democratize Connectivity

Bridging the Silence: How LEO Satellites and Edge AI Will Democratize Connectivity Why on-device intelligence and low-orbit constellations are the only viable path to universal accessibility The post Bridging the Silence: How LEO Satellites and Edge AI Will Democratize Connectivity appeared first on Towards Data Science. Aakash Goswami Go to original source

December 9, 2025
The Machine Learning “Advent Calendar” Day 8: Isolation Forest in Excel

The Machine Learning “Advent Calendar” Day 8: Isolation Forest in Excel Isolation Forest may look technical, but its idea is simple: isolate points using random splits. If a point is isolated quickly, it is an anomaly; if it takes many splits, it is normal. Using the tiny dataset 1, 2, 3, 9, we can see…

December 9, 2025
The AI Bubble Will Pop — And Why That Doesn’t Matter

The AI Bubble Will Pop — And Why That Doesn’t Matter How history’s biggest tech bubble explains where AI is headed next The post The AI Bubble Will Pop — And Why That Doesn’t Matter appeared first on Towards Data Science. Michael Malin Go to original source

December 9, 2025
How to Create an ML-Focused Newsletter

How to Create an ML-Focused Newsletter Learn how to make a newsletter with AI tools The post How to Create an ML-Focused Newsletter appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

December 9, 2025
Optimizing PyTorch Model Inference on CPU

Optimizing PyTorch Model Inference on CPU Flyin’ Like a Lion on Intel Xeon The post Optimizing PyTorch Model Inference on CPU appeared first on Towards Data Science. Chaim Rand Go to original source

December 9, 2025
How to Tame Your LLM: Semantic Collapse in Continuous Systems

How to Tame Your LLM: Semantic Collapse in Continuous Systems arXiv:2512.05162v1 Announce Type: new Abstract: We develop a general theory of semantic dynamics for large language models by formalizing them as Continuous State Machines (CSMs): smooth dynamical systems whose latent manifolds evolve under probabilistic transition operators. The associated transfer operator $P: L^2(M,mu) to L^2(M,mu)$ encodes…

December 8, 2025
One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow

One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow arXiv:2512.05251v1 Announce Type: new Abstract: Sampling from unnormalized target distributions is a fundamental yet challenging task in machine learning and statistics. Existing sampling algorithms typically require many iterative steps to produce high-quality samples, leading to high computational costs. We introduce one-step diffusion samplers which learn a step-conditioned…

December 8, 2025
Symmetric Linear Dynamical Systems are Learnable from Few Observations

Symmetric Linear Dynamical Systems are Learnable from Few Observations arXiv:2512.05337v1 Announce Type: new Abstract: We consider the problem of learning the parameters of a $N$-dimensional stochastic linear dynamics under both full and partial observations from a single trajectory of time $T$. We introduce and analyze a new estimator that achieves a small maximum element-wise error…

December 8, 2025
Design-marginal calibration of Gaussian process predictive distributions: Bayesian and conformal approaches

Design-marginal calibration of Gaussian process predictive distributions: Bayesian and conformal approaches arXiv:2512.05611v1 Announce Type: new Abstract: We study the calibration of Gaussian process (GP) predictive distributions in the interpolation setting from a design-marginal perspective. Conditioning on the data and averaging over a design measure mu, we formalize mu-coverage for central intervals and mu-probabilistic calibration through…

December 8, 2025
Do We Really Even Need Data? A Modern Look at Drawing Inference with Predicted Data

Do We Really Even Need Data? A Modern Look at Drawing Inference with Predicted Data arXiv:2512.05456v1 Announce Type: new Abstract: As artificial intelligence and machine learning tools become more accessible, and scientists face new obstacles to data collection (e.g., rising costs, declining survey response rates), researchers increasingly use predictions from pre-trained algorithms as substitutes for…

December 8, 2025
Inferential Statistics on long-form census data from stats can

Inferential Statistics on long-form census data from stats can I am using the following tool https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=9810065601 to query Statistics Canada and get data from the long-form census. However, since it’s a census of 25% of the population, there is a need for inferential statistics. That being said in order to do inferential statistics on the…

December 8, 2025
Weekly Entering & Transitioning – Thread 08 Dec, 2025 – 15 Dec, 2025

Weekly Entering & Transitioning – Thread 08 Dec, 2025 – 15 Dec, 2025 Welcome to this week’s entering & transitioning thread! This thread is for any questions about getting started, studying, or transitioning into the data science field. Topics include: Learning resources (e.g. books, tutorials, videos) Traditional education (e.g. schools, degrees, electives) Alternative education (e.g.…

December 8, 2025
Lost and Feel Like a Fraud

Lost and Feel Like a Fraud This might not be the appropriate place to say this, but I honestly feel like the biggest fraud ever. If I could go back, I don’t think I would have went into data science. I did my undergraduate in biology, and then did a masters in data science. I’ve…

December 8, 2025
MLE coding rounds (UK)

MLE coding rounds (UK) I’m a data scientist transitioning to ML-AI Engineer roles. What kind of coding questions-rounds should I expect? I’ve heard that it’s a mixed bag, can be leetcode, can be Pytorch,tf for all ML related I’ve also heard about building ML concepts-algos from scratch using numpy etc. Or even an ML pipeline…

December 8, 2025
DS audiobook recommendations?

DS audiobook recommendations? I have a very, very long road trip ahead of me. I would like recommendations for a DS audiobook that can help make the ride easier. submitted by /u/Cross_examination [link] [comments] /u/Cross_examination Go to original source

December 8, 2025
How to Climb the Hidden Career Ladder of Data Science

How to Climb the Hidden Career Ladder of Data Science The behaviors that get you promoted The post How to Climb the Hidden Career Ladder of Data Science appeared first on Towards Data Science. Greg Rafferty Go to original source

December 8, 2025
The Machine Learning “Advent Calendar” Day 7: Decision Tree Classifier

The Machine Learning “Advent Calendar” Day 7: Decision Tree Classifier In Day 6, we saw how a Decision Tree Regressor finds its optimal split by minimizing the Mean Squared Error. Today, for Day 7 of the Machine Learning “Advent Calendar”, we switch to classification. With just one numerical feature and two classes, we explore how…

December 8, 2025
Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI — Clearly Explained

Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI — Clearly Explained Understanding AI in 2026 — from machine learning to generative models The post Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI — Clearly Explained appeared first on Towards Data Science. Sabrine Bendimerad Go to original source

December 8, 2025
Reading Research Papers in the Age of LLMs

Reading Research Papers in the Age of LLMs How I keep up with papers with a mix of manual and AI-assisted reading The post Reading Research Papers in the Age of LLMs appeared first on Towards Data Science. Parul Pandey Go to original source

December 7, 2025
The Machine Learning “Advent Calendar” Day 6: Decision Tree Regressor

The Machine Learning “Advent Calendar” Day 6: Decision Tree Regressor During the first days of this Machine Learning Advent Calendar, we explored models based on distances. Today, we switch to a completely different way of learning: Decision Trees. With a simple one-feature dataset, we can see how a tree chooses its first split. The idea…

December 7, 2025
How We Are Testing Our Agents in Dev

How We Are Testing Our Agents in Dev Testing that your AI agent is performing as expected is not easy. Here are a few strategies we learned the hard way. The post How We Are Testing Our Agents in Dev appeared first on Towards Data Science. Michael Segner Go to original source

December 7, 2025
The Step-by-Step Process of Adding a New Feature to My IOS App with Cursor

The Step-by-Step Process of Adding a New Feature to My IOS App with Cursor Cursor is great at writing code but not as good when it comes to design The post The Step-by-Step Process of Adding a New Feature to My IOS App with Cursor appeared first on Towards Data Science. Soner Yıldırım Go to…

December 6, 2025
The Machine Learning “Advent Calendar” Day 5: GMM in Excel

The Machine Learning “Advent Calendar” Day 5: GMM in Excel This article introduces the Gaussian Mixture Model as a natural extension of k-Means, by improving how distance is measured through variances and the Mahalanobis distance. Instead of assigning points to clusters with hard boundaries, GMM uses probabilities learned through the Expectation–Maximization algorithm – the general…

December 6, 2025
A Product Data Scientist’s Take on LinkedIn Games After 500 Days of Play

A Product Data Scientist’s Take on LinkedIn Games After 500 Days of Play What a simple puzzle game reveals about experimentation, product thinking, and data science The post A Product Data Scientist’s Take on LinkedIn Games After 500 Days of Play appeared first on Towards Data Science. Yu Dong Go to original source

December 6, 2025
YOLOv1 Paper Walkthrough: The Day YOLO First Saw the World

YOLOv1 Paper Walkthrough: The Day YOLO First Saw the World A detailed walkthrough of the YOLOv1 architecture and its PyTorch implementation from scratch The post YOLOv1 Paper Walkthrough: The Day YOLO First Saw the World appeared first on Towards Data Science. Muhammad Ardi Go to original source

December 6, 2025
On the Challenge of Converting TensorFlow Models to PyTorch

On the Challenge of Converting TensorFlow Models to PyTorch How to upgrade and optimize legacy AI/ML models The post On the Challenge of Converting TensorFlow Models to PyTorch appeared first on Towards Data Science. Chaim Rand Go to original source

December 6, 2025
Informative missingness and its implications in semi-supervised learning

Informative missingness and its implications in semi-supervised learning arXiv:2512.04392v1 Announce Type: new Abstract: Semi-supervised learning (SSL) constructs classifiers using both labelled and unlabelled data. It leverages information from labelled samples, whose acquisition is often costly or labour-intensive, together with unlabelled data to enhance prediction performance. This defines an incomplete-data problem, which statistically can be formulated…

December 5, 2025
Recurrent Neural Networks with Linear Structures for Electricity Price Forecasting

Recurrent Neural Networks with Linear Structures for Electricity Price Forecasting arXiv:2512.04690v1 Announce Type: new Abstract: We present a novel recurrent neural network architecture designed explicitly for day-ahead electricity price forecasting, aimed at improving short-term decision-making and operational management in energy systems. Our combined forecasting model embeds linear structures, such as expert models and Kalman filters,…

December 5, 2025
Provable FDR Control for Deep Feature Selection: Deep MLPs and Beyond

Provable FDR Control for Deep Feature Selection: Deep MLPs and Beyond arXiv:2512.04696v1 Announce Type: new Abstract: We develop a flexible feature selection framework based on deep neural networks that approximately controls the false discovery rate (FDR), a measure of Type-I error. The method applies to architectures whose first layer is fully connected. From the second…

December 5, 2025
Learning Causality for Longitudinal Data

Learning Causality for Longitudinal Data arXiv:2512.04980v1 Announce Type: new Abstract: This thesis develops methods for causal inference and causal representation learning (CRL) in high-dimensional, time-varying data. The first contribution introduces the Causal Dynamic Variational Autoencoder (CDVAE), a model for estimating Individual Treatment Effects (ITEs) by capturing unobserved heterogeneity in treatment response driven by latent risk…

December 5, 2025
Towards a unified framework for guided diffusion models

Towards a unified framework for guided diffusion models arXiv:2512.04985v1 Announce Type: new Abstract: Guided or controlled data generation with diffusion modelsblfootnote{Partial preliminary results of this work appeared in International Conference on Machine Learning 2025 citep{li2025provable}.} has become a cornerstone of modern generative modeling. Despite substantial advances in diffusion model theory, the theoretical understanding of guided…

December 5, 2025
Do Labels Make AI Blind? Self-Supervision Solves the Age-Old Binding Problem

Do Labels Make AI Blind? Self-Supervision Solves the Age-Old Binding Problem A new NeurIPS 2025 paper shows how self-supervised learning imbues ViT with better image understanding than supervised learning The post Do Labels Make AI Blind? Self-Supervision Solves the Age-Old Binding Problem appeared first on Towards Data Science. Jonathan Williford Go to original source

December 5, 2025
The Machine Learning “Advent Calendar” Day 4: k-Means in Excel

The Machine Learning “Advent Calendar” Day 4: k-Means in Excel How to implement a training algorithm that finally looks like “real” machine learning The post The Machine Learning “Advent Calendar” Day 4: k-Means in Excel appeared first on Towards Data Science. angela shi Go to original source

December 5, 2025
Build and Deploy Your First Supply Chain App in 20 Minutes

Build and Deploy Your First Supply Chain App in 20 Minutes A factory operator that discovered happiness by switching from notebook to streamlit – (Image Generated with GPT-5.1 by Samir Saci) The post Build and Deploy Your First Supply Chain App in 20 Minutes appeared first on Towards Data Science. Samir Saci Go to original…

December 5, 2025
Bootstrap a Data Lakehouse in an Afternoon

Bootstrap a Data Lakehouse in an Afternoon Using Apache Iceberg on AWS with Athena, Glue/Spark and DuckDB The post Bootstrap a Data Lakehouse in an Afternoon appeared first on Towards Data Science. Thomas Reid Go to original source

December 5, 2025
The Best Data Scientists are Always Learning

The Best Data Scientists are Always Learning Why continuous learning matters & how to come up with topics to study The post The Best Data Scientists are Always Learning appeared first on Towards Data Science. Jarom Hulet Go to original source

December 5, 2025
A note on the impossibility of conditional PAC-efficient reasoning in large language models

A note on the impossibility of conditional PAC-efficient reasoning in large language models arXiv:2512.03057v1 Announce Type: new Abstract: We prove an impossibility result for conditional Probably Approximately Correct (PAC)-efficient reasoning in large language models. While recent work has established marginal PAC efficiency guarantees for composite models that switch between expensive expert models and cheaper fast…

December 4, 2025
Uncertainty Quantification for Large Language Model Reward Learning under Heterogeneous Human Feedback

Uncertainty Quantification for Large Language Model Reward Learning under Heterogeneous Human Feedback arXiv:2512.03208v1 Announce Type: new Abstract: We study estimation and statistical inference for reward models used in aligning large language models (LLMs). A key component of LLM alignment is reinforcement learning from human feedback (RLHF), where humans compare pairs of model-generated answers and their…

December 4, 2025
Novelty detection on path space

Novelty detection on path space arXiv:2512.03243v1 Announce Type: new Abstract: We frame novelty detection on path space as a hypothesis testing problem with signature-based test statistics. Using transportation-cost inequalities of Gasteratos and Jacquier (2023), we obtain tail bounds for false positive rates that extend beyond Gaussian measures to laws of RDE solutions with smooth bounded…

December 4, 2025
Iterative Tilting for Diffusion Fine-Tuning

Iterative Tilting for Diffusion Fine-Tuning arXiv:2512.03234v1 Announce Type: new Abstract: We introduce iterative tilting, a gradient-free method for fine-tuning diffusion models toward reward-tilted distributions. The method decomposes a large reward tilt $exp(lambda r)$ into $N$ sequential smaller tilts, each admitting a tractable score update via first-order Taylor expansion. This requires only forward evaluations of the…

December 4, 2025
Colored Markov Random Fields for Probabilistic Topological Modeling

Colored Markov Random Fields for Probabilistic Topological Modeling arXiv:2512.03727v1 Announce Type: new Abstract: Probabilistic Graphical Models (PGMs) encode conditional dependencies among random variables using a graph -nodes for variables, links for dependencies- and factorize the joint distribution into lower-dimensional components. This makes PGMs well-suited for analyzing complex systems and supporting decision-making. Recent advances in topological…

December 4, 2025
The Architecture Behind Web Search in AI Chatbots

The Architecture Behind Web Search in AI Chatbots And what this means for generative engine optimization (GEO) The post The Architecture Behind Web Search in AI Chatbots appeared first on Towards Data Science. Ida Silfverskiöld Go to original source

December 4, 2025
Overcoming the Hidden Performance Traps of Variable-Shaped Tensors: Efficient Data Sampling in PyTorch

Overcoming the Hidden Performance Traps of Variable-Shaped Tensors: Efficient Data Sampling in PyTorch PyTorch Model Performance Analysis and Optimization — Part 11 The post Overcoming the Hidden Performance Traps of Variable-Shaped Tensors: Efficient Data Sampling in PyTorch appeared first on Towards Data Science. Chaim Rand Go to original source

December 4, 2025
The Machine Learning “Advent Calendar” Day 3: GNB, LDA and QDA in Excel

The Machine Learning “Advent Calendar” Day 3: GNB, LDA and QDA in Excel From local distance to global probability The post The Machine Learning “Advent Calendar” Day 3: GNB, LDA and QDA in Excel appeared first on Towards Data Science. angela shi Go to original source

December 4, 2025
How to Turn Your LLM Prototype into a Production-Ready System

How to Turn Your LLM Prototype into a Production-Ready System The most famous applications of LLMs are the ones that I like to call the “wow effect LLMs.” There are plenty of viral LinkedIn posts about them, and they all sound like this: “I built [x] that does [y] in [z] minutes using AI.” Where:…

December 4, 2025
Multi-Agent Arena: Insights from London Great Agent Hack 2025

Multi-Agent Arena: Insights from London Great Agent Hack 2025 What mattered: robust agents, glass-box reasoning, and red-team resilience The post Multi-Agent Arena: Insights from London Great Agent Hack 2025 appeared first on Towards Data Science. Erika G. Gonçalves Go to original source

December 4, 2025
Bayesian Physics-Informed Neural Networks for Inverse Problems (BPINN-IP): Application in Infrared Image Processing

Bayesian Physics-Informed Neural Networks for Inverse Problems (BPINN-IP): Application in Infrared Image Processing arXiv:2512.02495v1 Announce Type: new Abstract: Inverse problems arise across scientific and engineering domains, where the goal is to infer hidden parameters or physical fields from indirect and noisy observations. Classical approaches, such as variational regularization and Bayesian inference, provide well established theoretical…

December 3, 2025
Laplace Approximation For Tensor Train Kernel Machines In System Identification

Laplace Approximation For Tensor Train Kernel Machines In System Identification arXiv:2512.02532v1 Announce Type: new Abstract: To address the scalability limitations of Gaussian process (GP) regression, several approximation techniques have been proposed. One such method is based on tensor networks, which utilizes an exponential number of basis functions without incurring exponential computational cost. However, extending this…

December 3, 2025
Revisiting Theory of Contrastive Learning for Domain Generalization

Revisiting Theory of Contrastive Learning for Domain Generalization arXiv:2512.02831v1 Announce Type: new Abstract: Contrastive learning is among the most popular and powerful approaches for self-supervised representation learning, where the goal is to map semantically similar samples close together while separating dissimilar ones in the latent space. Existing theoretical methods assume that downstream task classes are…

December 3, 2025
EcoCast: A Spatio-Temporal Model for Continual Biodiversity and Climate Risk Forecasting

EcoCast: A Spatio-Temporal Model for Continual Biodiversity and Climate Risk Forecasting arXiv:2512.02260v1 Announce Type: cross Abstract: Increasing climate change and habitat loss are driving unprecedented shifts in species distributions. Conservation professionals urgently need timely, high-resolution predictions of biodiversity risks, especially in ecologically diverse regions like Africa. We propose EcoCast, a spatio-temporal model designed for continual…

December 3, 2025
Spatiotemporal Pyramid Flow Matching for Climate Emulation

Spatiotemporal Pyramid Flow Matching for Climate Emulation arXiv:2512.02268v1 Announce Type: cross Abstract: Generative models have the potential to transform the way we emulate Earth’s changing climate. Previous generative approaches rely on weather-scale autoregression for climate emulation, but this is inherently slow for long climate horizons and has yet to demonstrate stable rollouts under nonstationary forcings.…

December 3, 2025
The Machine Learning “Advent Calendar” Day 2: k-NN Classifier in Excel

The Machine Learning “Advent Calendar” Day 2: k-NN Classifier in Excel Exploring the k-NN classifier with its variants and improvements The post The Machine Learning “Advent Calendar” Day 2: k-NN Classifier in Excel appeared first on Towards Data Science. angela shi Go to original source

December 3, 2025
JSON Parsing for Large Payloads: Balancing Speed, Memory, and Scalability

JSON Parsing for Large Payloads: Balancing Speed, Memory, and Scalability Benchmarking JSON libraries for large payloads The post JSON Parsing for Large Payloads: Balancing Speed, Memory, and Scalability appeared first on Towards Data Science. Subha Ganapathi Go to original source

December 3, 2025
How to Use Simple Data Contracts in Python for Data Scientists

How to Use Simple Data Contracts in Python for Data Scientists Stop your pipelines from breaking on Friday afternoons using simple, open-source validation with Pandera. The post How to Use Simple Data Contracts in Python for Data Scientists appeared first on Towards Data Science. Eirik Berge Go to original source

December 3, 2025
How to Generate QR Codes in Python

How to Generate QR Codes in Python A beginner-friendly tutorial exploring the Python “qrcode” Package The post How to Generate QR Codes in Python appeared first on Towards Data Science. Mahnoor Javed Go to original source

December 3, 2025
DAISI: Data Assimilation with Inverse Sampling using Stochastic Interpolants

DAISI: Data Assimilation with Inverse Sampling using Stochastic Interpolants arXiv:2512.00252v1 Announce Type: new Abstract: Data assimilation (DA) is a cornerstone of scientific and engineering applications, combining model forecasts with sparse and noisy observations to estimate latent system states. Classical DA methods, such as the ensemble Kalman filter, rely on Gaussian approximations and heuristic tuning (e.g.,…

December 2, 2025
An RKHS Perspective on Tree Ensembles

An RKHS Perspective on Tree Ensembles arXiv:2512.00397v1 Announce Type: new Abstract: Random Forests and Gradient Boosting are among the most effective algorithms for supervised learning on tabular data. Both belong to the class of tree-based ensemble methods, where predictions are obtained by aggregating many randomized regression trees. In this paper, we develop a theoretical framework…

December 2, 2025
No-Regret Gaussian Process Optimization of Time-Varying Functions

No-Regret Gaussian Process Optimization of Time-Varying Functions arXiv:2512.00517v1 Announce Type: new Abstract: Sequential optimization of black-box functions from noisy evaluations has been widely studied, with Gaussian Process bandit algorithms such as GP-UCB guaranteeing no-regret in stationary settings. However, for time-varying objectives, it is known that no-regret is unattainable under pure bandit feedback unless strong and…

December 2, 2025
Statistical-computational gap in multiple Gaussian graph alignment

Statistical-computational gap in multiple Gaussian graph alignment arXiv:2512.00610v1 Announce Type: new Abstract: We investigate the existence of a statistical-computational gap in multiple Gaussian graph alignment. We first generalize a previously established informational threshold from Vassaux and Massouli’e (2025) to regimes where the number of observed graphs $p$ may also grow with the number of nodes…

December 2, 2025
Self-sufficient Independent Component Analysis via KL Minimizing Flows

Self-sufficient Independent Component Analysis via KL Minimizing Flows arXiv:2512.00665v1 Announce Type: new Abstract: We study the problem of learning disentangled signals from data using non-linear Independent Component Analysis (ICA). Motivated by advances in self-supervised learning, we propose to learn self-sufficient signals: A recovered signal should be able to reconstruct a missing value of its own…

December 2, 2025
The Machine Learning Lessons I’ve Learned This Month

The Machine Learning Lessons I’ve Learned This Month Christmas connections, Copilot’s costs, careful (no-)choices The post The Machine Learning Lessons I’ve Learned This Month appeared first on Towards Data Science. Pascal Janetzky Go to original source

December 2, 2025
The Machine Learning “Advent Calendar” Day 1: k-NN Regressor in Excel

The Machine Learning “Advent Calendar” Day 1: k-NN Regressor in Excel This first day of the Advent Calendar introduces the k-NN regressor, the simplest distance-based model. Using Excel, we explore how predictions rely entirely on the closest observations, why feature scaling matters, and how heterogeneous variables can make distances meaningless. Through examples with continuous and…

December 2, 2025
The Problem with AI Browsers: Security Flaws and the End of Privacy

The Problem with AI Browsers: Security Flaws and the End of Privacy How Atlas and most current AI-powered browsers fail on three aspects: privacy, security, and censorship The post The Problem with AI Browsers: Security Flaws and the End of Privacy appeared first on Towards Data Science. Mike Huls Go to original source

December 2, 2025
Learning, Hacking, and Shipping ML

Learning, Hacking, and Shipping ML Vyacheslav Efimov on AI hackathons, data science roadmaps, and how AI meaningfully changed day-to-day ML Engineer work The post Learning, Hacking, and Shipping ML appeared first on Towards Data Science. TDS Editors Go to original source

December 2, 2025
Why AI Alignment Starts With Better Evaluation

Why AI Alignment Starts With Better Evaluation You can’t align what you don’t evaluate The post Why AI Alignment Starts With Better Evaluation appeared first on Towards Data Science. Hailey Quach Go to original source

December 2, 2025
Algorithms and Scientific Software for Quasi-Monte Carlo, Fast Gaussian Process Regression, and Scientific Machine Learning

Algorithms and Scientific Software for Quasi-Monte Carlo, Fast Gaussian Process Regression, and Scientific Machine Learning arXiv:2511.21915v1 Announce Type: new Abstract: Most scientific domains elicit the development of efficient algorithms and accessible scientific software. This thesis unifies our developments in three broad domains: Quasi-Monte Carlo (QMC) methods for efficient high-dimensional integration, Gaussian process (GP) regression for…

December 1, 2025
Sparse Multiple Kernel Learning: Alternating Best Response and Semidefinite Relaxations

Sparse Multiple Kernel Learning: Alternating Best Response and Semidefinite Relaxations arXiv:2511.21890v1 Announce Type: new Abstract: We study Sparse Multiple Kernel Learning (SMKL), which is the problem of selecting a sparse convex combination of prespecified kernels for support vector binary classification. Unlike prevailing l1 regularized approaches that approximate a sparsifying penalty, we formulate the problem by…

December 1, 2025
A Sensitivity Approach to Causal Inference Under Limited Overlap

A Sensitivity Approach to Causal Inference Under Limited Overlap arXiv:2511.22003v1 Announce Type: new Abstract: Limited overlap between treated and control groups is a key challenge in observational analysis. Standard approaches like trimming importance weights can reduce variance but introduce a fundamental bias. We propose a sensitivity framework for contextualizing findings under limited overlap, where we…

December 1, 2025
On the Effect of Regularization on Nonparametric Mean-Variance Regression

On the Effect of Regularization on Nonparametric Mean-Variance Regression arXiv:2511.22004v1 Announce Type: new Abstract: Uncertainty quantification is vital for decision-making and risk assessment in machine learning. Mean-variance regression models, which predict both a mean and residual noise for each data point, provide a simple approach to uncertainty quantification. However, overparameterized mean-variance models struggle with signal-to-noise…

December 1, 2025
Support Vector Machine Classifier with Rescaled Huberized Pinball Loss

Support Vector Machine Classifier with Rescaled Huberized Pinball Loss arXiv:2511.22065v1 Announce Type: new Abstract: Support vector machines are widely used in machine learning classification tasks, but traditional SVM models suffer from sensitivity to outliers and instability in resampling, which limits their performance in practical applications. To address these issues, this paper proposes a novel rescaled…

December 1, 2025
Weekly Entering & Transitioning – Thread 01 Dec, 2025 – 08 Dec, 2025

Weekly Entering & Transitioning – Thread 01 Dec, 2025 – 08 Dec, 2025 Welcome to this week’s entering & transitioning thread! This thread is for any questions about getting started, studying, or transitioning into the data science field. Topics include: Learning resources (e.g. books, tutorials, videos) Traditional education (e.g. schools, degrees, electives) Alternative education (e.g.…

December 1, 2025
MSE-DS or OMSCS?

MSE-DS or OMSCS? I’ve gotten a lot of mixed responses about this on other subreddits, so I wanted to ask here I was recently accepted to UPenn’s online part-time MSE-DS program. I graduated from college this past May from a top 20 school with a degree in data science. To be honest, I originally applied…

December 1, 2025
Everyone Can ‘Code’ with AI Now, According to Google—But Tech Workers Aren’t Fully Convinced

Everyone Can ‘Code’ with AI Now, According to Google—But Tech Workers Aren’t Fully Convinced Have any data scientists here worked with AI for coding? Do you agree with experts’ skepticism in using it for high-level tasks? submitted by /u/disforwork [link] [comments] /u/disforwork Go to original source

December 1, 2025
ggplotly – A Grammar of Graphics implementation in Python/Plotly

ggplotly – A Grammar of Graphics implementation in Python/Plotly https://github.com/bbcho/ggplotly As a fun project, I decided to try and replicate ggplot2 in plotly and python. I know that plotnine exists, but I like the interactivity of plotly. Let me know what you think. Coverage isn’t 100% but you can do most things. I tried to…

December 1, 2025
Shap or LGBM gain for feature selection?

Shap or LGBM gain for feature selection? Which one do you use during recursive feature elimination or forward/backward selection? I’ve always used gain and only used shap for analytics on model predictions, but came across some shap values recommendations. Bonus question: have you used “null importance” / permutation method? Fitting models with shuffled targets to…

December 1, 2025