Category: cs.LG

Testing for correlation between network structure and high-dimensional node covariates

Testing for correlation between network structure and high-dimensional node covariates arXiv:2509.03772v1 Announce Type: new Abstract: In many application domains, networks are observed with node-level features. In such settings, a common problem is to assess whether or not nodal covariates are correlated with the network structure itself. Here, we present four novel methods for addressing this…

September 5, 2025
Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series

Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series arXiv:2509.03898v1 Announce Type: new Abstract: This paper develops dimension reduction techniques for accelerating diffusion model inference in the context of synthetic data generation. The idea is to integrate compressed sensing into diffusion models: (i) compress the data into a latent…

September 5, 2025
Batched Stochastic Matching Bandits

Batched Stochastic Matching Bandits arXiv:2509.04194v1 Announce Type: new Abstract: In this study, we introduce a novel bandit framework for stochastic matching based on the Multi-nomial Logit (MNL) choice model. In our setting, $N$ agents on one side are assigned to $K$ arms on the other side, where each arm stochastically selects an agent from its…

September 5, 2025
An invertible generative model for forward and inverse problems

An invertible generative model for forward and inverse problems arXiv:2509.03910v1 Announce Type: new Abstract: We formulate the inverse problem in a Bayesian framework and aim to train a generative model that allows us to simulate (i.e., sample from the likelihood) and do inference (i.e., sample from the posterior). We review the use of triangular normalizing…

September 5, 2025
Fast kernel methods: Sobolev, physics-informed, and additive models

Fast kernel methods: Sobolev, physics-informed, and additive models arXiv:2509.02649v1 Announce Type: new Abstract: Kernel methods are powerful tools in statistical learning, but their cubic complexity in the sample size n limits their use on large-scale datasets. In this work, we introduce a scalable framework for kernel regression with O(n log n) complexity, fully leveraging GPU…

September 4, 2025
Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on irregular geometry

Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on irregular geometry arXiv:2509.02617v1 Announce Type: new Abstract: Parametric partial differential equations (PDEs) are fundamental mathematical tools for modeling complex physical systems, yet their numerical evaluation across parameter spaces remains computationally intensive when using conventional high-fidelity solvers. To address this challenge, we propose a…

September 4, 2025
Scale-Adaptive Generative Flows for Multiscale Scientific Data

Scale-Adaptive Generative Flows for Multiscale Scientific Data arXiv:2509.02971v1 Announce Type: new Abstract: Flow-based generative models can face significant challenges when modeling scientific data with multiscale Fourier spectra, often producing large errors in fine-scale features. We address this problem within the framework of stochastic interpolants, via principled design of noise distributions and interpolation schedules. The key…

September 4, 2025
Bayesian Additive Regression Trees for functional ANOVA model

Bayesian Additive Regression Trees for functional ANOVA model arXiv:2509.03317v1 Announce Type: new Abstract: Bayesian Additive Regression Trees (BART) is a powerful statistical model that leverages the strengths of Bayesian inference and regression trees. It has received significant attention for capturing complex non-linear relationships and interactions among predictors. However, the accuracy of BART often comes at…

September 4, 2025
Understanding and Improving the Shampoo Optimizer via Kullback-Leibler Minimization

Understanding and Improving the Shampoo Optimizer via Kullback-Leibler Minimization arXiv:2509.03378v1 Announce Type: new Abstract: As an adaptive method, Shampoo employs a structured second-moment estimation, and its effectiveness has attracted growing attention. Prior work has primarily analyzed its estimation scheme through the Frobenius norm. Motivated by the natural connection between the second moment and a covariance…

September 4, 2025
Simulation-based inference of yeast centromeres

Simulation-based inference of yeast centromeres arXiv:2509.00200v1 Announce Type: new Abstract: The chromatin folding and the spatial arrangement of chromosomes in the cell play a crucial role in DNA replication and genes expression. An improper chromatin folding could lead to malfunctions and, over time, diseases. For eukaryotes, centromeres are essential for proper chromosome segregation and folding.…

September 3, 2025
Probit Monotone BART

Probit Monotone BART arXiv:2509.00263v1 Announce Type: new Abstract: Bayesian Additive Regression Trees (BART) of Chipman et al. (2010) has proven to be a powerful tool for nonparametric modeling and prediction. Monotone BART (Chipman et al., 2022) is a recent development that allows BART to be more precise in estimating monotonic functions. We further these developments…

September 3, 2025
Assessing One-Dimensional Cluster Stability by Extreme-Point Trimming

Assessing One-Dimensional Cluster Stability by Extreme-Point Trimming arXiv:2509.00258v1 Announce Type: new Abstract: We develop a probabilistic method for assessing the tail behavior and geometric stability of one-dimensional n i.i.d. samples by tracking how their span contracts when the most extreme points are trimmed. Central to our approach is the diameter-shrinkage ratio, that quantifies the relative…

September 3, 2025
The Nondecreasing Rank

The Nondecreasing Rank arXiv:2509.00265v1 Announce Type: new Abstract: In this article the notion of the nondecreasing (ND) rank of a matrix or tensor is introduced. A tensor has an ND rank of r if it can be represented as a sum of r outer products of vectors, with each vector satisfying a monotonicity constraint. It…

September 3, 2025
Partial Functional Dynamic Backdoor Diffusion-based Causal Model

Partial Functional Dynamic Backdoor Diffusion-based Causal Model arXiv:2509.00472v1 Announce Type: new Abstract: We introduce a Partial Functional Dynamic Backdoor Diffusion-based Causal Model (PFD-BDCM), specifically designed for causal inference in the presence of unmeasured confounders with spatial heterogeneity and temporal dependency. The proposed PFD-BDCM framework addresses the restrictions of the existing approaches by uniquely integrating models…

September 3, 2025
Quantum-inspired probability metrics define a complete, universal space for statistical learning

Quantum-inspired probability metrics define a complete, universal space for statistical learning arXiv:2508.21086v1 Announce Type: new Abstract: Comparing probability distributions is a core challenge across the natural, social, and computational sciences. Existing methods, such as Maximum Mean Discrepancy (MMD), struggle in high-dimensional and non-compact domains. Here we introduce quantum probability metrics (QPMs), derived by embedding probability…

September 1, 2025
Weighted Support Points from Random Measures: An Interpretable Alternative for Generative Modeling

Weighted Support Points from Random Measures: An Interpretable Alternative for Generative Modeling arXiv:2508.21255v1 Announce Type: new Abstract: Support points summarize a large dataset through a smaller set of representative points that can be used for data operations, such as Monte Carlo integration, without requiring access to the full dataset. In this sense, support points offer…

September 1, 2025
Adaptive generative moment matching networks for improved learning of dependence structures

Adaptive generative moment matching networks for improved learning of dependence structures arXiv:2508.21531v1 Announce Type: new Abstract: An adaptive bandwidth selection procedure for the mixture kernel in the maximum mean discrepancy (MMD) for fitting generative moment matching networks (GMMNs) is introduced, and its ability to improve the learning of copula random number generators is demonstrated. Based…

September 1, 2025
Privacy Auditing Synthetic Data Release through Local Likelihood Attacks

Privacy Auditing Synthetic Data Release through Local Likelihood Attacks arXiv:2508.21146v1 Announce Type: cross Abstract: Auditing the privacy leakage of synthetic data is an important but unresolved problem. Most existing privacy auditing frameworks for synthetic data rely on heuristics and unreasonable assumptions to attack the failure modes of generative models, exhibiting limited capability to describe and…

September 1, 2025
Stochastic Gradients under Nuisances

Stochastic Gradients under Nuisances arXiv:2508.20326v1 Announce Type: new Abstract: Stochastic gradient optimization is the dominant learning paradigm for a variety of scenarios, from classical supervised learning to modern self-supervised learning. We consider stochastic gradient algorithms for learning problems whose objectives rely on unknown nuisance parameters, and establish non-asymptotic convergence guarantees. Our results show that, while…

August 29, 2025
Towards Trustworthy Amortized Bayesian Model Comparison

Towards Trustworthy Amortized Bayesian Model Comparison arXiv:2508.20614v1 Announce Type: new Abstract: Amortized Bayesian model comparison (BMC) enables fast probabilistic ranking of models via simulation-based training of neural surrogates. However, the reliability of neural surrogates deteriorates when simulation models are misspecified – the very case where model comparison is most needed. Thus, we supplement simulation-based training…

August 29, 2025
Polynomial Chaos Expansion for Operator Learning

Polynomial Chaos Expansion for Operator Learning arXiv:2508.20886v1 Announce Type: new Abstract: Operator learning (OL) has emerged as a powerful tool in scientific machine learning (SciML) for approximating mappings between infinite-dimensional functional spaces. One of its main applications is learning the solution operator of partial differential equations (PDEs). While much of the progress in this area…

August 29, 2025
Transfer Learning for Classification under Decision Rule Drift with Application to Optimal Individualized Treatment Rule Estimation

Transfer Learning for Classification under Decision Rule Drift with Application to Optimal Individualized Treatment Rule Estimation arXiv:2508.20942v1 Announce Type: new Abstract: In this paper, we extend the transfer learning classification framework from regression function-based methods to decision rules. We propose a novel methodology for modeling posterior drift through Bayes decision rules. By exploiting the geometric…

August 29, 2025
Discovering equations from data: symbolic regression in dynamical systems

Discovering equations from data: symbolic regression in dynamical systems arXiv:2508.20257v1 Announce Type: cross Abstract: The process of discovering equations from data lies at the heart of physics and in many other areas of research, including mathematical ecology and epidemiology. Recently, machine learning methods known as symbolic regression have automated this process. As several methods are…

August 29, 2025
Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy

Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy arXiv:2508.19750v1 Announce Type: new Abstract: Normalizing Flows provide a principled framework for high-dimensional density estimation and generative modeling by constructing invertible transformations with tractable Jacobian determinants. We propose Fractal Flow, a novel normalizing flow architecture that enhances both expressiveness and interpretability through…

August 28, 2025
Conditional Normalizing Flow Surrogate for Monte Carlo Prediction of Radiative Properties in Nanoparticle-Embedded Layers

Conditional Normalizing Flow Surrogate for Monte Carlo Prediction of Radiative Properties in Nanoparticle-Embedded Layers arXiv:2508.19841v1 Announce Type: new Abstract: We present a probabilistic, data-driven surrogate model for predicting the radiative properties of nanoparticle embedded scattering media. The model uses conditional normalizing flows, which learn the conditional distribution of optical outputs, including reflectance, absorbance, and transmittance,…

August 28, 2025
The Information Dynamics of Generative Diffusion

The Information Dynamics of Generative Diffusion arXiv:2508.19897v1 Announce Type: new Abstract: Generative diffusion models have emerged as a powerful class of models in machine learning, yet a unified theoretical understanding of their operation is still developing. This perspective paper provides an integrated perspective on generative diffusion by connecting their dynamic, information-theoretic, and thermodynamic properties under…

August 28, 2025
Track Component Failure Detection Using Data Analytics over existing STDS Track Circuit data

Track Component Failure Detection Using Data Analytics over existing STDS Track Circuit data arXiv:2508.11693v1 Announce Type: cross Abstract: Track Circuits (TC) are the main signalling devices used to detect the presence of a train on a rail track. It has been used since the 19th century and nowadays there are many types depending on the…

August 28, 2025
Physics-Informed Regression: Parameter Estimation in Parameter-Linear Nonlinear Dynamic Models

Physics-Informed Regression: Parameter Estimation in Parameter-Linear Nonlinear Dynamic Models arXiv:2508.19249v1 Announce Type: cross Abstract: We present a new efficient hybrid parameter estimation method based on the idea, that if nonlinear dynamic models are stated in terms of a system of equations that is linear in terms of the parameters, then regularized ordinary least squares can…

August 28, 2025
Deterministic Coreset Construction via Adaptive Sensitivity Trimming

Deterministic Coreset Construction via Adaptive Sensitivity Trimming arXiv:2508.18340v1 Announce Type: new Abstract: We develop a rigorous framework for deterministic coreset construction in empirical risk minimization (ERM). Our central contribution is the Adaptive Deterministic Uniform-Weight Trimming (ADUWT) algorithm, which constructs a coreset by excising points with the lowest sensitivity bounds and applying a data-dependent uniform weight…

August 27, 2025
Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems

Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems arXiv:2508.18604v1 Announce Type: new Abstract: Follow-the-Regularized-Leader (FTRL) policies have achieved Best-of-Both-Worlds (BOBW) results in various settings through hybrid regularizers, whereas analogous results for Follow-the-Perturbed-Leader (FTPL) remain limited due to inherent analytical challenges. To advance the analytical foundations of FTPL, we revisit classical FTRL-FTPL duality for unbounded perturbations…

August 27, 2025
Efficient Best-of-Both-Worlds Algorithms for Contextual Combinatorial Semi-Bandits

Efficient Best-of-Both-Worlds Algorithms for Contextual Combinatorial Semi-Bandits arXiv:2508.18768v1 Announce Type: new Abstract: We introduce the first best-of-both-worlds algorithm for contextual combinatorial semi-bandits that simultaneously guarantees $widetilde{mathcal{O}}(sqrt{T})$ regret in the adversarial regime and $widetilde{mathcal{O}}(ln T)$ regret in the corrupted stochastic regime. Our approach builds on the Follow-the-Regularized-Leader (FTRL) framework equipped with a Shannon entropy regularizer, yielding…

August 27, 2025
Sparse minimum Redundancy Maximum Relevance for feature selection

Sparse minimum Redundancy Maximum Relevance for feature selection arXiv:2508.18901v1 Announce Type: new Abstract: We propose a feature screening method that integrates both feature-feature and feature-target relationships. Inactive features are identified via a penalized minimum Redundancy Maximum Relevance (mRMR) procedure, which is the continuous version of the classic mRMR penalized by a non-convex regularizer, and where…

August 27, 2025
Echoes of the past: A unified perspective on fading memory and echo states

Echoes of the past: A unified perspective on fading memory and echo states arXiv:2508.19145v1 Announce Type: new Abstract: Recurrent neural networks (RNNs) have become increasingly popular in information processing tasks involving time series and temporal data. A fundamental property of RNNs is their ability to create reliable input/output responses, often linked to how the network…

August 27, 2025
GraphPPD: Posterior Predictive Modelling for Graph-Level Inference

GraphPPD: Posterior Predictive Modelling for Graph-Level Inference arXiv:2508.16995v1 Announce Type: new Abstract: Accurate modelling and quantification of predictive uncertainty is crucial in deep learning since it allows a model to make safer decisions when the data is ambiguous and facilitates the users’ understanding of the model’s confidence in its predictions. Along with the tremendously increasing…

August 26, 2025
Limitations of refinement methods for weak to strong generalization

Limitations of refinement methods for weak to strong generalization arXiv:2508.17018v1 Announce Type: new Abstract: Standard techniques for aligning large language models (LLMs) utilize human-produced data, which could limit the capability of any aligned LLM to human level. Label refinement and weak training have emerged as promising strategies to address this superalignment problem. In this work,…

August 26, 2025
CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference

CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference arXiv:2508.17077v1 Announce Type: new Abstract: Current experimental scientists have been increasingly relying on simulation-based inference (SBI) to invert complex non-linear models with intractable likelihoods. However, posterior approximations obtained with SBI are often miscalibrated, causing credible regions to undercover true parameters. We develop $texttt{CP4SBI}$, a model-agnostic…

August 26, 2025
Neural Stochastic Differential Equations on Compact State-Spaces

Neural Stochastic Differential Equations on Compact State-Spaces arXiv:2508.17090v1 Announce Type: new Abstract: Many modern probabilistic models rely on SDEs, but their adoption is hampered by instability, poor inductive bias outside bounded domains, and reliance on restrictive dynamics or training tricks. While recent work constrains SDEs to compact spaces using reflected dynamics, these approaches lack continuous…

August 26, 2025
Rao Differential Privacy

Rao Differential Privacy arXiv:2508.17135v1 Announce Type: new Abstract: Differential privacy (DP) has recently emerged as a definition of privacy to release private estimates. DP calibrates noise to be on the order of an individuals contribution. Due to the this calibration a private estimate obscures any individual while preserving the utility of the estimate. Since the…

August 26, 2025
Interpretable Kernels

Interpretable Kernels arXiv:2508.15932v1 Announce Type: new Abstract: The use of kernels for nonlinear prediction is widespread in machine learning. They have been popularized in support vector machines and used in kernel ridge regression, amongst others. Kernel methods share three aspects. First, instead of the original matrix of predictor variables or features, each observation is mapped…

August 25, 2025
Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning

Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning arXiv:2508.16027v1 Announce Type: new Abstract: Transformers have demonstrated exceptional performance across a wide range of domains. While their ability to perform reinforcement learning in-context has been established both theoretically and empirically, their behavior in non-stationary environments remains less understood. In this study, we address this gap…

August 25, 2025
A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions

A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions arXiv:2508.16306v1 Announce Type: new Abstract: Diffusion-based generative models have emerged as highly effective methods for synthesizing high-quality samples. Recent works have focused on analyzing the convergence of their generation process with minimal assumptions, either through reverse SDEs or Probability Flow ODEs. The best known guarantees,…

August 25, 2025
Deep Intrinsic Coregionalization Multi-Output Gaussian Process Surrogate with Active Learning

Deep Intrinsic Coregionalization Multi-Output Gaussian Process Surrogate with Active Learning arXiv:2508.16434v1 Announce Type: new Abstract: Deep Gaussian Processes (DGPs) are powerful surrogate models known for their flexibility and ability to capture complex functions. However, extending them to multi-output settings remains challenging due to the need for efficient dependency modeling. We propose the Deep Intrinsic Coregionalization…

August 25, 2025
Underdamped Langevin MCMC with third order convergence

Underdamped Langevin MCMC with third order convergence arXiv:2508.16485v1 Announce Type: new Abstract: In this paper, we propose a new numerical method for the underdamped Langevin diffusion (ULD) and present a non-asymptotic analysis of its sampling error in the 2-Wasserstein distance when the $d$-dimensional target distribution $p(x)propto e^{-f(x)}$ is strongly log-concave and has varying degrees of…

August 25, 2025
Kernel-based Equalized Odds: A Quantification of Accuracy-Fairness Trade-off in Fair Representation Learning

Kernel-based Equalized Odds: A Quantification of Accuracy-Fairness Trade-off in Fair Representation Learning arXiv:2508.15084v1 Announce Type: new Abstract: This paper introduces a novel kernel-based formulation of the Equalized Odds (EO) criterion, denoted as $EO_k$, for fair representation learning (FRL) in supervised settings. The central goal of FRL is to mitigate discrimination regarding a sensitive attribute $S$…

August 22, 2025
Bayesian Inference and Learning in Nonlinear Dynamical Systems: A Framework for Incorporating Explicit and Implicit Prior Knowledge

Bayesian Inference and Learning in Nonlinear Dynamical Systems: A Framework for Incorporating Explicit and Implicit Prior Knowledge arXiv:2508.15345v1 Announce Type: new Abstract: Accuracy and generalization capabilities are key objectives when learning dynamical system models. To obtain such models from limited data, current works exploit prior knowledge and assumptions about the system. However, the fusion of…

August 22, 2025
Bayesian Optimization with Expected Improvement: No Regret and the Choice of Incumbent

Bayesian Optimization with Expected Improvement: No Regret and the Choice of Incumbent arXiv:2508.15674v1 Announce Type: new Abstract: Expected improvement (EI) is one of the most widely used acquisition functions in Bayesian optimization (BO). Despite its proven empirical success in applications, the cumulative regret upper bound of EI remains an open question. In this paper, we…

August 22, 2025
Tree-like Pairwise Interaction Networks

Tree-like Pairwise Interaction Networks arXiv:2508.15678v1 Announce Type: new Abstract: Modeling feature interactions in tabular data remains a key challenge in predictive modeling, for example, as used for insurance pricing. This paper proposes the Tree-like Pairwise Interaction Network (PIN), a novel neural network architecture that explicitly captures pairwise feature interactions through a shared feed-forward neural network…

August 22, 2025
Can synthetic data reproduce real-world findings in epidemiology? A replication study using tree-based generative AI

Can synthetic data reproduce real-world findings in epidemiology? A replication study using tree-based generative AI arXiv:2508.14936v1 Announce Type: cross Abstract: Generative artificial intelligence for synthetic data generation holds substantial potential to address practical challenges in epidemiology. However, many current methods suffer from limited quality, high computational demands, and complexity for non-experts. Furthermore, common evaluation strategies…

August 22, 2025
Comparing Model-agnostic Feature Selection Methods through Relative Efficiency

Comparing Model-agnostic Feature Selection Methods through Relative Efficiency arXiv:2508.14268v1 Announce Type: new Abstract: Feature selection and importance estimation in a model-agnostic setting is an ongoing challenge of significant interest. Wrapper methods are commonly used because they are typically model-agnostic, even though they are computationally intensive. In this paper, we focus on feature selection methods related…

August 21, 2025
Evaluation and Optimization of Leave-one-out Cross-validation for the Lasso

Evaluation and Optimization of Leave-one-out Cross-validation for the Lasso arXiv:2508.14368v1 Announce Type: new Abstract: I develop an algorithm to produce the piecewise quadratic that computes leave-one-out cross-validation for the lasso as a function of its hyperparameter. The algorithm can be used to find exact hyperparameters that optimize leave-one-out cross-validation either globally or locally, and its…

August 21, 2025
The C-index Multiverse

The C-index Multiverse arXiv:2508.14821v1 Announce Type: new Abstract: Quantifying out-of-sample discrimination performance for time-to-event outcomes is a fundamental step for model evaluation and selection in the context of predictive modelling. The concordance index, or C-index, is a widely used metric for this purpose, particularly with the growing development of machine learning methods. Beyond differences between…

August 21, 2025
Noise Robust One-Class Intrusion Detection on Dynamic Graphs

Noise Robust One-Class Intrusion Detection on Dynamic Graphs arXiv:2508.14192v1 Announce Type: cross Abstract: In the domain of network intrusion detection, robustness against contaminated and noisy data inputs remains a critical challenge. This study introduces a probabilistic version of the Temporal Graph Network Support Vector Data Description (TGN-SVDD) model, designed to enhance detection accuracy in the…

August 21, 2025
Optimal Subspace Embeddings: Resolving Nelson-Nguyen Conjecture Up to Sub-Polylogarithmic Factors

Optimal Subspace Embeddings: Resolving Nelson-Nguyen Conjecture Up to Sub-Polylogarithmic Factors arXiv:2508.14234v1 Announce Type: cross Abstract: We give a proof of the conjecture of Nelson and Nguyen [FOCS 2013] on the optimal dimension and sparsity of oblivious subspace embeddings, up to sub-polylogarithmic factors: For any $ngeq d$ and $epsilongeq d^{-O(1)}$, there is a random $tilde O(d/epsilon^2)times…

August 21, 2025
Preference Models assume Proportional Hazards of Utilities

Preference Models assume Proportional Hazards of Utilities arXiv:2508.13189v1 Announce Type: new Abstract: Approaches for estimating preferences from human annotated data typically involves inducing a distribution over a ranked list of choices such as the Plackett-Luce model. Indeed, modern AI alignment tools such as Reward Modelling and Direct Preference Optimization are based on the statistical assumptions…

August 20, 2025
Flow Matching-Based Generative Modeling for Efficient and Scalable Data Assimilation

Flow Matching-Based Generative Modeling for Efficient and Scalable Data Assimilation arXiv:2508.13313v1 Announce Type: new Abstract: Data assimilation (DA) is the problem of sequentially estimating the state of a dynamical system from noisy observations. Recent advances in generative modeling have inspired new approaches to DA in high-dimensional nonlinear settings, especially the ensemble score filter (EnSF). However,…

August 20, 2025
Structural Foundations for Leading Digit Laws: Beyond Probabilistic Mixtures

Structural Foundations for Leading Digit Laws: Beyond Probabilistic Mixtures arXiv:2508.13237v1 Announce Type: new Abstract: This article presents a modern deterministic framework for the study of leading significant digit distributions in numerical data. Rather than relying on traditional probabilistic or mixture-based explanations, we demonstrate that the observed frequencies of leading digits are determined by the underlying…

August 20, 2025
Smooth Flow Matching

Smooth Flow Matching arXiv:2508.13831v1 Announce Type: new Abstract: Functional data, i.e., smooth random functions observed over a continuous domain, are increasingly available in areas such as biomedical research, health informatics, and epidemiology. However, effective statistical analysis for functional data is often hindered by challenges such as privacy constraints, sparse and irregular sampling, infinite dimensionality, and…

August 20, 2025
Online Conformal Selection with Accept-to-Reject Changes

Online Conformal Selection with Accept-to-Reject Changes arXiv:2508.13838v1 Announce Type: new Abstract: Selecting a subset of promising candidates from a large pool is crucial across various scientific and real-world applications. Conformal selection offers a distribution-free and model-agnostic framework for candidate selection with uncertainty quantification. While effective in offline settings, its application to online scenarios, where data…

August 20, 2025
BaMANI: Bayesian Multi-Algorithm causal Network Inference

BaMANI: Bayesian Multi-Algorithm causal Network Inference arXiv:2508.11741v1 Announce Type: new Abstract: Improved computational power has enabled different disciplines to predict causal relationships among modeled variables using Bayesian network inference. While many alternative algorithms have been proposed to improve the efficiency and reliability of network prediction, the predicted causal networks reflect the generative process but also…

August 19, 2025
Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings

Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings arXiv:2508.11847v1 Announce Type: new Abstract: We propose a method for evaluating the robustness of a widely used LLM ranking system — the Bradley–Terry ranking system — to dropping a worst-case very small fraction of evaluation data. Our approach is computationally fast and…

August 19, 2025
Robust Data Fusion via Subsampling

Robust Data Fusion via Subsampling arXiv:2508.12048v1 Announce Type: new Abstract: Data fusion and transfer learning are rapidly growing fields that enhance model performance for a target population by leveraging other related data sources or tasks. The challenges lie in the various potential heterogeneities between the target and external data, as well as various practical concerns…

August 19, 2025
An Introduction to Sliced Optimal Transport

An Introduction to Sliced Optimal Transport arXiv:2508.12519v1 Announce Type: new Abstract: Sliced Optimal Transport (SOT) is a rapidly developing branch of optimal transport (OT) that exploits the tractability of one-dimensional OT problems. By combining tools from OT, integral geometry, and computational statistics, SOT enables fast and scalable computation of distances, barycenters, and kernels for probability…

August 19, 2025
Non-asymptotic convergence bound of conditional diffusion models

Non-asymptotic convergence bound of conditional diffusion models arXiv:2508.10944v1 Announce Type: new Abstract: Learning and generating various types of data based on conditional diffusion models has been a research hotspot in recent years. Although conditional diffusion models have made considerable progress in improving acceleration algorithms and enhancing generation quality, the lack of non-asymptotic properties has hindered…

August 18, 2025
Counterfactual Survival Q Learning for Longitudinal Randomized Trials via Buckley James Boosting

Counterfactual Survival Q Learning for Longitudinal Randomized Trials via Buckley James Boosting arXiv:2508.11060v1 Announce Type: new Abstract: We propose a Buckley James (BJ) Boost Q learning framework for estimating optimal dynamic treatment regimes under right censored survival data, tailored for longitudinal randomized clinical trial settings. The method integrates accelerated failure time models with iterative boosting…

August 18, 2025
Uniform convergence for Gaussian kernel ridge regression

Uniform convergence for Gaussian kernel ridge regression arXiv:2508.11274v1 Announce Type: new Abstract: This paper establishes the first polynomial convergence rates for Gaussian kernel ridge regression (KRR) with a fixed hyperparameter in both the uniform and the $L^{2}$-norm. The uniform convergence result closes a gap in the theoretical understanding of KRR with the Gaussian kernel, where…

August 18, 2025
ADMIRE-BayesOpt: Accelerated Data MIxture RE-weighting for Language Models with Bayesian Optimization

ADMIRE-BayesOpt: Accelerated Data MIxture RE-weighting for Language Models with Bayesian Optimization arXiv:2508.11551v1 Announce Type: new Abstract: Determining the optimal data mixture for large language model training remains a challenging problem with an outsized impact on performance. In practice, language model developers continue to rely on heuristic exploration since no learning-based approach has emerged as a…

August 18, 2025
Nonparametric learning of stochastic differential equations from sparse and noisy data

Nonparametric learning of stochastic differential equations from sparse and noisy data arXiv:2508.11597v1 Announce Type: new Abstract: The paper proposes a systematic framework for building data-driven stochastic differential equation (SDE) models from sparse, noisy observations. Unlike traditional parametric approaches, which assume a known functional form for the drift, our goal here is to learn the entire…

August 18, 2025
Prediction-Powered Inference with Inverse Probability Weighting

Prediction-Powered Inference with Inverse Probability Weighting arXiv:2508.10149v1 Announce Type: new Abstract: Prediction-powered inference (PPI) is a recent framework for valid statistical inference with partially labeled data, combining model-based predictions on a large unlabeled set with bias correction from a smaller labeled subset. We show that PPI can be extended to handle informative labeling by replacing…

August 15, 2025
Mo’ Memory, Mo’ Problems: Stream-Native Machine Unlearning

Mo’ Memory, Mo’ Problems: Stream-Native Machine Unlearning arXiv:2508.10193v1 Announce Type: new Abstract: Machine unlearning work assumes a static, i.i.d training environment that doesn’t truly exist. Modern ML pipelines need to learn, unlearn, and predict continuously on production streams of data. We translate the notion of the batch unlearning scenario to the online setting using notions…

August 15, 2025
An Iterative Algorithm for Differentially Private $k$-PCA with Adaptive Noise

An Iterative Algorithm for Differentially Private $k$-PCA with Adaptive Noise arXiv:2508.10879v1 Announce Type: new Abstract: Given $n$ i.i.d. random matrices $A_i in mathbb{R}^{d times d}$ that share a common expectation $Sigma$, the objective of Differentially Private Stochastic PCA is to identify a subspace of dimension $k$ that captures the largest variance directions of $Sigma$, while…

August 15, 2025
A pseudo-inverse of a line graph

A pseudo-inverse of a line graph arXiv:2508.09412v1 Announce Type: new Abstract: Line graphs are an alternative representation of graphs where each vertex of the original (root) graph becomes an edge. However not all graphs have a corresponding root graph, hence the transformation from graphs to line graphs is not invertible. We investigate the case when…

August 14, 2025
Scalable h-adaptive probabilistic solver for time-independent and time-dependent systems

Scalable h-adaptive probabilistic solver for time-independent and time-dependent systems arXiv:2508.09623v1 Announce Type: new Abstract: Solving partial differential equations (PDEs) within the framework of probabilistic numerics offers a principled approach to quantifying epistemic uncertainty arising from discretization. By leveraging Gaussian process regression and imposing the governing PDE as a constraint at a finite set of collocation…

August 14, 2025
Structured Kernel Regression VAE: A Computationally Efficient Surrogate for GP-VAEs in ICA

Structured Kernel Regression VAE: A Computationally Efficient Surrogate for GP-VAEs in ICA arXiv:2508.09721v1 Announce Type: new Abstract: The interpretability of generative models is considered a key factor in demonstrating their effectiveness and controllability. The generated data are believed to be determined by latent variables that are not directly observable. Therefore, disentangling, decoupling, decomposing, causal inference,…

August 14, 2025
Objective Soups: Multilingual Multi-Task Modeling for Speech Processing

Objective Soups: Multilingual Multi-Task Modeling for Speech Processing arXiv:2508.09228v1 Announce Type: cross Abstract: Training a single model for multilingual, multi-task speech processing (MSP) is severely hampered by conflicting objectives between tasks like speech recognition and translation. While multi-objective optimization (MOO) aims to align gradient updates, its effectiveness diminishes as the number of tasks grows, making…

August 14, 2025
On Experiments

On Experiments arXiv:2508.08288v1 Announce Type: new Abstract: The scientific process is a means for turning the results of experiments into knowledge about the world in which we live. Much research effort has been directed toward automating this process. To do this, one needs to formulate the scientific process in a precise mathematical language. This paper…

August 13, 2025
Projection-based multifidelity linear regression for data-scarce applications

Projection-based multifidelity linear regression for data-scarce applications arXiv:2508.08517v1 Announce Type: new Abstract: Surrogate modeling for systems with high-dimensional quantities of interest remains challenging, particularly when training data are costly to acquire. This work develops multifidelity methods for multiple-input multiple-output linear regression targeting data-limited applications with high-dimensional outputs. Multifidelity methods integrate many inexpensive low-fidelity model evaluations…

August 13, 2025
In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality

In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality arXiv:2508.08673v1 Announce Type: new Abstract: This paper investigates the expected excess risk of In-Context Learning (ICL) for multiclass classification. We model each task as a sequence of labeled prompt samples and a query input, where a pre-trained model estimates the conditional class probabilities of…

August 13, 2025
Hierarchical Variable Importance with Statistical Control for Medical Data-Based Prediction

Hierarchical Variable Importance with Statistical Control for Medical Data-Based Prediction arXiv:2508.08724v1 Announce Type: new Abstract: Recent advances in machine learning have greatly expanded the repertoire of predictive methods for medical imaging. However, the interpretability of complex models remains a challenge, which limits their utility in medical applications. Recently, model-agnostic methods have been proposed to measure…

August 13, 2025
Bio-Inspired Artificial Neural Networks based on Predictive Coding

Bio-Inspired Artificial Neural Networks based on Predictive Coding arXiv:2508.08762v1 Announce Type: new Abstract: Backpropagation (BP) of errors is the backbone training algorithm for artificial neural networks (ANNs). It updates network weights through gradient descent to minimize a loss function representing the mismatch between predictions and desired outputs. BP uses the chain rule to propagate the…

August 13, 2025
Federated Online Learning for Heterogeneous Multisource Streaming Data

Federated Online Learning for Heterogeneous Multisource Streaming Data arXiv:2508.06652v1 Announce Type: new Abstract: Federated learning has emerged as an essential paradigm for distributed multi-source data analysis under privacy concerns. Most existing federated learning methods focus on the “static” datasets. However, in many real-world applications, data arrive continuously over time, forming streaming datasets. This introduces additional…

August 12, 2025
MOCA-HESP: Meta High-dimensional Bayesian Optimization for Combinatorial and Mixed Spaces via Hyper-ellipsoid Partitioning

MOCA-HESP: Meta High-dimensional Bayesian Optimization for Combinatorial and Mixed Spaces via Hyper-ellipsoid Partitioning arXiv:2508.06847v1 Announce Type: new Abstract: High-dimensional Bayesian Optimization (BO) has attracted significant attention in recent research. However, existing methods have mainly focused on optimizing in continuous domains, while combinatorial (ordinal and categorical) and mixed domains still remain challenging. In this paper, we…

August 12, 2025
Membership Inference Attacks with False Discovery Rate Control

Membership Inference Attacks with False Discovery Rate Control arXiv:2508.07066v1 Announce Type: new Abstract: Recent studies have shown that deep learning models are vulnerable to membership inference attacks (MIAs), which aim to infer whether a data record was used to train a target model or not. To analyze and study these vulnerabilities, various MIA methods have…

August 12, 2025
Statistical Inference for Autoencoder-based Anomaly Detection after Representation Learning-based Domain Adaptation

Statistical Inference for Autoencoder-based Anomaly Detection after Representation Learning-based Domain Adaptation arXiv:2508.07049v1 Announce Type: new Abstract: Anomaly detection (AD) plays a vital role across a wide range of domains, but its performance might deteriorate when applied to target domains with limited data. Domain Adaptation (DA) offers a solution by transferring knowledge from a related source…

August 12, 2025
Stochastic dynamics learning with state-space systems

Stochastic dynamics learning with state-space systems arXiv:2508.07876v1 Announce Type: new Abstract: This work advances the theoretical foundations of reservoir computing (RC) by providing a unified treatment of fading memory and the echo state property (ESP) in both deterministic and stochastic settings. We investigate state-space systems, a central model class in time series learning, and establish…

August 12, 2025
Random Walk Learning and the Pac-Man Attack

Random Walk Learning and the Pac-Man Attack arXiv:2508.05663v1 Announce Type: new Abstract: Random walk (RW)-based algorithms have long been popular in distributed systems due to low overheads and scalability, with recent growing applications in decentralized learning. However, their reliance on local interactions makes them inherently vulnerable to malicious behavior. In this work, we investigate an…

August 11, 2025
Stochastic Trace Optimization of Parameter Dependent Matrices Based on Statistical Learning Theory

Stochastic Trace Optimization of Parameter Dependent Matrices Based on Statistical Learning Theory arXiv:2508.05764v1 Announce Type: new Abstract: We consider matrices $boldsymbol{A}(boldsymboltheta)inmathbb{R}^{mtimes m}$ that depend, possibly nonlinearly, on a parameter $boldsymboltheta$ from a compact parameter space $Theta$. We present a Monte Carlo estimator for minimizing $text{trace}(boldsymbol{A}(boldsymboltheta))$ over all $boldsymbolthetainTheta$, and determine the sampling amount so that…

August 11, 2025
Reduction Techniques for Survival Analysis

Reduction Techniques for Survival Analysis arXiv:2508.05715v1 Announce Type: new Abstract: In this work, we discuss what we refer to as reduction techniques for survival analysis, that is, techniques that “reduce” a survival task to a more common regression or classification task, without ignoring the specifics of survival data. Such techniques particularly facilitate machine learning-based survival…

August 11, 2025
Lightweight Auto-bidding based on Traffic Prediction in Live Advertising

Lightweight Auto-bidding based on Traffic Prediction in Live Advertising arXiv:2508.06069v1 Announce Type: new Abstract: Internet live streaming is widely used in online entertainment and e-commerce, where live advertising is an important marketing tool for anchors. An advertising campaign hopes to maximize the effect (such as conversions) under constraints (such as budget and cost-per-click). The mainstream…

August 11, 2025
Decorrelated feature importance from local sample weighting

Decorrelated feature importance from local sample weighting arXiv:2508.06337v1 Announce Type: new Abstract: Feature importance (FI) statistics provide a prominent and valuable method of insight into the decision process of machine learning (ML) models, but their effectiveness has well-known limitations when correlation is present among the features in the training data. In this case, the FI…

August 11, 2025
Differentially Private Model-X Knockoffs via Johnson-Lindenstrauss Transform

Differentially Private Model-X Knockoffs via Johnson-Lindenstrauss Transform arXiv:2508.04800v1 Announce Type: new Abstract: We introduce a novel privatization framework for high-dimensional controlled variable selection. Our framework enables rigorous False Discovery Rate (FDR) control under differential privacy constraints. While the Model-X knockoff procedure provides FDR guarantees by constructing provably exchangeable “negative control” features, existing privacy mechanisms like…

August 8, 2025
The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models

The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models arXiv:2508.04884v1 Announce Type: new Abstract: In this work, we study the problem of choosing the discretisation schedule for sampling from masked discrete diffusion models in terms of the information geometry of the induced probability path. Specifically, we show that the optimal schedule under the Fisher-Rao…

August 8, 2025
L1-Regularized Functional Support Vector Machine

L1-Regularized Functional Support Vector Machine arXiv:2508.05567v1 Announce Type: new Abstract: In functional data analysis, binary classification with one functional covariate has been extensively studied. We aim to fill in the gap of considering multivariate functional covariates in classification. In particular, we propose an $L_1$-regularized functional support vector machine for binary classification. An accompanying algorithm is…

August 8, 2025
High-Dimensional Differentially Private Quantile Regression: Distributed Estimation and Statistical Inference

High-Dimensional Differentially Private Quantile Regression: Distributed Estimation and Statistical Inference arXiv:2508.05212v1 Announce Type: new Abstract: With the development of big data and machine learning, privacy concerns have become increasingly critical, especially when handling heterogeneous datasets containing sensitive personal information. Differential privacy provides a rigorous framework for safeguarding individual privacy while enabling meaningful statistical analysis. In…

August 8, 2025
High-Order Error Bounds for Markovian LSA with Richardson-Romberg Extrapolation

High-Order Error Bounds for Markovian LSA with Richardson-Romberg Extrapolation arXiv:2508.05570v1 Announce Type: new Abstract: In this paper, we study the bias and high-order error bounds of the Linear Stochastic Approximation (LSA) algorithm with Polyak-Ruppert (PR) averaging under Markovian noise. We focus on the version of the algorithm with constant step size $alpha$ and propose a…

August 8, 2025
Reliable Programmatic Weak Supervision with Confidence Intervals for Label Probabilities

Reliable Programmatic Weak Supervision with Confidence Intervals for Label Probabilities arXiv:2508.03896v1 Announce Type: new Abstract: The accurate labeling of datasets is often both costly and time-consuming. Given an unlabeled dataset, programmatic weak supervision obtains probabilistic predictions for the labels by leveraging multiple weak labeling functions (LFs) that provide rough guesses for labels. Weak LFs commonly…

August 7, 2025
Reinforcement Learning in MDPs with Information-Ordered Policies

Reinforcement Learning in MDPs with Information-Ordered Policies arXiv:2508.03904v1 Announce Type: new Abstract: We propose an epoch-based reinforcement learning algorithm for infinite-horizon average-cost Markov decision processes (MDPs) that leverages a partial order over a policy class. In this structure, $pi’ leq pi$ if data collected under $pi$ can be used to estimate the performance of $pi’$,…

August 7, 2025
Deep Neural Network-Driven Adaptive Filtering

Deep Neural Network-Driven Adaptive Filtering arXiv:2508.04258v1 Announce Type: new Abstract: This paper proposes a deep neural network (DNN)-driven framework to address the longstanding generalization challenge in adaptive filtering (AF). In contrast to traditional AF frameworks that emphasize explicit cost function design, the proposed framework shifts the paradigm toward direct gradient acquisition. The DNN, functioning as…

August 7, 2025
Negative binomial regression and inference using a pre-trained transformer

Negative binomial regression and inference using a pre-trained transformer arXiv:2508.04111v1 Announce Type: new Abstract: Negative binomial regression is essential for analyzing over-dispersed count data in in comparative studies, but parameter estimation becomes computationally challenging in large screens requiring millions of comparisons. We investigate using a pre-trained transformer to produce estimates of negative binomial regression parameters…

August 7, 2025
The Relative Instability of Model Comparison with Cross-validation

The Relative Instability of Model Comparison with Cross-validation arXiv:2508.04409v1 Announce Type: new Abstract: Existing work has shown that cross-validation (CV) can be used to provide an asymptotic confidence interval for the test error of a stable machine learning algorithm, and existing stability results for many popular algorithms can be applied to derive positive instances where…

August 7, 2025
Hedging with memory: shallow and deep learning with signatures

Hedging with memory: shallow and deep learning with signatures arXiv:2508.02759v1 Announce Type: new Abstract: We investigate the use of path signatures in a machine learning context for hedging exotic derivatives under non-Markovian stochastic volatility models. In a deep learning setting, we use signatures as features in feedforward neural networks and show that they outperform LSTMs…

August 6, 2025