Printable PDF
Department of Mathematics,
University of California San Diego

****************************

Defense Talk

Varun Khurana

University of California, San Diego

Learning With Measure-Valued Data

Abstract:

This talk discusses computationally feasible machine learning methods, based on optimal transport and neural network theory, applied to measure-valued data.  We first analyze linearized optimal transport (LOT), which essentially embeds measure-valued data into an $L^2$ space, where out-of-the-box machine learning techniques are available.  We analyze the situations when LOT provides an isometric embedding with respect to the Wasserstein-2 distance and provide necessary bounds when we can achieve a pre-specified linear separation level in the LOT embedding space.  Second, we produce a computationally feasible algorithm to recover low-dimensional structures in measure-valued data by using the LOT embedding along with dimensionality reduction techniques.  Using computational methods for solving optimal transport problems such as the Sinkhorn algorithm or linear programming, we provide approximation guarantees in terms of the sampling rates.  Third, we study structured approximations of measures in Wasserstein space by a scaled Voronoi partition of $\mathbb{R}^d$ generated from a full rank lattice.  We show that these structured approximations match rates of optimal quantizers and empirical measure approximation in most instances.  We then extend these results for noncompactly supported measures that decay fast enough.  Finally, we study methods for comparing probability measures by analyzing a neural network two-sample test.  In particular, we perform time-analysis on a related neural tangent kernel (NTK) two-sample test and extend the analysis to the neural network two-sample test with a small-time training regime.  We also show the amount of time needed before the two-sample test detects a deviation $\epsilon > 0$ in the case the probability measures considered are different versus when they are the same.

Advisor: Alex Cloninger

May 29, 2024

12:30 PM

APM 6402

Research Areas

Statistics

****************************