Wasserstein barycenters and applications in image processing

From Optimal Transport Wiki
Revision as of 06:36, 12 February 2022 by Gsarajian (talk | contribs)
Jump to navigation Jump to search

In optimal transport, a Wasserstein barycenter [1] is a probability measure that acts as a center of mass between two or more probability measures. It generalizes the notions of physical barycenters and geometric centroids.


Introduction

Motivation

Barycenters in physics and geometry are points that represent a notion of a mean of a number of objects. In astronomy, the barycenter is the center of mass of two or more objects that orbit each other, and in geometry, a centroid is the arithmetic mean of all the points in an object. Given countably many points in with nonnegative weights , the weighted barycenter of the points is the unique point minimizing . Wasserstein barycenters attempt to capture this concept for probability measures by replacing the Euclidean distance with the Wasserstein distance of two probability measures, .

Definition

Let be a domain and  be the set of probability measures on . Given a collection of probability measures and nonnegative weights , we define a weighted barycenter of as any probability measure  that minimizes  over the space . Here denotes the 2-Wasserstein distance, which may be replaced with the -Wasserstein distance, , though this is not always as convenient.

The minimization problem above was originally introduced by Agueh and Carlier [2], who also proposed an alternative formulation of the problem above when there are finitely many measures . Instead of considering the sum over all probability measures , one can equivalently consider the optimization problem over all multi-marginal transport plans whose push forwards satisfy for and for some unspecified probability measure .

Existence and Uniqueness

Agueh and Carlier [3] demonstrated that when at least one of the measures is absolutely continuous, the Wasserstein barycenter exists and is unique.

Examples[4]

In the case where all of the measures are finitely supported measures, computing the Wasserstein barycenter reduces to a problem in linear programming.

When our family of measures consists of only one finitely supported measure and we restrict the input probability measure to finitely supported probability measures whose supports have at most points, the problem of finding the Wasserstein barycenter is equivalent to the -means clustering problem.

Applications

Barycenters in Image processing

Generalizations

Wasserstein barycenters are examples of Karcher and Fréchet means where the distance function used is the Wasserstein distance.

References

  1. Santambrogio, Filippo. Optimal Transport for Applied Mathematicians. Birkhäuser., 2015.
  2. Martial Agueh, Guillaume Carlier. Barycenters in the Wasserstein space. SIAM Journal on Mathematical Analysis, Society for Industrial and Applied Mathematics, 2011, 43 (2), pp.904-924. ff10.1137/100805741ff. ffhal-00637399f
  3. Martial Agueh, Guillaume Carlier. Barycenters in the Wasserstein space. SIAM Journal on Mathematical Analysis, Society for Industrial and Applied Mathematics, 2011, 43 (2), pp.904-924. ff10.1137/100805741ff. ffhal-00637399f
  4. Peyré, Gabriel and Marco Cuturi. “Computational Optimal Transport.” Found. Trends Mach. Learn. 11 (2019): 355-607.