Latest revision as of 04:36, 28 February 2022

Introduction

There are many ways that we can describe Wasserstein metric. One of them is to characterize absolutely continuos curves (AC)(p.188^[1]) and provide a dynamic formulation of the special case $W_{2}^{2}$ Namely, it is possible to see $W_{2}^{2}(\mu ,\nu )$ as an infimum of the lengts of curves that satisfy Continuity equation.

Geodesics in general metric spaces

First, we will introduce definition of the geodesic in general metric space $(X,d)$ . In the following sections. we are going to follow a presentation from the book by Santambrogio^[1] with some digression, here and there.

For the starting point, we need to introduce length of the curve in our metric space $(X,d)$ .

Definition. A length of the curve

\omega :[0,1]\rightarrow X

is defined by

                   $L(\omega )=\sup\{\sum _{j=0}^{n-1}d(\omega (t_{j}),\omega (t_{j+1}))|\quad n\geq 2,\quad 0=t_{0}<t_{1}<...<t_{n-1}=1\}$

Secondly, we use the definition of length of a curve to introduce a geodesic curve.

Definition. A curve

c:[0,1]\rightarrow X

is said to be geodesic between

x

and

y

in

X

if it minimizes the length

L(\omega )

among all the curves

\omega :[0,1]\rightarrow X

such that

x=\omega (0)

and

y=\omega (1)

.

Since we have a definition of a geodesic in the general metric space, it is natural to think of Riemannian structure. It can be formally defined. More about this topic can be seen in the following article Formal Riemannian Structure of the Wasserstein_metric.

Now, we proceed with necessary definitions in order to be able to understand Wasserstein metric in a different way.

Definition. A metric space

(X,d)

is called a length space if it holds

                     $d(x,y)=\inf\{L(\omega )|\quad \omega \in AC(X),\quad \omega (0)=x\quad \omega (1)=y\}.$

A space $(X,d)$ is called geodesic space if the distance $d(x,y)$ is attained for some curve $\omega$ .

Definition. In a length space, a curve

\omega :[0,1]\rightarrow X

is said to be constant speed geodesic between

\omega (0)

and

\omega (1)

in

X

if it satisfies

                     $d(\omega (s),\omega (t))=|t-s|d(\omega (0),\omega (1))$  for all  $t,s\in [0,1]$

It is clear that constant-speed geodesic curve $\omega$ connecting $x$ and $y$ is a geodesic curve. This is very important definition since we have that every constant-speed geodesic $\omega$ is also in $AC(X)$ where $|\omega '(t)|=d(\omega (0),\omega (1))$ almost everywhere in $[0,1]$ .
In addition, minimum of the set $\{\int _{0}^{1}|c'(t)|^{p}dt|c:[0,1]\rightarrow X,c(0)=x,c(1)=y\}$ is attained by our constant-speed geodesic curve $\omega .$ Last fact is important since it is connected to Wasserstein $p$ metric. For more information, please take a look at Wasserstein metric.

For more information on constant-speed geodesics, especially how they depend on uniqueness of the plan that is induced by transport and characterization of a constant-speed geodesic look at the book by L.Ambrosio, N.Gilgi, G.Savaré ^[2] or the book by Santambrogio^[1].

Dynamic formulation of Wasserstein distance

Finally, we can rephrase Wasserstein metrics in dynamic language as mentioned in the Introduction.

Whenever $\Omega \subseteq {\mathcal {R}}^{d}$ is convex set, $W_{p}(\Omega )$ is a geodesic space. Proof can be found in the book by Santambrogio^[1].

Theorem.^[1] Let

\mu ,\nu \in {\mathcal {P}}_{2}(R^{d})

. Then

       $W_{p}^{p}(\mu ,\nu )=\inf _{(\mu (t).\nu (t))}\{\int _{0}^{1}|v(,t)|_{L^{p}(\mu (t))}^{p}dt\quad |\quad \partial _{t}\mu +\nabla \cdot (v\mu )=0,\quad \mu (0)=\mu ,\quad \mu (1)=\nu \}.$

In special case, when $\Omega$ is compact, infimum is attained by some constant-speed geodesic.

Generalized geodesics

There are many ways to generalize this fact. We will talk about a special case $p=2$ and a displacement convexity. Here we follow again book by Santambrogio^[3].

In general, the functional $\mu \rightarrow W_{2}^{2}(\mu ,\nu )$ is not a displacement convex. We can fix this by introducing a generalized geodesic.

Definition. Let

\rho \in {\mathcal {P}}(\Omega )

be an absolutely continuous measure and

\mu _{0}

and

\mu _{1}

probability measures in

{\mathcal {P}}(\Omega )

. We say that

\mu _{t}=((1-t)T_{0}+tT_{1})\#\rho

is a generalized geodesic in

{\mathcal {W}}_{2}(\Omega )

with base

\rho

, where

T_{0}

is the optimal transport plan from

\rho

to

\mu _{0}

and

T_{1}

is the optimal transport plan from

\rho

to

\mu _{1}

.

By calculation, we have the following $W_{2}^{2}(\mu _{t},\rho )\leq (1-t)W_{2}^{2}(\mu _{0},\rho )+tW_{2}^{2}(\mu _{1},\rho ).$

Therefore, along the generalized geodesic, the functional $t\rightarrow W_{2}^{2}(\mu _{t},\rho )$ is convex.

This fact is very important in establishing uniqueness and existence theorems in the geodesic flows.

References

↑ ^1.0 ^1.1 ^1.2 ^1.3 ^1.4 F. Santambrogio, Optimal Transport for Applied Mathematicians, Chapter 1, pages 202-207
↑ [https://link.springer.com/book/10.1007/b137080 L.Ambrosio, N.Gilgi, G.Savaré, Gradient Flows in Metric Spaces and in the Space of Probability Measures, Chapter 7.2., pages 158-160]
↑ F. Santambrogio, Optimal Transport for Applied Mathematicians, Chapter 1, pages 269-276

[Santambrogio-1] 1.0 ^1.1 ^1.2 ^1.3 ^1.4 F. Santambrogio, Optimal Transport for Applied Mathematicians, Chapter 1, pages 202-207

[Ambrosio-2] [https://link.springer.com/book/10.1007/b137080 L.Ambrosio, N.Gilgi, G.Savaré, Gradient Flows in Metric Spaces and in the Space of Probability Measures, Chapter 7.2., pages 158-160]

[Santambrogio1-3] F. Santambrogio, Optimal Transport for Applied Mathematicians, Chapter 1, pages 269-276

[1]

[2]

[3]

@@ Line 1: / Line 1: @@
 == Introduction ==
-There are many ways that we can describe [https://en.wikipedia.org/wiki/Wasserstein_metric Wasserstein metric]. One of them is to characterize absolutely continuos curves (AC)(p.188<ref name=Santambrogio />) and provide a dynamic formulation of the special case <math> W_{2}^{2} </math> Namely, it is possible to see <math> W_{2}^{2}(\mu, \nu) </math> as an infimum of the lengts of curves that satisfy [https://en.wikipedia.org/wiki/Continuity_equation Continuity equation] <br> (<math> \partial_{t}\mu+\nabla(v\mu)=0 </math>).
+There are many ways that we can describe [https://en.wikipedia.org/wiki/Wasserstein_metric Wasserstein metric]. One of them is to characterize absolutely continuos curves (AC)(p.188<ref name=Santambrogio />) and provide a dynamic formulation of the special case <math> W_{2}^{2} </math> Namely, it is possible to see <math> W_{2}^{2}(\mu, \nu) </math> as an infimum of the lengts of curves that satisfy [https://en.wikipedia.org/wiki/Continuity_equation Continuity equation].
 == Geodesics in general metric spaces ==
-First, we will introduce definition of the geodesic in general metric space <math> X </math>. We are going to follow presentation from the book by Santambrogio<ref name="Santambrogio" />.
+First, we will introduce definition of the geodesic in general metric space <math> (X,d) </math>. In the following sections. we are going to follow a presentation from the book by Santambrogio<ref name="Santambrogio" /> with some digression, here and there.
-: '''Definition.''' A curve <math> c:[0,1] \rightarrow X</math> is said to be geodesic in <math> X </math> if it minimizes the length <math> L(\omega)</math> of all  the curves <math> \omega:[0,1] \rightarrow X</math> <br> such that <math> c(0)=\omega(0)</math> and <math> c(1)=\omega(1)</math>.
+For the starting point, we need to introduce length of the curve in our metric space <math> (X,d) </math>.
-Since we have a definition of a geodesic in the general space, it is natural to think of Riemannian structure. More about it can be seen in the following article [http://34.106.105.83/wiki/ Formal Riemannian Structure of the Wasserstein_metric].
+: '''Definition.''' A length of the curve <math> \omega:[0,1] \rightarrow X</math> is defined by
+                   <math> L(\omega)=\sup\{ \sum_{j=0}^{n-1} d(\omega(t_{j}),\omega(t_{j+1})) | \quad n \geq 2,\quad 0=t_{0}<t_{1}<...<t_{n-1}=1 \} </math>
-Now, we proceed with necessary definitions in order to be able to understand Wasserstein metric.
+Secondly, we use the definition of length of a curve to introduce a geodesic curve.
+: '''Definition.''' A curve <math> c:[0,1] \rightarrow X</math> is said to be geodesic between <math> x </math> and <math> y </math> in <math> X </math> if it minimizes the length <math> L(\omega)</math> among all the curves <math> \omega:[0,1] \rightarrow X</math> <br> such that <math> x=\omega(0)</math> and <math> y=\omega(1)</math>.
+Since we have a definition of a geodesic in the general metric space, it is natural to think of Riemannian structure. It can be formally defined. More about this topic can be seen in the following article [http://34.106.105.83/wiki/ Formal Riemannian Structure of the Wasserstein_metric].
+Now, we proceed with necessary definitions in order to be able to understand Wasserstein metric in a different way.
 : '''Definition.''' A metric space <math> (X,d) </math> is called a length space if it holds
-                      <math> d(x,y)=\inf \{L(\omega) | \quad  \omega \in AC(X), \quad \omega(0)=x \quad \omega(1)=y \}.</math>
+                      <math> d(x,y)=\inf \{ L(\omega) | \quad  \omega \in AC(X), \quad \omega(0)=x \quad \omega(1)=y \}.</math>
+A space <math> (X,d) </math> is called geodesic space if the distance <math> d(x,y) </math> is attained for some curve <math> \omega </math>.
 : '''Definition.''' In a length space, a curve <math> \omega:[0,1]\rightarrow X </math> is said to be constant speed geodesic between <math> \omega(0)</math> and <math> \omega(1)</math> in <math> X </math> if it satisfies
@@ Line 20: / Line 29: @@
                       <math> d(\omega(s),\omega(t))=|t-s|d(\omega(0),\omega(1)) </math> for all <math> t,s \in [0,1]</math>
-It is clear that constant speed geodesic curve is geodesic curve.
+It is clear that constant-speed geodesic curve <math> \omega </math> connecting <math> x  </math> and <math> y </math> is a geodesic curve. This is very important definition since we have that every constant-speed geodesic <math> \omega </math> is also in <math> AC(X) </math> where <math> |\omega'(t)|=d(\omega(0),\omega(1)) </math> almost everywhere in <math> [0,1] </math>. <br>
+In addition, minimum of the set <math> \{ \int_{0}^{1}|c'(t)|^{p}dt |  c:[0,1]\rightarrow X, c(0)=x, c(1)=y \} </math> is attained by our constant-speed geodesic curve <math> \omega.</math> Last fact is important since it is connected to Wasserstein <math>p</math> metric. For more information, please take a look at [https://en.wikipedia.org/wiki/Wasserstein_metric Wasserstein metric].
+For more information on constant-speed geodesics, especially how they depend on uniqueness of the plan that is induced by transport and characterization of a constant-speed geodesic look at the book by L.Ambrosio, N.Gilgi, G.Savaré <ref name="Ambrosio" /> or the book by Santambrogio<ref name="Santambrogio" />.
+== Dynamic formulation of Wasserstein distance ==
+Finally, we can rephrase Wasserstein metrics in dynamic language as mentioned in the Introduction.
+Whenever <math> \Omega \subseteq \mathcal{R}^{d} </math> is convex set, <math> W_{p}(\Omega) </math> is a geodesic space. Proof can be found in the book by Santambrogio<ref name="Santambrogio" />.
+: '''Theorem.'''<ref name=Santambrogio /> Let <math> \mu, \nu \in \mathcal{P}_{2}(R^{d}) </math>. Then
+       <math> W_{p}^{p}(\mu, \nu)=\inf_{(\mu(t).\nu(t))} \{\int_{0}^{1} |v(,t)|_{L^{p}(\mu(t))}^{p}dt \quad | \quad \partial_{t}\mu+\nabla\cdot(v\mu)=0,\quad \mu(0)=\mu,\quad \mu(1)=\nu \}. </math>
+In special case, when <math> \Omega </math> is compact, infimum is attained by some constant-speed geodesic.
+== Generalized geodesics ==
-== Statement of Theorem==
+There are many ways to generalize this fact. We will talk about a special case <math> p=2 </math> and a displacement convexity.
+Here we follow again book by Santambrogio<ref name="Santambrogio1" />.
-Now, we can rephrase Wasserstein metrics in dynamic language. In special case, for <math> p=2 </math>:
+In general, the functional <math> \mu \rightarrow W_{2}^{2}(\mu,\nu) </math> is not a displacement convex. We can fix this by introducing a generalized geodesic.
-: '''Theorem.'''(Benamou-Brenier)<ref name=Santambrogio /> Let <math> \mu, \nu \in P_{2}(R^{d}) </math>. Then we have
+: '''Definition.''' Let <math> \rho \in \mathcal{P}(\Omega) </math> be an absolutely continuous measure and <math> \mu_{0} </math> and <math> \mu_{1} </math> probability measures in <math> \mathcal{P}(\Omega) </math>. We say that <math> \mu_{t} = ((1-t)T_{0}+tT_{1})\#\rho </math> <br> is a generalized geodesic in <math> \mathcal{W}_{2}(\Omega) </math> with base <math> \rho </math>, where <math> T_{0} </math> is the optimal transport plan from <math> \rho </math> to <math> \mu_{0} </math> and <math> T_{1} </math> is the optimal transport plan from <math> \rho </math> to <math> \mu_{1} </math>.
-       <math> W_{2}^{2}(\mu, \nu)=\inf_{(\mu(t).\nu(t))} \{\int_{0}^{1} |v(,t)|_{L^{2}(\mu(t))}^{2}dt, \quad \partial_{t}\mu+\nabla(v\mu)=0, \mu(0)=\mu, \mu(1)=\nu \} </math>
-== Generalization ==
+By calculation, we have the following <math> W_{2}^{2}(\mu_{t},\rho) \leq (1-t)W_{2}^{2}(\mu_{0},\rho) + tW_{2}^{2}(\mu_{1},\rho). </math>
-It is possible to generalize the previous theorem and theory to <math>W_{p} </math> metrics. More about that could be seen in the book <ref name="Ambrosio" />. <br>
+Therefore, along the generalized geodesic, the functional <math> t \rightarrow W_{2}^{2}(\mu_{t},\rho) </math> is convex.
-However, it is possible to generalize theorem for a different kind of geodesics <ref name="Santambrogio1" />.
+This fact is very important in establishing uniqueness and existence theorems in the geodesic flows.
 = References =
@@ Line 41: / Line 66: @@
 <ref name="Santambrogio"> [https://link-springer-com.proxy.library.ucsb.edu:9443/book/10.1007/978-3-319-20828-2 F. Santambrogio, ''Optimal Transport for Applied Mathematicians'', Chapter 1, pages 202-207] </ref>
-<ref name="Santambrogio1"> [https://link-springer-com.proxy.library.ucsb.edu:9443/book/10.1007/978-3-319-20828-2 F. Santambrogio, ''Optimal Transport for Applied Mathematicians'', Chapter 1, pages 275-276] </ref>
+<ref name="Santambrogio1"> [https://link-springer-com.proxy.library.ucsb.edu:9443/book/10.1007/978-3-319-20828-2 F. Santambrogio, ''Optimal Transport for Applied Mathematicians'', Chapter 1, pages 269-276] </ref>
 <ref name="Ambrosio"> [https://link.springer.com/book/10.1007/b137080 L.Ambrosio, N.Gilgi, G.Savaré, ''

Geodesics and generalized geodesics: Difference between revisions

Latest revision as of 04:36, 28 February 2022

Contents

Introduction

Geodesics in general metric spaces

Dynamic formulation of Wasserstein distance

Generalized geodesics

References

Navigation menu

Geodesics and generalized geodesics: Difference between revisions

Latest revision as of 04:36, 28 February 2022

Introduction

Geodesics in general metric spaces

Dynamic formulation of Wasserstein distance

Generalized geodesics

References

Navigation menu

Search