Latest revision as of 04:36, 28 February 2022

The Kantorovich Dual Problem is one of the minimization problems in Optimal Transport. It is a dual problem of the Kantorovich Problem.

The Shipper's Problem

One of the ways to understand this problem is stated by Caffarelli. The statement is presented in the book by Villani ^[1]. We will provide the modern rephrase of his statement.

During the COVID-19 pandemic, people are in the lockdown and it is the best time to enjoy a coffee time. All in all, it costs Amazon $c(x,y)$ dollars to ship one box of necessary espresso capsules from place $x$ to place $y$ , i.e. from warehouses to homes. We want to optimize this expensive habit and consequently to solve appropriate Monge-Kantorovich problem. The mathematicians come to Amazon and propose the new kind of payment. For every box at place $x$ they will charge $\varphi (x)$ dollars and $\psi (y)$ dollars to deliver at place $y$ . However, mathematicians will not reveal their shipping routes. Of course, in order for Amazon to accept this offer, the price $\varphi (x)+\psi (y)\leq c(x,y).$ The moral is that if the mathematicians are smart enough, they will be capable to make this shipment cheaper. This is provided by Kantorovich duality theorem. Take care that in the same cases, mathematicians will also give negative prices, if it is necessary!

Statement of Theorem

This is the statement of the Theorem in the book "Topics in Optimal Transportation", by Cedric Villani.

Theorem.^[1] Let X and Y be Polish spaces, let

\mu \in {\mathcal {P}}(X)

and

\nu \in {\mathcal {P}}(Y)

, and let a cost function

c:X\times Y\rightarrow [0,+\infty ]

be lower semi-continuous.

Whenever $\pi \in {\mathcal {P}}(X\times Y)$ and $(\varphi ,\psi )\in L^{1}(d\mu )\times L^{1}(d\nu )$ , define

$I[\pi ]=\int _{X\times Y}c(x,y)d\pi (x,y),\quad J(\varphi ,\psi )=\int _{X}\varphi (x)d\mu (x)+\int _{Y}\psi (y)d\nu (y)$ .

Define $\Pi (\mu ,\nu )$ to be the set of Borel probability measures $\pi$ on $X\times Y$ such that for all measurable sets $A\subset X$ and $B\subset Y$ , $\pi [A\times Y]=\mu (A)$ , $\pi [X\times B]=\nu (B)$ , and define $\Phi _{c}$ to be the set of all measurable functions $(\varphi ,\psi )\in L^{1}(d\mu )\times L^{1}(d\nu )$ satisfying $\varphi (x)+\psi (y)\leq c(x,y)$ for $d\mu$ almost everywhere in X and $d\nu$ almost everywhere in Y.

Then $\inf _{\Pi (\mu ,\nu )}I[\pi ]=\sup _{\Phi _{c}}J(\varphi ,\psi )$ .

Moreover, the infimum $\inf _{\Pi (\mu ,\nu )}I[\pi ]$ is attained. In addition it is possible to restrict $\varphi$ and $\psi$ to be continuous and bounded.

Ideas and the techniques used in the proof

First, we assume that our spaces $X$ and $Y$ are compact and that the cost function $c(x,y)$ is continuous. The general case follows by an approximation argument.

The main idea is to use minimax principle, i.e. interchanging inf sup with sup inf in the proof. For this, we need some basic convex analysis techniques, namely Legendre-Fenchel transform and Theorem on Fenchel-Rockafellar Duality (its proof is based on Hahn-Banach theorem [1] consequence on separating convex sets). The required statements can be found in the book by Rockafellar^[2] and the book by Bauschke and Combettes^[3].

Take a note that at some point we use Arzela-Ascoli Theorem [2]. In a non-compact space this is not possible. In order to evade compactness property, we have to use Prokhorov's theorem.

Theorem.^[4] Let

\mu _{n}

be a sequence of tight probability measures on Polish space

X

. Then, there exists

\mu \in P(X)

and convergent subsequence

\mu _{n_{k}}

such that

\mu _{n_{k}}\rightarrow \mu

in the dual of

C_{b}(X)

. Conversely, every sequence

\mu _{n}\rightarrow \mu

is tight.

The proof of the previous Theorem can be found in ^[4]. For more information on $C_{b}(X)$ duality, take a look at Dual space of C_0(x) vs C_b(x).

C-concave functions

There are a few alternative proofs of the above Theorem. First, we will discuss the conclusion of the Theorem. Again, we will follow the path given in the book by Villani ^[1].

In Kantorovich Duality Theorem, the left-hand side of the last equality, the infimum $\inf _{\Pi (\mu ,\nu )}I[\pi ]$ is attained. We do not know anything similar about the right-hand side. However, when cost function $c(x,y)$ is bounded we can restrict $\sup _{\Phi _{c}}J(\varphi ,\psi )$ to pairs $(\varphi ^{cc},\varphi ^{c})$ where $\varphi$ is bounded and

$\varphi ^{c}(y)=\inf _{y\in Y}[c(x,y)-\varphi (x)],\quad \varphi ^{cc}(x)=\inf _{y\in Y}[c(x,y)-\varphi ^{c}(y)].$

The pair $(\varphi ^{cc},\varphi ^{c})$ is called a pair of conjugate c-concave functions. It is known that $(\varphi ^{cc})^{c}=\varphi ^{c}$ and that $\varphi ^{c}$ is measurable. The proof can be found in the book ^[4](Chapter 1, p.27).

In addition, it is possible to give a proof of Kantorovich Duality theorem using c-concave functions. Namely, we can find c-concave function $\varphi$ such that
$(x,y)\in \Gamma \implies \varphi (x)+\varphi ^{c}(y)=c(x,y).$ Here, $\Gamma$ is the support for the Kantorovich Problem (it has to be non-empty). The proof of this fact can also be found in the book ^[4](Chapter 1, p.11).

References

↑ ^1.0 ^1.1 ^1.2 C. Villani, Topics in Optimal Transportation, Chapter 1, pages 17-21
↑ R.T. Rockafellar,Convex Analysis, Princeton University Press, Princeton, 1970
↑ Heinz H. Bauschke, Patrick L. Combettes, Convex Analysis and Monotone Operator Theory in Hilbert Spaces
↑ ^4.0 ^4.1 ^4.2 ^4.3 F. Santambrogio, Optimal Transport for Applied Mathematicians, Chapter 1, pages 9-16

[Villani-1] 1.0 ^1.1 ^1.2 C. Villani, Topics in Optimal Transportation, Chapter 1, pages 17-21

[Rockafellar-2] R.T. Rockafellar,Convex Analysis, Princeton University Press, Princeton, 1970

[BandC-3] Heinz H. Bauschke, Patrick L. Combettes, Convex Analysis and Monotone Operator Theory in Hilbert Spaces

[Santambrogio-4] 4.0 ^4.1 ^4.2 ^4.3 F. Santambrogio, Optimal Transport for Applied Mathematicians, Chapter 1, pages 9-16

[1]

[2]

[3]

[4]

@@ Line 1: / Line 1: @@
-==Introduction==
+The Kantorovich Dual Problem is one of the minimization problems in [http://34.106.105.83/wiki/Main_Page Optimal Transport]. It is a dual problem of the [http://34.106.105.83/wiki/ Kantorovich Problem].
-The main advantage of Kantorovich Problem, in comparison to Monge problem, is in the convex constraint property. It is possible to formulate the dual problem. It is formulated in the very general metric spaces called Polish spaces, i.e. complete separable.
 ==The Shipper's Problem==
-Type of this problem is stated by Caffarelli. We will provide the modern one.
+One of the ways to understand this problem is stated by Caffarelli. The statement is presented in the book by Villani <ref name=Villani />. We will provide the modern rephrase of his statement.
-During the pandemic, people are in the lockdown and it is the best time to enjoy a coffee time. All in all, it costs Amazon <math> c(x,y) </math> dollars to ship one box of necessary espresso capsules from place <math> x </math> to  place <math> y </math>, i.e. from warehouses to homes. We want to optimize this expensive habit and consequently to solve appropriate Monge-Kantorovich problem. The mathematicians come to Amazon and propose the new kind of payment. For every box at place <math> x </math> they will charge <math> \varphi(x) </math> dollars and <math> \psi(y) </math> dollars to deliver at place <math> y </math>. However, mathematicians will not reveal their shipping routes. Of course, in order for Amazon to accept this offer, the price <math> \varphi(x)+\psi(y) \leq c(x,y).</math>  The moral is that if the mathematicians are smart enough, they will be capable to make this shipment cheaper. This is provided by Kantorovich duality theorem. Take care that in the same cases, mathematicians will also give negative prices!
+During the [https://en.wikipedia.org/wiki/ COVID-19 pandemic], people are in the lockdown and it is the best time to enjoy a coffee time. All in all, it costs Amazon <math> c(x,y) </math> dollars to ship one box of necessary espresso capsules from place <math> x </math> to  place <math> y </math>, i.e. from warehouses to homes. We want to optimize this expensive habit and consequently to solve appropriate Monge-Kantorovich problem. The mathematicians come to Amazon and propose the new kind of payment. For every box at place <math> x </math> they will charge <math> \varphi(x) </math> dollars and <math> \psi(y) </math> dollars to deliver at place <math> y </math>. However, mathematicians will not reveal their shipping routes. Of course, in order for Amazon to accept this offer, the price <math> \varphi(x)+\psi(y) \leq c(x,y).</math>  The moral is that if the mathematicians are smart enough, they will be capable to make this shipment cheaper. This is provided by Kantorovich duality theorem. Take care that in the same cases, mathematicians will also give negative prices, if it is necessary!
 ==Statement of Theorem==
+This is the statement of the Theorem in the book "Topics in Optimal Transportation", by Cedric Villani.
 : '''Theorem.'''<ref name=Villani /> Let X and Y be Polish spaces, let <math>\mu \in \mathcal{P}(X)</math> and <math>\nu \in \mathcal{P}(Y)</math>, and let a cost function <math> c:X \times Y \rightarrow[0,+\infty] </math> be lower semi-continuous.
@@ Line 16: / Line 16: @@
 <math> I[\pi]= \int_{X\times Y} c(x,y) d\pi(x,y), \quad J(\varphi,\psi)=\int_{X}\varphi(x)d\mu(x)+\int_{Y}\psi(y) d\nu(y) </math>.
-Define <math> \Pi(\mu,\nu) </math> to be the set of Borel probability measures <math> \pi </math> on <math> X\times Y </math> such that for all measurable sets <math> A \subset X </math> and <math> B \subset Y </math>, <br>
+Define <math> \Pi(\mu,\nu) </math> to be the set of Borel probability measures <math> \pi </math> on <math> X\times Y </math> such that for all measurable sets <math> A \subset X </math> and <math> B \subset Y </math>, <math> \pi[A\times Y]=\mu(A) </math>, <math> \pi[X\times B]=\nu(B) </math>, and define <math> \Phi_{c} </math> to be the set of all measurable functions <math> (\varphi, \psi) \in L^{1}(d\mu) \times L^{1}(d\nu) </math> satisfying <math> \varphi(x)+\psi(y) \leq c(x,y) </math> for <math> d\mu </math> almost everywhere in X and <math> d\nu </math> almost everywhere in Y. <br>
-<math> \pi[A\times Y]=\mu(A) </math>, <math> \pi[X\times B]=\nu(B) </math>, <br>
+Then <math> \inf_{\Pi(\mu,\nu)} I[\pi] = \sup_{\Phi_{c}} J(\varphi,\psi) </math>. <br>
-and define <math> \Phi_{c} </math> to be the set of all measurable functions <math> (\varphi, \psi) \in L^{1}(d\mu) \times L^{1}(d\nu) </math> satisfying <math> \varphi(x)+\psi(y) \leq c(x,y) </math> for <math> d\mu </math> almost everywhere in X and <math> d\nu </math> almost everywhere in Y. <br>
+Moreover, the infimum <math> \inf_{\Pi(\mu,\nu)} I[\pi] </math> is attained. In addition it is possible to restrict <math> \varphi </math> and <math> \psi </math> to be continuous and bounded.
-Then <math> inf_{\Pi(\mu,\nu)} I[\pi] = sup_{\Phi_{c}} J(\varphi,\psi) </math>. <br>
+==Ideas and the techniques used in the proof==
-Moreover, the infimum <math> inf_{\Pi(\mu,\nu)} I[\pi] </math> is attained. In addition it is possible to restrict <math> \varphi </math> and <math> \psi </math> to be continuous and bounded.
-==Outline of the Proof==
 First, we assume that our spaces <math> X </math> and <math> Y </math> are compact and that the cost function <math> c(x,y) </math> is continuous. The general case follows by an approximation argument.
 The main idea is to use minimax principle, i.e. interchanging inf sup with sup inf in the proof.
-For this, we need some basic convex analysis techniques, namely Legendre-Fenchel transform (qoute needed) and Theorem on Fenchel-Rockafellar Duality (its proof is based on Hahn-Banach theorem consequence on separating convex sets.)
+For this, we need some basic convex analysis techniques, namely Legendre-Fenchel transform and Theorem on Fenchel-Rockafellar Duality (its proof is based on Hahn-Banach theorem [https://en.wikipedia.org/wiki/Hahn%E2%80%93Banach_theorem] consequence on separating convex sets). The required statements can be found in the book by Rockafellar<ref name=Rockafellar /> and the book by Bauschke and Combettes<ref name=BandC />.
-Take a note that at some point we use Arzela-Ascoli Theorem. In a non-compact space this is not possible.
+Take a note that at some point we use Arzela-Ascoli Theorem [https://en.wikipedia.org/wiki/Arzel%C3%A0%E2%80%93Ascoli_theorem]. In a non-compact space this is not possible.
 In order to evade compactness property, we have to use Prokhorov's theorem.
+: '''Theorem.'''<ref name=Santambrogio /> Let <math> \mu_{n} </math> be a sequence of tight probability measures on Polish space <math> X </math>. Then, there exists <math> \mu \in P(X) </math> and convergent subsequence <math> \mu_{n_{k}}</math> such that <math> \mu_{n_{k}} \rightarrow \mu </math> in the dual of <math> C_{b}(X) </math>. Conversely, every sequence <math> \mu_{n} \rightarrow \mu </math> is tight.
+The proof of the previous Theorem can be found in <ref name=Santambrogio />. For more information on <math> C_{b}(X) </math> duality, take a look at [http://34.106.105.83/wiki/ Dual space of C_0(x) vs C_b(x)].
 ==C-concave functions==
-In Kantorovich Duality Theorem, the left-hand side of the last equality, the infimum <math> inf_{\Pi(\mu,\nu)} I[\pi] </math> is attained. We do not know something similar about the right-hand side. However, when cost function <math> c(x,y) </math> is bounded we can restrict <math >sup_{\Phi_{c}} J(\varphi,\psi) </math> to pairs <math> (\varphi^{cc},\varphi^{c}) </math> where <math> \varphi </math> is bounded and <br>
+There are a few alternative proofs of the above Theorem. First, we will discuss the conclusion of the Theorem. Again, we will follow the path given in the book by Villani <ref name=Villani />.
+In Kantorovich Duality Theorem, the left-hand side of the last equality, the infimum <math> \inf_{\Pi(\mu,\nu)} I[\pi] </math> is attained. We do not know anything similar about the right-hand side. However, when cost function <math> c(x,y) </math> is bounded we can restrict <math >\sup_{\Phi_{c}} J(\varphi,\psi) </math> to pairs <math> (\varphi^{cc},\varphi^{c}) </math> where <math> \varphi </math> is bounded and <br>
+<math> \varphi^{c}(y)=\inf_{y \in Y} [c(x,y) - \varphi(x)], \quad \varphi^{cc}(x)=\inf_{y \in Y} [c(x,y) - \varphi^{c}(y)]. </math>
+The pair <math> (\varphi^{cc},\varphi^{c}) </math> is called a pair of conjugate c-concave functions. It is known that <math> (\varphi^{cc})^{c}=\varphi^{c} </math> and that <math> \varphi^{c} </math> is measurable. The proof can be found in the book <ref name=Santambrogio />(Chapter 1, p.27).
-<math> \varphi^{c}(y)=inf_{y \in Y} [c(x,y) - \varphi(x)], \quad \varphi^{cc}(x)=inf_{y \in Y} [c(x,y) - \varphi^{c}(y)]. </math>
+In addition, it is possible to give a proof of Kantorovich Duality theorem using c-concave functions. Namely, we can find c-concave function <math> \varphi </math> such that <br> <math> (x,y) \in \Gamma \implies \varphi(x)+\varphi^{c}(y)=c(x,y). </math> Here, <math> \Gamma </math> is the support for the Kantorovich Problem (it has to be non-empty). The proof of this fact can also be found in the book <ref name=Santambrogio />(Chapter 1, p.11).
-The pair <math> (\varphi^{cc},\varphi^{c}) </math> is called a pair of conjugate c-concave functions. It is known that <math> (\varphi^{cc})^{c}=\varphi^{c} </math> and that <math> \varphi^{c} </math> is measurable.
+= References =
-In addition, it is possible to give a proof of Kantorovich Duality theorem using c-concave functions. Namely, we can find c-concave function <math> \varphi </math> such that <br> <math> (x,y) \in \Gamma \implies \varphi(x)+\varphi^{c}(y)=c(x,y). </math> This should lead to much faster proof.
+<references>
-==References==
+<ref name="Villani">[https://people.math.gatech.edu/~gangbo/Cedric-Villani.pdf C. Villani, ''Topics in Optimal Transportation'', Chapter 1, pages 17-21]</ref>
-<references />
+<ref name="Santambrogio"> [https://link-springer-com.proxy.library.ucsb.edu:9443/book/10.1007/978-3-319-20828-2 F. Santambrogio, ''Optimal Transport for Applied Mathematicians'', Chapter 1, pages 9-16] </ref>
-<ref name="Villani">[https://people.math.gatech.edu/~gangbo/Cedric-Villani.pdf C. Villani, ''Topics in Optimal Transportation'', Chapter 1.] (pages 17-21)</ref>
+<ref name="BandC"> [https://link.springer.com/book/10.1007/978-3-319-48311-5 Heinz H. Bauschke, Patrick L. Combettes, ''Convex Analysis and Monotone Operator Theory in Hilbert Spaces''] </ref>
-<ref name="Santambrogio">https://link-springer-com.proxy.library.ucsb.edu:9443/book/10.1007/978-3-319-20828-2 F. Santambrogio, ''Optimal Transport for Applied Mathematicians'', Chapter 1.] (pages 9-16)</ref>
+<ref name="Rockafellar"> R.T. Rockafellar,''Convex Analysis'', Princeton University Press, Princeton, 1970  </ref>
-</ references>
+</references>

Kantorovich Dual Problem (for general costs): Difference between revisions

Latest revision as of 04:36, 28 February 2022

Contents

The Shipper's Problem

Statement of Theorem

Ideas and the techniques used in the proof

C-concave functions

References

Navigation menu

Kantorovich Dual Problem (for general costs): Difference between revisions

Latest revision as of 04:36, 28 February 2022

The Shipper's Problem

Statement of Theorem

Ideas and the techniques used in the proof

C-concave functions

References

Navigation menu

Search