Kantorovich Problem: Difference between revisions
No edit summary |
|||
(62 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
The Kantorovich | The Kantorovich problem <ref>Villani, Cedric. Topics In Optimal Transportation. American Mathematical Soc., 2003.</ref> is one of the two essential minimization problems in [https://en.wikipedia.org/wiki/Transportation_theory_(mathematics) optimal transport] (the other being the [http://34.106.105.83/wiki/Monge_Problem Monge problem]). It is named after Russian mathematician and Nobel Laureate [https://en.wikipedia.org/wiki/Leonid_Kantorovich Leonid Kantorovich]. | ||
[[File:Kantorovich Problem Image.png|300px|thumb|right|Kantorovich Problem. The Kantorovich Problem (like the Monge Problem) can be visualized as distributing piles of sand into holes (of equal volume) in an optimal fashion. <ref>Mertens, Stephan. A New Approach to the Matching Problem. Physics, 7, 77. 21 July 2014. </ref> ]] | |||
==Introduction== | ==Introduction== | ||
The Kantrovich problem admits a dual because it is a linear minimization problem with convex constraints. | In contrast to the Monge problem, the Kantorovich problem involves working with a non-empty minimization set, a convex constraint set, and a convex effort functional. The Kantrovich problem admits a [https://en.wikipedia.org/wiki/Duality_(mathematics) dual] because it is a linear minimization problem with convex constraints. | ||
===Shipping problem=== | |||
The intuition behind the Kantorovich problem can be given by an explanation of optimizing shipments. Suppose there is a merchant who is attempting to ship items from one place to another. The merchant can hire trucks at some cost <math>c(x, y)</math> for each unit of merchandise which is shipped from point <math>x </math>to point <math>y</math>. Now the shipper is approached by a mathematician, who claims that prices can be set such that they align with the shipper's financial interests <ref>Carlier, Guillame. Optimal Transportation and Economic Applications. IMA. New Mathematical Models in Economics and Finance. 2010 </ref>. This would be achieved by setting the price <math>\phi(x)</math> and <math>\phi(y)</math> such that the sum of <math>\phi(x)</math> and <math>\phi(y)</math> is always less than the cost <math>c(x, y)</math>. This may even involve setting negative prices in certain cases. However, it can be shown that the shipper will spend almost as much as they would have if instead they opted for the original pricing method <ref>Paris, Quininio. An Economic Interpretations ofLinear Programming. Springer. 29 April 2016 </ref>. | |||
==Kantorovich Optimal Transport Problem== | |||
=== | ===Transport Plans=== | ||
The Monge problem was about the optimal way to rearrange mass <ref> Craig, Katy. The Monge Problem. Math 260L. Univ. of Ca. at Santa Barbara. Spring 2020 </ref>. Note that in the Monge formulation of the optimal transport problem, the mass cannot be split and thus it is mapped <math>x \mapsto T(x)</math>. When considering discrete cases, this results in problems when trying to establish maps T such that <math>T_{\#} \mu=\nu</math>. Kantorovich made the observation that the mass in question could be split, which makes the problem much easier to model <ref> Craig, Katy. The Kantorovich Problem. Math 260L. Univ. of Ca. at Santa Barbara. Spring 2020 </ref>. Allowing the mass to be split results in a relaxation of the problem (e.g. half of the mass from <math>x_1</math> can go to <math>y_1</math> and half can go to <math>y_2</math>, and so on). To model this consider <math>d \pi(x, y)</math>, which denotes the mass transported from x to y. This allows the mass to be moved to multiple places. Also consider <math>\mu(A)</math> and <math>\nu(B)</math>: where the total mass taken from measurable set <math>A \in X</math> must be equal to <math>\mu(A)</math> and the total mass taken from measurable set <math>B \in Y</math> must equal <math>\nu(B)</math>. | |||
The constraints of the problem can be written in the following manner: | |||
== | <div style="text-align: center;"> | ||
<math>\pi(A \times Y)=\mu(A)</math> | |||
<math> \quad \pi(X \times B)=\nu(B) \quad</math> | |||
</div> | |||
for all measurable sets <math>A \subseteq X, B \subseteq Y</math>. As such, we can interpret <math> \pi( A\times B) </math> as representing the amount of mass from <math> \mu (A)</math> that is directed to <math> \nu (B)</math> | |||
If we have a measure <math>\pi</math> that satisfies these constraints, then the set of such <math>\pi</math> is referred to as <math>\Pi(\mu, \nu)</math> -- the set of transport plans between <math>\mu</math> and <math>\nu</math>. Notice again that now we are dealing with transport plans instead of the transport maps that are used in the Monge formulation of the problem <ref> Craig, Katy. The Kantorovich Problem. Math 260L. Univ. of Ca. at Santa Barbara. Spring 2020 </ref>. | |||
===Problem Statement=== | |||
Given <math>\mu \in \mathcal{P}(X)</math> and <math>\nu \in \mathcal{P}(Y)</math>, solve | |||
<div style="text-align: center;"> | |||
<math> | <math>\operatorname{min} \mathbb{K}(\pi):= \operatorname{min} \int_{X \times Y} c(x, y) \mathrm{d} \pi(x, y)</math> | ||
\ | </div> | ||
</math> | |||
over all such <math>\pi \in \Pi(\mu, \nu)</math> | |||
<math> | |||
</math> | |||
== | Assuming there is a transport map <math>T^{\dagger}: X \rightarrow Y</math> for the Monge problem, we define <math>\mathrm{d} \pi(x, y)=\mathrm{d} \mu(x) \delta_{y=T^{\dagger}(x)}</math>. Using this we can see that: | ||
<div style="text-align: center;"> | |||
<math>\begin{aligned} \pi(A \times Y) &=\int_{A} \delta_{T^{\dagger}(x) \in Y} \mathrm{d} \mu(x)=\mu(A) \\ \pi(X \times B) &=\int_{X} \delta_{T^{\dagger}(x) \in B} \mathrm{d} \mu(x)=T_{\#}^{\dagger} \mu(B)=\nu(B) \end{aligned}</math> | |||
</div> | |||
We can see that <math>\int_{X \times Y} c(x, y) \mathrm{d} \pi(x, y)=\int_{X} c\left(x, T^{\dagger}(x)\right) \mathrm{d} \mu(x)</math> | |||
thus <math>\inf \mathbb{K}(\pi) \leq \inf \mathbb{M}(T)</math>. | |||
===Kantorovich Duality=== | |||
Since the Kantorovich problem is a linear minimization problem with convex constraints it admits a [http://34.106.105.83/wiki/Kantorovich_Dual_Problem_(for_general_costs) dual problem]. The astute reader may notice that this is a linear programming problem -- Kantorovich is also considered to be the founder of linear programming. | |||
== | ===Calculus of Variations Approach=== | ||
Under the right setting, one can show the Kantovorich problem indeed has a minimizer using the direct method of the calculus of variations. More specifically, if one turns to the narrow topology, then it turns out that we get compactness of the constraint set. Moreover, such a topology ensures us that our objective function is lower semi-continuous. | |||
===Knott-Smith Optimality Criterion=== | |||
One useful result we have that allows us to connect both the Monge and Kantorovich problems is the so-called the Knott-Smith Optimality Criterion (see below). | |||
==Statement:== | |||
Suppose <math> X\subset \mathbb{R}^d </math> is compact and <math> \mu, \nu \in \mathcal{P}(X)</math>. Let <math> c(x,y) = |x-y|^2 </math>. Then, | |||
(i) There exists <math> f_* \in L^1(\mu) </math> proper, lower semi-continuous, and convex such that | |||
< | (a) <math> \sup_{\phi, \psi \in C_{b}(\mathbb{R}^d),\ \phi \bigoplus \psi \leq c} \int \phi\ d\mu + \int \psi d\nu = -P_o = \int |x|^2 - 2f_*(x) d\mu(x) + \int |x|^2 - 2f_*^*(x) d\nu(x)</math> | ||
(b) For any optimal transport plan <math> \pi_* </math>, we have that <math> y\in \partial f_*(x)</math> <math>\pi_*</math>-almost-everywhere <math> (x,y)</math>. | |||
< | (ii) Conversely, if <math> \pi \in \Pi(\mu, \nu)</math> and <math> f\in L^1(\mu)</math> is proper, lower semi-continuous, and convex for which <math> y\in \partial f_*(x) </math> <math> \pi </math>-almost-everywhere <math>(x,y)</math>, then | ||
< | (a) <math> \pi </math> is optimal | ||
(b) <math> -P_o = \int|x|^2-f_*(x)d\mu(x) + \int |x|^2-2f_*^*(x)d\nu(x).</math> | |||
===References=== |
Latest revision as of 06:15, 17 March 2022
The Kantorovich problem [1] is one of the two essential minimization problems in optimal transport (the other being the Monge problem). It is named after Russian mathematician and Nobel Laureate Leonid Kantorovich.
Introduction
In contrast to the Monge problem, the Kantorovich problem involves working with a non-empty minimization set, a convex constraint set, and a convex effort functional. The Kantrovich problem admits a dual because it is a linear minimization problem with convex constraints.
Shipping problem
The intuition behind the Kantorovich problem can be given by an explanation of optimizing shipments. Suppose there is a merchant who is attempting to ship items from one place to another. The merchant can hire trucks at some cost for each unit of merchandise which is shipped from point to point . Now the shipper is approached by a mathematician, who claims that prices can be set such that they align with the shipper's financial interests [3]. This would be achieved by setting the price and such that the sum of and is always less than the cost . This may even involve setting negative prices in certain cases. However, it can be shown that the shipper will spend almost as much as they would have if instead they opted for the original pricing method [4].
Kantorovich Optimal Transport Problem
Transport Plans
The Monge problem was about the optimal way to rearrange mass [5]. Note that in the Monge formulation of the optimal transport problem, the mass cannot be split and thus it is mapped . When considering discrete cases, this results in problems when trying to establish maps T such that . Kantorovich made the observation that the mass in question could be split, which makes the problem much easier to model [6]. Allowing the mass to be split results in a relaxation of the problem (e.g. half of the mass from can go to and half can go to , and so on). To model this consider , which denotes the mass transported from x to y. This allows the mass to be moved to multiple places. Also consider and : where the total mass taken from measurable set must be equal to and the total mass taken from measurable set must equal .
The constraints of the problem can be written in the following manner:
for all measurable sets . As such, we can interpret as representing the amount of mass from that is directed to
If we have a measure that satisfies these constraints, then the set of such is referred to as -- the set of transport plans between and . Notice again that now we are dealing with transport plans instead of the transport maps that are used in the Monge formulation of the problem [7].
Problem Statement
Given and , solve
over all such
Assuming there is a transport map for the Monge problem, we define . Using this we can see that:
We can see that
thus .
Kantorovich Duality
Since the Kantorovich problem is a linear minimization problem with convex constraints it admits a dual problem. The astute reader may notice that this is a linear programming problem -- Kantorovich is also considered to be the founder of linear programming.
Calculus of Variations Approach
Under the right setting, one can show the Kantovorich problem indeed has a minimizer using the direct method of the calculus of variations. More specifically, if one turns to the narrow topology, then it turns out that we get compactness of the constraint set. Moreover, such a topology ensures us that our objective function is lower semi-continuous.
Knott-Smith Optimality Criterion
One useful result we have that allows us to connect both the Monge and Kantorovich problems is the so-called the Knott-Smith Optimality Criterion (see below).
Statement:
Suppose is compact and . Let . Then,
(i) There exists proper, lower semi-continuous, and convex such that
(a) (b) For any optimal transport plan , we have that -almost-everywhere .
(ii) Conversely, if and is proper, lower semi-continuous, and convex for which -almost-everywhere , then
(a) is optimal (b)
References
- ↑ Villani, Cedric. Topics In Optimal Transportation. American Mathematical Soc., 2003.
- ↑ Mertens, Stephan. A New Approach to the Matching Problem. Physics, 7, 77. 21 July 2014.
- ↑ Carlier, Guillame. Optimal Transportation and Economic Applications. IMA. New Mathematical Models in Economics and Finance. 2010
- ↑ Paris, Quininio. An Economic Interpretations ofLinear Programming. Springer. 29 April 2016
- ↑ Craig, Katy. The Monge Problem. Math 260L. Univ. of Ca. at Santa Barbara. Spring 2020
- ↑ Craig, Katy. The Kantorovich Problem. Math 260L. Univ. of Ca. at Santa Barbara. Spring 2020
- ↑ Craig, Katy. The Kantorovich Problem. Math 260L. Univ. of Ca. at Santa Barbara. Spring 2020