Optimal Transport in One Dimension: Difference between revisions
(37 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
In this article, we explore the optimal transport problem on the real line along with some examples. | In this article, we briefly explore the optimal transport problem on the real line along with some examples. | ||
== Linear Cost Example == | == Linear Cost Example == | ||
For this example, consider the cost function <math> c(x,y) = L(x-y) </math> along with a given linear map <math> A: \mathbb{R}^d \rightarrow \mathbb{R} </math>. Moreover, if let <math> \gamma </math> be any transport plan, then by direct computation we see that | For this example, consider the cost function <math> c(x,y) = L(x-y) </math> along with a given linear map <math> A: \mathbb{R}^d \rightarrow \mathbb{R} </math>. Moreover, if let <math> \gamma </math> be any transport plan, then by direct computation we see that | ||
<math> \int L(x-y) d \gamma = \int L(x) d \gamma - \int L(y) d \gamma = \int L(x) d \mu - \int L(y) d \nu </math> | |||
which suggests that this result only depends on the marginals of <math> \gamma </math> (wherein <math> \mu </math> and <math>\nu </math> are compactly supported probability measures). In fact, in such cases, every transport plan/map is optimal. | which suggests that this result only depends on the marginals of <math> \gamma </math> (wherein <math> \mu </math> and <math>\nu </math> are compactly supported probability measures). In fact, in such cases, every transport plan/map is optimal. | ||
Line 11: | Line 11: | ||
==Book Shifting Example== | ==Book Shifting Example== | ||
Consider the cost function <math> c(x,y) = |x-y|</math> along with <math> \mu = \frac{1}{2} \lambda_{[0,2]} </math> and <math> \nu = \frac{1}{2}\lambda_{[1,3]}</math> (where <math> \lambda </math> is the one-dimensional Lebesgue measure). | Consider the cost function <math> c(x,y) = |x-y|</math> along with <math> \mu = \frac{1}{2} \lambda_{[0,2]} </math> and <math> \nu = \frac{1}{2}\lambda_{[1,3]}</math> (where <math> \lambda </math> is the one-dimensional Lebesgue measure). A (monotone) transport plan that rearranges <math> \mu </math> to look like <math> \nu </math> is given by <math> T_0(x) = x+1 </math> and its corresponding cost is | ||
<math> M(T_0)= \int |T_0(x)-x| d\mu \equiv 1 </math>. | |||
Furthermore, notice that the piecewise map <math> T_1</math> given by <math> T_1(x) = x+2 </math> (for <math> x\leq 1</math>) and <math> T_1(x) = x </math> (for <math> x>1</math>) satisfies <math> T_1 \# \mu = \nu </math>, i.e. <math> T_1 </math> is a transport map from <math> \mu </math> to <math> \nu </math>; moreover, the corresponding cost is | |||
<math> M(T_1) = \int |T_1(x)-x| d\mu = \frac{1}{2}\int_0^2 2dx \equiv 1 </math> | |||
and so we conclude that <math> T_1</math> is indeed optimal as well. | |||
==Quadratic Cost== | |||
'''Theorem:''' Let <math> \mu, \nu </math> be probability measures on <math> \mathbb{R}</math> with cumulative distribution functions (CDFs) <math> F</math> and <math> G</math>, respectively. Also, let <math> \pi </math> be the probability measure on <math> \mathbb{R}^2</math> with the CDF <math> H(x,y) = \min (F(x), G(y))</math>. Then, <math> \pi \in \Gamma(\mu, \nu)</math> and is optimal (in the Kantorovich problem setting) between <math> \mu </math> and <math> \nu </math> for the (quadratic) cost function <math> c(x,y) = |x-y|^2 </math>, and the corresponding cost is | |||
<math> T_2(\mu, \nu) = \int_0^1 |F^{-1}(t) - G^{-1}(t)|^2dt </math> | |||
where <math> F^{-1}</math> and <math> G^{-1}</math> are the pseudo-inverses of the respective CDFs. | |||
===Ideas and Remarks for the Proof=== | |||
Note the proof from Cedric-Villani gives this result in arbitrary dimensions. Below is a rough outline of proof, and the full details can be found in "Topics in Optimal Transportation" (Villani, cite later). Moreover, the measure <math> \pi </math> constructed in the theorem indeed is optimal provided that the cost function <math> c(x,y)</math> is of the form <math> c(x-y)</math> where <math> c</math> is a convex, nonnegative symmetric function on <math> \mathbb{R}</math>. | |||
One of the first major steps in proving this theorem is showing that <math>supp(\pi) \subset \{(x,y)\in \mathbb{R}^2: F(x^-) \leq G(y) \text{ and } G(y^-) \leq F(x)\}</math> by considering specific cases. Upon showing this, we may conclude that <math> \pi</math> is supported in a monotone subset of <math> \mathbb{R}^2</math> and hence also supported in the sub-differential of some lower semi-continuous convex function. From here, we make use of the Knott-Smith optimality criterion (Villani pg 66) which establishes that <math> \pi </math> is an optimal transference plan. Then, upon showing that <math> \pi = (F^{-1}\times G^{-1})\# \lambda_{[0,1]} </math>, we see that for any nonnegative, measurable function <math> u</math> on <math> \mathbb{R}^2 </math> | |||
<math> \int_{\mathbb{R}^2}u(x,y)d\pi (x,y) =\int_0^1 u(F^{-1}(t), G^{-1}(t))dt </math>. | |||
This then immediately yields the cost <math> T_2(\mu, \nu)</math> and completes the proof. |
Latest revision as of 06:52, 12 February 2022
In this article, we briefly explore the optimal transport problem on the real line along with some examples.
Linear Cost Example
For this example, consider the cost function along with a given linear map . Moreover, if let be any transport plan, then by direct computation we see that
which suggests that this result only depends on the marginals of (wherein and are compactly supported probability measures). In fact, in such cases, every transport plan/map is optimal.
Distance Cost Example
Consider the cost function along with probability measures (on ) and . Then, for any we see that , which then immediately puts us back in the linear cost position, so any transport map/plan is also optimal for such costs.
Book Shifting Example
Consider the cost function along with and (where is the one-dimensional Lebesgue measure). A (monotone) transport plan that rearranges to look like is given by and its corresponding cost is
.
Furthermore, notice that the piecewise map given by (for ) and (for ) satisfies , i.e. is a transport map from to ; moreover, the corresponding cost is
and so we conclude that is indeed optimal as well.
Quadratic Cost
Theorem: Let be probability measures on with cumulative distribution functions (CDFs) and , respectively. Also, let be the probability measure on with the CDF . Then, and is optimal (in the Kantorovich problem setting) between and for the (quadratic) cost function , and the corresponding cost is
where and are the pseudo-inverses of the respective CDFs.
Ideas and Remarks for the Proof
Note the proof from Cedric-Villani gives this result in arbitrary dimensions. Below is a rough outline of proof, and the full details can be found in "Topics in Optimal Transportation" (Villani, cite later). Moreover, the measure constructed in the theorem indeed is optimal provided that the cost function is of the form where is a convex, nonnegative symmetric function on .
One of the first major steps in proving this theorem is showing that by considering specific cases. Upon showing this, we may conclude that is supported in a monotone subset of and hence also supported in the sub-differential of some lower semi-continuous convex function. From here, we make use of the Knott-Smith optimality criterion (Villani pg 66) which establishes that is an optimal transference plan. Then, upon showing that , we see that for any nonnegative, measurable function on
.
This then immediately yields the cost and completes the proof.