The Ky-Fan Metric

We shall discuss an exam question of my Probability-3 course and see a proof as to why the given distance function is a metric over the space of random variables and a few related properties.
Published on: 14 June, 2026 | Author: Pragyan Pranay
probabilityanalysisky-fanmetric-spaces

Probability-3 course was a nightmare for me and I even had to appear for backpaper for this course. This question originally came in the end-semester paper for B.Stat 2nd Year in session 2025-26. Yeah, this one was exceptionally hard to be done in exam conditions, although with the right tools, this isn’t very difficult, but producing the details with this much perfection is still quite hard. Our professor won’t leave us if we wrote even a single half-baked argument 🫠

Table of Contents

πŸ” Problem Statement

Problem (The Ky-Fan Metric) :

Let T\mathcal{T} be the set of all real-valued random variables defined on a given probability space (Ξ©,F,P)(\Omega, \mathcal{F} ,\mathbb{P})(where two random variables are said to be equivalent if they are equal with probability 11). Define

d(X,Y)≔inf⁑{Ο΅β‰₯0:P(∣Xβˆ’Y∣>Ο΅)≀ϡ},X,Y∈Td(X,Y) \coloneqq \inf\{\epsilon \ge 0 : \mathbb{P}(|X-Y| > \epsilon) \le \epsilon \}, \quad X,Y \in \mathcal{T}
  1. Show that the infimum in the definition of dd is attained.
  2. Show that dd is a metric on T\mathcal{T}.
  3. Let X∈TX \in \mathcal{T} and {Xn}nβ‰₯1\{X_n\}_{n \ge 1} be a sequence with Xn∈TX_n \in \mathcal{T} for nβ‰₯1n \ge 1. Show that

    Xnβ†’pXβ€…β€ŠβŸΊβ€…β€Šd(Xn,X)β†’0X_n \overset{p}{\to} X \iff d(X_n, X) \to 0

    as nβ†’βˆžn \to \infty.

  4. Show that T\mathcal{T} is complete with respect to metric dd.

πŸ’‘Solution

We will solve the problem part by part. Also, for the entire solution, define

DX,Y≔{Ο΅β‰₯0:P(∣Xβˆ’Y∣>Ο΅)≀ϡ}.D_{X,Y}\coloneqq \{\epsilon \ge 0 : \mathbb{P}(|X-Y| > \epsilon) \le \epsilon \}.

Part A

DX,YD_{X,Y} is non-empty since 1∈DX,Y1 \in D_{X,Y} and DX,YD_{X,Y} is bounded below by 00 by definition. Thus, the infimum exists. Let Ξ³=inf⁑DX,Y\gamma = \inf D_{X,Y}, which means there exists a real sequence anβ†˜Ξ³a_n \searrow \gamma where an∈DX,Ya_n \in D_{X,Y}. Thus,

P(∣Xβˆ’Y∣>an)≀anforΒ allΒ nβ‰₯1.\mathbb{P}(|X-Y| > a_n) \le a_n \quad \text{for all } n \ge 1.

Write P(∣Xβˆ’Y∣>an)=1βˆ’P(∣Xβˆ’Yβˆ£β‰€an)\mathbb{P}(|X-Y| > a_n) = 1 - \mathbb{P}(|X-Y| \le a_n) and let Z=∣Xβˆ’Y∣Z = |X-Y|. Then we get

1βˆ’an≀P(∣Xβˆ’Yβˆ£β‰€an)=FZ(an)1 - a_n \le \mathbb{P}(|X-Y| \le a_n) = F_Z(a_n)

Taking nβ†’βˆžn \to \infty, then by the right continuity of FZF_Z, we get

1βˆ’Ξ³β‰€FZ(Ξ³)=P(∣Xβˆ’Yβˆ£β‰€Ξ³)=1βˆ’P(∣Xβˆ’Y∣>Ξ³).1 - \gamma \le F_Z(\gamma) = \mathbb{P}(|X-Y| \le \gamma) = 1 - \mathbb{P}(|X-Y| > \gamma).

Rearranging, we get

P(∣Xβˆ’Y∣>Ξ³)β‰€Ξ³β€…β€ŠβŸΉβ€…β€ŠΞ³βˆˆDX,Y\mathbb{P}(|X-Y| > \gamma) \le \gamma \implies \gamma \in D_{X,Y}

which completes the proof to the first part. β– \blacksquare

Part B

This one is trickier, lets first recall the definition of a metric space for our context, where the parent set is T\mathcal{T} and distance function is dd.

Definition : (Metric Spaces)

A pair (T,d)(\mathcal{T}, d) is a metric space (where d ⁣:TΓ—Tβ†’Rd\colon \mathcal{T} \times \mathcal{T} \to \mathbb{R}) if dd satisifies:1

  1. d(X,Y)=0β€…β€ŠβŸΊβ€…β€ŠX=Y  a.sd(X,Y) = 0 \iff X = Y\,\,\text{a.s};

  2. d(X,Y)=d(Y,X)d(X,Y) = d(Y,X) for all X,Y∈TX,Y \in \mathcal{T};  [Symmetry]

  3. d(X,Z)≀d(X,Y)+d(Y,Z)d(X,Z) \le d(X,Y) + d(Y,Z) for all X,Y,Z∈TX,Y,Z \in \mathcal{T}. [Triangle Inequality]

We will start with the first point. Assume that d(X,Y)=0d(X,Y) = 0. By part a), we know that the infimum is attained, thus we have

P(∣Xβˆ’Y∣>0)≀0.\mathbb{P}\left( |X-Y| > 0 \right) \le 0.

Since P(βˆ™)β‰₯0\mathbb{P}(\bullet) \ge 0, we get

P(∣Xβˆ’Y∣>0)=0β€…β€ŠβŸΉβ€…β€ŠP(∣Xβˆ’Yβˆ£β‰€0)=1.\begin{align*} \mathbb{P}\left(|X-Y| > 0 \right) &= 0 \\ \implies \mathbb{P}\left( |X-Y| \le 0 \right) &= 1. \end{align*}

Since ∣r∣β‰₯0|r| \ge 0 for any r∈Rr \in \mathbb{R}, we get P(0β‰€βˆ£Xβˆ’Yβˆ£β‰€0)=1β€…β€ŠβŸΉβ€…β€ŠP(Xβˆ’Y=0)=1\mathbb{P}(0 \le |X-Y| \le 0) = 1 \implies P(X-Y = 0) = 1. Thus

P(X=Y)=1,\mathbb{P}(X = Y) = 1,

which shows d(X,Y)=0β€…β€ŠβŸΉβ€…β€ŠX=Y  a.s.d(X,Y) = 0 \implies X = Y\,\,\text{a.s.} Now assume that X=Y  a.sX = Y\,\, \text{a.s}, then

0∈DX,Yβ€…β€ŠβŸΊβ€…β€ŠP(∣Xβˆ’Y∣>0)≀0β€…β€ŠβŸΊβ€…β€ŠP(∣Xβˆ’Y∣>0)=0β€…β€ŠβŸΊβ€…β€ŠP(∣Xβˆ’Yβˆ£β‰€0)=1.0 \in D_{X,Y} \iff \mathbb{P}(|X-Y| > 0) \le 0 \iff \mathbb{P}(|X-Y| > 0) = 0 \iff \mathbb{P}(|X-Y| \le 0) = 1.

Finally, this means 0∈DX,Yβ€…β€ŠβŸΊβ€…β€ŠP(Xβˆ’Y=0)=1β€…β€ŠβŸΊβ€…β€ŠX=Y  a.s.0 \in D_{X,Y} \iff \mathbb{P}(X - Y = 0) = 1 \iff X = Y\,\,\text{a.s.} Recall that 0≀t0 \le t for all t∈DX,Yt \in D_{X,Y}, and now we have shown 0∈DX,Y0 \in D_{X,Y}, which means d(X,Y)=inf⁑DX,Y=0d(X,Y) = \inf D_{X,Y} = 0 as desired.

Proving symmetry is simple. Just note that

DX,Y={Ο΅β‰₯0:P(∣Xβˆ’Y∣>Ο΅)≀ϡ}={Ο΅β‰₯0:P(∣Yβˆ’X∣>Ο΅)≀ϡ}=DY,XD_{X,Y} = \{\epsilon \ge 0 : \mathbb{P}(|X-Y| > \epsilon) \le \epsilon \} = \{\epsilon \ge 0 : \mathbb{P}(|Y-X| > \epsilon) \le \epsilon \} = D_{Y,X}

where the equality of probabilities follows from the simple fact that ∣r∣=βˆ£βˆ’r∣|r| = |-r| for all r∈Rr \in \mathbb{R}. This gives inf⁑DX,Y=inf⁑DY,Xβ€…β€ŠβŸΉβ€…β€Šd(X,Y)=d(Y,X)\inf D_{X,Y} = \inf D_{Y,X} \implies d(X,Y) = d(Y,X) and done.

We shall now prove the triangle inequality. This is one is quite tricky. For X,Y,Z∈TX,Y,Z \in \mathcal{T}, we shall prove

d(X,Y)≀d(X,Z)+d(Z,Y)β€…β€ŠβŸΊβ€…β€Šc≀a+bd(X,Y) \le d(X,Z) + d(Z,Y) \iff c \le a + b

where c=d(X,Y)c = d(X,Y), a=d(X,Z)a = d(X,Z) and b=d(Z,Y)b = d(Z,Y). Define

A≔{Ο‰:∣X(Ο‰)βˆ’Y(Ο‰)∣>a+b}B≔{Ο‰:∣X(Ο‰)βˆ’Z(Ο‰)∣>a}C≔{Ο‰:∣Z(Ο‰)βˆ’Y(Ο‰)∣>b}\begin{align*} A &\coloneqq \{\omega : |X(\omega) - Y(\omega)| > a+b \} \\ B &\coloneqq \{\omega : |X(\omega) - Z(\omega)| > a \} \\ C &\coloneqq \{\omega : |Z(\omega) - Y(\omega)| > b \} \end{align*}

We claim the following.

Claim β€”
AβŠ†BβˆͺCA \subseteq B \cup C.

Let α∈A\alpha \in A. We wish to show that α∈BβˆͺC\alpha \in B \cup C. Assume otherwise, α∉BβˆͺC\alpha \not\in B \cup C if and only if α∈(BβˆͺC)β€²=Bβ€²βˆ©Cβ€²\alpha \in (B \cup C)' = B' \cap C'. Since α∈Bβ€²βˆ©Cβ€²\alpha \in B' \cap C', we must have

∣X(Ξ±)βˆ’Z(Ξ±)βˆ£β‰€a∣Z(Ξ±)βˆ’Y(Ξ±)βˆ£β‰€b.\begin{align*} |X(\alpha) - Z(\alpha)| &\le a \\ |Z(\alpha) - Y(\alpha)| &\le b. \end{align*}

Adding the above equations and using triangle inequality in R\mathbb{R} gives

∣X(Ξ±)βˆ’Y(Ξ±)βˆ£β‰€a+b.|X(\alpha) - Y(\alpha)| \le a + b.

This is a contradiction since α∈A\alpha \in A gives ∣X(Ξ±)βˆ’Y(Ξ±)∣>a+b|X(\alpha) - Y(\alpha)| > a + b. Hence, the claim is proved. β–‘\square

With the above claim and with the union bound P(A)≀P(BβˆͺC)≀P(B)+P(C)\mathbb{P}(A) \le \mathbb{P}(B \cup C) \le \mathbb{P}(B) + \mathbb{P}(C) we get

P(∣Xβˆ’Y∣>a+b)=P(A)≀P(∣Xβˆ’Z∣>a)+P(∣Zβˆ’Y∣>b)=a+b\mathbb{P}(|X-Y| > a + b) = \mathbb{P}(A) \le \mathbb{P}(|X-Z| > a) + \mathbb{P}(|Z-Y| > b) = a + b

which ultimately shows a+b∈DX,Yβ€…β€ŠβŸΉβ€…β€Šc=d(X,Y)=inf⁑DX,Y≀a+ba+b \in D_{X,Y} \implies c = d(X,Y) = \inf D_{X,Y} \le a + b completing the proof to Part B. β– \blacksquare

Part C

Assume Xnβ†’pXX_n \overset{p}{\to} X. Fix Ξ΅>0\varepsilon > 0, we now need to show that there exists N∈NN \in \mathbb{N} such that d(Xn,X)=∣d(Xn,X)βˆ’0∣<Ξ΅d(X_n, X) = |d(X_n, X) - 0| < \varepsilon for all n>Nn > N. Since Xnβ†’pXX_n \overset{p}{\to} X, for all Ξ·>0\eta > 0 there exists N1∈NN_1 \in \mathbb{N} such that

P(∣Xnβˆ’X∣>Ξ΅/2)<Ξ·βˆ€n>N1β€…β€ŠβŸΉβ€…β€ŠP(∣Xnβˆ’X∣>Ξ΅/2)≀ηforΒ allΒ n>N1\mathbb{P}(|X_n - X| > \varepsilon/2) < \eta \quad \forall n > N_1 \implies \mathbb{P}(|X_n - X| > \varepsilon/2) \le \eta \quad \text{for all }n > N_1

Since Ξ·\eta is free, put Ξ·=Ξ΅/2\eta = \varepsilon/2 to get

P(∣Xnβˆ’X∣>Ξ΅/2)≀Ρ2forΒ allΒ n>N1β€…β€ŠβŸΉβ€…β€ŠΞ΅/2∈DXn,XforΒ allΒ n>N1.\mathbb{P}(|X_n - X| > \varepsilon/2) \le \frac{\varepsilon}{2} \quad \text{for all } n > N_1 \implies \varepsilon/2 \in D_{X_n, X} \quad \text{for all }n > N_1.

To finish it off, this means (inf⁑DXn,X)≀Ρ/2(\inf D_{X_n, X}) \le \varepsilon/2 for all n>N1n > N_1, which gives

d(Xn,X)≀Ρ2<Ξ΅forΒ allΒ n>N1d(X_n, X) \le \frac{\varepsilon}{2} < \varepsilon \quad \text{for all }n > N_1

which proves lim⁑nβ†’βˆžd(Xn,X)=0\displaystyle\lim_{n \to \infty} d(X_n, X) = 0 as needed.

The other direction is quite easy as well, assume lim⁑nβ†’βˆžd(Xn,X)=0\displaystyle\lim_{n \to \infty} d(X_n, X) = 0 and define an≔d(Xn,X)a_n \coloneqq d(X_n,X). By definition of ana_n, we know that P(∣Xnβˆ’X∣>an)≀an\mathbb{P}(|X_n-X| > a_n) \le a_n for all n∈Nn \in \mathbb{N}. Fix Ξ΅,Ξ·>0\varepsilon, \eta > 0. Since an>0a_n > 0 and anβ†’0a_n \to 0, we can find N1∈NN_1 \in \mathbb{N} such that an<Ξ΅a_n < \varepsilon for n>N1n > N_1. Moreover, we can also find N2∈NN_2 \in \mathbb{N} such that an<Ξ·a_n < \eta for n>N2n > N_2. For n>max⁑(N1,N2)n > \max(N_1,N_2) we have

P(∣Xnβˆ’X∣>Ξ΅)≀P(∣Xnβˆ’X∣>an)≀an<Ξ·β€…β€ŠβŸΉβ€…β€ŠP(∣Xnβˆ’X∣>Ξ΅)<Ξ·forΒ allΒ n>max⁑(N1,N2)\begin{align*} \mathbb{P}(|X_n - X| > \varepsilon) \le \mathbb{P}(|X_n - X| > a_n) \le a_n < \eta \\ \implies \mathbb{P}(|X_n - X| > \varepsilon) < \eta \quad \text{for all } n > \max(N_1,N_2) \end{align*}

which proves that Xn→pXX_n \overset{p}{\to} X. This completes the proof to Part C. ■\blacksquare

Part D

This one will use a few tools, specifically the Cauchy Criteria for convergence in probability of random variables. We shall now define what cauchy in probability means and then prove the theorem which will trivialize the problem.

Definition : (Cauchy in Probability)

We say a that a sequence of random variables {Xn}nβ‰₯1\{X_n\}_{n \ge 1} is Cauchy in Probability if for all Ξ΅,Ξ·>0\varepsilon, \eta > 0 there exists N∈NN \in \mathbb{N} such that

P(∣Xmβˆ’Xn∣>Ξ΅)<Ξ·\mathbb{P}(|X_m - X_n| > \varepsilon) < \eta

for all m,n>Nm,n > N.

Theorem : (Cauchy Criteria for Convergence in Probability)

For a sequence of real-valued random variables {Xn}nβ‰₯1\{X_n\}_{n \ge 1}

Xnβ†’pXβ€…β€ŠβŸΊβ€…β€Š{Xn}nβ‰₯1  isΒ CauchyΒ inΒ Probability.X_n \overset{p}{\to} X \iff \{X_n\}_{n \ge 1}\,\,\text{is Cauchy in Probability.}

Assume Xnβ†’pXX_n \overset{p}{\to} X. Fix Ξ΅,Ξ·>0\varepsilon, \eta > 0. By convergence in probability, we know there exists N∈NN \in \mathbb{N} such that

P(∣Xmβˆ’X∣>Ξ΅/2)<Ξ·2\mathbb{P}(|X_m - X| > \varepsilon/2) < \frac \eta 2

for all m>Nm > N. Define

A≔{Ο‰:∣Xm(Ο‰)βˆ’Xn(Ο‰)∣>Ξ΅}B≔{Ο‰:∣Xm(Ο‰)βˆ’X(Ο‰)∣>Ξ΅2}C≔{Ο‰:∣Xn(Ο‰)βˆ’X(Ο‰)∣>Ξ΅2}\begin{align*} A &\coloneqq \{\omega : |X_m(\omega) - X_n(\omega)| > \varepsilon \} \\ B &\coloneqq \left\{\omega : |X_m(\omega) - X(\omega)| > \frac\varepsilon 2\right\} \\ C &\coloneqq \left\{\omega : |X_n(\omega) - X(\omega)| > \frac\varepsilon 2 \right\} \end{align*}
Claim β€”
AβŠ†BβˆͺC.A \subseteq B \cup C.

We ommit the proof, since its analogous to the claim in Part B. For m,n>Nm,n > N, using the claim in conjunction with the union bound2 like in Part B we get

P(∣Xmβˆ’Xn∣>Ξ΅)≀P(∣Xmβˆ’X∣>Ξ΅2)+P(∣Xnβˆ’X∣>Ξ΅2)<Ξ·/2+Ξ·/2=Ξ·\mathbb{P}(|X_m-X_n| > \varepsilon) \le \mathbb{P}\left(|X_m - X| > \frac\varepsilon 2\right) + \mathbb{P}\left(|X_n - X| > \frac \varepsilon 2 \right) < \eta/2 + \eta/2 = \eta

which proves that {Xn}nβ‰₯1\{X_n\}_{n \ge 1} is cauchy in probability as desired.


The other direction is much harder. I’d produce the proof as it is done by our Professor with some commentary as the proof is quite involved. Assume that {Xn}nβ‰₯1\{X_n\}_{n \ge 1} is cauchy in probability.


Step 1 – Construct a rapidly Cauchy subsequence.
Since {Xn}\{X_n\} is Cauchy in probability, for each j∈Nj\in\mathbb{N} we can choose an index njn_j (with n1<n2<…n_1 < n_2 < \dots) such that

βˆ€β€‰r,sβ‰₯nj,P(∣Xrβˆ’Xs∣>2βˆ’j)<2βˆ’j.\forall\, r,s \ge n_j, \qquad \mathbb{P}\bigl(|X_r - X_s| > 2^{-j}\bigr) < 2^{-j}.

In particular, for every jj we have

P(∣Xnj+1βˆ’Xnj∣>2βˆ’j)<2βˆ’j.\mathbb{P}\bigl(|X_{n_{j+1}} - X_{n_j}| > 2^{-j}\bigr) < 2^{-j}.

Step 2 – Apply Borel–Cantelli.
Because βˆ‘j=1∞2βˆ’j<∞\sum_{j=1}^{\infty} 2^{-j} < \infty, the first Borel–Cantelli lemma gives

P(∣Xnj+1βˆ’Xnj∣>2βˆ’ji.o.Β inΒ j)=0.\mathbb{P}\bigl( |X_{n_{j+1}} - X_{n_j}| > 2^{-j} \text{i.o. in $j$} \bigr) = 0.

Define the event

A={Ο‰:∣Xnj+1(Ο‰)βˆ’Xnj(Ο‰)∣>2βˆ’jΒ forΒ onlyΒ finitelyΒ manyΒ j}.A = \bigl\{ \omega : |X_{n_{j+1}}(\omega) - X_{n_j}(\omega)| > 2^{-j} \text{ for only finitely many } j \bigr\}.

Then P(A)=1\mathbb{P}(A) = 1.


Step 3 – Almost sure convergence of the subsequence.
For any Ο‰βˆˆA\omega \in A, there exists J(Ο‰)J(\omega) such that for all jβ‰₯J(Ο‰)j \ge J(\omega),

∣Xnj+1(Ο‰)βˆ’Xnj(Ο‰)βˆ£β‰€2βˆ’j.|X_{n_{j+1}}(\omega) - X_{n_j}(\omega)| \le 2^{-j}.

For k>β„“β‰₯J(Ο‰)k > \ell \ge J(\omega) we have the telescoping estimate

∣Xnk(Ο‰)βˆ’Xnβ„“(Ο‰)βˆ£β‰€βˆ‘i=β„“kβˆ’1∣Xni+1(Ο‰)βˆ’Xni(Ο‰)βˆ£β‰€βˆ‘i=β„“βˆž2βˆ’i=2βˆ’β„“+1.|X_{n_k}(\omega) - X_{n_\ell}(\omega)| \le \sum_{i=\ell}^{k-1} |X_{n_{i+1}}(\omega) - X_{n_i}(\omega)| \le \sum_{i=\ell}^{\infty} 2^{-i} = 2^{-\ell+1}.

Hence {Xnj(Ο‰)}\{X_{n_j}(\omega)\} is a Cauchy sequence in R\mathbb{R} and therefore converges. Define

X(Ο‰)={lim⁑jβ†’βˆžXnj(Ο‰),Ο‰βˆˆA,0,Ο‰βˆ‰A.X(\omega) = \begin{cases} \displaystyle \lim_{j\to\infty} X_{n_j}(\omega), & \omega \in A,\\[4pt] 0, & \omega \notin A. \end{cases}

Then XX is a random variable (the limit of measurable functions on AA, and constant on AcA^c). Moreover, Xnj→a.s.XX_{n_j} \xrightarrow{\text{a.s.}} X, which implies Xnj→pXX_{n_j} \xrightarrow{p} X.


Step 4 – From the subsequence to the whole sequence.
Fix Ξ΅>0\varepsilon > 0 and Ξ·>0\eta > 0. Because Xnjβ†’pXX_{n_j} \xrightarrow{p} X, there exists JJ such that for all jβ‰₯Jj \ge J,

P(∣Xnjβˆ’X∣>Ξ΅/2)<Ξ·/2.\mathbb{P}\bigl(|X_{n_j} - X| > \varepsilon/2\bigr) < \eta/2.

Since {Xn}\{X_n\} is Cauchy in probability, we can also choose KK such that for all m,nβ‰₯Km,n \ge K,

P(∣Xmβˆ’Xn∣>Ξ΅/2)<Ξ·/2.\mathbb{P}\bigl(|X_m - X_n| > \varepsilon/2\bigr) < \eta/2.

Now take N=max⁑(nJ,K)N = \max(n_J, K). For any nβ‰₯Nn \ge N, pick a subsequence index njn_j with njβ‰₯Nn_j \ge N (possible because njβ†’βˆžn_j \to \infty). Then

P(∣Xnβˆ’X∣>Ξ΅)≀P(∣Xnβˆ’Xnj∣>Ξ΅/2)+P(∣Xnjβˆ’X∣>Ξ΅/2)<Ξ·/2+Ξ·/2=Ξ·.\begin{aligned} \mathbb{P}\bigl(|X_n - X| > \varepsilon\bigr) &\le \mathbb{P}\bigl(|X_n - X_{n_j}| > \varepsilon/2\bigr) + \mathbb{P}\bigl(|X_{n_j} - X| > \varepsilon/2\bigr) \\ &< \eta/2 + \eta/2 = \eta. \end{aligned}

Thus Xn→pXX_n \xrightarrow{p} X, completing the proof. ░\quad\square


Returning back to the problem, let {Yn}nβ‰₯1\{Y_n\}_{n \ge 1} be a sequence of random variables with Yn∈TY_n \in \mathcal{T} for all n∈Nn \in \mathbb{N} which dβˆ’d-cauchy(that is cauchy with the metric dd). We will prove that {Yn}nβ‰₯1\{Y_n\}_{n \ge 1} is _cauchy in probability. Define am,n≔d(Ym,Yn)a_{m,n} \coloneqq d(Y_m,Y_n). Fix Ξ΅,Ξ·>0\varepsilon,\eta > 0, then by definition there exists some N1,N2∈NN_1, N_2 \in \mathbb{N} such that

am,n=d(Ym,Yn)<Ξ΅forΒ allΒ m,n>N1;am,n=d(Ym,Yn)<Ξ·2forΒ allΒ m,n>N2.\begin{align*} a_{m,n} &= d(Y_m,Y_n) < \varepsilon \quad \text{for all } m,n > N_1;\\ a_{m,n} &= d(Y_m,Y_n) < \frac{\eta}{2} \quad \text{for all } m,n > N_2. \end{align*}

For all m,n>max⁑(N1,N2)m,n > \max(N_1, N_2), we must have

P(∣Ymβˆ’Yn∣>Ξ΅)≀P(∣Ymβˆ’Yn∣>am,n)≀am,n≀η2<Ξ·\mathbb{P}(|Y_m-Y_n| > \varepsilon) \le \mathbb{P}(|Y_m-Y_n| > a_{m,n}) \le a_{m,n} \le \frac{\eta}{2} < \eta

where the second inequality is by definition of am,na_{m,n}. This shows that {Yn}nβ‰₯1\{Y_n\}_{n \ge 1} is cauchy in probability. Invoking the cited theorem, the proof to Part D is β€˜complete’. β– \blacksquare

Footnotes

Footnotes

  1. We have excluded the non-negativity condition which is given in a lot of books. Regardless, if you assume all these conditions, then the non-negativity of dd follows. You can see this stackexchange thread. In particular, see the reply to the question by Hagen von Eitzen. In our case, proving non-negative is easy, since DX,YD_{X,Y} is bounded below by 00(!) ↩

  2. Union bound simply means that for any two measurable events A,BA,B we have P(AβˆͺB)≀P(A)+P(B).\mathbb{P}(A \cup B) \le \mathbb{P}(A) + \mathbb{P}(B). ↩