Isometric embedding of metric space
This post summarizes the proof for Theorem 7.9 in Royden's "Real Analysis".
Theorem 9 If $\langle X, \rho \rangle$ is an incomplete metric space, it is possible to find a complete metric space $X^*$ in which $X$ is isometrically embedded as a dense subset. If $X$ is contained in an arbitrary complete space $Y$, then $X^*$ is isometric with the closure of $X$ in $Y$.
Analysis: The proof about the completion of $X$ is not straightforward because this problem is given in an abstract way that there is nothing to manipulate or we cannot construct a specific example as the completion of $X$ for facilitating our understanding. Hence a new structure must be introduced to fulfill this purpose, which is the set of equivalence classes $X^*$ derived from the collection of all Cauchy sequences in $X$. Then it is to be proved that $X$ is identified with $F(X)$ in $X^*$, where $F$ is an isometry from $X$ to $F(X)$ and $F(X)$ is dense in $X^*$.
The proof of this theorem is divided into 5 steps as suggested by Exercise 17.
-
If $\{x_n\}_{n \geq 1}$ and $\{y_n\}_{n \geq 1}$ are Cauchy sequences from a metric space $X$, then $\{\rho(x_n, y_n)\}_{n \geq 1}$ converges.
Proof: Because $\langle X, \rho \rangle$ is an incomplete space, the two sequences $\{x_n\}_{n \geq 1}$ and $\{y_n\}_{n \geq 1}$ do not necessarily have their limits within $X$. However, this proposition indicates that the inter-distance between $x_n$ and $y_n$ does converge when $n$ approaches to $\infty$.
Because $\rho(x_n, y_n) \in \mathbb{R}$ and $\mathbb{R}$ is complete, to prove $\{\rho(x_n, y_n)\}_{n \geq 1}$ converges, we need to show that it is a Cauchy sequence.
From the given condition, for any $\varepsilon > 0$, there exists $N \in \mathbb{N}$ such that when $m_1, m_2 > N$, $\abs{x_{m_1} - x_{m_2}} < \frac{\varepsilon}{2}$ and when $n_1, n_2 > N$, $\abs{y_{n_1} - y_{n_2}} < \frac{\varepsilon}{2}$.
For any $m, n > N$,
$$ \abs{\rho(x_n, y_n) - \rho(x_m ,y_m)} = \abs{\rho(x_n, y_n) - \rho(x_n, y_m) + \rho(x_n, y_m) - \rho(x_m, y_m)}. $$Due to the triangle inequality satisfied by the metric $\rho$, we have
$$ \rho(x_n, y_n) \leq \rho(x_n, y_m) + \rho(y_m, y_n) $$and
$$ \rho(x_n, y_m) \leq \rho(x_n, y_n) + \rho(y_n, y_m). $$Hence,
$$ \rho(x_n, y_n) - \rho(x_n, y_m) \leq \rho(y_m, y_n) $$and
$$ \rho(x_n, y_m) - \rho(x_n, y_n) \leq \rho(y_m, y_n), $$which is actually
$$ \abs{\rho(x_n, y_n) - \rho(x_n, y_m)} \leq \rho(y_m, y_n). $$This is just a variation of the triangle inequality for a metric which says that the difference between the lengths of two edges of a triangle is smaller than or equal to the length of the third edge. Similarly, we have
$$ \abs{\rho(x_n, y_m) - \rho(x_m, y_m)} \leq \rho(x_m, x_n). $$Then
$$ \begin{aligned} \abs{\rho(x_n, y_n) - \rho(x_m, y_m)} &= \abs{\rho(x_n, y_n) - \rho(x_n, y_m) + \rho(x_n, y_m) - \rho(x_m, y_m)} \\ & \leq \abs{\rho(x_n, y_n) - \rho(x_n, y_m)} + \abs{\rho(x_n, y_m) - \rho(x_m, y_m)} \\ & \leq \rho(y_m, y_n) + \rho(x_m, x_n) \\ & < \frac{\varepsilon}{2} + \frac{\varepsilon}{2} \\ & = \varepsilon. \end{aligned} $$Therefore, $\{\rho(x_n, y_n)\}_{n \geq 1}$ is a Cauchy sequence in $\mathbb{R}$ and converges to some $a$ in $\mathbb{R}$.
-
The set of all Cauchy sequences from a metric space $X$ becomes a pseudometric space if $\rho^*(\{x_n\}_{n \geq 1}, \{y_n\}_{n \geq 1}) = \lim_{n \rightarrow \infty} \rho(x_n, y_n)$
Proof: Because $\{\rho(x_n, y_n)\}_{n \geq 1}$ is convergent, the above definition of $\rho^*(\{x_n\}_{n \geq 1}, \{y_n\}_{n \geq 1})$ is meaningful. It can be verified that $\rho^*$ satisfies the conditions of positiveness, symmetry and triangle inequality, which are derived from those of $\rho$. Then we only need to find two different Cauchy sequences, the distance between which is zero, so that $\rho^*$ is a pseudometric.
Let $\{z_k\}_{k \geq 1}$ be a Cauchy sequence. Let $x_n = z_{2n-1}$ and $y_n = z_{2n}$. Then $\{x_n\}_{n \geq 1}$ and $\{y_n\}_{n \geq 1}$ are two different Cauchy sequences with $\rho^*(\{x_n\}_{n \geq 1}, \{y_n\}_{n \geq 1}) = \lim_{n \rightarrow \infty} \rho(x_n, y_n) = 0$. Therefore, $\rho^*$ defined on the collection of all Cauchy sequences in $X$ is a pseudometric.
-
This pseudometric space becomes a metric space $X^*$ when we identify elements for which $\rho^* = 0$ and $X$ is isometrically embedded in $X^*$.
Proof: According to Exercise 3 in Section 1, by letting $R := \{\rho^*(\{x_n\}_{n \geq 1}, \{y_n\}_{n \geq 1}) = 0\}$ be the equivalence condition on the set of all Cauchy sequences in $X$, the obtained collection of equivalence classes $X^*$ is a metric space. This can be verified as below.
Let $\mathcal{X}$ and $\mathcal{Y}$ be two different equivalence classes in $X^*$. Let $x_0$ be the representative element of $\mathcal{X}$ and $y_0$ be that of $\mathcal{Y}$. Then for any $x$ in $\mathcal{X}$ and any $y$ in $\mathcal{Y}$, we have $\rho^*(x, y) = \rho^*(x_0, y_0) = \rho^*(\mathcal{X}, \mathcal{Y}) \geq 0$. If $\rho^*(\mathcal{X}, \mathcal{Y}) = 0$, $\rho^*(x_0, y_0) = 0$ and for any $y$ in $\mathcal{Y}$, $\rho^*(x_0, y) = 0$. Because of the equivalence relation $R$, $y$ belongs to $\mathcal{X}$. Similarly, for any $x$ in $\mathcal{X}$, $x$ belongs to $\mathcal{Y}$. Therefore, $\rho^*(\mathcal{X}, \mathcal{Y}) = 0$ implies $\mathcal{X} = \mathcal{Y}$. On the other hand, when $\mathcal{X} = \mathcal{Y}$, $\rho^*(\mathcal{X}, \mathcal{Y}) = \rho^*(x_0, x_0) = 0$. So $\rho^*$ has the property of positive definitiveness.
The commutativity of $\rho^*$ is obvious, which is derived from that of $\rho$.
Finally, for $\mathcal{X}$, $\mathcal{Y}$, $\mathcal{Z}$ in $X^*$ with their respective representative elements $x_0$, $y_0$ and $z_0$, $\rho(\mathcal{X}, \mathcal{Y}) = \rho(x_0, y_0)$, $\rho(\mathcal{X}, \mathcal{Z}) = \rho(x_0, z_0)$ and $\rho(\mathcal{Z}, \mathcal{Y}) = \rho(z_0, y_0)$. Because $\rho(x_0, y_0) \leq \rho(x_0, z_0) + \rho(z_0, y_0)$, we have the triangle inequality for $\rho^*$. Therefore, $X^*$ with $\rho^*$ is a metric space.
Next, let $F: X \rightarrow X^*$ which associates each $x$ in $X$ with the equivalence class in $X^*$ that contains the Cauchy sequence $\{x, x, \cdots\}$. It is easy to see that for any $x$, $y$ in $X$,
$$ \rho^*(F(x), F(y)) = \rho^*(\{x, x, \cdots \}, \{y, y, \cdots\}) = \lim_{n \rightarrow \infty} \rho(x, y) = \rho(x, y). $$Meanwhile, if $F(x) = F(y)$, we have $\rho^*(F(x), F(y)) = \rho(x, y) =\rho(x, x) = 0$. Because $\rho$ is a standard metric, $x = y$. Hence, $F$ is injective. For any open ball $B(\{x, x, \cdots\}, \varepsilon)$ with a radius $\varepsilon$ in $F(X)$, its inverse image under $F^{-1}$ is $B(x, \varepsilon)$ in $X$, which is open in $X$. Then $F$ is a continuous map. Similarly, $F^{-1}$ is also continuous. Therefore, $F$ is a homeomorphism between $X$ and $F(X)$. Moreover, because $\rho^*(F(x), F(y)) = \rho(x, y)$, $F$ is an isometry.
Then, we will prove $X$ is isometrically embedded in $X^*$ as a dense subset.
Let $B(\{x_n\}_{n \geq 1}, \varepsilon)$ be an open ball in $X^*$, which is centered at an arbitrary element $\{x_n\}_{n \geq 1}$ in $X^*$. Because $\{x_n\}_{n \geq 1}$ is a Cauchy sequence, there exists $N$ in $\mathbb{N}$ such that when $m, n > N$, $\abs{x_m - x_n} < \varepsilon$. Then $\rho^*(\{x_n\}_{n \geq 1}, \{x_m, x_m, \cdots\}) = \lim_{n \rightarrow \infty} \abs{x_n - x_m} < \varepsilon$. Hence $\{x_m, x_m, \cdots\}$ belongs to $B(\{x_n\}_{n \geq 1}, \varepsilon)$ and $F(X)$ is dense in $X^*$. Because $F$ is an isometry from $X$ to $F(X)$, $X$ can be identified with $F(X)$. Therefore, $X$ is isometrically embedded in $X^*$.
-
The metric space $\langle X^*, \rho^* \rangle$ is complete. (N.B. What is convergent here is a sequence of Cauchy sequences.)
Proof: Let $\{x_n\}_{n \geq 1}$ be any Cauchy sequence in $X$. We can extract a subsequence from it as $\{x_{n_k}\}_{k \geq 1}$ such that $\rho(x_{n_k}, x_{n_{k+1}}) < 2^{-k}$. This subsequence can be rewritten as $\{\tilde{x}_k\}_{k \geq 1}$ with the condition $\rho(\tilde{x}_k, \tilde{x}_{k+1}) < 2^{-k}$. Then we select an arbitrary Cauchy sequence of such sequences as $\{S_m\}_{m \geq 1}$ with $S_m = \{\tilde{x}_{k,m}\}_{k \geq 1}$ satisfying for any $\varepsilon > 0$, there exists $N$ in $\mathbb{N}$ such that when $m, n > N$, $\rho^*(S_m, S_n) = \lim_{k \rightarrow \infty} \rho(\tilde{x}_{k,m}, \tilde{x}_{k,n}) < \varepsilon$. This suggests that there exists $K$ in $\mathbb{N}$ such that when $k > K$, $\rho(\tilde{x}_{k,k}, \tilde{x}_{k,n}) < \varepsilon$. This Cauchy sequence of Cauchy sequences can be illustrated as a 2-dimensional matrix with infinite length as below,
$$ \begin{pmatrix} \tilde{x}_{1,1} & \tilde{x}_{1,2} & \tilde{x}_{1,3} & \cdots \\ \tilde{x}_{2,1} & \tilde{x}_{2,2} & \tilde{x}_{2,3} & \cdots \\ \tilde{x}_{3,1} & \tilde{x}_{3,2} & \tilde{x}_{3,3} & \cdots \\ \vdots & \vdots & \vdots & \vdots \end{pmatrix}, $$from which we extract the diagonal elements to construct a new sequence $S^* = \{\tilde{x}_{k,k}\}_{k \geq 1}$. For any $\varepsilon > 0$, there exists $N$ in $\mathbb{N}$ such that when $m, n > N$, $\rho(\tilde{x}_{m,m}, \tilde{x}_{n,n}) < \varepsilon$. Hence $S^*$ is a Cauchy sequence so it belongs to $X^*$. The distance between $S_m$ and $S^*$ is $\rho^*(S_m, S^*) = \lim_{k \rightarrow \infty} \rho(\tilde{x}_{k,m}, \tilde{x}_{k,k})$. It is obvious that for any $\varepsilon > 0$, when $m > N$ and $k > K$, $\rho(\tilde{x}_{k,m}, \tilde{x}_{k,k}) < \varepsilon$. Therefore, $\lim_{m \rightarrow \infty} \rho^*(S_m, S^*) = 0$ and $\langle X^*, \rho^* \rangle$ is complete.
Up to now, the first part of Theorem 9 is proved, i.e. we have found the completion of $X$ as $X^*$ in which $X$ is isometrically embedded as a dense subset.
-
The above isometry $F$ from $X$ to $F(X)$ in $X^*$ is uniformly continuous, which is because for any $x$ and $y$ in $X$ such that $\rho(x, y) < \varepsilon$, $\rho^*(F(x), F(y)) = \lim_{n \rightarrow \infty} \rho(x, y) = \rho(x, y) < \varepsilon$. Then according to Proposition 11 in Section 5 of this Chapter, viewing $X$ as a subset of $Y$, $F$ is a uniformly continuous mapping from $X$ into the complete space $X^*$. Then there exists a unique continuous extension $G$ of $F$ from $X$ to $\overline{X}$ with $\overline{X}$ being the closure of $X$ with respect to the standard topology induced by the metric. Because $Y$ is also complete with respect to this topology, $\overline{X}$ is contained within $Y$. Also note that, for any $x$ in $\overline{X}$ but not in $X$, there exists a Cauchy sequence $\{x_n\}_{n \geq 1}$ in $X$ convergent to $x$. Then the value $G(x)$ only depends on $x$, i.e. $G(x) = \lim_{n \rightarrow \infty} F(x_n)$.
Due to Proposition 10 in Section 5, when $\{x_n\}_{n \geq 1}$ is a Cauchy sequence in $X$, $\{F(x_n)\}_{n \geq 1}$ is also a Cauchy sequence because $F$ is uniformly continuous. Therefore, $G(x)$ belongs to the closure of $F(X)$ in $X^*$, i.e. $\overline{F(X)}$. Meanwhile, for any $y$ in $\overline{F(X)}$, there exists a Cauchy sequence $\{y_n\}_{n \geq 1}$ in $F(X)$ and $\{x_n\}_{n \geq 1}$ in $X$ such that $x_n = F^{-1}(y_n)$. Because $F$ is an isometry from $X$ to $F(X)$, $F^{-1}$ is an isometry from $F(X)$ to $X$ and hence $F^{-1}$ is also uniformly continuous. Therefore, $\{x_n\}_{n \geq 1}$ is a Cauchy sequence in $X$. Then, according to the definition of $G$, let $x$ in $\overline{X}$ and $x = \lim_{n \rightarrow \infty} x_n$, we have $G(x) = y$. This means that the actual range of $G$ is $\overline{F(X)} = X^*$.
On the other hand, viewing $F(X)$ as a subset of $X^*$, $F^{-1}$ is an isometry from $F(X)$ to $X \subset \overline{X}$, which is also uniformly continuous. Then there exists a unique extension $H$ of $F^{-1}$ from $F(X)$ to $\overline{F(X)} = X^*$. So $H$ is a map from $X^*$ into $Y$. With a similar analysis as that for $G$, the actual range of $H$ is $\overline{X}$.
Then we have $H \circ G = {\rm id}_{\overline{X}}$ and $G \circ H = {\rm id}_{X^*}$. Therefore, $G$ is the inverse of $H$ and vice versa. Because $G$ is uniformly continuous, $G$ is a homeomorphism. Then we need to prove $G$ is isometric. We already know that when $G$ constrained to $X$, $G\vert_X = F$ is isometric. Furthermore, for any $x_1$ and $x_2$ in $\overline{X}$, we should prove $\rho(x_1, x_2) = \rho^*(\{a_n\}_{n \geq 1}, \{b_n\}_{n \geq 1})$ where $a_n \rightarrow x_1$ and $b_n \rightarrow x_2$, which is quite obvious: $\rho^*(\{a_n\}_{n \geq 1}, \{b_n\}_{n \geq 1}) = \lim_{n \rightarrow \infty} \rho(a_n, b_n) = \rho(x_1, x_2)$. Hence, $G$ is an isometry between $\overline{X}$ and $X^*$.