on Slepian's lemma and Gaussian comparison

Overview

This post examines Slepian’s Lemma (1962), a fundamental comparison principle in the theory of Gaussian processes. It formalizes the intuition that the more independent a set of Gaussian variables is, the larger their expected maximum will be. We explore the theorem’s rigorous statement, its generalization via the Sudakov-Fernique Inequality, and provide a detailed proof using the Gaussian Interpolation Method.

🏷️ The Intuition: Correlation and Supremum

Consider a collection of Gaussian random variables ${X_{1}, \dots, X_{n}}$ . We are interested in the behavior of the supremum $E max X_{i}$ .

The Correlation Effect

i.i.d. Case: If the variables are independent and $N (0, 1)$ , the expected maximum scales as $2 lo g n$ .

Perfect Correlation: If $X_{1} = X_{2} = \dots = X_{n}$ , they move as a single unit, and $E max X_{i} = E X_{1} = 0$ .

Slepian’s insight was that positive correlation between variables “pulls” them together, effectively reducing the effective volume they cover and thus shrinking the expected maximum.

🏷️ Slepian’s Inequality

Slepian’s Lemma (Slepian, 1962) provides a formal comparison between two Gaussian processes based on their covariance structures.

Slepian’s Lemma (1962)

Let $X = (X_{1}, \dots, X_{n})$ and $Y = (Y_{1}, \dots, Y_{n})$ be centered Gaussian random vectors in $R^{n}$ such that for all $i$ :
$E X_{i}^{2} = E Y_{i}^{2}$
and for all $i \neq = j$ :
$E X_{i} X_{j} \leq E Y_{i} Y_{j}$
Then for any real numbers $a_{1}, \dots, a_{n}$ :
$P (max X_{i} \leq a_{i}) \leq P (max Y_{i} \leq a_{i})$
In particular, this implies $E max X_{i} \geq E max Y_{i}$ .

🏷️ Detailed Proof: The Interpolation Method

The most powerful proof of Slepian-type inequalities relies on Gaussian Interpolation, a technique that allows us to continuously deform one process into another while tracking the evolution of the expectation.

Proof: The Smart Path

Let $f : R^{n} \to R$ be a smooth approximation of the maximum function. We define the interpolated process $Z (t)$ for $t \in [0, 1]$ as:
$Z_{i} (t) = t Y_{i} + 1 - t X_{i}$
where $X$ and $Y$ are independent. We study the function $ϕ (t) = E f (Z (t))$ .

1. The Derivative: By the chain rule:
$ϕ^{'} (t) = i = 1 \sum n E [\frac{\partial f}{\partial z _{i}} (Z (t)) \cdot (\frac{1}{2 t} Y_{i} - \frac{1}{2 1 - t} X_{i})]$
2. Gaussian Integration by Parts (Stein’s Lemma): For any centered Gaussian $G$ and smooth $g$ , $E [G g (G)] = E [G^{2}] E [g^{'} (G)]$ . Applying this to the $X$ and $Y$ components:
$E [Y_{i} \frac{\partial f}{\partial z _{i}}] = t j = 1 \sum n E Y_{i} Y_{j} E [\frac{\partial ^{2} f}{\partial z _{i} \partial z _{j}}]$ $E [X_{i} \frac{\partial f}{\partial z _{i}}] = 1 - t j = 1 \sum n E X_{i} X_{j} E [\frac{\partial ^{2} f}{\partial z _{i} \partial z _{j}}]$
3. Combining Terms: Substituting these back into $ϕ^{'} (t)$ :
$ϕ^{'} (t) = \frac{1}{2} i, j = 1 \sum n (E Y_{i} Y_{j} - E X_{i} X_{j}) E [\frac{\partial ^{2} f}{\partial z _{i} \partial z _{j}} (Z (t))]$
4. The Sign Analysis:

For $i = j$ : The term is zero because $E Y_{i}^{2} = E X_{i}^{2}$ .

For $i \neq = j$ : By assumption, $(E Y_{i} Y_{j} - E X_{i} X_{j}) \geq 0$ .

The Max Function: For the maximum function $f (z) = max z_{k}$ , the second derivatives $\partial^{2} f / \partial z_{i} \partial z_{j}$ are non-positive for $i \neq = j$ .

Consequently, $ϕ^{'} (t) \leq 0$ , which implies $E f (Y) = ϕ (1) \leq ϕ (0) = E f (X)$ . This proves $E max Y_{i} \leq E max X_{i}$ .

🏷️ Generalization: Sudakov-Fernique Inequality

A significant limitation of Slepian’s original result is the requirement of equal variances. The Sudakov-Fernique Inequality [@fernique1975regularite] generalizes this to compare increments directly.

Sudakov-Fernique Inequality

Let ${X_{t}}_{t \in T}$ and ${Y_{t}}_{t \in T}$ be centered Gaussian processes such that for all $s, t \in T$ :
$E (X_{t} - X_{s})^{2} \leq E (Y_{t} - Y_{s})^{2}$
Then $E sup_{t \in T} X_{t} \leq E sup_{t \in T} Y_{t}$ .

Proof Insight: Variational Reformulation

The proof of Sudakov-Fernique follows the same interpolation logic but handles the variance terms by recognizing that:
$E (X_{t} - X_{s})^{2} = E X_{t}^{2} + E X_{s}^{2} - 2 E X_{t} X_{s}$
Substituting this into the $ϕ^{'} (t)$ formula reveals that the increase in incremental variance compensates for any individual variance growth, maintaining the non-positivity of the derivative for the supremum functional.

🌊 Advanced Application: Random Matrix Theory

The Sudakov-Fernique inequality provides a remarkably simple proof for bounding the expected operator norm of a Gaussian random matrix.

Expected Operator Norm of a Gaussian Matrix

Let $G$ be an $n \times m$ matrix with i.i.d. $N (0, 1)$ entries. The operator norm is:
$∥ G ∥ = u \in S^{m - 1}, v \in S^{n - 1} sup ⟨ G u, v ⟩ = u, v sup i, j \sum G_{ij} u_{j} v_{i}$
1. The $X$ -process: Define $X_{u, v} = \sum_{i, j} G_{ij} u_{j} v_{i}$ . The increments are $E (X_{u, v} - X_{u^{'}, v^{'}})^{2} = ∥ u v^{T} - u^{'} v^{' T} ∥_{F}^{2} \leq ∥ u - u^{'} ∥^{2} + ∥ v - v^{'} ∥^{2}$ .

2. The $Y$ -process: Let $g \sim N (0, I_{n})$ and $h \sim N (0, I_{m})$ be independent Gaussian vectors. Define $Y_{u, v} = ⟨ g, v ⟩ + ⟨ h, u ⟩$ . The increments are $E (Y_{u, v} - Y_{u^{'}, v^{'}})^{2} = ∥ v - v^{'} ∥^{2} + ∥ u - u^{'} ∥^{2}$ .

3. Comparison: By Sudakov-Fernique, $E ∥ G ∥ \leq E sup_{v} ⟨ g, v ⟩ + E sup_{u} ⟨ h, u ⟩ \leq n + m$ .

📉 Advanced Application: Sudakov Minoration

While on Dudley’s Theorem provides an upper bound via metric entropy, Slepian’s logic allows us to establish a Lower Bound.

Sudakov Minoration

If $T$ contains $M$ points that are at least $ϵ$ -separated in the Gaussian metric $d (t, s)$ , we can compare the process to $M$ independent Gaussians with variance $ϵ^{2} /2$ .
$E t \in T sup X_{t} ≳ ϵ lo g N (T, d, ϵ)$
Significance: This result (Sudakov, 1971) is the dual to Dudley’s integral. It proves that if a set has high metric entropy at a single scale, the process MUST fluctuate significantly. This is refined by Talagrand’s $γ_{2}$ functional, which “interpolates” between Sudakov and Dudley to achieve sharpness.

📝 Notes

Gordon’s Inequality: A further refinement by Yehoram Gordon (Gordon, 1985) compares the expected value of $E in f_{u} sup_{v} \dots$ , which is essential for studying the smallest singular values of random matrices.
The Max Functionality: The key to Slepian’s Lemma is that the “max” function is sub-modular (the off-diagonal second derivatives are $\leq 0$ ). This ensures that increasing correlations (moving variables closer) always reduces the expected maximum.

🔗 See Also

on Dudley’s Theorem --- Slepian/Sudakov provides the necessary lower bounds that complement Dudley’s chaining upper bound.
on eigenvalue estimate of kernel --- Comparison inequalities are the primary tool for bounding the spectra of random operators.

📚 References

🐻 Gordon, Y. 1985. Some inequalities for Gaussian processes and applications. Israel Journal of Mathematics 50(4), 265–289.

🐻 Slepian, D.S. 1962. The one-sided barrier problem for Gaussian noise. Bell System Technical Journal 41(2), 463–501.

🐻 Sudakov, V.N. 1971. Gaussian random processes and measures of solid angles in Hilbert space. Doklady Akademii Nauk SSSR 197(1), 43–45.

Latent Seminar

Explorer

on Slepian's lemma and Gaussian comparison

🏷️ The Intuition: Correlation and Supremum

🏷️ Slepian’s Inequality

🏷️ Detailed Proof: The Interpolation Method

🏷️ Generalization: Sudakov-Fernique Inequality

🌊 Advanced Application: Random Matrix Theory

📉 Advanced Application: Sudakov Minoration

📝 Notes

🔗 See Also

📚 References

Graph View

Table of Contents

Backlinks