next up previous
Next: Other topics Up: Applications of IT in Previous: Large deviations: Sanov's theorem

Measure concentration

Measure concentration is currently a hot topic in probability theory (cf. Talagrand 1995, 1996). A general description of problems pertinent to that topic would lead too far, but one result now considered as an early example of a measure concentration theorem is well known to information theorists. It is the blowing up lemma (Margulis 1974, Ahlswede, Gács and Körner 1976) that says, intuitively, that by slightly ``blowing up'' any set tex2html_wrap_inline814 of not exponentially small probability, one gets a set of probability close to 1.

In IT, the blowing up lemma was originally just a mathematical tool, particularly useful in multiuser Shannon theory (cf. Csiszár and Körner 1981). Today it is an integral part of IT due to its information theoretic proof (Marton 1986) that was, actually, its first simple proof. Recently, Marton extended her approach to prove also other measure concentration results, providing beautiful examples of applications of IT in probability theory. Here we sketch some main ideas of that approach, restricting attention to the simplest case.

Let tex2html_wrap_inline816 be PD's on a finite set tex2html_wrap_inline510 , let tex2html_wrap_inline820 , and let

  equation145

denote the tex2html_wrap_inline822 -blow up of a set tex2html_wrap_inline814 (where d denotes normalized Hamming distance).

Marton (1996) proved that the distance

  equation149

of two subsets of tex2html_wrap_inline752 can be bounded in terms of their probabilities as

  equation152

With the choice tex2html_wrap_inline830 , when tex2html_wrap_inline832 , (2.15) gives the following strong version of the blowing up lemma:

  equation163

Sketch of proof of (2.15):

(i) The key idea is to show that given tex2html_wrap_inline820 and any PD tex2html_wrap_inline836 on tex2html_wrap_inline752 not necessarily of product form, there exist random variables tex2html_wrap_inline650 with distribution tex2html_wrap_inline842 and tex2html_wrap_inline844 with distribution tex2html_wrap_inline836 such that

  equation175

For n=1, this is obvious by Pinsker's inequality (1.4), since the minimum of tex2html_wrap_inline850 subject to tex2html_wrap_inline852 , tex2html_wrap_inline854 equals tex2html_wrap_inline856 .

For n>1, the proof of (2.17) goes by induction, using a coupling argument to extend tex2html_wrap_inline764 and tex2html_wrap_inline862 satisfying the induction hypothesis by suitable new components tex2html_wrap_inline864 .

(ii) The result (i) implies, by the triangle inequality, that given arbitrary PD's tex2html_wrap_inline836 and tex2html_wrap_inline868 on tex2html_wrap_inline752 , there exist tex2html_wrap_inline862 and tex2html_wrap_inline874 with distributions tex2html_wrap_inline836 and tex2html_wrap_inline868 such that

  equation189

(iii) Finally, (2.15) follows from (2.18), letting tex2html_wrap_inline836 and tex2html_wrap_inline868 be the conditional distributions obtained from tex2html_wrap_inline842 by conditioning on A and B, respectively. Indeed, with that choice, the left hand side of (2.18) is lower bounded by d(A,B) since tex2html_wrap_inline892 with probability 1.

This approach to derive bounds like (2.15) has been extended to distributions of certain processes with memory in the role of tex2html_wrap_inline842 , including mixing Markov chains (Marton 1996). So far, other approaches to measure concentration could not be so extended. It should be noted that a weaker asymptotic form of the blowing up lemma has been established for a rather broad class of processes, again with substantial use of IT ideas (Marton and Shields 1994).


next up previous
Next: Other topics Up: Applications of IT in Previous: Large deviations: Sanov's theorem

Ramesh Rao
Mon Apr 6 16:41:42 PDT 1998