logo de MathNerds
Best
Texans
Volunteer
Logout
cartoon of two MathNerds
Archive
Ask a Question!
Contact Us
FAQ
Legal
Links
My Home
Networks
Sponsors
Volunteer
frame lower left corner
Haga clic aquí para ver esta página en español.

Distribution of \{ n \alpha \}

Prepared by Valerio De Angelis

We prove the following theorem:

Theorem A

Let \alpha be a real number, and let x_n = \{ n \alpha \} be the fractional part of n\alpha.

i) If \alpha is rational, then the sequence x_n is periodic, and so \{ x_n:n \ge 1 \} is a finite set

ii) If \alpha is irrational, then x_n is uniformly distributed over the interval [0,1), in the sense that for every interval I \subseteq [0,1),

\lim_{n \to \infty} \frac{1}{n} | \{ k: x_k \in I, 1 \le k \le n \} | = |I| \quad \quad \mathrm{(1)}.

Proof of part i)

If \alpha = p/q, where p and q are integers, then

x_{n+q} = \left\{ (n+q)\frac{p}{q} \right\}= \left\{ n \frac{p}{q} + p \right\} = \left\{n \frac{p}{q} \right\} = x_n.

So x_n has period q, and \{ x_n : n \ge 1 \} = \{ x_1, x_2, \dots, x_q \}.

Proof of part ii)

We will make use of the following basic real analysis result:

Theorem B Any continuous function f(x) on [0,1] can be expressed as an infinite sum

f(x) = \sum_{j=0}^\infty \left( a_j \sin(2 \pi j x) + b_j \cos (2 \pi j x) \right).

This infinite sum is called the Fourier series of the function f(x). In addition, if we write

S_m(x) = \sum_{j=0}^m \left( a_j \sin(2 \pi j x) + b_j \cos (2 \pi j x) \right),

we have

\lim_{m \to \infty} \max_{0 \le x \le 1}|S_m(x) - f(x)| = 0.

Most textbooks of real analysis discuss Fourier series. We will not provide a proof of Theorem B in these notes. For a good online reference, click here.

Note: The meaning of the first statement in the theorem is that

\lim_{m \to \infty} |S_m(x) - f(x)| = 0

for every x. In other words, given any x and any \epsilon > 0 (which we can think of as a tolerance level for the error), we can find some N such that |S_m(x) - f(x)| \le \epsilon as soon as m is larger than N, but this number N may well be different for different x. The second statement is stronger: it tells us that given a tolerance number \epsilon > 0 for the difference |S_m(x) - f(x)|, we can find a single N such that when m is larger than N, |S_m(x) - f(x)| \le \epsilon is true for every x. This is expressed by saying that the infinite series converges uniformly to f(x) on [0,1].

We now proceed to the proof of (1), assuming that \alpha is irrational. For any interval I \subseteq [0,1), let's denote by \chi_I(x) the indicator function of I, that is,

\chi_I(x) = \begin{cases} 1 & \text{if $x \in I$} \\ 0 & \text{if $x \not \in I$} \\ \end{cases}

So (1) is equivalent to:

\lim_{n \to \infty} \frac{1}{n} \sum_{k=1}^n \chi_I(x_k) = |I| \qquad \mathrm{(2)}

Note that \int_0^1 \chi_I(x) \, dx = |I|, so that (2) is the same as

\lim_{n \to \infty} \frac{1}{n} \sum_{k=1}^n \chi_I(x_k) = \int_0^1 \chi_I(x) \, dx \qquad \mathrm{(3)}

We will first prove that (3) holds for any continuous function f(x) on [0,1] (of course, \chi_I(x) is not continuous). So we will prove that for any continuous function f(x) on [0,1]

\lim_{n \to \infty} \sum_{k=1}^n f(x_k) = \int_0^1 f(x) \, dx\qquad \mathrm{(4)}

First we fix an \epsilon > 0 and use Theorem B to find some m such that the partial sum S_m(x) = \sum_{j=0}^m ( a_j \sin(2 \pi j x) + b_j \cos (2 \pi j x)) is uniformly within \epsilon of f(x), that is S_m(x) - \epsilon \le f(x) \le S_m(x) + \epsilon for all x in [0,1]. Then we calculate

\begin{align} \frac{1}{n} \sum_{k=1}^n f(x_k) &\le \frac{1}{n} \sum_{k=1}^n (S_m(x_k) + \epsilon) = \frac{1}{n} \sum_{k=1}^n S_m(x_k) + \frac{1}{n} \sum_{k=1}^n \epsilon = \frac{1}{n} \sum_{k=1}^n S_m(x_k) + \epsilon \\ &= \frac{1}{n} \sum_{k=1}^n \sum_{j=0}^m \left( a_j \sin(2 \pi j x_k) + b_j \cos (2 \pi j x_k) \right) + \epsilon) \\ &= \frac{1}{n} \sum_{j=0}^m a_j \sum_{k=1}^n \sin(2 \pi j x_k) + \frac{1}{n} \sum_{j=0}^m b_j \sum_{k=1}^n \cos (2 \pi j x_k) + \epsilon. \qquad \mathrm{(5)} \end{align}

So we are led to consider the sums \sum_{k=1}^n \sin(2 \pi j x_k) and \sum_{k=1}^n \cos(2 \pi j x_k). To compute these sums, we note that they are the imaginary and real part, respectively, of the sum \sum_{k=1}^n \exp(2 \pi i j x_k), which is easily computed for j > 0 :

\begin{align} \sum_{k=1}^n \exp( 2 \pi i j x_k) &= \sum_{k=1}^n \exp(2 \pi i j \{\alpha k\}) = \sum_{k=1}^n \exp(2 \pi i j (\alpha k - \lfloor \alpha k \rfloor)) = \sum_{k=1}^n \exp(2 \pi i j \alpha k) \exp( - 2 \pi i j \lfloor \alpha k \rfloor) \\ &= \sum_{k=1}^n ( \exp(2 \pi i j \alpha) ) ^k = \exp(2 \pi i j \alpha) \, \frac{\exp(2 \pi i j n \alpha) - 1}{\exp(2 \pi i j \alpha) - 1} \\ &= \exp(2 \pi i j \alpha) \, \frac{\exp(\pi i j n \alpha)}{\exp(\pi i j \alpha)} \; \frac{\exp(\pi i j n \alpha) - \exp(-\pi i j n \alpha)}{\exp(\pi i j \alpha) - \exp(-\pi i j \alpha)} \\ &= \exp(\pi i j (n+1) \alpha) \, \frac{\sin(\pi j n \alpha)}{\sin(\pi j \alpha)} \\ &= \cos(\pi j (n+1) \alpha) \, \frac{\sin(\pi j n \alpha)}{\sin(\pi j \alpha)} + i \sin(\pi j (n+1) \alpha) \, \frac{\sin(\pi j n \alpha)}{\sin(\pi j \alpha)} \end{align}

The crucial ingredient for the legitimacy of this calculation is the fact that \alpha is irrational, because it ensures that for every non-zero integer j, \sin(\pi j \alpha) \ne 0.

So we conclude that for each fixed non-zero integer j,

\left| \sum_{k=1}^n \sin(2 \pi j x_k) \right| = \left| \sin(\pi j (n+1) \alpha) \, \frac{\sin(\pi j n \alpha)}{\sin(\pi j \alpha)} \right| \le \frac{1}{| \sin(\pi j \alpha) |}
and
\left| \sum_{k=1}^n \cos(2 \pi j x_k) \right| = \left| \cos(\pi j (n+1) \alpha) \,\frac{\sin(\pi j n \alpha)}{\sin(\pi j \alpha)} \right| \le \frac{1}{| \sin(\pi j \alpha) |}

This means that all the terms in the sums over j in (5) will tend to zero as n gets large, except the term in the second sum with j = 0.

Since the sums in (5) are finite, we can find some N such that

\left| \sum_{j=1}^m a_j \frac{1}{n} \sum_{k=1}^n \sin(2 \pi j x_k) \right| \le \epsilon
and
\left| \sum_{j=1}^m b_j \frac{1}{n} \sum_{k=1}^n \cos(2 \pi j x_k) \right| \le \epsilon

whenever n \ge N. Then we conclude that if n \ge N,

\frac{1}{n} \sum_{k=1}^n f(x_k) \le \epsilon + \epsilon + \frac{1}{n} \sum_{k=1}^n b_0 = b_0 + 3 \epsilon.

Now note that b_0 = \int_0^1 f(x) \, dx, as it follows by integrating the Fourier series for f(x) over [0,1].

So we have proved that given any \epsilon > 0, we can find some N such that

\frac{1}{n} \sum_{k=1}^n f(x_k) \le \int_0^1 f(x) \, dx + 3 \epsilon

whenever n \ge N. In just the same way we can prove that

\frac{1}{n} \sum_{k=1}^n f(x_k) \ge \int_0^1 f(x) \, dx - 3 \epsilon

and we conclude that \left| \frac{1}{n} \sum_{k=1}^n f(x_k) - \int_0^1 f(x) \, dx \right| \le 3 \epsilon for all large N. This means, by definition, that \lim_{n \to \infty} \sum_{k=1}^n f(x_k) = \int_0^1 f(x) \, dx, and (4) is proved.

To finish our proof, we now need to show that (3) follows from (4). The idea is simple: even though \chi_I(x) is not continuous, we can approximate it with continuous functions. To be precise, for each \epsilon > 0 we can find a continuous function f(x) such that \chi_I(x) \le f(x) and \int_0^1 f(x) \, dx \le |I| + \epsilon.

For example, if I=[a,b], we can take \delta \le \frac{\epsilon}{2} in the picture below to make the shaded area less than \epsilon, with some obvious modifications in case one of the end points of I is an end point of [0,1].

graph of f and chi

Then we find, for large enough n,

\frac{1}{n} \sum_{k=1}^n \chi_I(x_k) \le \frac{1}{n} \sum_{k=1}^n f(x_k) \le \int_0^1 f(x) \, dx + \epsilon \le |I| + 2 \epsilon.

In a similar way, we can find a continuous function g(x) such that g(x) \le \chi_I(x) and \int_0^1 g(x) \, dx \ge |I| - \epsilon (take the base of the trapezoid in the figure to be inside I). Then for large enough n we have

\frac{1}{n} \sum_{k=1}^n \chi_I(x_k) \ge \frac{1}{n} \sum_{k=1}^n g(x_k) \ge \int_0^1 g(x) \, dx - \epsilon \ge |I| - 2 \epsilon.

In conclusion, we have proved that given any \epsilon > 0, we have | \frac{1}{n} \sum_{k=1}^n \chi_I(x_k) - |I| \, | \le 2 \epsilon for all large enough n, which means that \lim_{n \to \infty} \sum_{k=1}^n \chi_I(x_k) = |I|, and (3) is proved. This concludes the proof of Theorem A.


Notes

The problem of determining the asymptotic distribution of sequences determined by simple transformations of the unit interval has been much investigated. The sequence we considered in this problem arises from the transformation T(x)=x+\alpha \pmod{1} (a translation by \alpha), because we have x_n = T^n(0). A much more difficult problem is to understand the distribution of sequences arising from multiplication by a number, for example T(x) = \alpha x. Iteration of this transformation leads to sequences such as x_n = (3/2)^n \pmod{1}, for which it is unknown whether the asymptotic distribution is uniform.

A combinatorial and fairly elementary proof of Theorem A is found in An Introduction to the Theory of Numbers, 6th edition (2008) by Hardy and Wright, Theorem 445, pp. 520-521. (This reference provided by Allen Stenger). A succinct proof along the lines of our exposition is in An Introduction to Ergodic Theory by Peter Walters, Springer-Verlag Graduate Texts in Mathematics, 2000, Theorem 1.8.

Click here to return to the article "The first digit of 2^n".

© 1999 - 2012 MathNerds TM. All Rights Reserved.
Email the Webmaster