Zeta distribution
TemplateStyles' src attribute must not be empty.
This article needs additional citations for verification. (August 2011) |
Template:Probability distribution
In probability theory and statistics, the zeta distribution is a discrete probability distribution. If X is a zeta-distributed random variable with parameter s, then the probability that X takes the positive integer value k is given by the probability mass function
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle f_s(k) = \frac{k^{-s}}{\zeta(s)} }
where ζ(s) is the Riemann zeta function (which is undefined for s = 1).
The multiplicities of distinct prime factors of X are independent random variables.
The Riemann zeta function being the sum of all terms Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle k^{-s}} for positive integer k, it appears thus as the normalization of the Zipf distribution. The terms "Zipf distribution" and "zeta distribution" are often used interchangeably. But while the zeta distribution is a probability distribution by itself, it is not associated with Zipf's law with the same exponent.
Definition
The zeta distribution is defined for positive integers Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle k \geq 1} , and its probability mass function is given by Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle P(x=k) = \frac 1 {\zeta(s)} k^{-s}, } where Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle s>1} is the parameter, and is the Riemann zeta function.
The cumulative distribution function is given by Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle P(x \leq k) = \frac{H_{k,s}}{\zeta(s)},} where Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle H_{k,s}} is the generalized harmonic number Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle H_{k,s} = \sum_{i=1}^k \frac 1 {i^s}.}
Moments
The nth raw moment is defined as the expected value of Xn:
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle m_n = E(X^n) = \frac{1}{\zeta(s)}\sum_{k=1}^\infty \frac{1}{k^{s-n}}}
The series on the right is just a series representation of the Riemann zeta function, but it only converges for values of Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle s-n} that are greater than unity. Thus:
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle m_n = \begin{cases} \zeta(s-n)/\zeta(s) & \text{for } n < s-1 \\ \infty & \text{for } n \ge s-1 \end{cases} }
The ratio of the zeta functions is well-defined, even for n > s − 1 because the series representation of the zeta function can be analytically continued. This does not change the fact that the moments are specified by the series itself, and are therefore undefined for large n.
Moment generating function
The moment generating function is defined as
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle M(t;s) = E(e^{tX}) = \frac{1}{\zeta(s)} \sum_{k=1}^\infty \frac{e^{tk}}{k^s}.}
The series is just the definition of the polylogarithm, valid for Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle e^t<1} so that
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle M(t;s) = \frac{\operatorname{Li}_s(e^t)}{\zeta(s)}\text{ for }t<0.}
Since this does not converge on an open interval containing Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle t=0} , the moment generating function does not exist.
The case s = 1
ζ(1) is infinite as the harmonic series, and so the case when s = 1 is not meaningful. However, if A is any set of positive integers that has a density, i.e. if
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \lim_{n\to\infty}\frac{N(A,n)}{n}}
exists where N(A, n) is the number of members of A less than or equal to n, then
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \lim_{s\to 1^+}P(X\in A)\,}
is equal to that density.
The latter limit can also exist in some cases in which A does not have a density. For example, if A is the set of all positive integers whose first digit is d, then A has no density, but nonetheless, the second limit given above exists and is proportional to
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \log(d+1) - \log(d) = \log\left(1+\frac{1}{d}\right),\,}
which is Benford's law.
Infinite divisibility
The zeta distribution can be constructed with a sequence of independent random variables with a geometric distribution. Let Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle p} be a prime number and Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle X(p^{-s})} be a random variable with a geometric distribution of parameter Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle p^{-s}} , namely
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \mathbb{P}\left( X(p^{-s}) = k \right) = p^{-ks } (1 - p^{-s} )}
If the random variables Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle ( X(p^{-s}) )_{p \in \mathcal{P} }} are independent, then, the random variable Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle Z_s} defined by
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle Z_s = \prod_{p \in \mathcal{P} } p^{ X(p^{-s}) }}
has the zeta distribution: Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \mathbb{P}\left( Z_s = n \right) = \frac{1}{ n^s \zeta(s) }} .
Stated differently, the random variable Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \log(Z_s) = \sum_{p \in \mathcal{P} } X(p^{-s}) \, \log(p)} is infinitely divisible with Lévy measure given by the following sum of Dirac masses:
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \Pi_s(dx) = \sum_{p \in \mathcal{P} } \sum_{k \geqslant 1 } \frac{p^{-k s}}{k} \delta_{k \log(p) }(dx)}
See also
Other "power-law" distributions
External links
- Template:Cite CiteSeerX What Gut calls the "Riemann zeta distribution" is actually the probability distribution of −log X, where X is a random variable with what this article calls the zeta distribution.
- Weisstein, Eric W. "Zipf Distribution". MathWorld.