Zipf distribution: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Richard Pinch
(New entry, just a stub)
 
imported>Richard Pinch
(supplied Reference Woodroofe)
Line 1: Line 1:
In [[probability theory]] and [[statistics]], the '''Zipf distribution''' and '''zeta distribution''' refer to a class of [[discrete probability distribution]]s.  They have been used to model the distribution of text strings and keys in databases.
In [[probability theory]] and [[statistics]], the '''Zipf distribution''' and '''zeta distribution''' refer to a class of [[discrete probability distribution]]s.  They have been used to model the distribution of words in words in a text , of text strings and keys in databases, and of the sizes of businesses and towns.


The Zipf distribution with parameter ''n'' assigns probability proportional to 1/''r'' to an integer ''r'' &le; ''n'' and zero otherwise, with [[normalization]] factor ''H''<sub>''n''</sub>, the ''n''-th [[harmonic number]].
The Zipf distribution with parameter ''n'' assigns probability proportional to 1/''r'' to an integer ''r'' &le; ''n'' and zero otherwise, with [[normalization]] factor ''H''<sub>''n''</sub>, the ''n''-th [[harmonic number]].
Line 6: Line 6:


The zeta distribution with parameter ''s'' assigns probability proportional to 1/''r''<sup>''s''</sup> to all integers ''r'' with normalization factor given by the [[Riemann zeta function]] 1/ζ(''s'').
The zeta distribution with parameter ''s'' assigns probability proportional to 1/''r''<sup>''s''</sup> to all integers ''r'' with normalization factor given by the [[Riemann zeta function]] 1/ζ(''s'').
==References==
* {{cite book | author=Michael Woodroofe | coauthors=Bruce Hill | title=On Zipf's law | journal=J. Appl. Probab. | volume=12 | pages=425-434 | year=1975 | id=Zbl 0343.60012 }}

Revision as of 14:54, 21 December 2008

In probability theory and statistics, the Zipf distribution and zeta distribution refer to a class of discrete probability distributions. They have been used to model the distribution of words in words in a text , of text strings and keys in databases, and of the sizes of businesses and towns.

The Zipf distribution with parameter n assigns probability proportional to 1/r to an integer rn and zero otherwise, with normalization factor Hn, the n-th harmonic number.

A Zipf-like distribution with parameters n and s assigns probability proportional to 1/rs to an integer rn and zero otherwise, with normalization factor .

The zeta distribution with parameter s assigns probability proportional to 1/rs to all integers r with normalization factor given by the Riemann zeta function 1/ζ(s).

References

  • Michael Woodroofe; Bruce Hill (1975). On Zipf's law, 425-434. Zbl 0343.60012.