Zipf distribution: Difference between revisions
imported>Richard Pinch (New entry, just a stub) |
mNo edit summary |
||
(2 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
In [[probability theory]] and [[statistics]], the '''Zipf distribution''' and '''zeta distribution''' refer to a class of [[discrete probability distribution]]s. They have been used to model the distribution of text strings and keys in databases. | {{subpages}} | ||
In [[probability theory]] and [[statistics]], the '''Zipf distribution''' and '''zeta distribution''' refer to a class of [[discrete probability distribution]]s. They have been used to model the distribution of words in words in a text , of text strings and keys in databases, and of the sizes of businesses and towns. | |||
The Zipf distribution with parameter ''n'' assigns probability proportional to 1/''r'' to an integer ''r'' ≤ ''n'' and zero otherwise, with [[normalization]] factor ''H''<sub>''n''</sub>, the ''n''-th [[harmonic number]]. | The Zipf distribution with parameter ''n'' assigns probability proportional to 1/''r'' to an integer ''r'' ≤ ''n'' and zero otherwise, with [[normalization]] factor ''H''<sub>''n''</sub>, the ''n''-th [[harmonic number]]. | ||
Line 6: | Line 7: | ||
The zeta distribution with parameter ''s'' assigns probability proportional to 1/''r''<sup>''s''</sup> to all integers ''r'' with normalization factor given by the [[Riemann zeta function]] 1/ζ(''s''). | The zeta distribution with parameter ''s'' assigns probability proportional to 1/''r''<sup>''s''</sup> to all integers ''r'' with normalization factor given by the [[Riemann zeta function]] 1/ζ(''s''). | ||
==References== | |||
* {{cite book | author=Michael Woodroofe | coauthors=Bruce Hill | title=On Zipf's law | journal=J. Appl. Probab. | volume=12 | pages=425-434 | year=1975 | id=Zbl 0343.60012 }} | |||
[[Category:Suggestion Bot Tag]] |
Latest revision as of 12:00, 10 November 2024
In probability theory and statistics, the Zipf distribution and zeta distribution refer to a class of discrete probability distributions. They have been used to model the distribution of words in words in a text , of text strings and keys in databases, and of the sizes of businesses and towns.
The Zipf distribution with parameter n assigns probability proportional to 1/r to an integer r ≤ n and zero otherwise, with normalization factor Hn, the n-th harmonic number.
A Zipf-like distribution with parameters n and s assigns probability proportional to 1/rs to an integer r ≤ n and zero otherwise, with normalization factor .
The zeta distribution with parameter s assigns probability proportional to 1/rs to all integers r with normalization factor given by the Riemann zeta function 1/ζ(s).
References
- Michael Woodroofe; Bruce Hill (1975). On Zipf's law, 425-434. Zbl 0343.60012.