On the Pareto Type III Distribution

Bottazzi, Giulio

doi:10.3389/fphy.2022.807199

BRIEF RESEARCH REPORT article

Front. Phys., 14 April 2022

Sec. Interdisciplinary Physics

Volume 10 - 2022 | https://doi.org/10.3389/fphy.2022.807199

On the Pareto Type III Distribution

GB
Giulio Bottazzi ^*

Institute of Economics, Scuola Superiore Sant’Anna, Pisa, Italy

Abstract

This note analyzes some properties of the Pareto Type III distribution. A three parameter version of the original two parameter distribution proposed by Pareto is introduced and both its density and characteristic function are derived. The analytic expression of the inverse distribution function is also obtained, together with an explicit expression of its moments of any order. Finally, a simple statistical exercise is proposed, designed to show the reliability of the Pareto Type III distribution in describing asymptotically dumped power-like behaviors.

1 Introduction

Zipf’s Law [1], and Pareto Laws in general [2], have been assumed, since a relative long time, as reliable descriptions of the distributional properties of variables characterizing different social and natural phenomena (see Newman [3], the critical review in Kleiber and Kotz [4] and references therein). The application is the domain of economics are many, ranging from portfolio theory to economic growth [5,6]. These Laws are usually assumed valid when sufficiently large realizations of the variable of interest are considered, that is for the upper tail of the associated distributions. If a random variable X follows the Pareto Law with tail index a, the associated survival function possesses a power-like asymptotic behavior, 1 − F(x) ∼ x^−a, so that its n-th moment E[Xⁿ], for sufficiently large values of n, it is not defined. This property is particularly relevant when small values of the tail parameter are considered. For instance, when a ≤ 2, the variance (n = 2) is already absent. Contrary to the Pareto hypothesis, however, the empirical evidence often suggests finite values for the central moments, also when heavy-tail distributed variables are considered. Sometimes this can be understood with the presence of a natural upper bound which limits the largest attainable value of a given variable. Indeed in several cases it has been reported that a truncated Pareto provides a better fit to the data (Burroughs and Tebbens, 2001) than a pure power Law. In this paper I introduce a new three parameter version of the two parameter family of distributions originally proposed in Pareto [7]. The added parameter is a “dispersion” parameter which explicitly accounts for the typical scale of the underlying random variable, thus reducing the negative effect of the latter on the estimation of the power and exponential coefficients reported in Creedy [8]. Since the shape of the distribution is essentially the same of the orignal two parameter family proposed by Pareto, following Kleiber and Kotz [4] I retain the name of Pareto Type III distribution. The exponential dumping in the upper tail makes this distribution particularly suitable in describing all those samples which display power behavior for intermediate values and a more than power-like decrease in probability above a certain threshold or for particularly large observations (c.f. the examples in Burroughs and Tebbens [9]).

2 The Pareto Type III Distribution

Consider the three parameter family of distributionswhere s > 0 is a scale parameter, a > 0 the power coefficient and b ≥ 0 the exponential coefficient. The upper-tail of the distribution is characterized by a power-like behavior dumped by an exponential factor. When the exponential coefficient is set to zero, b = 0, the previous distribution reduces to a Pareto Type II distribution [4], which is asymptotically equivalent to the original Pareto distribution. The effect of the values of the exponential coefficient b on the overall shape of the distribution can be see in Figure 1, where the logarithm of the survival function log(1 − F(x)) is reported. For x/s ≪ 1 (left panel), when b is large the exponential factor becomes leading and the survival function decreases linearly. At the same time, when x/s ≫ 1 (right panel) an increase in b introduces a deviation (on the log-log graph) from the straight line characteristic of the Pareto power like behavior. With respect to the original formulation of Pareto and some later works, the functional expression proposed in Eq. 1 contains the additional scale parameter s. The addition of the scale parameter does not change the general properties of the distribution. In fact, it can be simplified away by a simple redefinition of the random variable. However, it might be useful in application, when the typical scale of the variable under study is not clear and has to be derived from data. The density associated to Eq. 1 reads

FIGURE 1

The density f(x) is strictly decreasing, thus the Pareto III distribution is unimodal with mode in x = 0.

2.1 Characteristic Function and Moments

Following Abramowitz and Stegun [10] (equation 6.5.20, p. 262)) definewhere Γ(n, x) stands for the incomplete gamma function

Using the properties of Γ(n, x), it is immediate to obtain the following recurrence relation

Using this relation it is easy to show the following

Theorem 2.1. If the random variableXis distributed according toEq. 1, its characteristic functionϕ(k) = E[e^ikX] reads

ProofBy definition one haswhich, after the change of variable z = x/s + 1, reduces toUsing Eq. 3 this can be rewritten aswhich using (Eq. 4) reduces to (Eq. 5).In principle, all the moments of Eq. 1 can be obtained from the characteristic function (Eq. 5). For instance, one immediately has that the mean m₁ isA more practical expression can be obtained for higher moments.

Theorem 2.2. The momentsm_n = E[Xⁿ], whereXis distributed according toEq. 1, admit the following representation:

ProofBy definition of the n-th moment and using z = x/s + 1 one hasTaking the binomial expansion of (z − 1)ⁿ and using Eq. 3, it becomeswhich given Eq. 4 and the definition of the α function in terms of incomplete gamma function, reduces to Eq. 7.

2.2 Inverse Distribution Function

The distribution function in Eq. 1 is defined over [0, + ∞) and has image in [0, 1). In this Section 1 derive its inverse, the quantile function Q(q). Consider the real function g(x) = x e^x. Since it is continuous and strictly increasing for x ≥−1, it admits a continuous inverse W(y) defined for y ≥−1/e, that satisfies the equationsfor any y ≥−1/e and x ≥−1. The function W(y) corresponds to the real branch of the Lambert function (c.f. Jeffrey et al. [11]).

Theorem 2.3. The inverse distribution functionQofEq. 1readswhereW(x) is the real branch of the Lambert function.

ProofLet q ∈ [0, + ∞). We are interested in the value of x that satisfy F(x) = q. Considering the expression in Eq. 1 this reduces to the equationThe expression can be rewritten in terms of z = (x/s + 1) b/a. After some manipulations one getsThe right hand side is non-negative, thus from Eq. 8 it iswhence the assertion.When q ∈ [0, + ∞), the argument of the function W in Eq. 9 is positive, so that the above expression is well defined. From Eq. 8, it follows that Q(0) = s a/b W(b/a exp(b/a)) − s = 0, as expected. The quantile function Eq. 9 can be used to generate i.i.d. pseudo-random variables extracted from a Pareto Type III distribution: generate a set of independent realizations {q_i} uniformly distributed in [0, 1) and apply the inverse distribution function Q to each realization to obtain a set {Q(q_i)} of independent variates distributed according to Eq. 1. Another application of the quantile function, common in exploratory data analysis, is the construction of q-q plots to investigate the agreement of data to the theoretical model.

3 Firms Size Distribution

The upper tail exponential cut-off which characterizes the Pareto Type III distribution can prove extremely useful in obtaining better and more reliable descriptions of empirical data. I illustrate this claim with an example based on the size distribution of business firms. The Pareto III distribution will be used to describe the upper tail of the Empirical Distribution Function (EDF) of annual revenues of the largest worldwide companies. This choice seems particularly fit, as it was precisely the problem of finding a reliable statistical description of the distribution of wealth which originally prompted the work of Pareto. I consider the Fortune 500 database, published each year by the Fortune magazine, which collects the revenues of the largest 500 firms in the world. Using Maximum Likelihood (ML) estimation (c.f. Hill [12] and for details Bottazzi et al. [13]) I fit the Pareto Type III distribution defined in Eq. 1 to the EDF upper tail, using subpopulations of different sizes. The analytic nature of the distribution and density functions, Eqs 1, 2, guarantee an asymptotic optimal performance of the ML approach. For comparison, I also fit a Pareto Type I distribution defined as

The Pareto Type I is the distribution traditionally applied to the description of the upper tail of wealth or income distribution (see for instance Castaldi and Milakovic [14] and references therein). Results are reported in Figure 2: the power-like decay of the Pareto I distribution shows up as a straight line in the log-log plot (left panel) while the exponential cut-off of the Pareto III distribution appears as a convex shape. As can be seen, the latter is remarkably less sensitive to the size of the considered sample and much more adapted to empirical observations. The estimate values for the tail index and the exponential correction are reported in Table 1.

FIGURE 2

TABLE 1

	Pareto I			Pareto III
sample size	5%	10%	30%	5%	10%	30%
	1.74	1.39	1.13	1.74	1.31	1.48
	-	-	-	0.024	0.026	0.023

Estimated coefficients for Pareto Type I and III on Fortune 500 data using different upper tail fractions of observations.

4 Conclusion

The Pareto Type III distribution, originally proposed by Pareto in 1896 as one of the possible statistical characterization of the distributional properties of the wealth in the Grand Duchy of Oldenburg, has never received much attention. However, its exponential asymptotic shape assures the existence of all central moments. Due to its relative simplicity and analytical tractability, it can be successfully used as an alternative candidate to probe for the presence of “deviation” from the power-like behavior of the upper tail of empirical distributions. The use of an exponentially dumped power behaviour could help in shading some new light on the difficulty of statistically differentiate between a power-like or a stretched exponential behavior [15]. As shown in Gabaix [16], the assumption of a power-like behavior of the size distribution of individual firms has important implications for the observed dynamics at the aggregate, macroeconomic, level. Thus, investigating the validity and extent of the Paretian assumption acquires a new relevance and the particular distribution analyzed in these notes might help in this investigation.

Statements

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: http://cafim.santannapisa.it/∼giulio/other/Fortune500/index.html.

Author contributions

The author confirms being the sole contributor of this work and has approved it for publication.

Funding

Support from the Scuola Superiore Sant’Anna (grant E6006GB) and from the EU (Contract No 12410 (NEST)) is gratefully acknowledged.

Acknowledgments

I would like to thank Herman Rubin for his suggestion of using Lambert function to express the Pareto Type III inverse distribution function. I also thank Federico Tamagni and Angelo Secchi for a number of helpful comments.

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1.
ZipfGK. Human Behaviour and the Principle of Least-Effort. Cambridge MA: Addison-Wesley (1949).
- Google Scholar
2.
ParetoV. Cours D’économie Politique. Lousanne: Ed. Rouge (1987).
- Google Scholar
3.
NewmanM. Power Laws, Pareto Distributions and Zipf's Law. Contemp Phys (2005) 46:323–51. 10.1080/00107510500052444
- CrossRef
- Google Scholar
4.
KleiberCKotzS. Statistical Size Distributions in Economics and Actuarial Sciences. New York: Wiley and Sons Statistics (2003).
- Google Scholar
5.
de WitG. Firm Size Distributions. Int J Ind Organ (2005) 23:423–50. 10.1016/j.ijindorg.2005.01.012
- CrossRef
- Google Scholar
6.
GabaixX. Power Laws in Economics and Finance. Annu Rev Econ (2009) 1:255–94. 10.1146/annurev.economics.050708.142940
- CrossRef
- Google Scholar
7.
ParetoV. Sur la Courve de la rèpartition de la richesse, Universitè de Lousanne. English traslation. Revista di Politica Economica (1986) 87:647–700.
- Google Scholar
8.
CreedyJ. Notes and Memoranda Pareto and the Distribution of Income*. Rev Income Wealth (1977) 23:405–11. 10.1111/j.1475-4991.1977.tb00026.x
- CrossRef
- Google Scholar
9.
BurroughsSMTebbensSF. Upper-truncated Power Law Distributions. Fractals (2001) 09:209–22. 10.1142/s0218348x01000658
- CrossRef
- Google Scholar
10.
AbramowitzMStegunIA. Handbook of Mathematical Functions. New York: Dover (1964).
- Google Scholar
11.
JeffreyDJHareDEGCorlessRM. Unwinding the Branches of the Lambert W Function. Math Scientist (1996) 21:1–7.
- Google Scholar
12.
HillBM. A Simple General Approach to Inference about the Tail of a Distribution. Ann Stat (1975) 3:1163–74. 10.1214/aos/1176343247
- CrossRef
- Google Scholar
13.
BottazziGPirinoDTamagniF. Zipf Law and the Firm Size Distribution: a Critical Discussion of Popular Estimators. J Evol Econ (2015) 25:585–610. 10.1007/s00191-015-0395-7
- CrossRef
- Google Scholar
14.
CastaldiCMilakovićM. Turnover Activity in Wealth Portfolios. J Econ Behav Organ (2007) 63(3):537–52. 10.1016/j.jebo.2005.06.008
- CrossRef
- Google Scholar
15.
ClausetAShaliziCRNewmanMEJ. Power-Law Distributions in Empirical Data. SIAM Rev (2009) 51(4):661–703. 10.1137/070710111
- CrossRef
- Google Scholar
16.
Gabaixx. The Granular Origins of Aggregate Fluctuations. Econometrica (2011) 79:733–72.
- Google Scholar

Summary

Keywords

pareto distribution, fat tails, power law, Zipf’s law, firm size distribution

Citation

Bottazzi G (2022) On the Pareto Type III Distribution. Front. Phys. 10:807199. doi: 10.3389/fphy.2022.807199

Received

01 November 2021

Accepted

16 March 2022

Published

14 April 2022

Volume

10 - 2022

Edited by

Sergio Da Silva, Federal University of Santa Catarina, Brazil

Reviewed by

Pushpa Rathie, University of Brasilia, Brazil

Roberto Vila Gabriel, University of Brasilia, Brazil

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Giulio Bottazzi, g.bottazzi@santannapisa.it

This article was submitted to Interdisciplinary Physics, a section of the journal Frontiers in Physics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Interdisciplinary Physics

BRIEF RESEARCH REPORT article

On the Pareto Type III Distribution

Abstract

1 Introduction