7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
View/Download PDF

Translate this page into:

Original article
32 (
1
); 356-365
doi:
10.1016/j.jksus.2018.05.021

The Marshall-Olkin extended inverted Kumaraswamy distribution: Theory and applications

College of Statistical and Actuarial Science, University of the Punjab, Lahore, Pakistan
Quality Enhancement Cell (QEC), National College of Arts, Lahore, Pakistan

⁎Corresponding author. usmanrana0331@gmail.com (Rana Muhammad Usman)

Disclaimer:
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.

Peer review under responsibility of King Saud University.

Abstract

A new three parametric distribution is proposed and analyzed, termed as the Marshall-Olkin extended inverted Kumaraswamy (MOEIK) distribution. This generalization has some renowned sub models such as the Beta type II, the Lomax and the Fisk distribution, stated in literature. Study includes the basic properties of the observed probabilistic model. Explicit expressions for major mathematical properties of this distribution such as quantile function, complete and incomplete moments, entropies and moments of order statistic are derived. Maximum likelihood estimation method is used to estimate the parameters. For different parameter values, a number of simulation studies are conducted for different sample sizes and compare the performance of the MOEIK distribution. Three real life applications are provided to explain the potentiality and reliability of the extended distribution with confidence that the generalized model have wider applications in hydrology and the associated fields.

Keywords

Inverted Kumaraswamy distribution
Quantile function
Moment generating function
Order statistic
Maximum likelihood
1

1 Introduction

From last few decades researchers have greater intention toward the inversion of univariate probability models and their applicability under inverse transformation i.e. inverted beta (Dubey, 1970), inverse Rayleigh (Voda, 1972), inverse Gaussian (Folks and Chhikara, 1978), inverse Weibull (Calabria and Pulcini, 1990), inverted Burr type XII also called Burr type III (Al-Dayian, 1999), inverse Rayleigh (Rosaiah and Kantam, 2005), inverted exponential (Dey, 2007), inverse Lindley (Sharma et al., 2015) and many other distributions.

Since two-parameter Kumaraswamy distribution has much importance as a substitute of the beta distribution introduced by Kumaraswamy (1980) during the explanation of hydrological applications. He had found that numerous important distributions such as log normal distribution, normal distribution, beta distribution and many more do not have better fit on hydrological data i.e. daily stream flow data and rain fall data. Moreover, the model endorse wide range of applications including test scores, atmospheric temperature, height of individuals and many others (Jones, 2009; Sindhu et al., 2013; El-Deen et al., 2014). The distribution has two shape parameters α > 0 and β > 0 while interval of random variable belong to distribution is [0, 1]. Many researchers such as Seifi et al. (2000), Ganji et al. (2006) and Nadarajah (2008) had presented the betterment of Kumaraswamy distribution over beta distribution. These models have much resemblance in their basic properties (Lemonte et al., 2013).

In literature, it had seen that IK distribution has long right tail as compared to other existing distributions. Therefore, it will have long term affect in passionate predictions of rare events and reliability predictions (Iqbal et al., 2017). To follow the concept of the inverted distribution, Al-Fattah et al. (2017) developed the inverted Kumaraswamy distribution by using the inverse transformation T = 1 Y - 1 , where Y follows the Kumaraswamy distribution with parameters α and β . The probability density function (pdf) and its corresponding cumulative distribution function (cdf) of the Inverted Kumaraswamy (IK) distribution with two shape parameters α > 0 , β > 0 are

(1)
g ( t ) = α β ( 1 + t ) - α - 1 ( 1 - ( 1 + t ) - α ) β - 1 ; t 0 , α , β > 0
(2)
G ( t ) = ( 1 - ( 1 + t ) - α ) β

The main purpose of the study is to develop the Marshall-Olkin extended inverted Kumaraswamy distribution to increase the flexibility of the parent model and study its behavior and basic properties. Recently Haq et al. (2017) used this transformation to develop Marshall-Olkin length biased moment exponential distribution. Marshall and Olkin (1997) introduced an innovative technique to add an additional shape parameter to the existing distribution and distribution resulting in the Marshall-Olkin distribution. If G ( x ) is the cdf and G ¯ ( x ) = 1 - G ( x ) is the survival rate function (srf) then the srf of Marshall-Olkin family takes the form

(3)
F ¯ ( x ; φ ) = λ G ¯ ( x ; φ ) 1 - λ ¯ G ¯ ( x ; φ ) where λ ¯ = 1 - λ and φ represents the parameters of the parent distribution. It is clear that we can obtain the base distribution for λ = 1 . The parameter λ is generally called “tilt parameter”. The probability density function of the Marshall Olkin generated family is
(4)
f ( x ; φ ) = λ g ( x ; φ ) [ 1 - λ ¯ G ¯ ( x ; φ ) ] 2

The major motivation of the MO family is given as follows: Let Z N = min { T 1 , , T N } , where T 1 , , T N are a sequence of IID random variables from G ( x ) and N is a random variable with probability mass function γ ( 1 - γ n - 1 ) , n = 1 , 2 , . Then, the CDF of Z N is defined by; P ( Z N x ) = 1 - n = 1 P ( Z N x | N = n ) P ( N = n ) = G ( x ) G ( x ) + γ G ¯ ( x ) , which is equivalent to (3).

Some explicit expressions of the moments, incomplete moments, mode, random number generator, order statistics and its moments are provided in Section 3. Section 4 explains the simulation study of true parameters estimated from maximum likelihood estimation. Three real life applications are also provided in Section 5 to explain the flexibility of the observed model as compared to some existing models. The Section 6 concludes the study.

2

2 The MOEIK distribution

By using the Marshall-Olkin generator, the pdf and cdf of the MOEIK distribution with a new parameter λ > 0 are given respectively.

(5)
f ( x ; λ , α , β ) = λ α β ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β - 1 [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] 2 for α > 0 , β > 0 and x > 0
(6)
F ( x ; λ , α , β ) = ( 1 - ( 1 + x ) - α ) β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β )

Since the distribution function is complex therefore, to get the explicit expression of the distribution. We use some expansion functions follow the generalized binomial theorem (for 0 < a < 1), ( 1 - a ) - n = k = 0 Γ ( n + k ) Γ ( n ) Γ ( k + 1 ) a k and ( 1 + z ) b = j = 0 b j z j , | z | > 0

Under these expansions, the pdf of the MOEIK model becomes

(7)
f ( x ; λ , α , β ) = λ α β ( 1 + x ) - α - 1 i , j = 0 w i , j ( 1 - ( 1 + x ) - α ) β j + β - 1 where w i , j = Γ ( 2 + i ) ( 1 - λ ) i ( - 1 ) j Γ ( 2 ) Γ ( i + 1 ) i j

The survival rate function and the hazard rate function (hrf) of the observed models are

(8)
R ( x ; λ , α , β ) = λ ( 1 - ( 1 - ( 1 + x ) - α ) β ) 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β )
(9)
h ( x ; λ , α , β ) = α β ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β - 1 ( 1 - ( 1 - ( 1 + x ) - α ) β ) [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ]
respectively.

A simple motivation of the MOEIK distribution follows by considering a parallel system with Z independent components and suppose that a random variable Z has the geometric distribution with the probability mass function P ( Z = z ) = γ - 1 ( 1 - γ - 1 ) z - 1 , z = 1 , 2 , and γ > 1 . Let T 1 , T 2 , represent the lifetimes of each component and suppose that they have the IK distribution given in (1). Then a random variable X = max ( T 1 , , T z ) represents the lifetime of the system. Therefore, the random variable X follows the MOEIK in (5).

2.1

2.1 Limiting behavior for the MOEIK density function

Lemma 1

For x approaches to origin, limits of the MOEIK density function is as follows

(10)
lim x 0 f ( x ) = for β < 1 α β β λ for β = 1 0 for β > 1

Proof:

As the pdf of the MOEIK distribution is lim x 0 f ( x ) = lim x 0 λ α β ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β - 1 [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] 2

The quantity lim x 0 ( 1 + x ) - α - 1 1 and lim x 0 [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] 2 λ 2

The above expression becomes lim x 0 f ( x ) = lim x 0 α β λ ( 1 - ( 1 + x ) - α ) β - 1 lim x 0 f ( x ) = lim x 0 α β λ 1 - 1 - α x + ( α x ) 2 2 ! - ( α x ) 3 3 ! + β - 1 lim x 0 f ( x ) = lim x 0 α β β λ x β - 1 1 - α x 2 ! + ( α x ) 2 3 ! + β - 1

Now the results can be formed easily. □

2.2

2.2 Shapes of the density and hazard rate function

The Fig. 1(a) and (b) represent the behavior of the density function and explains the tractability and flexibility of the model graphically with its sub-families. The pdf plot shows that for β < 1 , the newly developed model has exponentially decreasing behavior. For β = 1 , the curves also have exponentially decreasing behavior but starts from y-axis while for β > 1 , the distribution has unimodal positively skewed behavior. Fig. 1(c) and (d) represents behavior of hazard rate function (hrf) which is unimodal at various parameter combinations and also have bathtub and decreasing behavior for small α and β , while λ approaches to zero. Moreover, the observed model can have upside down bathtub behavior as the x-axis increases.

Density function 1 (a) and 1 (b) and hazard rate function 1 (c) and 1 (d) at different parameter values.
Fig. 1
Density function 1 (a) and 1 (b) and hazard rate function 1 (c) and 1 (d) at different parameter values.

Table: Some special models of MOEIK distribution.

Special Cases Resultant model Pdf for x > 0
β = λ = 1 Lomax distribution ( α ) α ( 1 + x ) α + 1 for α > 0
β = 1 Marshall-Olkin Lomax ( α , λ ) λ α ( 1 + x ) - α - 1 [ 1 - ( 1 - λ ) ( 1 + x ) - α ] 2 for λ > 0 , α > 0
α = λ = 1 Beta type II ( β ) 1 B ( β , 1 ) x β - 1 ( 1 + x ) - ( β + 1 ) for β > 0
α = 1 Marshall-Olkin Beta type II ( β , λ ) λ x β - 1 ( 1 + x ) - ( β + 1 ) B ( β , 1 ) 1 - ( 1 - λ ) 1 - x 1 + x β - 2 β , λ > 0
α = β = λ = 1 The log-logistic (Fisk) distribution 1 ( 1 + x ) 2
α = β = 1 Marshall-Olkin log-logistic ( λ ) λ ( 1 + x ) 2 1 - ( 1 - λ ) 1 + x - 2 for λ > 0

2.3

2.3 Quantile function

Let random variable X MOEIK ( λ , α , β ) having pdf in Eq. (5). The computation of the quantile function, say x = Q ( u ) of the MOEIK distribution can be attained by inverting Eq. (6)

(11)
x = Q ( u ) = F - 1 ( u ) = 1 - u λ 1 - u ( 1 - λ ) 1 β - 1 α - 1 where U follows the uniform distribution under limits [0, 1]. We can compute the first quartile, median and the third quartile of the observed distribution by inserting u = 0.25 , 0.5 and 0.75 respectively.

3

3 Properties of MOEIK distribution

This section provides unambiguous expressions for some major properties of the observed distribution starting from the ordinary moments.

3.1

3.1 Moments

This subsection provides the unambiguous expressions of the rth non-central moment and cumulants. Numerous important features and characteristics such as tendencies, coefficient of skewness, coefficient of kurtosis and dispersions can be studied from moments. The given theorem provide moments.

The rth raw moments of the MOEIK distribution can be obtained by following the definition of the moments as μ r = 0 x r f ( x ; α , β , λ ) dx = λ α β i , j = 0 w i , j 0 x r ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β j + β - 1 dx

(12)
μ r = λ β i , j , k = 0 π i , j , k B 1 - r α , β ( j + 1 ) for α > r where π i , j , k = Γ ( 2 + i ) ( 1 - λ ) i ( - 1 ) j + k + r Γ ( 2 ) Γ ( i + 1 ) i j r k under the conditions i j and r k .

The cumulants ( k s ) of the MOEIK distribution are attained from above expression as k s = μ s - k = 1 s - 1 s - 1 k - 1 k k μ s - k , μ s = k = 0 s s k ( - 1 ) k μ 1 k μ s - k where μ 1 = k . Thus, k 2 = μ 2 - ( μ 1 ) 2 , k 3 = μ 3 - 3 μ 2 μ 1 + 2 ( μ 1 ) 3 , k 4 = μ 4 - 4 μ 3 μ 1 - 3 ( μ 2 ) 2 + 12 μ 2 ( μ 1 ) 2 - 6 ( μ 1 ) 4

The coefficient of skewness and the coefficient of kurtosis for the MOEIK distribution can also obtained from the third and the fourth standardized cumulants by using formulae γ 1 = k 3 k 4 3 / 2 and γ 2 = k 4 k 2 2 respectively. Table 1 represents the behavior of raw moments, skewness and kurtosis for MOEIK distribution.

Table 1 Moments, skewness and Kurtosis for X at different parameter combination.
Moments→ μ 1 μ 2 μ 3 μ 4 Skewness Kurtosis
λ α = 5 β = 1.5
0.25 0.1567 0.0755 0.0985 0.3760 0.4097 65.961
0.75 0.2826 0.1896 0.2825 1.1204 0.7920 31.164
1.50 0.3974 0.3274 0.5398 2.2225 1.2115 20.731
2.50 0.5026 0.4808 0.8609 3.6718 1.6616 15.885
4.50 0.6478 0.7334 1.4549 6.5181 2.3928 12.118
7.50 0.7962 1.0410 2.2696 10.692 3.2844 9.8651
β α = 5 λ = 2.5
0.25 0.1643 0.1050 0.1564 0.6219 0.5890 56.385
0.75 0.3418 0.2746 0.4504 1.8523 1.1005 24.559
1.50 0.5026 0.4807 0.8609 3.6718 1.6616 15.886
2.50 0.6459 0.7087 1.3711 6.0602 2.2654 12.066
4.50 0.8370 1.0801 2.3096 10.741 3.2496 9.2058
7.50 1.0266 1.5265 3.5879 17.585 4.4521 7.5467
α β = 0.5 λ = 2.5
4.25 0.3281 0.3213 0.8082 9.3416 2.2608 90.457
4.75 0.2836 0.2267 0.4076 2.1659 1.1475 42.108
5.50 0.2356 0.1477 0.1859 0.5050 0.5427 23.146
6.50 0.1922 0.0935 0.0841 0.1374 0.2608 15.702
7.50 0.1623 0.0645 0.0451 0.0527 0.1486 12.661
9.50 0.1237 0.0360 0.0174 0.0129 0.0639 10.005

3.2

3.2 Incomplete moments

Researchers have general interest to compute the rth incomplete moments of life-time models. The incomplete moments are mostly utilized to explain the Bonferroni and Lorenz curves while it also helps to evaluate the mean deviation. Let X follows the MOEIK distribution and its rth incomplete moment could be evaluated from φ ( x ) = 0 x v r f ( v ) dv = λ α β i , j = 0 w i , j 0 x v r ( 1 + v ) - α - 1 ( 1 - ( 1 + v ) - α ) β j + β - 1 dv Let ( 1 + v ) - α = z , v = z - 1 α - 1 and dv = - 1 α z - 1 α - 1 dz = ck β r i , j , k = 0 w i , j ( - 1 ) r + k r k ( 1 + x ) - α 1 z 1 - r α - 1 ( 1 - z ) β j + β - 1 dz

(13)
φ ( x ) = ck β r i , j = 0 w i , j , k B 1 - r α , β ( j + 1 ) , ( 1 + x ) - α for α > r where w i , j , k = i , j , k = 0 w i , j ( - 1 ) r + k r k and r k

3.3

3.3 Mean deviation

A most appropriate tool used to evaluate the average absolute deviation of the observation from their mean is mean deviation. We can find the mean deviation about the mean and median from the following expressions given respectively.

(14)
δ μ ( y ) = E ( | Y - μ 1 | ) = 0 β | y - μ 1 | f ( y ) dy = 2 μ 1 F ( μ 1 ) - 2 φ 1 ( μ 1 )
(15)
δ M ( y ) = E ( | y - M | ) = 0 β | y - M | f ( y ) dy = μ 1 - 2 φ 1 ( M )
where μ 1 = E ( Y ) can be obtained from Eq. (11), F ( μ 1 ) can be calculated from Eq. (6) and φ 1 ( μ 1 ) is the first incomplete moment computed from Eq. (13).

Mean deviation is used in numerous important fields such as economics, demography, insurance, medicines and reliability.

3.4

3.4 Mode

For symmetrical distributions such as the standard normal, t-distribution and Cauchy distribution, the mean, median and mode are same say 0. But for the asymmetrical distribution we have to follow the general method with necessary and sufficient conditions to find the mode. The mode of the MOEIK distribution can be obtained by taking the first derivative and equate it to zero.

f ( x ) = 0 .

The density function of the MOEIK in Eq. (6) is given as f ( x ) = λ α β ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β - 1 [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] 2

Following the definition of mode f ( x ) = λ α 2 β ( 1 + x ) - 2 α - 2 ( 1 - ( 1 + x ) - α ) β - 2 ( β - 1 ) ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) + 2 β ( 1 - ( 1 + x ) - α ) β { 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) } 3 = 0

Since ( 1 + x ) - 2 α - 2 > 0 , ( 1 - ( 1 + x ) - α ) β - 2 > 0 and [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] 3 > 0 , α , β , θ > 0 for all x > 0 . Therefore, ( β - 1 ) ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) + 2 β ( 1 - ( 1 + x ) - α ) β = 0

(16)
x = 1 - λ ( β + 1 ) ( β - 1 ) ( 1 - λ ) 1 β - 1 α - 1 is the mode of the MOEIK distribution.

3.5

3.5 Rényi entropy

Entropy is generally belongs to statistical mechanics, more specifically microscopic variables. Therefore, more precisely we can say that Entropy is the measure of disorder in a microscopic system. A common measure of entropy is the Rényi entropy and has much importance in many fields such as statistical inference, classification, problems identification in statistics, econometrics and pattern recognition in computer sciences. The given theorem provides expression for Rényi entropy.

Theorem 3

If the random variable X is defined as Eq. (6), then the Rényi entropy is given under the necessary condition that δ > 0 and δ 1

(17)
I R ( δ ) = log λ 1 - δ - log α + log β 1 - δ + 1 1 - δ log i , j = 0 φ i , j B δ + δ α + 1 α , β ( δ + j ) - δ + 1

Proof:

If r.v X follows MOEIK distribution then Rényi entropy is defined as I R ( δ ) = 1 1 - δ log [ I ( δ ) ] where I ( δ ) = 0 f δ ( x ) dx I ( δ ) = 0 λ δ α δ β δ ( 1 + x ) - α δ - δ ( 1 - ( 1 + x ) - α ) β δ - δ [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] - 2 δ dx

After simplification, the expression becomes I ( δ ) = λ δ α δ β δ i , j = 0 φ i , j 0 ( 1 + x ) - α δ - δ ( 1 - ( 1 + x ) - α ) β j + β δ - δ dx where φ i , j = Γ ( i + 2 δ ) Γ ( 2 δ ) Γ ( i + 1 ) ( 1 - λ ) i ( - 1 ) j i j I ( δ ) = λ δ α δ - 1 β δ i , j = 0 φ i , j 0 1 z δ + δ α + 1 α - 1 ( 1 + z ) β ( δ + j ) - δ + 1 - 1 dy I ( δ ) = λ δ α δ - 1 β δ i , j = 0 φ i , j B δ + δ α + 1 α , β ( δ + j ) - δ + 1

Substitution completes the proof. □

3.6

3.6 Order statistics

Let the random variables and its ordered values are denoted as X ( 1 ) , X ( 2 ) , X ( 3 ) , , X ( n ) . The probability density function (pdf) of order statistics is obtained using the function f s : n ( x ) = n ! ( s - 1 ) ! ( n - s ) ! [ F ( x ) ] s - 1 [ 1 - F ( x ) ] n - s f ( x )

The density of the nth ordered statistics follows the MOEIK distribution is f s : n ( x ) = n ! ( s - 1 ) ! ( n - s ) ! ( 1 - ( 1 + x ) - α ) β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) s - 1 1 - ( 1 - ( 1 + x ) - α ) β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) n - s λ α β ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β - 1 [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] 2

(18)
f s : n ( x ) = n ! λ α β ( 1 + x ) - α - 1 ( s - 1 ) ! ( n - s ) ! i , j , k = 0 ϕ i , j , k ( 1 - ( 1 + x ) - α ) β ( s + i + k ) - 1 where ϕ i , j , k = n - s i Γ ( s + i + j + 1 ) Γ ( s + i + 1 ) Γ ( j + 1 ) ( 1 - λ ) j ( - 1 ) i + k j k

The rth moment of order statistic is obtained as E ( X s : n r ) = n ! λ α β ( s - 1 ) ! ( n - s ) ! i , j , k = 0 ϕ i , j , k 0 x r ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β ( s + i + k ) - 1 dx

On simplifications, we get = n ! λ β ( s - 1 ) ! ( n - s ) ! i , j , k , m = 0 ψ i , j , k , m 0 1 z 1 - m α - 1 ( 1 - z ) β ( s + i + k ) - 1 dz where ψ i , j , k , m = ϕ i , j , k m = 0 ( - 1 ) m + r r m

The expressions for rth moment of order statistic becomes

(19)
E ( X s : n r ) = n ! λ β ( s - 1 ) ! ( n - s ) ! i , j , k , m = 0 ψ i , j , k , m B 1 - m α , β ( s + i + k ) for α > m

3.7

3.7 Maximum likelihood estimation

This study adopts the maximum likelihood estimation method so that it is mostly used and provides the maximum information about the properties of the estimated parameters. Moreover, the normal approximation of these estimators can frankly be managed systematically and mathematically for large sample theory. Consequently, the ML estimation has adopted to estimate the unknown parameters ( α , β and λ ) of the MOEIK distribution.

Let random variable X belong to the observed distribution and have vector of the parameters ( α , β , λ ) T with size n. The sample likelihood function is achieved as i = 1 n f ( x ; α , β , λ ) = α n β n λ n i = 1 n ( 1 + x ) - α - 1 ( 1 - ( 1 + x ) - α ) β - 1 [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ] 2

The log-likelihood function is

(20)
L = n log α + n log β + n log λ - ( α + 1 ) log ( 1 + x ) + ( β - 1 ) log [ 1 - ( 1 + x ) - α ] - 2 log [ 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ]

Now we have to maximize the log-likelihood function given in Eq. (20) to get the ML estimates of unknown parameters of the Marshall-Olkin extended Kumaraswamy distribution. For this purpose, we take the first derivative of the log-likelihood equation with respect to parameters and equate to zero respectively.

(21)
L α = n α - log ( 1 + x ) + ( β - 1 ) log ( 1 + x ) ( 1 + x ) - α 1 - ( 1 + x ) - α - 2 β ( 1 - λ ) log ( 1 + x ) ( 1 + x ) - α ( 1 - ( 1 + x ) - α ) β - 1 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β )
(22)
L β = n β + log ( 1 - ( 1 + x ) - α ) - 2 ( 1 - λ ) log ( 1 - ( 1 + x ) - α ) ( 1 - ( 1 + x ) - α ) β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β )
(23)
L λ = n λ - 2 1 - ( 1 - ( 1 + x ) - α ) β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β )

The exact solution of the derived ML estimator for unknown parameters in Eqs. (21)–(23) is genuinely not possible. Therefore, it is more appropriate to use the nonlinear optimization algorithms such as a Newton-Raphson algorithm for maximizing the likelihood function numerically. We can use R (optimal function or maxBFGS function), or MATHEMATICA (Maximize function). After application of large sample property of the ML Estimates, MLE θ ̂ can be treated as being approximately normal with the mean θ and the variance-covariance matrix equal to the inverse of the expected information matrix, i.e. n ( θ ̂ - θ ) N ( 0 , nI - 1 ( θ ) ) , I ( θ ) is the information matrix then its inverse of matrix is I - 1 ( θ ) provide the variances and covariance’s. The I ( θ ̂ ) variance-covariance matrix is actually equal to the inverse of the expected information matrix I - 1 ( θ ̂ ) is given as

(24)
I ( θ ̂ ) = J α α J α β J α λ J β α J β β J β λ J λ α J λ β J λ λ

The elements of the variance-covariance information matrix are given in Appendix.

4

4 Simulation

To inspect the performance of the Marshall-Olkin extended inverted Kumaraswamy distribution, we conduct a simulation study by using the Monte Carlos simulation method with 10,000 repetitions on the basis of the bias and the mean square error of the estimated parameters from the maximum likelihood estimation method. The simulation is done as follows:

  • Data is generated from F ( x ) = u , where u is uniformly distributed (0, 1).

  • ( 1.25 , 2.50 , 0.75 and ( 2.5 , 3 , 1.5 ) are taken as the true parameter values ( α , β and lambda ) .

  • Simulation is conducted for the sample sizes n = 10 , 50 , 100 and 200 .

  • The repetition of the experiment is 10,000 times for each sample size.

Table 2 represents the outcomes of the Monte Carlos simulation study. We evaluate the mean of estimated parameters, mean square errors (MSE) and biases. These findings based on the expected first order asymptotic theory as the bias and the MSE’s decreases toward zero with the increase in sample size.

Table 2 Summaries of the estimates for the MOEIK distribution.
True parameters Sample size Parameters Summaries of Parameters
α β λ Mean Bias MSE
1.25 2.50 0.75 10 α 1.625 0.375 111.6
β 2.828 0.328 1.330
λ 0.777 0.027 0.026
50 α 1.305 0.055 0.085
β 2.556 0.056 0.162
λ 0.755 0.005 0.004
100 α 1.276 0.026 0.036
β 2.532 0.032 0.078
λ 0.753 0.003 0.002
200 α 1.262 0.012 0.017
β 2.515 0.015 0.037
λ 0.751 0.001 0.001
2.5 3 1.5 10 α 6.065 3.565 709.1
β 3.264 0.264 1.131
λ 1.556 0.056 0.108
50 α 2.665 0.165 0.601
β 3.047 0.047 0.157
λ 1.510 0.010 0.016
100 α 2.582 0.082 0.236
β 3.025 0.025 0.079
λ 1.505 0.005 0.008
200 α 2.536 0.036 0.098
β 3.013 0.013 0.038
λ 1.503 0.003 0.004

The observations in Table 2 direct the MSE of the ML estimators of α , β and λ decreases and their biases decay towards 0 as sample size increases. While the increase in shape parameters, MSE of estimated parameters increases.

5

5 Applications

This section represents the potentiality and flexibility of the new model as compared to some other existing life-time models by using some real-life examples.

  1. The first real-life data was originally reported by Hinkley (1977). Data consists of 30 observations of the March precipitation (in inches) in Minneapolis/St Paul. The observed values are

0.77 1.74 0.81 1.20 1.95 1.20 0.47 1.43 3.37 2.20 3.00 3.09
1.51 2.10 0.52 1.62 1.31 0.32 0.59 0.81 2.81 1.87 1.18 1.35
4.75 2.48 0.96 1.89 0.90 2.05
  1. The second data also consists of 30 values for the failure time of repairable objects used by Murthy et al. (2004). The observed values are

1.43 0.11 0.71 0.77 2.63 1.49 3.46 2.46 0.59 0.74 1.23 0.94
4.36 0.40 1.74 4.73 2.23 0.45 0.70 1.06 1.46 0.30 1.82 2.37
0.63 1.23 1.24 1.97 1.86 1.17
  1. The third application is initially used by Bhaumik et al. (2009). The observed values

5.1 1.2 1.3 0.6 0.5 2.4 0.5 1.1 8.0 0.8 0.4 0.6
0.9 0.4 2.0 0.5 5.3 3.2 2.7 2.9 2.5 2.3 1.0 0.2
0.1 0.1 1.8 0.9 2.0 4.0 6.8 1.2 0.4 0.2
represents vinyl chloride used for monitoring wells in mg/L. Table 3 provides the discriptive statistics for all given data sets.
Table 3 Descriptive statistics for Data 1, Data 2 and Data 3.
Min. Q1 Median Q3 Mean Max.
Data 1 0.320 0.915 1.470 2.088 1.675 4.750
Data 2 0.110 0.717 1.235 1.943 1.543 4.730
Data 3 0.100 0.500 1.150 2.475 1.879 8.000

By using these data sets, we have made comparison for the new generated Marshall-Olkin extended inverted Kumaraswamy (MOEIK) distribution with the Generalized Inverted Kumaraswamy (GIK), Transmuted Exponentiated Inverse Rayleigh (TEIR), Logistic Weibull (LW), Transmuted Power Lindley (TPL), Marshall Olkin Frechet (MOFr) and Inverted Kumaraswamy (IK) distribution. We use numerous goodness of fit measures to compare the new developed model with other existing models such as the log likelihood function (−2l), Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), Consistent Akaike Information Criterion (CAIC), Hanna-Quinn Information Criterion (HQIC), Kolmogrov-Smirnov (K-S), Anderson Darling (A) and Cramer-von Mises (W). The pdf of other existing models are stated below.

  • Generalized Inverted Kumaraswamy Distribution (Iqbal et al., 2017) f ( x ) = α β γ x γ - 1 ( 1 + x γ ) - ( α + 1 ) [ 1 - ( 1 + x γ ) - α ] β - 1 where α > 0 , β > 0 , γ > 0 are shape parameters.

  • Transmuted Exponentiated Inverse Rayleigh Distribution (Haq, 2016) f ( x ) = 2 α θ x 3 e - α θ x 2 1 + λ - λ e α θ x 2 where α > 0 is shape parameter, β > 0 is scale parameter and λ 1 is transmuted parameter.

  • Logistic Weibull Distribution (Tahir et al., 2016) f ( x ) = λ α β λ x - ( λ α + 1 ) ( 1 + ( β x α ) - λ ) - 2 where α > 0 , β > 0 , λ > 0 are shape parameters.

  • Transmuted Power Lindley Distribution (Granzotto et al., 2016) f ( x ) = α θ 2 ( 1 + θ ) ( 1 + x α ) x α - 1 e - θ x α 1 + λ - 2 λ 1 - 1 + θ x α 1 + θ e - θ x α where α > 0 , β > 0 and λ 1 .

  • Marshall Olkin Fréchet Distribution (Krishna et al., 2013) f ( x ) = α β σ β x - ( β + 1 ) e - ( σ x ) β α + ( 1 - α ) e - ( σ x ) β 2 where α > 0 , β > 0 are positive shape parameters while σ > 0 is scale parameter.

Tables 4–6 provide the estimated parameters and goodness of fit measures of three real life data sets. It is obvious from the tables that the goodness of fit measures such as AIC, BIC, CAIC, HQIC, K-S, A and W of the new developed MOEIK distribution are less than that of the GIK, TEIR, LW and IK distribution, therefore, considered as a best fitted model. Figs. 2, 3 and 4 show the histogram of data with estimated pdf curves (left side) and also explain the estimated and the empirical cdf curves (right side). It is clear from the above tables and figures that the MOEIK distribution provides better fit as compared to other existing models.

Table 4 ML estimates for parameters with the goodness of fit measures (March precipitation).
Models Estimates (Std. Error) −2l AIC BIC CAIC HQIC K-S A* W*
MOEIK ( α , β , λ ) 4.3228 (0.9387) 6.5798 (5.0017) 6.9226 (9.6043) 76.69 82.69 86.89 84.79 84.03 0.068 0.137 0.019
GIK ( α , β , γ ) 1.9552 (1.8117) 3.9501 (5.6951) 1.4202 (1.0211) 78.63 84.63 88.84 86.77 85.98 0.110 0.318 0.052
TEIR ( α , θ , λ ) 6.5630 (80.152) 0.0958 (1.1693) −0.6700 (0.2661) 84.20 90.20 94.41 92.30 91.55 0.182 1.139 0.212
LW ( α , β , λ ) 2.7709 (243.47) 0.3635 (32.322) 1.0061 (88.405) 77.86 83.86 88.06 85.96 85.21 0.074 0.191 0.025
TPL ( α , θ , λ ) 1.5965 (0.2054) 0.4801 (0.1643) 0.5812 (0.6548) 77.30 83.31 89.05 86.18 86.19 0.091 0.146 0.029
MOFr ( α , β , σ ) 42.598 (2.8490) 2.6975 (0.4482) 0.3548 (0.2109) 77.60 83.60 87.80 85.70 84.94 0.095 0.224 0.024
IK ( α , β ) 2.9872 (0.4730) 8.5899 (3.1222) 78.85 83.85 87.65 85.25 84.74 0.114 0.338 0.055

These bold values highlight the model which have better fit as compared to competitor models.

Table 5 ML estimates of the parameters with the goodness of fit measures (failure time data).
Models Estimates (Std. Error) −2l AIC BIC CAIC HQIC K-S A* W*
MOEIK ( α , β , λ ) 3.8031 (0.8166) 2.1861 (2.0393) 9.6185 (14.059) 79.60 85.60 89.80 87.70 86.94 0.078 0.119 0.017
GIK ( α , β , γ ) 1.1623 (1.0228) 1.4462 (1.3918) 1.8481 (1.1813) 81.64 87.64 91.84 89.74 88.98 0.109 0.284 0.048
TEIR ( α , θ , λ ) 7.2940 (25.055) 0.0254 (0.0871) −0.8803 (0.1142) 117.0 123.0 127.2 125.1 124.3 0.376 1.366 1.471
LW ( α , β , λ ) 2.8927 (293.52) 0.5531 (33.237) 0.7775 (78.895) 80.83 86.83 91.03 88.93 88.17 0.083 0.177 0.023
TPL ( α , θ , λ ) 1.3380 (0.1743) 0.6301 (0.1927) 0.5356 (0.5587) 80.73 86.73 90.93 88.83 88.07 0.089 0.180 0.025
MOFr ( α , β , σ ) 63.165 (6.7032) 2.1070 (0.3087) 0.1669 (0.0608) 81.53 87.53 91.73 89.63 88.87 0.104 0.283 0.043
IK ( α , β ) 2.4609 (0.4214) 4.1716 (1.2783) 82.48 86.48 90.28 88.88 87.37 0.111 0.340 0.052

These bold values highlight the model which have better fit as compared to competitor models.

Table 6 ML estimates of the parameters with the goodness of fit measures (vinyl chloride data).
Models Estimates (Std. Error) −2l AIC BIC CAIC HQIC K-S A* W*
MOEIK ( α , β , λ ) 2.1193 (0.5717) 1.7026 (0.8020) 2.2471 (2.3570) 110.9 116.9 121.5 119.2 118.5 0.082 0.233 0.035
GIK ( α , β , γ ) 2.0236 (1.8839) 2.6369 (3.8684) 0.8858 (0.6642) 111.5 117.5 122.1 119.8 119.1 0.095 0.281 0.039
TEIR ( α , θ , λ ) 6.9784 (12.604) 0.0138 (0.0247) −0.7798 (0.1350) 170.8 176.8 181.4 179.1 178.3 0.429 18.01 2.294
LW ( α , β , λ ) 2.7709 (193.53) 0.7056 (17.189) 0.5526 (38.601) 111.9 117.9 122.5 120.2 119.5 0.085 6.327 0.633
TPL ( α , θ , λ ) 0.9265 (0.1156) 0.7453 (0.2414) 0.4094 (0.5958) 111.2 117.2 122.5 120.5 119.7 0.091 0.280 0.048
MOFr ( α , β , σ ) 29.053 (2.3853) 1.4730 (0.2397) 0.1124 (0.1098) 111.4 117.4 121.9 119.7 119.3 0.089 0.289 0.043
IK ( α , β ) 1.7409 (0.3238) 2.1058 (0.5373) 112.5 117.5 122.6 120.1 119.6 0.096 0.279 0.039

These bold values highlight the model which have better fit as compared to competitor models.

The fitted densities (left) and estimated cdf (right) on Data set 1.
Fig. 2
The fitted densities (left) and estimated cdf (right) on Data set 1.
The fitted densities (left) and estimated cdf (right) on Data set 2.
Fig. 3
The fitted densities (left) and estimated cdf (right) on Data set 2.
The fitted densities (left) and estimated cdf (right) on Data set 3.
Fig. 4
The fitted densities (left) and estimated cdf (right) on Data set 3.

6

6 Conclusion

The study reveals a generalization of the Inverted Kumaraswamy distribution and elaborates explicit expression for some of its major properties. The study also explains the behavior of the estimated parameters by using the Monte Carlos simulation approach. Three real-life applications have also presented for the demonstration of enhanced flexibility and better fit of the observed model as compared to the other considered existing models.

References

  1. , . Burr type III distribution: properties and estimation. Egypt. Stat. J.. 1999;43:102-116.
    [Google Scholar]
  2. , , , . Inverted Kumaraswamy distribution: properties and estimation. Pak. J. Stat.. 2017;33(1)
    [Google Scholar]
  3. , , , . Testing parameters of a gamma distribution for small samples. Technometrics. 2009;51(3):326-334.
    [Google Scholar]
  4. , , . On the maximum likelihood and least-squares estimation in the Inverse Weibull distributions. Stat. Appl.. 1990;2(1):53-66.
    [Google Scholar]
  5. , . Inverted exponential distribution as a life distribution model from a Bayesian viewpoint. Data Sci. J.. 2007;6:107-113.
    [Google Scholar]
  6. , . Compound gamma, beta and F distributions. Metrika. 1970;16(1):27-31.
    [Google Scholar]
  7. , , , . Statistical inference for Kumaraswamy distribution based on generalized order statistics with applications. Br. J. Math. Comput. Sci.. 2014;4(12):1710.
    [Google Scholar]
  8. , , . The inverse Gaussian distribution and its statistical application – a review. J. R. Stat. Soc. Ser. B (Methodological) 1978:263-289.
    [Google Scholar]
  9. , , , , . Grain yield reliability analysis with crop water demand uncertainty. Stochastic Environ. Res. Risk Assess.. 2006;20(4):259-277.
    [Google Scholar]
  10. , , , . Statistical study of monthly rainfall trends by using the transmuted power Lindley distribution. Int. J. Stat. Prob.. 2016;6(1):111.
    [Google Scholar]
  11. , . Transmuted exponentiated inverse Rayleigh distribution. J. Stat. Appl. Prob.. 2016;5(2):337-343.
    [Google Scholar]
  12. , , , , . The Marshall-Olkin length-biased exponential distribution and its applications. J. King Saud Univ. Sci. 2017 (Published online)
    [Google Scholar]
  13. , . On quick choice of power transformation. Appl. Stat. 1977:67-69.
    [Google Scholar]
  14. , , , , , . Generalized Inverted Kumaraswamy distribution: properties and application. Open J. Stat.. 2017;7(04):645.
    [Google Scholar]
  15. , . Kumaraswamy’s distribution: a beta-type distribution with some tractability advantages. Stat. Methodol.. 2009;6(1):70-81.
    [Google Scholar]
  16. , , , . Applications of Marshall-Olkin Fréchet distribution. Commun. Stat. Simul. Comput.. 2013;42(1):76-89.
    [Google Scholar]
  17. , . A generalized probability density function for double-bounded random processes. J. Hydrol.. 1980;46(1–2):79-88.
    [Google Scholar]
  18. , , , . The exponentiated Kumaraswamy distribution and its log-transform. Braz. J. Prob. Stat.. 2013;27(1):31-53.
    [Google Scholar]
  19. , , . A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 1997;84(3):641-652.
    [Google Scholar]
  20. , , , . Weibull Models. Vol vol. 505. John Wiley & Sons; .
  21. , . On the distribution of Kumaraswamy. J. Hydrol.. 2008;348(3):568-569.
    [Google Scholar]
  22. , , . Acceptance sampling based on the inverse Rayleigh distribution. Econ. Qual. Control. 2005;20(2):277-286.
    [Google Scholar]
  23. , , , . Maximization of manufacturing yield of systems with arbitrary distributions of component values. Ann. Oper. Res.. 2000;99(1):373-383.
    [Google Scholar]
  24. , , , , . The inverse Lindley distribution: a stress-strength reliability model with application to head and neck cancer data. J. Ind. Prod. Eng.. 2015;32(3):162-173.
    [Google Scholar]
  25. , , , . Bayesian analysis of the Kumaraswamy distribution under failure censoring sampling scheme. Int. J. Adv. Sci. Technol.. 2013;51:39-58.
    [Google Scholar]
  26. , , , , , . The Logistic-X family of distributions and its applications. Commun. Stat. Theory Methods. 2016;45(24):7326-7349.
    [Google Scholar]
  27. , . On the inverse Rayleigh distributed random variable. Rep. Stat. Appl. Res. JUSE. 1972;19:13-21.
    [Google Scholar]

Appendix

J α α = - n α 2 - ( β - 1 ) Log [ 1 + x ] 2 ( 1 + x ) - 2 α ( 1 - ( 1 + x ) - α ) 2 + Log [ 1 + x ] 2 ( 1 + x ) - α 1 - ( 1 + x ) - α + 2 β 2 ( 1 - λ ) 2 Log [ 1 + x ] 2 ( 1 + x ) - 2 α ( 1 - ( 1 + x ) - α ) 2 β - 2 ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) 2 - β ( β - 1 ) ( 1 - λ ) Log [ 1 + x ] 2 ( 1 + x ) - 2 α ( 1 - ( 1 + x ) - α ) β - 2 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) + β ( 1 - λ ) Log [ 1 + x ] 2 ( 1 + x ) - α ( 1 - ( 1 + x ) - α ) β - 1 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) J α β = Log [ 1 + x ] ( 1 + x ) - α 1 - ( 1 + x ) - α - 2 - β ( 1 - λ ) 2 Log [ 1 + x ] Log [ 1 - ( 1 + x ) - α ] ( 1 + x ) - α ( 1 - ( 1 + x ) - α ) 2 β - 1 ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) 2 + ( 1 - λ ) Log [ 1 + x ] ( 1 + x ) - α ( 1 - ( 1 + x ) - α ) β - 1 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) + β ( 1 - λ ) Log [ 1 + x ] Log [ 1 - ( 1 + x ) - α ] ( 1 + x ) - α ( 1 - ( 1 + x ) - α ) - 1 + β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) J α λ = 2 β ( 1 - λ ) Log [ 1 + x ] ( 1 + x ) - α ( 1 - ( 1 + x ) - α ) β - 1 ( 1 - ( 1 - ( 1 + x ) - α ) β ) ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) 2 + β Log [ 1 + x ] ( 1 + x ) - α ( 1 - ( 1 + x ) - α ) β - 1 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) J β β = - n β 2 + 2 ( 1 - λ ) 2 Log [ 1 - ( 1 + x ) - α ] 2 ( 1 - ( 1 + x ) - α ) 2 β ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) 2 - ( 1 - λ ) Log [ 1 - ( 1 + x ) - α ] 2 ( 1 - ( 1 + x ) - α ) β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) J β λ = 2 ( 1 - λ ) Log [ 1 - ( 1 + x ) - α ] ( 1 - ( 1 + x ) - α ) β ( 1 - ( 1 - ( 1 + x ) - α ) β ) ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) 2 + Log [ 1 - ( 1 + x ) - α ] ( 1 - ( 1 + x ) - α ) β 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) J λ λ = - n λ 2 + 2 ( 1 - ( 1 - ( 1 + x ) - α ) β ) 2 ( 1 - ( 1 - λ ) ( 1 - ( 1 - ( 1 + x ) - α ) β ) ) 2

Show Sections