7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
View/Download PDF

Translate this page into:

Original article
01 2021
:34;
101714
doi:
10.1016/j.jksus.2021.101714

A new flexible extension of the Lindley distribution with applications

Department of Statistics and Operations Research, College of Science, King Saud University, Riyadh, Saudi Arabia

⁎Corresponding author. drkayid@ksu.edu.sa (Mohamed Kayid)

Disclaimer:
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.

Abstract

A new general family of distributions based on the Lindley model was introduced. Some properties of the proposed model, including moments, quantiles, and order statistics are presented. In addition, some reliability measures of this model were investigated. The parameters were estimated using the moments and the maximum likelihood methods for complete and right-censored data. Then, the behavior of the maximum likelihood estimator was investigated in a simulation study. Finally, the model was applied to the analysis of two data sets to demonstrate its usefulness.

Keywords

62N01
62N05
Mixture models
Gamma distribution
Failure rate
Mean residual life
Mean inactivity time
PubMed
1

1 Introduction

In recent years, there have been many papers dealing with the Lindley distribution and its applications; see Lindley (1958), Ghitany et al. (2008), Bakouch et al. (2012), Al-Mutairi et al. (2013) and Cakmakyapan and Kadilar (2017). Many authors have introduced some generalizations and/or extensions to the Lindley model by increasing either the number of underlying parameters or the number of mixed density functions, see for example (Ghitany et al., 2011, Al-Babtain et al., 2014, Abouammoh et al., 2015, Cordeiro et al., 2018, Abouammohm et al., 2020). The main objective of these works is to introduce more flexible probability distributions to model different types of lifetime variables in real applications. The Lindley model is defined by its probability density function (PDF) as f ( x ) = θ 2 1 + θ ( 1 + x ) e - θ x , θ > 0 , x 0 , which is a mixture of the gamma distribution with shape parameter 1 and scale parameter θ , G ( 1 , θ ) , and G ( 2 , θ ) with weights p = θ θ + 1 and 1 - p , respectively. The cumulative distribution function (CDF) of the Lindley model is: F ( x ) = 1 - 1 + θ + θ x 1 + θ e - θ x , θ > 0 , x 0 .

In connection with the generalization of the Lindley distribution, Shanker (2016a,b) introduced the Aradhana distribution with the following PDF and CDF, respectively: f ( x ) = θ 3 θ 2 + 2 θ + 2 ( 1 + x ) 2 e - θ x , θ > 0 , x 0 , and F ( x ) = 1 - 2 + 2 θ + 2 θ x + θ 2 + 2 θ 2 x + θ 2 x 2 θ 2 + 2 θ + 2 e - θ x , θ > 0 , x 0 .

This version of the Aradhana model was used to fit the lifetime data of 20 patients receiving an analgesic reported in Gross and Clark (1976) to relief times (in minutes) and a dataset representing aircraft window thickness cited in Fuller et al. (1994). In addition, Welday and Shanker (2018) have provided a generalization of this model to the two-parameter Aradhana distribution with the PDF:

(1)
f ( x ) = θ 3 θ 2 + 2 α θ + 2 α 2 ( 1 + α x ) 2 e - θ x , α 0 , θ > 0 , x 0 , and the corresponding CDF F ( x ) = 1 - 2 α 2 + 2 α θ + 2 α 2 θ x + θ 2 + 2 α θ 2 x + α 2 θ 2 x 2 θ 2 + 2 α θ + 2 α 2 e - θ x , α 0 , θ > 0 , x 0 .

Moreover, Shanker and Shukla (2018) introduced the power Aradhana model whose PDF is given by the PDF of X 1 α , namely f ( x ) = α θ 3 θ 2 + 2 θ + 2 x α - 1 ( 1 + 2 x α + x 2 α ) e - θ x α , α 0 , θ > 0 , x 0 , and the CDF F ( x ) = 1 - 1 + θ x α ( θ x α + 2 θ + 2 ) θ 2 + 2 θ + 2 e - θ x α , α 0 , θ > 0 , x 0 .

They used this model to analyze the tensile strength measured in GPA of 69 carbon fibers tested in tension at a length of 20 mm (Bader and Priest, 1982).

Most data sets are mixtures of multiple populations, and usually no information is available to determine the associated subpopulation of each data point. For example, the lifetime of a device may be recorded without regard to manufacturer or date of production, or some measurements of humans may be reported without regard to geographic location or blood type. When the measured characteristics depend on data that are not available (manufacturer, production date, geographic location, or blood type), the data are said to be mixed. It is not easy to find data sets that are not mixed in some way, since in almost every case some relative covariates are not observed. There are many applications and statistical frameworks in which mixture models occur. For detailed discussions, see Titterington et al. (1985), Lindsay (1995) and Ord (1972).

The above models are mixtures of two or three gamma distributions with different shape parameters. However, in many situations, the data may come from more than two or three sub-populations. For example, a device may be manufactured by more than three factories in a company, etc. Therefore, it is better to use a mixture model with an adjustable number of underlying sub-models. The aim of this paper is to investigate such a flexible model, which also generalizes the previous models.

In this paper, we present in Section 2 a new generalized model that incorporates the above and many other models. Section 3 presents the main statistical and reliability properties of this new model. Parameter estimation for complete and right-censored data is discussed in Section 4. In Section 5, a simulation study was conducted to investigate the behavior of the MLE. Finally, the proposed model and some competing models were fitted to two datasets to show their usefulness.

2

2 The Abouammoh-Alrasheedi model

One can include most of the previously mentioned models in a general family from which many more special cases can be derived and studied. Statistical and reliability properties, estimation of the underlying parameters, and fitting the derived model to real data will make the models available in the literature even richer with more flexible manipulations. We now give the following definition.

Definition 1

The random variable X is said to have Abouammoh-Alrasheedi distribution with parameters ( m , α , θ ) , denoted by AA ( m , α , θ ) if its PDF be of the form

(2)
f ( x ) = θ m i = 1 m Γ ( m ) Γ ( i ) α m - i θ i - 1 ( 1 + θ x ) m - 1 e - θ x , m > 0 , α > 0 , θ > 0 , x 0 .

One can, without difficulty, verify that this is a PDF, i.e., 0 f ( x ) dx = 1 . For integer m, it can be written as a mixture of gamma distributions, so by Theorem 3 of Atienza et al. (2006), it is identifiable. More precisely f ( x ) = i = 1 m w i f i ( x ) , x 0 , where

(3)
w i = 1 Γ ( i ) ( θ α ) i j = 1 m 1 Γ ( j ) ( θ α ) j , and f i ( x ) = θ m - i + 1 Γ ( m - i + 1 ) x m - i e - θ x , x > 0 , is the PDF of the gamma distribution G ( m - i + 1 , θ ) .
Lemma 1

The constant coefficient of the PDF (2) can be written as the following form: c = θ m i = 1 m Γ ( m ) Γ ( i ) α m - i θ i - 1 = e - θ α θ m α m - 1 Γ ( θ α , m ) .

Proof

Since f ( x ) is a PDF, we have 0 ( 1 + α x ) m - 1 e - θ x dx = 1 c .

On the other hand, by straightforward algebra we have 0 ( 1 + α x ) m - 1 e - θ x dx = e θ α θ m α m - 1 Γ ( θ α , m ) , which shows the result immediately. □

Some of special cases of the AA ( m , α , θ ) are listed in the following.

  • For m = 1 and α = 1 , it reduces to the exponential distribution.

  • For m = 2 and α = 1 it gives the Lindley distribution.

  • For m = 3 and α = 1 , AA ( m , α , θ ) reduces to Aradhana distribution which is a mixture of G ( 1 , θ ) , G ( 2 , θ ) and G ( 3 , θ ) with weights θ 2 θ 2 + 2 θ + 2 , 2 θ θ 2 + 2 θ + 2 and 2 θ 2 + 2 θ + 2 , see Shanker (2016a,b).

  • For α = 1 , and m 3 , AA ( m , α , θ ) becomes a generalized Aradhana distribution and is also a generalization of the Lindley distribution.

3

3 Statistical and reliability properties

Now, we derive the main statistical and reliability properties of the proposed distribution. The cumulative distribution function of AA ( m , α , θ ) is

(4)
F ( x ) = 1 - Γ ( θ α + θ x , m ) Γ ( θ α , m ) , where Γ ( t , s ) = t y s - 1 e - y dy is the incomplete upper gamma function.
Proposition 1

The k th moments of AA ( m , α , θ ) model is

(5)
E ( X k ) = 1 Γ ( θ α , m ) j = 0 k ( 1 θ ) j ( - 1 α ) k - j k j Γ ( θ α , m + j ) .

Proof

The k th moments of a distribution with the reliability function R equals to

(6)
E ( X k ) = 0 kt k - 1 R ( t ) dt .

Thus, for AA ( m , α , θ ) we have

(7)
E ( X k ) = 1 Γ θ α , m 0 kt k - 1 Γ ( θ α + θ t , m ) dt .

Now, we have

(8)
0 kt k - 1 Γ ( θ α + θ t , m ) dt = 0 θ α + θ t kt k - 1 y m - 1 e - y dydt = θ α 0 y θ - 1 α kt k - 1 y m - 1 e - y dtdy = θ α y m - 1 e - y ( y θ - 1 α ) k dy = j = 0 k ( 1 θ ) j ( - 1 α ) k - j k j Γ ( θ α , m + j ) , and the result follows by (7) and (8). □

By the fact that AA ( m , α , θ ) model can be written as a mixture of the corresponding moments of the underlying gamma distributions, we have another representation of the k th moments

(9)
E ( X k ) = i = 1 m w i Γ ( k + m - i + 1 ) θ k Γ ( m - i + 1 ) , where w i is defined in (3).

By straightforward algebra the moment generating function of AA ( m , α , θ ) can be simplified as the following form.

(10)
M ( t ) = E ( e tX ) = Γ ( θ - t α , m ) Γ ( θ α , m ) ( θ θ - t ) m e - t α .

The quantile function is of the form

(11)
q ( p ) = 1 θ ( Γ - 1 ( p ¯ Γ ( θ α , m ) , m ) - θ α ) , 0 < p < 1 , where p ¯ = 1 - p and Γ - 1 ( a , m ) shows the inverse of the incomplete upper gamma function at a.

The Lorenz curve provides a graphical representation of the wealth distribution and is defined to be L ( p ) = 0 q ( p ) xdF ( x ) 0 q ( 1 ) xdF ( x ) .

It can be checked easily that for AA ( m , α , θ ) it can be simplified as follows. L ( p ) = 1 θ Γ ( θ α , m + 1 ) - Γ ( θ α + θ q ( p ) , m + 1 ) - 1 α Γ ( θ α , m ) - Γ ( θ α + θ q ( p ) , m ) 1 θ Γ ( θ α , m + 1 ) - 1 α Γ ( θ α , m ) = p α ( m Γ ( θ α , m ) + ( θ α ) m e - θ α - u m e - u ) - θ Γ ( θ α , m ) α ( m Γ ( θ α , m ) + ( θ α ) m e - θ α ) - θ Γ ( θ α , m ) , where u = Γ - 1 ( p ¯ Γ ( θ α , m ) , m ) .

Let X 1 , X 2 , , X n represents a sample of AA ( m , α , θ ) . The PDF of k th order statistics, X ( k ) , is f k : n ( x ) = k n k e - θ α θ m α m - 1 Γ ( θ α , m ) 1 - Γ ( θ α + θ x , m ) Γ ( θ α , m ) k - 1 Γ ( θ α + θ x , m ) Γ ( θ α , m ) n - k ( 1 + θ x ) m - 1 e - θ x .

Thus the PDF of series and parallel systems with such identical components reduces to f 1 : n ( x ) = n e - θ α θ m α m - 1 Γ ( θ α , m ) Γ ( θ α + θ x , m ) Γ ( θ α , m ) n - 1 ( 1 + θ x ) m - 1 e - θ x , and f n : n ( x ) = n e - θ α θ m α m - 1 Γ ( θ α , m ) 1 - Γ ( θ α + θ x , m ) Γ ( θ α , m ) n - 1 ( 1 + θ x ) m - 1 e - θ x . respectively.

3.1

3.1 Reliability measures

The failure rate function is an important measure in reliability theory and survival analysis. Assuming that an event has not yet occurred by time x, it represents the instantaneous risk of occurrence at time x. For AA ( m , α , θ ) , the failure rate function is:

(12)
h ( x ) = f ( x ) R ( x ) = e - θ α θ m ( 1 + α x ) m - 1 e - θ x α m - 1 Γ ( θ α + θ x , m ) , where R ( x ) = 1 - F ( x ) shows the reliability function. For a lifetime random variable X, the mean residual life (MRL) function is defined to be
(13)
m ( x ) = E ( X - x | X x ) = x R ( t ) dt R ( x ) ,
and measures the mean of the remaining lifetime given survival up to time x.
Proposition 2

The MRL function of AA ( m , α , θ ) distribution is of the form m ( x ) = 1 θ m + ( θ α + θ x ) m e - ( θ α + θ x ) Γ ( θ α + θ x , m ) - 1 α - x , x 0 .

Proof

We can write

(14)
x R ( t ) dt = 1 Γ ( θ α , m ) x Γ ( θ α + θ t , m ) dt .

The integral in the right side of (14) can be simplified as the following.

(15)
x Γ ( θ α + θ t , m ) dt = x θ α + θ t y m - 1 e - y dydt = θ α + θ x x y θ - 1 α y m - 1 e - y dtdy = θ α + θ x y θ - 1 α - x y m - 1 e - y dy = 1 θ Γ ( θ α + θ x , m + 1 ) - ( 1 α + x ) Γ ( θ α + θ x , m ) .

Applying ()()()(13)–(15) the result follows. □

The p-QRL function, denoted by q p ( x ) , is the conditional pth quantile of the remaining life of an object provided that it is still alive at x, precisely q p ( x ) = R - 1 ( p ¯ R ( x ) ) - x = F - 1 ( 1 - p ¯ R ( x ) ) - x , x > 0 , where p ¯ = 1 - p . For AA ( m , α , θ ) , by (11) we have q p ( x ) = 1 θ Γ - 1 p ¯ Γ ( θ α + θ x , m ) , m - 1 α - x , x > 0 .

Figs. 1–4 show the PDF, failure rate function, MRL function, and median residual life function, respectively. The failure rate shows an increasing shape and the MRL and median residual life functions show a decreasing shape. The MRL shows larger values than the median residual lifetime, indicating that the conditional residual lifetime is skewed to the right.

The PDF of AA ( m , α , θ ) for some values of parameters.
Fig. 1
The PDF of AA ( m , α , θ ) for some values of parameters.
The failure rate function of AA ( m , α , θ ) for some values of parameters.
Fig. 2
The failure rate function of AA ( m , α , θ ) for some values of parameters.
The mean residual life of AA ( m , α , θ ) for some values of parameters.
Fig. 3
The mean residual life of AA ( m , α , θ ) for some values of parameters.
The median residual life of AA ( m , α , θ ) for some values of parameters.
Fig. 4
The median residual life of AA ( m , α , θ ) for some values of parameters.

The mean inactivity time (MIT) at time x , m ( x ) , represents the mean of elapsed time at x given the event has been occurred before x, mathematically, m ( x ) = E ( x - X | X < x ) = 1 F ( x ) 0 x F ( z ) dz , x 0 .

Proposition 3

The MIT function of AA ( m , α , θ ) distribution is of the form

(16)
m ( x ) = ( x + 1 α - 1 θ ) - 1 θ e - θ α ( θ α ) m - ( θ α + θ x ) m e - θ x Γ ( θ α , m ) - Γ ( θ α + θ x , m ) , x 0 .

Proof

We have

(17)
0 x F ( z ) dz = 1 Γ ( θ α , m ) 0 x Γ ( θ α , m ) - Γ ( θ α + θ z , m ) dz = 1 Γ ( θ α , m ) x Γ ( θ α , m ) - 0 x Γ ( θ α + θ z , m ) dz .

Now, we can simplify the integral in the last expression as follows.

(18)
0 x Γ ( θ α + θ z , m ) dz = 0 x θ α + θ z t m - 1 e - t dtdz = θ α θ α + θ x 0 t θ - 1 α t m - 1 e - t dzdt + θ α + θ x 0 x t m - 1 e - t dzdt = 1 θ Γ ( θ α , m + 1 ) - Γ ( θ α + θ x , m + 1 ) - 1 α Γ ( θ α , m ) - Γ ( θ α + θ x , m ) + x Γ ( θ α + θ x , m ) .

On the other hand

(19)
Γ ( θ α , m + 1 ) - Γ ( θ α + θ x , m + 1 ) = m Γ ( θ α , m ) + ( θ α ) m e - θ α - m Γ ( θ α + θ x , m ) - ( θ α + θ x ) m e - ( θ α + θ x ) .

Then, the results follows by (17), (18) and (19). □

The quantile inactivity time is an alternative for MIT and represents, at time x, the quantile of the elapsed time given the event occurred before x. For a distribution F, it is defined by:

(20)
q p ( x ) = x - F - 1 ( p ¯ F ( x ) ) , x 0 . and for AA ( m , α , θ ) can be written as
(21)
q p ( x ) = x - 1 θ Γ - 1 p Γ θ α , m + p ¯ Γ θ α + θ x , m , m + 1 α , x 0 .

Fig. 5 shows the MIT and the median inactivity time ( q 0.5 ( x ) ) of AA ( m , α , θ ) for some parameters and shows increasing and convex forms. The greater values of the MIT indicate that the conditional distribution of the elapsed time is skewed to right.

Left: The MIT of AA ( m , α , θ ) for some values of the parameters. Right: The median inactivity time of this distribution for the selected values of the parameters.
Fig. 5
Left: The MIT of AA ( m , α , θ ) for some values of the parameters. Right: The median inactivity time of this distribution for the selected values of the parameters.

4

4 Estimation of the parameters

Let m be known. By the moments method and applying (9), we can estimate ( α , θ ) by minimizing the following expression. Δ ( α , θ ) = ( X - E ( X ) ) 2 + ( X 2 ¯ - E ( X 2 ) ) 2 .

So, the moments estimator is ( α ̂ , θ ̂ ) = arg min α , θ Δ ( α , θ ) .

The moment estimates can be used as initial values for maximizing the log-likelihood function and finding the maximum likelihood estimator (MLE).

Let m be known and x 1 , x 2 , , x n represents a realization from AA ( m , α , θ ) , the log-likelihood function is l ( α , θ ; x ) = - n θ α + nm ln θ - n ( m - 1 ) ln α - n ln Γ ( θ α , m ) + i = 1 n ( m - 1 ) ln ( 1 + α x i ) - θ i = 1 n x i .

Also, the likelihood equations are as follows: l α = n θ α 2 - n m - 1 α - n ( θ α ) m e - θ α α Γ ( θ α , m ) + ( m - 1 ) i = 1 n x i 1 + α x i = 0 , and l θ = - n α + nm θ + n ( θ α ) m - 1 e - θ α α Γ ( θ α , m ) - i = 1 n x i = 0 .

The MLE can be calculated by maximizing the log-likelihood function or by solving the likelihood equations. If m is not known, which is usually the case, we can estimate the parameters for a range of m values and then choose the best model based on the Kolmogorov–Smirnov (K-S) statistic or other criteria.

The Fisher information matrix for the AA ( m , α , θ ) is of the form K = E - 2 l α 2 E - 2 l α θ E - 2 l θ α E - 2 l θ 2 . where Let l = ln f ( X ) . Let X i , i = 1 , 2 , , n stands for a sample from AA ( m , α 0 , θ 0 ) . Then, the MLE, ( α ̂ , θ ̂ ) , converges in distribution to bivariate normal N ( ( α 0 , θ 0 ) , n - 1 K - 1 ) in which K - 1 is the inverse of the information matrix.

Suppose that events are subject to random right censorship. A random event X i is said to be right-censored if the only information about the event is that it is greater than the random censoring variable C i , i.e. X i > C i . So, the observations of a right censored random sample consist of T i = min ( X i , C i ) and d i , where d i = 1 if the event is not censored, X i C i , and d i = 0 if the event is censored, X i > C i . Let we have one right censored sample ( t i , d i ) , i = 1 , 2 , , n . Then, the log-likelihood function is l ( α , θ ; t , d ) = i = 1 n d i ln f ( t i ) + i = 1 n ( 1 - d i ) ln R ( t i ) . in which f and R are the PDF and the reliability functions of the AA ( m , α , θ ) respectively. It is easy to check that the log-likelihood function simplifies to l ( α , θ ; t , d ) = n ln Γ θ α , m + i = 1 n d i - θ α + m ln θ - ( m - 1 ) ln α + ( m - 1 ) ln ( 1 + α t i ) - θ t i + i = 1 n ( 1 - d i ) ln Γ θ α + θ t i , m .

5

5 Simulation

Given that AA ( m , α , θ ) is a mixture of gamma distributions, we can extract a random sample of size n from this model as described in the following steps:

  1. Simulate one random instance of multinomial distribution with parameters n , w 1 , w 2 ,…, w m , where w i is defined by (3). Let the simulated instance be ( n 1 , n 2 , , n m ) which will satisfy n 1 + n 2 + + n m = n .

  2. For every n i , simulate one random sample of the gamma distribution G ( m - i + 1 , θ ) of size n i . Then combine these samples to provide one sample of size n from AA ( m , α , θ ) .

  3. Here, the degenerate random variable with mean t has been used as the random censorship variable C i . Given the censorship percentage p, we have t = q ( p ¯ ) where q ( ) is defined in (11).

The simulation results have been abstracted in Table 1. We have considered two values p = 0 and 0.2 for censorship rate. Every cell of the table shows the results for one run. In every run, we provide r = 1000 replicates of samples of size n = 80 , 100 . For every sample the MLE, ( α ̂ , θ ̂ ) has been computed. Then the bias (B), absolute bias (AB) and the mean squared error (MSE) for both α and θ have been computed. These measures are defined in the following relations for α . They are defined similarly for θ . B α = 1 r i = 1 r ( α ̂ i - α ) , AB α = 1 r i = 1 r | α ̂ i - α | , and MSE α = 1 r i = 1 r ( α ̂ i - α ) 2 .

Table 1 Simulation results for MLE of the parameters of AA ( m , α , θ ) . Every cell consists of triples B α , AB α , MSE α in the first line and B θ , AB θ , MSE θ in the second line.
p = 0 p = 0.2
n m α , θ B AB MSE B AB MSE
80 3 0.02, 0.05 0.004988 0.013574 0.000373 0.006054 0.018217 0.000646
0.000600 0.008016 0.000104 -0.000659 0.011561 0.000194
0.3, 5 0.983335 1.212043 3.712165 1.175853 1.455283 5.641408
0.848339 1.278958 2.889727 0.980640 1.536448 4.373064
0.05, 2 0.461190 0.499459 0.682545 0.616871 0.658195 1.280510
0.470890 0.579849 0.623839 0.561123 0.705742 0.986987
5 0.02, 0.05 0.001707 0.006268 0.000066 0.001591 0.007357 0.000098
0.000680 0.005271 0.000043 -0.000078 0.006786 0.000075
0.3, 5 0.209712 0.543812 0.572916 0.493983 0.762481 1.184417
0.337317 1.417153 3.323895 0.958836 1.892733 5.972325
0.05, 2 0.206730 0.251102 0.156499 0.320155 0.356562 0.356936
0.501932 0.695099 0.956195 0.737981 0.904887 1.761379
100 3 0.02, 0.05 0.004989 0.012495 0.000293 0.004451 0.016137 0.000483
0.000897 0.007160 0.000081 -0.001198 0.010670 0.000173
0.3, 5 0.867955 1.100889 2.932451 1.059814 1.356333 4.983397
0.770588 1.216026 2.658104 0.877139 1.441000 3.933093
0.05, 2 0.431200 0.466580 0.531628 0.539306 0.582099 1.067997
0.456392 0.558029 0.568953 0.523228 0.652936 0.872355
5 0.02, 0.05 0.001778 0.005664 0.000055 0.001718 0.006716 0.000082
0.000839 0.004859 0.000038 0.000443 0.006135 0.000060
0.3, 5 0.218949 0.535202 0.552218 0.420245 0.700898 1.018933
0.349058 1.399999 3.103568 0.795703 1.808621 5.484757
0.05, 2 0.168868 0.215696 0.126635 0.265858 0.302138 0.246075
0.411721 0.617124 0.793489 0.645761 0.806164 1.394646

Some of the main observations from simulation results are listed in the following.

  • The MLE of the parameters are consistent, i.e., AB and MSE decrease with n.

  • The AB and MSE have larger values for censored data ( p = 0.2 ) rather than uncensored data.

6

6 Applications

Shanker (2016a,b) analyzed a data set consisting of the number of cycles to failure for 25 yarn samples. The data are: 15, 20, 38, 42, 61, 76, 86, 98, 121, 146, 149, 157, 175, 176, 180, 180, 198, 220, 224, 251, 264, 282, 321, 325, 653. The MLE of the parameters of AA ( m , α , θ ) was calculated from m = 1 , 2 , , 20 . For m = 10 , we have the smallest value of the K-S statistic. Thus, the estimated model is AA ( 10 , 0.003368 , 0.022551 ) . The K-S statistic and corresponding p-value are 0.105787 and 0.942375 , respectively. The AIC value, which corresponds to the MLE, is 306.2316 . Fig. 6, left panel, shows the empirical CDF and the CDF of the fitted model, and graphically confirms that the model describes the data well. Also, Fig. 8, left panel, draws the histogram of data and the estimated PDFs.

The empirical distribution function along with fitted AA ( m , α , θ ) .
Fig. 6
The empirical distribution function along with fitted AA ( m , α , θ ) .
The histogram along with some fitted models for the first data set (left) and the second data set (right).
Fig. 8
The histogram along with some fitted models for the first data set (left) and the second data set (right).

The second data set consist of 100 waiting times (in minutes) of customers to be served in a bank: 0.8, 0.8, 1.3, 1.5, 1.8, 1.9, 1.9, 2.1, 2.6, 2.7, 2.9, 3.1, 3.2, 3.3, 3.5, 3.6, 4.0, 4.1, 4.2, 4.2, 4.3, 4.3, 4.4, 4.4, 4.6, 4.7, 4.7, 4.8, 4.9, 4.9, 5.0, 5.3, 5.5, 5.7, 5.7, 6.1, 6.2, 6.2, 6.2, 6.3, 6.7, 6.9, 7.1, 7.1, 7.1, 7.1, 7.4, 7.6, 7.7, 8.0, 8.2, 8.6, 8.6, 8.6, 8.8, 8.8, 8.9, 8.9, 9.5, 9.6, 9.7, 9.8, 10.7, 10.9, 11.0, 11.0, 11.1, 11.2, 11.2, 11.5, 11.9, 12.4, 12.5, 12.9, 13.0, 13.1, 13.3, 13.6, 13.7, 13.9, 14.1, 15.4, 15.4, 17.3, 17.3, 18.1, 18.2, 18.4, 18.9, 19.0, 19.9, 20.6, 21.3, 21.4, 21.9, 23.0, 27.0, 31.6, 33.1, 38.5, see Ghitany et al. (2008) and Shanker (2015).

For m = 1 , 2 , , 20 , the MLEs of the parameters of AA ( m , α , θ ) were calculated. Based on the K-S statistics, the best model among them is AA ( 2 , 1545.077 , 0.202477 ) . The corresponding AIC value is 638.6034 , and the K-S statistics and p-value are 0.042187 and 0.994198 , respectively, indicating a good fit. Fig. 6, right panel, shows the empirical and fitted CDFs and confirms a good fit.

In a comparative analysis, we fitted the Lindley distribution, the Aradhana distribution, and the generalized Aradhana distribution defined by (1) to these data sets. The MLEs of the parameters of the models were estimated and the results, summarized in Table 2, show that the proposed model AA ( m , α , θ ) performs better than the others in both examples. Figs. 7 and 8 show the fitted distributions and PDFs respectively.

Table 2 The results of fitting the Lindley distribution, the Aradhana distribution, the generalized Aradhana distribution and the AA ( m , α , θ ) to data sets.
Data Model θ ̂ α ̂ K-S statistics K-S p-value AIC
The first data set Lindley 0.01115 0.127756 0.762682 307.019
Aradhana 0.016728 0.12346 0.7967 311.0772
Generalized Aradhana 0.013388 0.019097 0.11755 0.8407 308.864
AA ( 10 , α , θ ) 0.022551 0.003368 0.105787 0.942375 306.2316
The second data set Lindley 0.1866 0.067677 0.7495 640.0784
Aradhana 0.27655 0.080136 0.5419 642.343
Generalized Aradhana 0.2597 0.5556 0.060349 0.8596 641.5076
AA ( 2 , α , θ ) 0.202477 1545.077 0.042187 0.991498 638.6034
The empirical distribution function along with some fitted models for the first data set (left) and the second data set (right).
Fig. 7
The empirical distribution function along with some fitted models for the first data set (left) and the second data set (right).

7

7 Conclusion

Because of its applicability and usefulness, the Lindley model and its generalizations have been considered by many authors. Here, a new generalization of the Lindley distribution has been introduced to extend this collection. Some properties of this distribution have been studied. Parameter estimation was discussed using moments and maximum likelihood methods. Simulation studies show that the MLE is efficient and consistent for both complete and right-censored data. The results of fitting the presented model to two real data sets show that it is useful in analyzing data.

Acknowledgments

The authors would like to thank the two anonymous reviewers for their suggestions and very constructive comments that improved the presentation and readability of the paper. This work is supported by Researchers Supporting Project number (RSP-2021/323), King Saud University, Riyadh, Saudi Arabia.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. , , , . A new generalized Lindley distribution. J. Stat. Comput. Simul.. 2015;85:3662-3678.
    [Google Scholar]
  2. , , . A New Flexible Generalized Lindley Model: Properties, Estimation and Applications. Symmetry. 2020;12:1678. URL:  https://www.mdpi.com/2073-8994/12/10/1678
    [Google Scholar]
  3. Al-Babtain A.A., Eid, H.A., A-Hadi N.A., Merovci, F., 2014. The five parameter Lindley distribution. Pakistan J. Stat. 31, 363–384.
  4. , , , . Inferences on stress-strength reliability from Lindley distribution. Commun. Stat. Theory Methods. 2013;42:1443-1463.
    [Google Scholar]
  5. , , , . A new condition for identifiability of finite mixture distributions. Metrika. 2006;63:215-221.
    [CrossRef] [Google Scholar]
  6. Bader, M.G., Priest, A.M., 1982. Statistical aspects of fiber and bundle strength in hybrid composites. In hayashi, T., Kawata, K. Umekawa, S. (Eds), Progress in Science in engineering Composites, ICCM-IV, Tokyo. 1129–1136.
  7. , , , , , . An extended Lindley distribution. J. Korean Stat. Soc.. 2012;41:75-85.
    [Google Scholar]
  8. , , . The Lindley family of distributions: properties an applications. Hacettepe J. Math. Stat.. 2017;46:1113-1137.
    [Google Scholar]
  9. , , , , , . The Lindley Weibull Distribution: properties and applications. Acad. Bras. Cienc.. 2018;90(3):2579-2598. PMID:30304208
    [CrossRef] [Google Scholar]
  10. , , , , , . Fracture mechanics approach to the design of glass aircraft windows: A case study. SPIE Proc.. 1994;2286:419-430.
    [Google Scholar]
  11. , , , . Lidley distribution and its application. Math. Comput. Simul.. 2008;78:493-506.
    [Google Scholar]
  12. , , , , . A two-parameter Lindley distribution and its applications to survival data. Math. Comput. Simul.. 2011;81:1190-1201.
    [Google Scholar]
  13. , , . Survival Distributions: Reliability Applications in the Biometrical Sciences. New York: John Wiley; .
  14. , . Fiducial distributions and Bayes’ theorem. J. R. Stat. Soc. B. 1958;20:102-107.
    [Google Scholar]
  15. , . Mixture models: theory, geometry and applications. In: NSF-CBMS Regional Conference Series in Probability and Statistics. . p. :5.
    [Google Scholar]
  16. , . Families of frequency distributions. London, U.K.: Charles Griffin; .
  17. , . Shanker distribution and its applications. Int. J. Stat. Appl.. 2015;5:338-348.
    [Google Scholar]
  18. , . Aradhana distribution and its applications. Int. J. Stat. Appl.. 2016;6:23-34.
    [Google Scholar]
  19. , . On Generalized Lindley Distribution and Its Applications to Model Lifetime Data from Biomedical Science and Engineering. Insights Biomed.. 2016;1(2)
    [Google Scholar]
  20. , , . A two-parameter power Aradhana distribution with properties and application. Indian J. Ind. Appl. Math.. 2018;9:210-220.
    [Google Scholar]
  21. , , , . Statistical analysis of finite mixture distributions. Chichester, U.K.: John Wiley and Sons; .
  22. , , . A generalized Aradhana distribution with properties and applications. Biometrics Biostat. Int. J.. 2018;7(4):374-385.
    [Google Scholar]
Show Sections