Translate this page into:
Enhanced estimation of population mean in the presence of auxiliary information
⁎Corresponding author. mirfan@zju.edu.cn (Muhammad Irfan)
-
Received: ,
Accepted: ,
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.
Peer review under responsibility of King Saud University.
Abstract
In this article, we propose a general family of exponential-type estimators for enhanced estimation of population mean in simple random sampling. These estimators are based on the available parameters of the auxiliary variable such as coefficient of skewness, coefficient of kurtosis, standard deviation and coefficient of variation etc. Expressions for bias, mean squared error and minimum mean squared error of the proposed family are derived up to first degree of approximation. Five natural populations are considered to assess the performance of the proposed estimators. Numerical findings confirm that the proposed estimators dominate over the existing estimators such as sample mean, ratio, regression, Singh et al. (2008, 2009), Upadhyaya et al. (2011), Yadav and Kadilar (2013) and Kadilar (2016) in terms of mean squared error.
Keywords
Auxiliary variable
Bias
Efficiency
Exponential type estimators
Mean squared error
Simple random sampling
1 Introduction
Survey sampling is broadly used in agriculture, business management, demography, economics, education, engineering, industry, medical sciences, political science, social sciences and many others. The main objective of sample survey theory is to make inferences about the unknown population parameters like population total, population proportion, population mean or population variance etc. One of the hottest issues in survey sampling is to enhanced the efficiency of ratio, product and regression type estimators in the presence of known auxiliary information for estimating the unknown population parameters of the study variable under different sampling techniques. Ratio method of estimation proposed by Cochran (1940) is at its best than the usual estimators of mean/total when the correlation between study variable and auxiliary variable is positively high and regression line of study variable on auxiliary variable is linear and passes through or nearly through the origin. In many real situations, auxiliary variables are highly correlated with the study variable for instance, person’s age and duration of sleep, income and expenditure, person’s age and blood pressure etc. Product method of estimation was first suggested by Robson (1957) and rediscovered by Murthy (1964). It is preferably used when the correlation between study variable and auxiliary variable is negatively high. Regression method of estimation proposed by Watson (1937) is the appropriate choice when the regression line is linear and passes through a point away from the origin.
Many authors have suggested estimators for the estimation of population mean based on auxiliary information in simple random sampling without replacement (SRSWOR). Readers may refer to Singh and Tailor (2003), Kadilar and Cingi (2004, 2006a, 2006b, 2006c), Koyuncu and Kadilar (2009), Singh et al. (2009), Yan and Tian (2010), Upadhyaya et al. (2011), Yadav and Kadilar (2013), Khan et al. (2015), Kadilar (2016) and the references cited therein.
The enthusiasm behind this article is to propose an improved general family of exponential-type estimators for the estimation of finite population mean under SRSWOR. A brief introduction of some conventional and exponential-type estimators for population mean is provided in section 2. In section 3, we proposed a general exponential-type family of estimators for estimating finite population mean under SRSWOR scheme and derived their properties up to first order of approximation. An empirical study using five real data sets is performed to compare the proposed and existing estimators in terms of MSE in section 4. Finally, concluding remarks are addressed in the last section.
2 Existing estimators
Consider a sample of size is selected from a population of size subject to the constraint under SRSWOR. Let pair of observations ( for the study and auxiliary variable respectively. Let and be the population means of the study and auxiliary variables respectively. Similarly, and be the respective sample means of the study and auxiliary variables.
To drive the expressions for bias and mean squared error (MSE) of the existing and proposed estimators, we consider such that where
The conventional unbiased estimator without utilizing auxiliary information is defined by
Variance/MSE of
is given by
The conventional ratio estimator proposed by Cochran (1940) is defined as
Expressions for the bias and MSE of the
estimator, up to first degree of approximation, are respectively given by
Exponential ratio type estimator initiated by Bahl and Tuteja (1991) of population mean is as below
Bias and MSE of the
estimator, up to first order of approximation, are respectively given as
Singh et al. (2008) suggested a ratio-product exponential type estimator for population mean given as
The optimal value of
Minimum MSE of
estimator, up to first degree of approximation, as follows
Motivated by Khoshnevisan et al. (2007), Singh et al. (2009) proposed an exponential family of estimators for
as
Following are the bias and MSE up to first degree of approximation
Some members of the η- family of estimators are given in Table 1. Furthermore, Singh et al. (2009) proposed a generalized estimator by combining the estimator η1 and
as follows
Estimators
Sample Mean
Bahl and Tuteja (1991)
1
Singh et al. (2009)
1
1
1
The optimal value of
Minimum MSE of
estimator is equal to the MSE of the regression estimator
Upadhyaya et al. (2011) proposed ratio exponential type estimator as
MSE of
estimator, up to first degree of approximation, as follows
Yadav and Kadilar (2013) suggested an improved exponential family of estimators for population mean
by utilizing Singh et al. (2009) as
Estimators
k
Bahl and Tuteja (1991)
Yadav and Kadilar (2013)
1
1
k
Following are the expressions of bias and MSE of the
- family of estimators
Partially differentiating Eq. (20) w.r.t.
and setting
, the optimal value of
is given by
Substituting Eq. (21) in Eq. (20), we get minimum MSE of the
-family of estimators as
Kadilar (2016) suggested a modified exponential type estimator for population mean as
The optimal value of
Minimum MSE of
estimator, up to first degree of approximation, as follows
Minimum MSE of
estimators, up to first order of approximation is exactly equal to the variance of the usual regression estimator
Regression estimator suggested by Watson (1937) is given by
3 The suggested family of estimators
In this section, general exponential-type estimators for estimating finite population mean
under SRSWOR is proposed. Some members of proposed family are given in Table 3. Expressions for the bias, MSE and minimum MSE are obtained to the first degree of approximation.
Estimators
where are the appropriate constants to be determined such the MSE of is minimum, and are either real numbers or functions of the known parameters such as coefficient of variation , standard deviation , coefficient of skewness coefficient of kurtosis and coefficient of correlation of the population.
3.1 Bias, MSE and minimum MSE of
Eq. (26) can be transformed in terms of
as
Subtracting
on both sides of the Eq. (27) and expanding up to first degree of approximation, we have
Taking expectation on both sides of Eq. (28), we obtained bias of the
-family of estimators to the first degree of approximation, given as
Squaring and taking expectation on both sides of Eq. (28), we get MSE of the proposed
-family of estimators to the first degree of approximation as:
To obtained minimum MSE of the proposed estimators , Eq. (30) is differentiated with respect to the unknown parameters and setting , we obtained the optimal values of as given below and
Minimum MSE of
-family up to first degree of approximation is obtained by substituting the optimal values of
in Eq. (30) and simplifying as
We can also obtain many more exponential-type estimators by putting different choices of in the estimator given in Eq. (26).
4 Empirical study
In this section, we demonstrate the performance of the proposed estimators over simple mean, ratio, regression, Bahl and Tuteja (1991), Singh et al. (2008, 2009), Upadhyaya et al. (2011), Yadav and Kadilar (2013) and Kadilar (2016) estimators in terms of MSE. Five natural populations are considered to access the performance of the proposed estimators. Parameters of all the populations are detailed below in this section.
Population 1 Source: Koyuncu and Kadilar, 2009
Let be the number of teachers and be the number of students in both primary and secondary schools for 923 districts of Turkey in 2007. The summary statistics are shown as follows.
Population 2 Source: Srisodaphol et al., 2015
Let be the entire height in feet and be the diameter in centimeters of a breast height of conifer (Pinus palustric) trees. The statistic values are shown as follows.
Population 3 Source: Cochran, 1977
Let be the population size in 1930 and be the population size in 1920. The statistic values are shown as follows.
Population 4 Source: Kadilar and Cingi, 2004
Let be the number of teachers and be the number of students in both primary and secondary schools of Turkey in 2007. The summary statistics are shown as follows.
Population 5 Source: Kadilar and Cingi, 2003
Let be the apple production quantity and be the count of apple trees in 854 villages in Turkey. The summary statistics are shown as follows.
We computed MSE of all the proposed and competing estimators including simple mean , ratio , Bahl and Tuteja (1991) , regression Singh et al. (2008), Singh et al. (2009) Upadhyaya et al. (2011) , Yadav and Kadilar (2013) , and Kadilar (2016) . Findings are presented in Tables 4–9. It is clearly observed from Tables 4–9 that
-
our proposed estimators have minimum MSEs as compared to all other estimators for all data sets.
-
in each data set, MSEs of all proposed estimators are almost equal which indicates that all proposed estimators are equally efficient. So, researcher may be used any one of them depending upon the availability of the values of and .
Population | ||||
---|---|---|---|---|
1 | 2515.074 | 267.644 | 651.042 | 224.625 |
2 | 9.562 | 3.093 | 2.348 | 1.691 |
3 | 448.563 | 18.362 | 109.665 | 16.230 |
4 | 5,411,348 | 2,542,740 | 3,758,095 | 1,409,115 |
5 | 35,205,782 | 8,096,858 | 17,380,652 | 6,625,693 |
- family | MSE | - family | MSE | - family | MSE |
---|---|---|---|---|---|
2515.0740 | 651.0423 | 223.7713 | |||
651.0423 | 649.9361 | 223.7714 | |||
652.8800 | 651.7722 | 223.7713 | |||
651.2254 | 650.1190 | 223.7726 | |||
651.1360 | 650.0297 | 223.7737 | |||
651.0520 | 650.0342 | 223.7833 | |||
652.0282 | 650.9211 | 223.7767 | |||
651.0925 | 649.9863 | 223.7713 | |||
651.2341 | 650.1278 | 223.7716 | |||
651.0473 | 649.9411 | 223.7714 | |||
652.9680 | 651.8601 | 223.7719 |
- family | MSE | - family | MSE | - family | MSE |
---|---|---|---|---|---|
9.5618 | 2.3479 | 1.6534 | |||
2.3479 | 2.3312 | 1.6565 | |||
2.4578 | 2.4422 | 1.6535 | |||
2.4148 | 2.3987 | 1.6583 | |||
2.4038 | 2.3877 | 1.6586 | |||
2.3854 | 2.3935 | 1.6597 | |||
2.4492 | 2.4335 | 1.6574 | |||
2.3995 | 2.3833 | 1.6535 | |||
2.4216 | 2.4057 | 1.6562 | |||
2.3793 | 2.3629 | 1.6551 | |||
2.4691 | 2.4535 | 1.6565 |
- family | MSE | - family | MSE | - family | MSE |
---|---|---|---|---|---|
448.5631 | 109.6657 | 16.1653 | |||
109.6657 | 109.4151 | 16.1696 | |||
120.1576 | 119.8725 | 16.1658 | |||
111.7679 | 111.5113 | 16.1853 | |||
111.7048 | 111.4484 | 16.1855 | |||
110.0760 | 111.4860 | 16.2048 | |||
120.0335 | 119.7489 | 16.1859 | |||
111.6803 | 111.4239 | 16.1655 | |||
111.8068 | 111.5501 | 16.1746 | |||
110.0636 | 109.8119 | 16.1672 | |||
120.3483 | 120.0624 | 16.1747 |
- family | MSE | - family | MSE | - family | MSE |
---|---|---|---|---|---|
5,411,348 | 3,758,095 | 1,246,075 | |||
3,758,095 | 2,423,766 | 1,246,075 | |||
3,759,902 | 2,424,194 | 1,246,075 | |||
3,758,205 | 2,423,792 | 1,246,075 | |||
3,758,140 | 2,423,777 | 1,246,075 | |||
3,758,098 | 2,423,779 | 1,246,070 | |||
3,758,956 | 2,423,970 | 1,246,073 | |||
3,758,117 | 2,423,771 | 1,246,075 | |||
3,758,223 | 2,423,797 | 1,246,075 | |||
3,758,097 | 2,423,767 | 1,246,075 | |||
3,760,195 | 2,424,264 | 1,246,075 |
- family | MSE | - family | MSE | - family | MSE |
---|---|---|---|---|---|
35,205,782 | 17,380,652 | 6,427,017 | |||
17,380,652 | 15,693,026 | 6,427,018 | |||
17,385,543 | 15,696,853 | 6,427,017 | |||
17,381,067 | 15,693,351 | 6,427,033 | |||
17,380,820 | 15,693,158 | 6,427,053 | |||
17,380,668 | 15,693,173 | 6,427,246 | |||
17,382,855 | 15,694,750 | 6,427,116 | |||
17,380,728 | 15,693,085 | 6,427,017 | |||
17,381,113 | 15,693,387 | 6,427,020 | |||
17,380,658 | 15,693,031 | 6,427,017 | |||
17,386,080 | 15,697,273 | 6,427,024 |
5 Concluding remarks
In this article, we proposed a general family of exponential-type estimators for finite population mean under simple random sampling. Known information of auxiliary variable including coefficient of skewness, coefficient of kurtosis, standard deviation and coefficient of variation etc. is utilized to enhance the estimation. We derived the expressions for bias, mean squared error (MSE) and minimum MSE of the proposed estimators up to first degree of approximation. A numerical study is conducted through five real data sets to highlight the applicability of proposed estimators. This study concludes that all the proposed estimators are i) more efficient over the competing estimators and ii) are equally efficient, so anyone of them can be used depending upon the availability of the auxiliary information ( and ). Therefore, the practitioners are suggested to use the proposed estimators to get more efficient results in future applications.
References
- Ratio and product type exponential estimator. Inf. Optim. Sci.. 1991;XII(1):159-163.
- [Google Scholar]
- The estimation of the yields of cereal experiments by sampling for the ratio gain to total produce. J. Agric. Sci.. 1940;30:262-275.
- [Google Scholar]
- Sampling Techniques (3 edn). New York: John Wiley and Sons; 1977.
- A new exponential type estimator for the population mean in simple random sampling. J. Mod. Appl. Stat. Methods. 2016;15(2):207-214.
- [Google Scholar]
- Ratio estimators in stratified random sampling. Biometrical J.. 2003;45(2):218-225.
- [Google Scholar]
- An improvement in estimating the population mean by using the correlation coefficient. Hacettepe J. Math. Stat.. 2006;35(1):103-109.
- [Google Scholar]
- Improvement in estimating the population mean in simple random sampling. Appl. Math. Lett.. 2006;19:75-79.
- [Google Scholar]
- A class of transformed efficient ratio estimators of finite population mean. Pakistan J. Statist.. 2015;31(4):353-362.
- [Google Scholar]
- A general family of estimators for estimating population mean using known value of some population parameter(s) Far East J. Theor. Stat.. 2007;22(2):181-191.
- [Google Scholar]
- Efficient estimators for the population mean. Hacettepe J. Math. Statist.. 2009;38(2):217-225.
- [Google Scholar]
- Application of multivariate polykays to the theory of unbiased ratio type estimation. J. Am. Stat. Assoc.. 1957;52:411-422.
- [Google Scholar]
- On linear combination of ratio and product type exponential estimator for estimating the finite population mean. Statist. Transition. 2008;9(1):105-115.
- [Google Scholar]
- Improvement in estimating the population mean using exponential estimator in simple random sampling. Bull. Statist. Econ.. 2009;3:13-18.
- [Google Scholar]
- Use of known correlation coefficient in estimating the finite population means. Statist. Transition. 2003;6(4):555-560.
- [Google Scholar]
- New ratio estimators of a population mean using one auxiliary variable in simple random sampling. Chiang Mai J. Sci.. 2015;42(2):523-527.
- [Google Scholar]
- Improved ratio and product exponential type estimators. J. Statist. Theory Pract.. 2011;5(2):285-302.
- [Google Scholar]
- Ratio method to the mean estimation using coefficient of skewness of auxiliary variable. ICICA. Part II, CCIS. 2010;106:103-110.
- [Google Scholar]
- Efficient family of exponential estimators for the population mean. Hacettepe J. Math. Statist.. 2013;42(6):671-677.
- [Google Scholar]