A generalized class of estimators for sensitive variable in the presence of measurement error and non-response under stratified random sampling
⁎Corresponding author. erumzahid22@gmail.com (Erum Zahid)
-
Received: ,
Accepted: ,
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.
Peer review under responsibility of King Saud University.
Abstract
In survey sampling an investigator may be unable to get the complete and correct information at the same time. So non-response and measurement error occur simultaneously and consequently may effect the estimator. Considering this problem, a generalized class of estimators is proposed for estimating the finite population mean for sensitive variable in the presence of measurement error and non-response under stratified random sampling. We conducted a study based on real data set at Quaid-i-Azam University, Islamabad. Simulation and real life data sets are used to observe the performances of the estimators. Bias and MSE values are given for the comparison of estimators.
Keywords
Auxiliary variable
Measurement error
Non-response
Randomized response
Stratified random sampling
1 Introduction
In survey sampling, certain surveys cause some problems for the researchers due to the fact that the respondents are reluctant to discuss sensitive topics such as drug use, abortion, sexually transmitted diseases etc. When surveying on those topics, measurement error and non-response can occur since the respondent may choose, not to respond some specific questions, not to give the accurate answers, or not to take part in the survey. The problem of measurement error is usually ignored during the sensitive surveys and the assumption is made that the information obtained through surveys is free from error. Another important factor in surveys is non-response, which may arises due to refusal of respondents to give the information or not at home or lack of interest due to some sensitive issues. Usually measurement error and non-response are studied separately for the sensitive variable using the known auxiliary or additional information. In reality, when the variable of interest is sensitive, the respondents hesitate to provide the personal information, which give rise to measurement error. In most of the cases, the information is not obtained from all units in surveys, specially when the variable of interest is stigmatizing in nature. Many researchers studied the problem of non-response, including (Hansen and Hurwitz, 1946; Cochran, 1977; Rao, 1986; Khare and Srivastava, 2010; Andridge and Little, 2010; Singh et al., 2011; Khare et al., 2013; Shabbir and Khan, 2013; Shabbir et al., 2018 and Singh and Khalid, 2020). In survey sampling, when the variable under study contains social stigma, then the respondents are not comfortable to provide their personal information. Direct survey on sensitive question increases the relative bias. Warner (1965) introduced the randomized response technique (RRT), which reduces the possible bias and is used to obtain the true information while insuring the privacy of the respondents. For estimation of mean of a sensitive quantitative variable the Randomized Response model (RRM) is extended by Greenberg et al. (1971). Further work is done by Eichhorn and Hayre (1983); Gupta and Shabbir (2004), Kim and Warde (2004); Singh and Mathur (2005), Gjestvang and Singh (2006); Diana and Perri (2010), Gupta et al. (2010); Chaudhuri and Pal (2015), Gupta et al. (2016) and Bouza et al. (2018).
The researchers dealt with the problem of measurement error for estimating the population mean. For more details, see Cochran (1968); Fuller (1995); Shalabh (1997); Biemer et al. (2011); Shukla et al. (2012), etc. Recently few researchers studied the problem of measurement error and non-response together like Kumar et al. (2015); Singh and Sharma (2015); Azeem and Hanif (2017) and Kumar (2016). Zahid and Shabbir (2018); Khalil et al. (2018) and Zahid and Shabbir (2019) have discussed the problem of measurement error and non-response under stratified random sampling.
In practice, the researchers who have studied measurement error, have ignored the presence of non-response and randomized response at the same time. In this study, we have proposed a class of estimators for estimating the population mean of a sensitive variable in the presence of measurement error and non-response simultaneously under stratified random sampling. The efficiency of the suggested class of estimators over the existing estimators is shown through simulation study and real data sets.
Consider a finite population of N identifiable units which are partitioned into L homogeneous subgroups called strata, such that the
Let
Let
In Section 2, some existing estimators of the finite population mean are given. In Section 3, a generalized class of estimators is suggested for estimating the finite population mean by incorporating both measurement error and non-response information simultaneously. Numerical results and simulation study are presented in Section 4. Conclusion is given in Section 5.
2 Existing Estimators in Literature
In this section we consider the following existing estimators.
2.1 Hansen and Hurwitz (1946) Estimator
In stratified random sampling, the Hansen and Hurwitz (1946) estimator for population mean
Here
The variance of
2.2 Ratio Estimator
The usual ratio estimator under stratified random sampling, is given by
The bias and MSE of
2.3 Product Estimator
The product estimator under stratified random sampling, is given by
The bias and MSE of
2.4 Bahl and Tuteja, 1991 Estimator
Bahl and Tuteja, 1991 estimator under stratified random sampling, is given by
The bias and MSE of
2.5 Singh and Kumar, 2010 Estimator
Singh and Kumar, 2010 estimator under stratified random sampling, is given by
The bias and MSE of
2.6 Difference Estimator
The usual difference estimator under stratified random sampling, is given by
The minimum variance of
The optimum value of
2.7 Azeem and Hanif (2017) Estimator
Azeem and Hanif (2016) estimator under stratified random sampling, is given by
The bias and MSE of
3 Proposed Generalized Class of Estimators
We suggest a generalized class of estimators for estimating the finite population mean for a sensitive variable, considering the problem of measurement error and non-response simultaneously in stratified random sampling. Measurement error and non-response are present on both the study variable and the auxiliary variable. The suggested estimator, is given by
Adding
Dividing both sides by
Similarly, we can get
Further
On simplifying, we get
Further simplifying, and ignoring error terms greater than two, we have
Using Eq. (25), the bias and MSE of
For finding the optimal values of
Substituting these optimum values in Eq. (27), we get the minimum MSE of
4 Numerical Results
In this section simulated data and two real data sets are used to show the performance of the generalized class of proposed estimator. The results are given in Tables 1, 2 (simulation) and 5, 6 (real data).
Estimators with Measurement Error | 10% non-response | 20% non-response | ||||
---|---|---|---|---|---|---|
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.107125 | 0.128072 | 0.169966 | 0.117775 | 0.160022 | 0.244517 |
|
0.030971 | 0.037839 | 0.051576 | 0.036732 | 0.055122 | 0.091904 |
(0.026764) | (0.034223) | (0.049143) | (0.034202) | 0.056537) | (0.101206) | |
|
0.458469 | 0.555397 | 0.749251 | 0.517367 | 0.732089 | 1.161533 |
(0.087794) | (0.106418) | (0.143665) | (0.098841) | (0.139558) | (0.220992) | |
|
0.034649 | 0.040819 | 0.053159 | 0.037435 | 0.049176 | 0.072660 |
(0.065539) | (0.069173) | (0.076441) | (0.053816) | (0.034004) | (0.005620) | |
|
1.085004 | 1.319813 | 1.789432 | 1.235507 | 1.771322 | 2.842953 |
(0.157364) | (0.193539) | (0.265890) | (0.183396) | (0.271635) | (0.448113) | |
|
0.022166 | 0.026426 | 0.034858 | 0.024615 | 0.033426 | 0.050598 |
|
0.096090 | 0.119132 | 0.165217 | 0.115666 | 0.177860 | 0.302249 |
(0.103052) | (0.124466) | (0.167295) | (0.115001) | (0.160313) | (0.250939) | |
|
0.022067 | 0.026286 | 0.034616 | 0.024492 | 0.033201 | 0.050087 |
(0.018672) | (0.022202) | (0.029160) | (0.020721) | (0.028004) | (0.042072) | |
|
0.022075 | 0.026298 | 0.034638 | 0.024503 | 0.033223 | 0.050141 |
(0.018679) | (0.022213) | (0.029180) | (0.020731) | (0.028023) | (0.042119) | |
|
0.022076 | 0.026299 | 0.034638 | 0.024503 | 0.033223 | 0.050143 |
(0.018679) | (0.022213) | (0.029180) | (0.020731) | (0.028024) | (0.042121) | |
|
0.021993 | 0.026179 | 0.034423 | 0.024394 | 0.033006 | 0.049605 |
(0.018608) | (0.022109) | (0.028993) | (0.020636) | (0.027834) | (0.041651) | |
|
0.022057 | 0.026230 | 0.034607 | 0.024428 | 0.033176 | 0.050079 |
(0.018698) | (0.022240) | (0.029231) | (0.020752) | (0.028070) | (0.042240) | |
Estimators without Measurement Error | 10% non-response | 20% non-response | ||||
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.096942 | 0.115829 | 0.153602 | 0.106425 | 0.144277 | 0.219980 |
|
0.004814 | 0.006475 | 0.009796 | 0.007385 | 0.014187 | 0.027792 |
(0.013029) | (0.017786) | (0.027300) | (0.018722) | (0.034866) | (0.067153) | |
|
0.432313 | 0.524032 | 0.707471 | 0.488020 | 0.691154 | 1.097421 |
(0.087794) | (0.106418) | (0.143665) | (0.098841) | (0.139558) | (0.220992) | |
|
0.020473 | 0.023796 | 0.030441 | 0.021586 | 0.027134 | 0.038230 |
(0.141554) | (0.160285) | (0.197747) | (0.139441) | (0.153945) | (0.182954) | |
|
1.010927 | 1.231085 | 1.671403 | 1.152171 | 1.654819 | 2.660115 |
(0.138073) | (0.170463) | (0.235244) | (0.161654) | (0.241207) | (0.400314) | |
|
0.001424 | 0.001860 | 0.002574 | 0.002043 | 0.003129 | 0.004569 |
|
0.049966 | 0.063866 | 0.091666 | 0.063824 | 0.105438 | 0.188667 |
(0.106485) | (0.128576) | (0.172756) | (0.118871) | (0.165731) | (0.259452) | |
|
0.001423 | 0.001858 | 0.002570 | 0.002040 | 0.003124 | 0.004558 |
(0.001219) | (0.001607) | (0.002228) | (0.001801) | (0.002785) | (0.004044) | |
|
0.001424 | 0.001859 | 0.002572 | 0.002042 | 0.003127 | 0.004564 |
(0.001220) | (0.001608) | (0.002230) | (0.001802) | (0.002787) | (0.004050) | |
|
0.001424 | 0.001859 | 0.002572 | 0.002042 | 0.003127 | 0.004564 |
(0.001220) | (0.001608) | (0.00223) | (0.001802) | (0.002787) | (0.004050) | |
|
0.001409 | 0.001837 | 0.002530 | 0.002019 | 0.003079 | 0.004440 |
(0.001207) | (0.001589) | (0.002193) | (0.001783) | (0.002745) | (0.003941) | |
|
0.001422 | 0.001858 | 0.002571 | 0.002040 | 0.003126 | 0.004564 |
(0.001219) | (0.001607) | (0.002229) | (0.001801) | (0.002787) | (0.004051) |
Estimators | 10% non-response | 20% non-response | ||||
---|---|---|---|---|---|---|
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.089797 | 0.108185 | 0.144961 | 0.098465 | 0.135429 | 0.209356 |
|
0.013224 | 0.016604 | 0.023364 | 0.014699 | 0.021331 | 0.034594 |
(0.000313) | (0.000378) | (0.001761) | (0.003640) | (0.004433) | (0.006020) | |
|
0.320038 | 0.387672 | 0.522939 | 0.334810 | 0.460273 | 0.711199 |
(0.074086) | (0.089385) | (0.119983) | (0.081194) | (0.111309) | (0.171537) | |
|
0.032302 | 0.038906 | 0.052115 | 0.037509 | 0.052036 | 0.081090 |
(0.180648) | (0.209399) | (0.266902) | (0.235166) | (0.319488) | (0.488131) | |
|
0.703946 | 0.855063 | 1.157297 | 0.723734 | 0.995864 | 1.540122 |
(0.067164) | (0.081340) | (0.109694) | (0.063355) | (0.087213) | (0.134930) | |
|
0.012522 | 0.015889 | 0.022418 | 0.014072 | 0.020597 | 0.033554 |
|
0.027173 | 0.034466 | 0.049053 | 0.024482 | 0.035358 | 0.057109 |
(0.089356) | (0.107659) | (0.144267) | (0.098718) | (0.135291) | (0.208437) | |
|
0.012476 | 0.015816 | 0.022274 | 0.014014 | 0.020474 | 0.033231 |
(0.012576) | (0.015866) | (0.022247) | (0.014418) | (0.020942) | (0.033839) | |
|
0.012477 | 0.015820 | 0.022281 | 0.014016 | 0.020480 | 0.033245 |
(0.012578) | (0.015868) | (0.022253) | (0.014420) | (0.020948) | (0.033853) | |
|
0.012478 | 0.015821 | 0.022282 | 0.014017 | 0.020481 | 0.033246 |
(0.012579) | (0.015869) | (0.022254) | (0.014421) | (0.020949) | (0.033854) | |
|
0.012455 | 0.015786 | 0.022218 | 0.013989 | 0.020426 | 0.033115 |
(0.012556) | (0.015835) | (0.022191) | (0.014392) | (0.020892) | (0.033718) | |
|
0.012474 | 0.015809 | 0.022230 | 0.014013 | 0.020464 | 0.033224 |
(0.012585) | (0.015879) | (0.022273) | (0.014429) | (0.020964) | (0.033895) | |
Estimators without Measurement Error | 10% non-response | 20% non-response | ||||
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.079750 | 0.095863 | 0.128091 | 0.087280 | 0.119695 | 0.184526 |
|
0.001954 | 0.002369 | 0.003199 | 0.002038 | 0.002794 | 0.004305 |
(0.001402) | (0.001226) | (0.005081) | (0.007060) | (0.011019) | ||
|
0.308768 | 0.373437 | 0.502774 | 0.322149 | 0.441736 | 0.680909 |
(0.089385) | (0.119983) | (0.081194) | (0.111309) | (0.171537) | ||
|
0.021949 | 0.026106 | 0.034421 | 0.025956 | 0.035602 | 0.054895 |
(0.222047) | (0.288625) | (0.245101) | (0.338267) | (0.524598) | ||
|
0.689010 | 0.835090 | 1.127248 | 0.706644 | 0.968914 | 1.493455 |
(0.066129) | (0.079822) | (0.107208) | (0.062190) | (0.085148) | (0.131064) | |
0.001194 | 0.001576 | 0.002205 | 0.001210 | 0.001686 | 0.002635 | |
|
0.014726 | 0.018444 | 0.025880 | 0.010445 | 0.014281 | 0.021953 |
(0.108009) | (0.144843) | (0.099002) | (0.135796) | (0.209385) | ||
|
0.001190 | 0.001572 | 0.002203 | 0.001205 | 0.001682 | 0.002625 |
(0.001574) | (0.002202) | (0.001256) | (0.001760) | (0.002759) | ||
|
0.001191 | 0.001573 | 0.002204 | 0.001206 | 0.001683 | 0.002626 |
(0.001575) | (0.002203) | (0.001257) | (0.001761) | (0.002760) | ||
|
0.001192 | 0.001574 | 0.002205 | 0.001207 | 0.001684 | 0.002627 |
(0.001576) | (0.002204) | (0.001258) | (0.001762) | (0.002760) | ||
|
0.001184 | 0.001564 | 0.002189 | 0.001200 | 0.001673 | 0.002604 |
(0.001566) | (0.002189) | (0.001251) | (0.001751) | (0.002737) | ||
|
0.001189 | 0.001570 | 0.002202 | 0.001204 | 0.001680 | 0.002622 |
(0.001573) | (0.002200) | (0.001254) | (0.001759) | (0.002760) |
4.1 Simulation Study
We have generated two populations (Population I and II) from normal distribution by using R language program, which are given in Appendix A. The results based on these population are given in Tables 1 and 2.
Tables 1 and 2 show that the generalized class of proposed estimators
In Table 2 the MSE for the generalized proposed estimator, when
Through the simulation study it is concluded that the generalized proposed class of estimators perform better as compared to the all other existing estimators. For 10% non-response rate, the MSE is minimum as compared to 20% of the non-response rate. The MSE also increases as the value of constant
4.2 Application to Real Data Set
In this section we consider two real life data sets for numerical comparisons, Population III is taken from Rosner, 2015, Population IV is obtained by conducting a survey at Quaid-i-Azam University, Islamabad 4.2.1. The results based on these data sets are given in Tables 5 and 6.
Population III. [Source:Rosner, 2015].
Strata I consist of 318 observations and strata II contains 336 observations. The data summary is given in Tables 3 and 4.
Variable | Mean | st.Dev | Min | Med | Max | |
---|---|---|---|---|---|---|
Forced expiratory volume (
|
2.45 | 0.65 | 0.79 | 2.48 | 3.83 | |
Age (
|
9.84 | 2.93 | 3.00 | 10.00 | 19.00 | |
Smoke (
|
0.12 | 0.32 | 0.00 | 0.00 | 1.00 |
Variable | Mean | st.Dev | Min | Med | Max | |
---|---|---|---|---|---|---|
Forced expiratory volume (
|
2.68 | 1.00 | 0.79 | 2.61 | 5.79 | |
Age (
|
10.01 | 2.97 | 3.00 | 10.00 | 19.00 | |
Sex (
|
0.07 | 0.27 | 0.00 | 0.00 | 1.00 |
4.2.1 Data Collection
To see the practical implication of measurement error, we conducted a study based on real data set at Quaid-i-Azam University, Islamabad. We distributed 55 questionnaires to the students of BS Statistics (5th Semester Fall, 2018) and M.Phil Statistics (1st and 2nd Semesters, Fall 2018) of Quaid-i-Azam University, Islamabad. We consider our population of those students who gave the false response, which comes out to be 23. As we already have the true response from their academic record. In question (i) we asked for
Population IV. [Source: Section 4.2.1].
Let, Y: Age of BS
Tables 5 and 6 show that the generalized class of proposed estimators
Estimators | 10% non-response | 20% non-response | ||||
---|---|---|---|---|---|---|
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.009864 | 0.011976 | 0.016201 | 0.012941 | 0.017808 | 0.027542 |
|
0.009932 | 0.012202 | 0.016742 | 0.009053 | 0.012444 | 0.019226 |
(0.003997) | (0.004906) | (0.006722) | (0.003688) | (0.005117) | (0.007975) | |
|
0.037387 | 0.045484 | 0.061677 | 0.044100 | 0.060841 | 0.094323 |
(0.004802) | (0.005828) | (0.007881) | (0.006228) | (0.008600) | (0.013344) | |
|
0.006449 | 0.007872 | 0.010719 | 0.007588 | 0.010418 | 0.016076 |
(0.005316) | (0.006551) | (0.009022) | (0.003434) | (0.004720) | (0.007291) | |
|
0.092504 | 0.112726 | 0.153171 | 0.102530 | 0.141543 | 0.219569 |
(0.009534) | (0.011631) | (0.015826) | (0.010642) | (0.014727) | (0.022896) | |
|
0.006262 | 0.007627 | 0.010355 | 0.006917 | 0.009419 | 0.014419 |
|
0.016593 | 0.020397 | 0.028004 | 0.014191 | 0.019546 | 0.030256 |
(0.004941) | (0.005987) | (0.008080) | (0.006670) | (0.009207) | (0.014279) | |
|
0.006252 | 0.007612 | 0.010329 | 0.006904 | 0.009395 | 0.014362 |
(0.004152) | (0.005037) | (0.006804) | (0.004963) | (0.006760) | (0.010341) | |
|
0.006252 | 0.007613 | 0.010329 | 0.006904 | 0.009395 | 0.014362 |
(0.004153) | (0.005038) | (0.006804) | (0.004963) | (0.006760) | (0.010342) | |
|
0.006252 | 0.007613 | 0.010329 | 0.006904 | 0.009396 | 0.014363 |
(0.004153) | (0.005038) | (0.006804) | (0.004963) | (0.006760) | (0.010342) | |
|
0.004734 | 0.005749 | 0.007771 | 0.005848 | 0.007960 | 0.012160 |
(0.003277) | (0.003966) | (0.005342) | (0.004165) | (0.005672) | (0.008669) | |
|
0.004737 | 0.005752 | 0.007777 | 0.005851 | 0.007966 | 0.012176 |
(0.003278) | (0.003969) | (0.005346) | (0.004168) | (0.005677) | (0.008681) | |
Estimators without Measurement Error | 10% non-response | 20% non-response | ||||
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.008595 | 0.010446 | 0.014149 | 0.010151 | 0.014012 | 0.021732 |
|
0.001193 | 0.001468 | 0.002018 | 0.000905 | 0.001243 | 0.001920 |
(0.000333) | (0.000407) | (0.000556) | (0.000353) | (0.000485) | (0.000750) | |
|
0.028648 | 0.034750 | 0.046953 | 0.035952 | 0.049640 | 0.077016 |
(0.004802) | (0.005828) | (0.007881) | (0.006228) | (0.008600) | (0.013344) | |
|
0.003313 | 0.004041 | 0.005499 | 0.003459 | 0.004770 | 0.007392 |
(0.004884) | (0.005936) | (0.008042) | (0.006588) | (0.009102) | (0.014129) | |
|
0.061352 | 0.074378 | 0.100428 | 0.078308 | 0.108130 | 0.167773 |
(0.004815) | (0.005841) | (0.007893) | (0.006268) | (0.008659) | (0.013441) | |
|
0.001144 | 0.001407 | 0.001934 | 0.0008737 | 0.001201 | 0.001856 |
|
0.001604 | 0.001962 | 0.002677 | 0.001586 | 0.002183 | 0.003379 |
(0.005709) | (0.006930) | (0.009371) | (0.007385) | (0.010198) | (0.015823) | |
|
0.001143 | 0.001407 | 0.001933 | 0.000873 | 0.001200 | 0.001855 |
(0.000677) | (0.000833) | (0.001145) | (0.000651) | (0.000893) | (0.001378) | |
|
0.001143 | 0.001407 | 0.001933 | 0.000873 | 0.001201 | 0.001855 |
(0.000677) | (0.000833) | (0.001145) | (0.000651) | (0.000893) | (0.001378) | |
|
0.001143 | 0.001407 | 0.001933 | 0.000873 | 0.001201 | 0.001855 |
(0.000677) | (0.000833) | (0.001145) | (0.000651) | (0.000893) | (0.001378) | |
|
0.001060 | 0.001307 | 0.001800 | 0.000822 | 0.001128 | 0.001742 |
(0.000631) | (0.000777) | (0.001069) | (0.000611) | (0.000839) | (0.001293) | |
|
0.001063 | 0.001309 | 0.001801 | 0.000824 | 0.001129 | 0.001743 |
(0.000631) | (0.000777) | (0.001070) | (0.000611) | (0.000839) | (0.001294) |
Estimators | 10% non-response | 20% non-response | ||||
---|---|---|---|---|---|---|
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.630021 | 0.837808 | 1.045595 | 0.630686 | 0.839804 | 1.048922 |
|
1.921722 | 2.793240 | 3.464758 | 2.121722 | 3.170594 | 4.119467 |
(0.130760) | (0.157226) | (0.104294) | (0.148446) | (0.192598) | ||
|
2.506471 | 3.404393 | 4.302315 | 2.735328 | 4.090965 | 5.446601 |
(0.013928) | (0.017060) | (0.011940) | (0.017360) | (0.022779) | ||
|
0.906671 | 1.228910 | 1.551149 | 0.945071 | 1.344110 | 1.743148 |
(307.1653) | (395.3818) | (248.3003) | (395.2198) | (542.1394) | ||
|
7.622591 | 10.43603 | 13.24947 | 8.484520 | 13.02182 | 17.55912 |
(0.567979) | (0.727644) | (0.461703) | (0.728147) | (0.994592) | ||
|
0.619716 | 0.824887 | 1.029976 | 0.619579 | 0.824426 | 1.029194 |
4.905149 | 6.754218 | 8.603288 | 5.446953 | 8.379632 | 11.31231 | |
(0.030936) | (0.044995) | (0.059054) | (0.035498) | (0.058680) | (0.081861) | |
|
0.617825 | 0.821560 | 1.024813 | 0.617755 | 0.821341 | 1.024502 |
(0.076580) | (0.095337) | (0.056926) | (0.074051) | (0.091135) | ||
|
0.617948 | 0.821788 | 1.025177 | 0.617894 | 0.821628 | 1.024985 |
(0.076603) | (0.096939) | (0.064079) | (0.091183) | (0.145374) | ||
|
0.617950 | 0.821792 | 1.025185 | 0.617896 | 0.821632 | 1.024994 |
(0.076603) | (0.095375) | (0.056940) | (0.074080) | (0.091184) | ||
|
0.334101 | 0.444233 | 0.552328 | 0.335842 | 0.447262 | 0.554985 |
(0.040907) | (0.050769) | (0.030738) | (0.040354) | (0.049608) | ||
|
0.336942 | 0.449357 | 0.560385 | 0.339090 | 0.453985 | 0.566332 |
(0.041422) | (0.051582) | (0.031060) | (0.041023) | (0.050737) | ||
Estimators without Measurement Error | 10% non-response | 20% non-response | ||||
|
|
|||||
2 | 4 | 8 | 2 | 4 | 8 | |
|
0.531888 | 0.706251 | 0.880614 | 0.534662 | 0.714573 | 0.894483 |
|
0.847220 | 1.129459 | 1.411698 | 0.893275 | 1.267623 | 1.641971 |
(0.043594) | (0.054407) | (0.037339) | (0.057267) | (0.077194) | ||
|
1.360452 | 1.797576 | 2.234700 | 1.458010 | 2.090249 | 2.722487 |
(0.013928) | (0.017060) | (0.011940) | (0.017360) | (0.022779) | ||
|
0.546567 | 0.728539 | 0.910510 | 0.553723 | 0.750007 | 0.946291 |
(70.46203) | (88.11975) | (60.15501) | (92.51417) | (124.8733) | ||
|
3.332912 | 4.403433 | 5.473954 | 3.663318 | 5.394650 | 7.125983 |
(0.172948) | (0.214997) | (0.147613) | (0.223090) | (0.298568) | ||
|
0.499049 | 0.664792 | 0.830500 | 0.499692 | 0.666329 | 0.832772 |
|
1.780605 | 2.369676 | 2.958748 | 1.941566 | 2.852558 | 3.763550 |
(0.003230) | (0.003599) | (0.002798) | (0.003040) | (0.003283) | ||
|
0.497680 | 0.662389 | 0.826776 | 0.498415 | 0.664262 | 0.829706 |
(0.064500) | (0.080376) | (0.047678) | (0.061733) | (0.075745) | ||
|
0.497707 | 0.662436 | 0.826849 | 0.498447 | 0.664327 | 0.829816 |
(0.064504) | (0.080382) | (0.047681) | (0.061739) | (0.075755) | ||
|
0.497708 | 0.662437 | 0.826850 | 0.498447 | 0.664328 | 0.829818 |
(0.064505) | (0.080382) | (0.047681) | (0.061739) | (0.075755) | ||
|
0.222332 | 0.293731 | 0.363450 | 0.223663 | 0.296212 | 0.366048 |
(0.030276) | (0.037404) | (0.022712) | (0.029331) | (0.035698) | ||
|
0.223172 | 0.295152 | 0.365602 | 0.224612 | 0.298063 | 0.369087 |
(0.030408) | (0.037603) | (0.022800) | (0.029499) | (0.035973) |
In Table 6 the MSE for the generalized proposed estimator, when
Through real data sets it is concluded that the generalized proposed estimator performs better as compared to the other existing estimators. For 10% non-response rate the MSE is minimum. The MSE also increases as the value of constant
5 Conclusion
In the present study, we proposed a generalized class of estimators in estimating the finite population mean for the sensitive variable in the presence of measurement error and non-response under stratified random sampling. Through simulation study and real life data sets it is observed that the proposed class of estimators perform better than the existing estimators. The MSE values are generally smaller under 10% of non-response as compared to 20% of non-response, which are expected results. Generally as the non-response rate increases, MSE also increases. Based on numerical findings, it turns out that the generalized proposed class of estimators is more efficient as compared to the other existing estimators, under certain situations.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
- A review of hot deck imputation for survey non-response. Int. Stat. Rev.. 2010;78(1):40-64.
- [Google Scholar]
- Joint influence of measurement error and non response on estimation of population mean. Commun. Statistics-Theory Methods. 2017;46(4):1679-1693.
- [Google Scholar]
- Ratio and product type exponential estimators. J. Inform. Optim. Sci.. 1991;12(1):159-164.
- [Google Scholar]
- Measurement Errors in Surveys. John Wiley & Sons; 2011.
- Ranked set sampling and optional scrambling randomized response modeling. Investigación Operacional. 2018;39(1):100-107.
- [Google Scholar]
- On efficacy of empirical bayes estimation of a finite population mean of a sensitive variable through randomized responses. Model Assisted Stat. Appl.. 2015;10(4):283-288.
- [Google Scholar]
- Sampling techniques. New York: John Wiley & Sons; 1977.
- New scrambled response models for estimating the mean of a sensitive quantitative character. J. Appl. Stat.. 2010;37(11):1875-1890.
- [Google Scholar]
- Scrambled randomized response methods for obtaining sensitive quantitative data. J. Stat. Planning inference. 1983;7(4):307-316.
- [Google Scholar]
- Estimation in the presence of measurement error. Int. Stat. Rev.. 1995;63(2):121-141.
- [Google Scholar]
- Application of the randomized response technique in obtaining quantitative data. J. Am. Stat. Assoc.. 1971;66(334):243-250.
- [Google Scholar]
- An improved estimator of the finite population mean in simple random sampling. Model Assisted Stat. Appl.. 2011;6(1):47-55.
- [Google Scholar]
- Sensitivity estimation for personal interview survey questions. Statistica. 2004;64(4):643-653.
- [Google Scholar]
- On improvement in estimating the population mean in simple random sampling. J. Appl. Stat.. 2008;35(5):559-566.
- [Google Scholar]
- Mean and sensitivity estimation in optional randomized response models. J. Stat. Planning Inference. 2010;140(10):2870-2874.
- [Google Scholar]
- Improved exponential type estimators of the mean of a sensitive variable in the presence of nonsensitive auxiliary information. Commun. Statistics-Simul. Comput.. 2016;45(9):3317-3328.
- [Google Scholar]
- The problem of non-response in sample surveys. J. Am. Stat. Assoc.. 1946;41(236):517-529.
- [Google Scholar]
- Estimation of finite population mean in stratified sampling using scrambled responses in the presence of measurement errors. Commun. Statistics-Theory Methods 2018:1-9.
- [Google Scholar]
- Estimation of population mean in sample surveys using auxiliary character, method of call backs and subsampling from non-respondents. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences. 2013;83(1):49-54.
- [Google Scholar]
- Generalized two phase sampling estimators for the population mean in the presence of nonresponse. Aligarh Jouranal of Statistics. 2010;30:39-54.
- [Google Scholar]
- A stratified warner’s randomized response model. J. Stat. Planning Inference. 2004;120(1–2):155-165.
- [Google Scholar]
- Improved estimation of population mean in presence of non-response and measurement error. J. Stat. Theory Practice 2016 just-accepted
- [Google Scholar]
- Estimation of population mean in the presence of non-response and measurement error. Revista Colombiana de EstadÝstica. 2015;38(1):145-161.
- [Google Scholar]
- Ratio estimation with subsampling the nonrespondents. Survey Methodology. 1986;12(2):217-230.
- [Google Scholar]
- Fundamentals of biostatistics. Duxbury Press; 2015.
- A generalized class of estimators under two-phase stratified sampling for non response. Commun. Stat.-Theory Methods 2018:1-17.
- [Google Scholar]
- Some modified exponential-ratio type estimators in the presence of non-response under two-phase sampling scheme. Electronic J. Appl. Stati. Anal.. 2013;6(1):1-17.
- [Google Scholar]
- Ratio method of estimation in the presence of measurement errors. J. Indian Society Agric. Stat.. 1997;52:150-155.
- [Google Scholar]
- An estimator for mean estimation in presence of measurement error. Res. Rev.: A J. Stat.. 2012;1(1):1-8.
- [Google Scholar]
- Some imputation methods to compensate with non-response for estimation of population mean in two-occasion successive sampling. Commun. Statistics-Theory Methods. 2020;49(14):3329-3351.
- [Google Scholar]
- Estimation of mean in presence of non-response using two phase sampling scheme. Stat. Pap.. 2010;51(3):559-582.
- [Google Scholar]
- Combination of regression and ratio estimate in presence of nonresponse. Brazilian J. Prob. Stat.. 2011;25(2):205-217.
- [Google Scholar]
- Estimation of population mean when coefficient of variation is known using scrambled response technique. J. Stat. Planning Inference. 2005;131(1):135-144.
- [Google Scholar]
- Method of estimation in the presence of non-response and measurement errors simultaneously. J. Modern Appl. Stat. Methods. 2015;14(1):12.
- [Google Scholar]
- Randomized response: A survey technique for eliminating evasive answer bias. J. Am. Stat. Assoc.. 1965;60(309):63-69.
- [Google Scholar]
- Estimation of population mean in the presence of measurement error and non response under stratified random sampling. PloS one. 2018;13(2):e0191572
- [Google Scholar]
- Estimation of finite population mean for a sensitive variable using dual auxiliary information in the presence of measurement errors. PloS one. 2019;14(2):e0212111
- [Google Scholar]
Appendix A
Simplification of MSE
Squaring both sides of Eq. (25), and keeping the terms up to power two in errors, and then taking expectations, the MSE of
Population
Population I.
Population II.
Members of Generalized Proposed Class of Estimators
1. For
2. For
3. For
4. For
5. For
6. For
7. For
8. For
9. For
10. For