Translate this page into:
A generalized class of estimators for sensitive variable in the presence of measurement error and non-response under stratified random sampling
⁎Corresponding author. erumzahid22@gmail.com (Erum Zahid)
-
Received: ,
Accepted: ,
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.
Peer review under responsibility of King Saud University.
Abstract
In survey sampling an investigator may be unable to get the complete and correct information at the same time. So non-response and measurement error occur simultaneously and consequently may effect the estimator. Considering this problem, a generalized class of estimators is proposed for estimating the finite population mean for sensitive variable in the presence of measurement error and non-response under stratified random sampling. We conducted a study based on real data set at Quaid-i-Azam University, Islamabad. Simulation and real life data sets are used to observe the performances of the estimators. Bias and MSE values are given for the comparison of estimators.
Keywords
Auxiliary variable
Measurement error
Non-response
Randomized response
Stratified random sampling
1 Introduction
In survey sampling, certain surveys cause some problems for the researchers due to the fact that the respondents are reluctant to discuss sensitive topics such as drug use, abortion, sexually transmitted diseases etc. When surveying on those topics, measurement error and non-response can occur since the respondent may choose, not to respond some specific questions, not to give the accurate answers, or not to take part in the survey. The problem of measurement error is usually ignored during the sensitive surveys and the assumption is made that the information obtained through surveys is free from error. Another important factor in surveys is non-response, which may arises due to refusal of respondents to give the information or not at home or lack of interest due to some sensitive issues. Usually measurement error and non-response are studied separately for the sensitive variable using the known auxiliary or additional information. In reality, when the variable of interest is sensitive, the respondents hesitate to provide the personal information, which give rise to measurement error. In most of the cases, the information is not obtained from all units in surveys, specially when the variable of interest is stigmatizing in nature. Many researchers studied the problem of non-response, including (Hansen and Hurwitz, 1946; Cochran, 1977; Rao, 1986; Khare and Srivastava, 2010; Andridge and Little, 2010; Singh et al., 2011; Khare et al., 2013; Shabbir and Khan, 2013; Shabbir et al., 2018 and Singh and Khalid, 2020). In survey sampling, when the variable under study contains social stigma, then the respondents are not comfortable to provide their personal information. Direct survey on sensitive question increases the relative bias. Warner (1965) introduced the randomized response technique (RRT), which reduces the possible bias and is used to obtain the true information while insuring the privacy of the respondents. For estimation of mean of a sensitive quantitative variable the Randomized Response model (RRM) is extended by Greenberg et al. (1971). Further work is done by Eichhorn and Hayre (1983); Gupta and Shabbir (2004), Kim and Warde (2004); Singh and Mathur (2005), Gjestvang and Singh (2006); Diana and Perri (2010), Gupta et al. (2010); Chaudhuri and Pal (2015), Gupta et al. (2016) and Bouza et al. (2018).
The researchers dealt with the problem of measurement error for estimating the population mean. For more details, see Cochran (1968); Fuller (1995); Shalabh (1997); Biemer et al. (2011); Shukla et al. (2012), etc. Recently few researchers studied the problem of measurement error and non-response together like Kumar et al. (2015); Singh and Sharma (2015); Azeem and Hanif (2017) and Kumar (2016). Zahid and Shabbir (2018); Khalil et al. (2018) and Zahid and Shabbir (2019) have discussed the problem of measurement error and non-response under stratified random sampling.
In practice, the researchers who have studied measurement error, have ignored the presence of non-response and randomized response at the same time. In this study, we have proposed a class of estimators for estimating the population mean of a sensitive variable in the presence of measurement error and non-response simultaneously under stratified random sampling. The efficiency of the suggested class of estimators over the existing estimators is shown through simulation study and real data sets.
Consider a finite population of N identifiable units which are partitioned into L homogeneous subgroups called strata, such that the strata consist of units, where and . It is assumed that N consists of two mutually exclusive groups called response and non-response groups. Let and are the responding and non-responding units in the stratum respectively. We select a sample of size from by using simple random sampling without replacement (SRSWOR) and assume that units respond and units do not respond. We select a sub-sample of size from non-responding units in the stratum.
Let be the observed values and be the actual values of the sampled units in the stratum. Let , and be the corresponding ranks of and respectively, then the measurement errors be.
and .
Let and be the population variances for the responding units and and be the population variances for non-responding units. Let and be the population variances associated with the measurement error for responding units. Let and be the population variances associated with measurement error for the non-responding part of the population. Let be the coefficients of correlation, between their subscripts for respondents and be the coefficients of correlation, between their subscripts for non-respondents in the population.
In Section 2, some existing estimators of the finite population mean are given. In Section 3, a generalized class of estimators is suggested for estimating the finite population mean by incorporating both measurement error and non-response information simultaneously. Numerical results and simulation study are presented in Section 4. Conclusion is given in Section 5.
2 Existing Estimators in Literature
In this section we consider the following existing estimators.
2.1 Hansen and Hurwitz (1946) Estimator
In stratified random sampling, the Hansen and Hurwitz (1946) estimator for population mean
, is given by
Here and are the sample means based on of responding and units of sub-samples from non-responding groups, respectively.
The variance of
, is given by
.
2.2 Ratio Estimator
The usual ratio estimator under stratified random sampling, is given by
The bias and MSE of
, are given by
,
,
.
2.3 Product Estimator
The product estimator under stratified random sampling, is given by
The bias and MSE of
, are given by
2.4 Bahl and Tuteja, 1991 Estimator
Bahl and Tuteja, 1991 estimator under stratified random sampling, is given by
The bias and MSE of
, are given by
2.5 Singh and Kumar, 2010 Estimator
Singh and Kumar, 2010 estimator under stratified random sampling, is given by
The bias and MSE of
, are given by
2.6 Difference Estimator
The usual difference estimator under stratified random sampling, is given by
The minimum variance of
, is given by
The optimum value of is , where .
2.7 Azeem and Hanif (2017) Estimator
Azeem and Hanif (2016) estimator under stratified random sampling, is given by
The bias and MSE of
, are given by
3 Proposed Generalized Class of Estimators
We suggest a generalized class of estimators for estimating the finite population mean for a sensitive variable, considering the problem of measurement error and non-response simultaneously in stratified random sampling. Measurement error and non-response are present on both the study variable and the auxiliary variable. The suggested estimator, is given by
, ,
, ,
, .
Adding and , we get.
.
Dividing both sides by
, and then simplifying, we get
Similarly, we can get
Further
,
,
,
,
,
.
On simplifying, we get
,
,
,
, and.
.
and .
Further simplifying, and ignoring error terms greater than two, we have
Using Eq. (25), the bias and MSE of
to first order of approximation, is given by
,
,
,
,
,
,
,
,
.
For finding the optimal values of and , we differentiate Eq. (27) with respect to and respectively. The optimal values, are given by.
,
, and.
.
Substituting these optimum values in Eq. (27), we get the minimum MSE of
as:
and.
.
4 Numerical Results
In this section simulated data and two real data sets are used to show the performance of the generalized class of proposed estimator. The results are given in Tables 1, 2 (simulation) and 5, 6 (real data).
Estimators with Measurement Error
10% non-response
20% non-response
2
4
8
2
4
8
0.107125
0.128072
0.169966
0.117775
0.160022
0.244517
0.030971
0.037839
0.051576
0.036732
0.055122
0.091904
(0.026764)
(0.034223)
(0.049143)
(0.034202)
0.056537)
(0.101206)
0.458469
0.555397
0.749251
0.517367
0.732089
1.161533
(0.087794)
(0.106418)
(0.143665)
(0.098841)
(0.139558)
(0.220992)
0.034649
0.040819
0.053159
0.037435
0.049176
0.072660
(0.065539)
(0.069173)
(0.076441)
(0.053816)
(0.034004)
(0.005620)
1.085004
1.319813
1.789432
1.235507
1.771322
2.842953
(0.157364)
(0.193539)
(0.265890)
(0.183396)
(0.271635)
(0.448113)
0.022166
0.026426
0.034858
0.024615
0.033426
0.050598
0.096090
0.119132
0.165217
0.115666
0.177860
0.302249
(0.103052)
(0.124466)
(0.167295)
(0.115001)
(0.160313)
(0.250939)
0.022067
0.026286
0.034616
0.024492
0.033201
0.050087
(0.018672)
(0.022202)
(0.029160)
(0.020721)
(0.028004)
(0.042072)
0.022075
0.026298
0.034638
0.024503
0.033223
0.050141
(0.018679)
(0.022213)
(0.029180)
(0.020731)
(0.028023)
(0.042119)
0.022076
0.026299
0.034638
0.024503
0.033223
0.050143
(0.018679)
(0.022213)
(0.029180)
(0.020731)
(0.028024)
(0.042121)
0.021993
0.026179
0.034423
0.024394
0.033006
0.049605
(0.018608)
(0.022109)
(0.028993)
(0.020636)
(0.027834)
(0.041651)
0.022057
0.026230
0.034607
0.024428
0.033176
0.050079
(0.018698)
(0.022240)
(0.029231)
(0.020752)
(0.028070)
(0.042240)
Estimators without Measurement Error
10% non-response
20% non-response
2
4
8
2
4
8
0.096942
0.115829
0.153602
0.106425
0.144277
0.219980
0.004814
0.006475
0.009796
0.007385
0.014187
0.027792
(0.013029)
(0.017786)
(0.027300)
(0.018722)
(0.034866)
(0.067153)
0.432313
0.524032
0.707471
0.488020
0.691154
1.097421
(0.087794)
(0.106418)
(0.143665)
(0.098841)
(0.139558)
(0.220992)
0.020473
0.023796
0.030441
0.021586
0.027134
0.038230
(0.141554)
(0.160285)
(0.197747)
(0.139441)
(0.153945)
(0.182954)
1.010927
1.231085
1.671403
1.152171
1.654819
2.660115
(0.138073)
(0.170463)
(0.235244)
(0.161654)
(0.241207)
(0.400314)
0.001424
0.001860
0.002574
0.002043
0.003129
0.004569
0.049966
0.063866
0.091666
0.063824
0.105438
0.188667
(0.106485)
(0.128576)
(0.172756)
(0.118871)
(0.165731)
(0.259452)
0.001423
0.001858
0.002570
0.002040
0.003124
0.004558
(0.001219)
(0.001607)
(0.002228)
(0.001801)
(0.002785)
(0.004044)
0.001424
0.001859
0.002572
0.002042
0.003127
0.004564
(0.001220)
(0.001608)
(0.002230)
(0.001802)
(0.002787)
(0.004050)
0.001424
0.001859
0.002572
0.002042
0.003127
0.004564
(0.001220)
(0.001608)
(0.00223)
(0.001802)
(0.002787)
(0.004050)
0.001409
0.001837
0.002530
0.002019
0.003079
0.004440
(0.001207)
(0.001589)
(0.002193)
(0.001783)
(0.002745)
(0.003941)
0.001422
0.001858
0.002571
0.002040
0.003126
0.004564
(0.001219)
(0.001607)
(0.002229)
(0.001801)
(0.002787)
(0.004051)
Estimators
10% non-response
20% non-response
2
4
8
2
4
8
0.089797
0.108185
0.144961
0.098465
0.135429
0.209356
0.013224
0.016604
0.023364
0.014699
0.021331
0.034594
(0.000313)
(0.000378)
(0.001761)
(0.003640)
(0.004433)
(0.006020)
0.320038
0.387672
0.522939
0.334810
0.460273
0.711199
(0.074086)
(0.089385)
(0.119983)
(0.081194)
(0.111309)
(0.171537)
0.032302
0.038906
0.052115
0.037509
0.052036
0.081090
(0.180648)
(0.209399)
(0.266902)
(0.235166)
(0.319488)
(0.488131)
0.703946
0.855063
1.157297
0.723734
0.995864
1.540122
(0.067164)
(0.081340)
(0.109694)
(0.063355)
(0.087213)
(0.134930)
0.012522
0.015889
0.022418
0.014072
0.020597
0.033554
0.027173
0.034466
0.049053
0.024482
0.035358
0.057109
(0.089356)
(0.107659)
(0.144267)
(0.098718)
(0.135291)
(0.208437)
0.012476
0.015816
0.022274
0.014014
0.020474
0.033231
(0.012576)
(0.015866)
(0.022247)
(0.014418)
(0.020942)
(0.033839)
0.012477
0.015820
0.022281
0.014016
0.020480
0.033245
(0.012578)
(0.015868)
(0.022253)
(0.014420)
(0.020948)
(0.033853)
0.012478
0.015821
0.022282
0.014017
0.020481
0.033246
(0.012579)
(0.015869)
(0.022254)
(0.014421)
(0.020949)
(0.033854)
0.012455
0.015786
0.022218
0.013989
0.020426
0.033115
(0.012556)
(0.015835)
(0.022191)
(0.014392)
(0.020892)
(0.033718)
0.012474
0.015809
0.022230
0.014013
0.020464
0.033224
(0.012585)
(0.015879)
(0.022273)
(0.014429)
(0.020964)
(0.033895)
Estimators without Measurement Error
10% non-response
20% non-response
2
4
8
2
4
8
0.079750
0.095863
0.128091
0.087280
0.119695
0.184526
0.001954
0.002369
0.003199
0.002038
0.002794
0.004305
(0.001402)
(0.001226)
(0.005081)
(0.007060)
(0.011019)
0.308768
0.373437
0.502774
0.322149
0.441736
0.680909
(0.089385)
(0.119983)
(0.081194)
(0.111309)
(0.171537)
0.021949
0.026106
0.034421
0.025956
0.035602
0.054895
(0.222047)
(0.288625)
(0.245101)
(0.338267)
(0.524598)
0.689010
0.835090
1.127248
0.706644
0.968914
1.493455
(0.066129)
(0.079822)
(0.107208)
(0.062190)
(0.085148)
(0.131064)
0.001194
0.001576
0.002205
0.001210
0.001686
0.002635
0.014726
0.018444
0.025880
0.010445
0.014281
0.021953
(0.108009)
(0.144843)
(0.099002)
(0.135796)
(0.209385)
0.001190
0.001572
0.002203
0.001205
0.001682
0.002625
(0.001574)
(0.002202)
(0.001256)
(0.001760)
(0.002759)
0.001191
0.001573
0.002204
0.001206
0.001683
0.002626
(0.001575)
(0.002203)
(0.001257)
(0.001761)
(0.002760)
0.001192
0.001574
0.002205
0.001207
0.001684
0.002627
(0.001576)
(0.002204)
(0.001258)
(0.001762)
(0.002760)
0.001184
0.001564
0.002189
0.001200
0.001673
0.002604
(0.001566)
(0.002189)
(0.001251)
(0.001751)
(0.002737)
0.001189
0.001570
0.002202
0.001204
0.001680
0.002622
(0.001573)
(0.002200)
(0.001254)
(0.001759)
(0.002760)
4.1 Simulation Study
We have generated two populations (Population I and II) from normal distribution by using R language program, which are given in Appendix A. The results based on these population are given in Tables 1 and 2.
Tables 1 and 2 show that the generalized class of proposed estimators perform better than other existing estimators for both with and without measurement errors. The values of the absolute biases are given in brackets. In Table 1 the MSE for the generalized proposed estimator, when is 0.021993 for 10% of non-response rate. When the non-response rate increases to 20%, the MSE for generalized proposed estimator increases to 0.024394. It is also observed that is less biased and is highly biased among all other considered estimator. Table 1 shows the same pattern of results for the case of no measurement error.
In Table 2 the MSE for the generalized proposed estimator, when is 0.012455 for 10% non-response rate. When the non-response rate increases to 20%, the MSE for generalized proposed estimator increases to 0.013989. It is also observed that is less biased and is highly biased among all other considered estimator. Table 2 shows the same pattern of results for the case of no measurement error.
Through the simulation study it is concluded that the generalized proposed class of estimators perform better as compared to the all other existing estimators. For 10% non-response rate, the MSE is minimum as compared to 20% of the non-response rate. The MSE also increases as the value of constant increases.
4.2 Application to Real Data Set
In this section we consider two real life data sets for numerical comparisons, Population III is taken from Rosner, 2015, Population IV is obtained by conducting a survey at Quaid-i-Azam University, Islamabad 4.2.1. The results based on these data sets are given in Tables 5 and 6.
Population III. [Source:Rosner, 2015].
Strata I consist of 318 observations and strata II contains 336 observations. The data summary is given in Tables 3 and 4.
Variable
Mean
st.Dev
Min
Med
Max
Forced expiratory volume (
)
2.45
0.65
0.79
2.48
3.83
Age (
)
9.84
2.93
3.00
10.00
19.00
Smoke (
) 0,1
0.12
0.32
0.00
0.00
1.00
Variable
Mean
st.Dev
Min
Med
Max
Forced expiratory volume (
)
2.68
1.00
0.79
2.61
5.79
Age (
)
10.01
2.97
3.00
10.00
19.00
Sex (
)0,1
0.07
0.27
0.00
0.00
1.00
and .
and .
4.2.1 Data Collection
To see the practical implication of measurement error, we conducted a study based on real data set at Quaid-i-Azam University, Islamabad. We distributed 55 questionnaires to the students of BS Statistics (5th Semester Fall, 2018) and M.Phil Statistics (1st and 2nd Semesters, Fall 2018) of Quaid-i-Azam University, Islamabad. We consider our population of those students who gave the false response, which comes out to be 23. As we already have the true response from their academic record. In question (i) we asked for = Age, = Marks in A level or Intermediate (in percentage). In question (ii) = Social media effects the academic result is asked, where Y is the study variable, X is the auxiliary variable and S is the scrambling response variable. We have 23 students ( ), including 8 male students and 15 female students who gave the false response.
Population IV. [Source: Section 4.2.1].
Let, Y: Age of BS and Mphil Students of Statistics department, X: Marks in A level or Intermediate, S:Social media effects on the academic result
(Male and Female).
.
Tables 5 and 6 show that the generalized class of proposed estimators
perform better than other existing estimators for both with and without measurement error. The values of the absolute biases are given in brackets in the tables. In Table 5 the MSE for the generalized proposed estimator, when
is 0.004734 for 10% non-response rate. When the non-response rate becomes 20%, the MSE for generalized proposed estimator increases to 0.005848. It is also observed that
is less biased and
is highly biased among all other considered estimator. Table 5 shows the same pattern of results in case for no measurement error.
Estimators
10% non-response
20% non-response
2
4
8
2
4
8
0.009864
0.011976
0.016201
0.012941
0.017808
0.027542
0.009932
0.012202
0.016742
0.009053
0.012444
0.019226
(0.003997)
(0.004906)
(0.006722)
(0.003688)
(0.005117)
(0.007975)
0.037387
0.045484
0.061677
0.044100
0.060841
0.094323
(0.004802)
(0.005828)
(0.007881)
(0.006228)
(0.008600)
(0.013344)
0.006449
0.007872
0.010719
0.007588
0.010418
0.016076
(0.005316)
(0.006551)
(0.009022)
(0.003434)
(0.004720)
(0.007291)
0.092504
0.112726
0.153171
0.102530
0.141543
0.219569
(0.009534)
(0.011631)
(0.015826)
(0.010642)
(0.014727)
(0.022896)
0.006262
0.007627
0.010355
0.006917
0.009419
0.014419
0.016593
0.020397
0.028004
0.014191
0.019546
0.030256
(0.004941)
(0.005987)
(0.008080)
(0.006670)
(0.009207)
(0.014279)
0.006252
0.007612
0.010329
0.006904
0.009395
0.014362
(0.004152)
(0.005037)
(0.006804)
(0.004963)
(0.006760)
(0.010341)
0.006252
0.007613
0.010329
0.006904
0.009395
0.014362
(0.004153)
(0.005038)
(0.006804)
(0.004963)
(0.006760)
(0.010342)
0.006252
0.007613
0.010329
0.006904
0.009396
0.014363
(0.004153)
(0.005038)
(0.006804)
(0.004963)
(0.006760)
(0.010342)
0.004734
0.005749
0.007771
0.005848
0.007960
0.012160
(0.003277)
(0.003966)
(0.005342)
(0.004165)
(0.005672)
(0.008669)
0.004737
0.005752
0.007777
0.005851
0.007966
0.012176
(0.003278)
(0.003969)
(0.005346)
(0.004168)
(0.005677)
(0.008681)
Estimators without Measurement Error
10% non-response
20% non-response
2
4
8
2
4
8
0.008595
0.010446
0.014149
0.010151
0.014012
0.021732
0.001193
0.001468
0.002018
0.000905
0.001243
0.001920
(0.000333)
(0.000407)
(0.000556)
(0.000353)
(0.000485)
(0.000750)
0.028648
0.034750
0.046953
0.035952
0.049640
0.077016
(0.004802)
(0.005828)
(0.007881)
(0.006228)
(0.008600)
(0.013344)
0.003313
0.004041
0.005499
0.003459
0.004770
0.007392
(0.004884)
(0.005936)
(0.008042)
(0.006588)
(0.009102)
(0.014129)
0.061352
0.074378
0.100428
0.078308
0.108130
0.167773
(0.004815)
(0.005841)
(0.007893)
(0.006268)
(0.008659)
(0.013441)
0.001144
0.001407
0.001934
0.0008737
0.001201
0.001856
0.001604
0.001962
0.002677
0.001586
0.002183
0.003379
(0.005709)
(0.006930)
(0.009371)
(0.007385)
(0.010198)
(0.015823)
0.001143
0.001407
0.001933
0.000873
0.001200
0.001855
(0.000677)
(0.000833)
(0.001145)
(0.000651)
(0.000893)
(0.001378)
0.001143
0.001407
0.001933
0.000873
0.001201
0.001855
(0.000677)
(0.000833)
(0.001145)
(0.000651)
(0.000893)
(0.001378)
0.001143
0.001407
0.001933
0.000873
0.001201
0.001855
(0.000677)
(0.000833)
(0.001145)
(0.000651)
(0.000893)
(0.001378)
0.001060
0.001307
0.001800
0.000822
0.001128
0.001742
(0.000631)
(0.000777)
(0.001069)
(0.000611)
(0.000839)
(0.001293)
0.001063
0.001309
0.001801
0.000824
0.001129
0.001743
(0.000631)
(0.000777)
(0.001070)
(0.000611)
(0.000839)
(0.001294)
Estimators
10% non-response
20% non-response
2
4
8
2
4
8
0.630021
0.837808
1.045595
0.630686
0.839804
1.048922
1.921722
2.793240
3.464758
2.121722
3.170594
4.119467
(0.130760)
(0.157226)
(0.104294)
(0.148446)
(0.192598)
2.506471
3.404393
4.302315
2.735328
4.090965
5.446601
(0.013928)
(0.017060)
(0.011940)
(0.017360)
(0.022779)
0.906671
1.228910
1.551149
0.945071
1.344110
1.743148
(307.1653)
(395.3818)
(248.3003)
(395.2198)
(542.1394)
7.622591
10.43603
13.24947
8.484520
13.02182
17.55912
(0.567979)
(0.727644)
(0.461703)
(0.728147)
(0.994592)
0.619716
0.824887
1.029976
0.619579
0.824426
1.029194
4.905149
6.754218
8.603288
5.446953
8.379632
11.31231
(0.030936)
(0.044995)
(0.059054)
(0.035498)
(0.058680)
(0.081861)
0.617825
0.821560
1.024813
0.617755
0.821341
1.024502
(0.076580)
(0.095337)
(0.056926)
(0.074051)
(0.091135)
0.617948
0.821788
1.025177
0.617894
0.821628
1.024985
(0.076603)
(0.096939)
(0.064079)
(0.091183)
(0.145374)
0.617950
0.821792
1.025185
0.617896
0.821632
1.024994
(0.076603)
(0.095375)
(0.056940)
(0.074080)
(0.091184)
0.334101
0.444233
0.552328
0.335842
0.447262
0.554985
(0.040907)
(0.050769)
(0.030738)
(0.040354)
(0.049608)
0.336942
0.449357
0.560385
0.339090
0.453985
0.566332
(0.041422)
(0.051582)
(0.031060)
(0.041023)
(0.050737)
Estimators without Measurement Error
10% non-response
20% non-response
2
4
8
2
4
8
0.531888
0.706251
0.880614
0.534662
0.714573
0.894483
0.847220
1.129459
1.411698
0.893275
1.267623
1.641971
(0.043594)
(0.054407)
(0.037339)
(0.057267)
(0.077194)
1.360452
1.797576
2.234700
1.458010
2.090249
2.722487
(0.013928)
(0.017060)
(0.011940)
(0.017360)
(0.022779)
0.546567
0.728539
0.910510
0.553723
0.750007
0.946291
(70.46203)
(88.11975)
(60.15501)
(92.51417)
(124.8733)
3.332912
4.403433
5.473954
3.663318
5.394650
7.125983
(0.172948)
(0.214997)
(0.147613)
(0.223090)
(0.298568)
0.499049
0.664792
0.830500
0.499692
0.666329
0.832772
1.780605
2.369676
2.958748
1.941566
2.852558
3.763550
(0.003230)
(0.003599)
(0.002798)
(0.003040)
(0.003283)
0.497680
0.662389
0.826776
0.498415
0.664262
0.829706
(0.064500)
(0.080376)
(0.047678)
(0.061733)
(0.075745)
0.497707
0.662436
0.826849
0.498447
0.664327
0.829816
(0.064504)
(0.080382)
(0.047681)
(0.061739)
(0.075755)
0.497708
0.662437
0.826850
0.498447
0.664328
0.829818
(0.064505)
(0.080382)
(0.047681)
(0.061739)
(0.075755)
0.222332
0.293731
0.363450
0.223663
0.296212
0.366048
(0.030276)
(0.037404)
(0.022712)
(0.029331)
(0.035698)
0.223172
0.295152
0.365602
0.224612
0.298063
0.369087
(0.030408)
(0.037603)
(0.022800)
(0.029499)
(0.035973)
In Table 6 the MSE for the generalized proposed estimator, when is 0.334101 for 10% non-response rate. When the non-response rate becomes 20%, the MSE for generalized proposed estimator increases to 0.335842. It is also observed that is less biased and is most biased among all other considered estimator. Table 6 shows the same pattern of results in case for no measurement error.
Through real data sets it is concluded that the generalized proposed estimator performs better as compared to the other existing estimators. For 10% non-response rate the MSE is minimum. The MSE also increases as the value of constant increases.
5 Conclusion
In the present study, we proposed a generalized class of estimators in estimating the finite population mean for the sensitive variable in the presence of measurement error and non-response under stratified random sampling. Through simulation study and real life data sets it is observed that the proposed class of estimators perform better than the existing estimators. The MSE values are generally smaller under 10% of non-response as compared to 20% of non-response, which are expected results. Generally as the non-response rate increases, MSE also increases. Based on numerical findings, it turns out that the generalized proposed class of estimators is more efficient as compared to the other existing estimators, under certain situations.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
- A review of hot deck imputation for survey non-response. Int. Stat. Rev.. 2010;78(1):40-64.
- [Google Scholar]
- Joint influence of measurement error and non response on estimation of population mean. Commun. Statistics-Theory Methods. 2017;46(4):1679-1693.
- [Google Scholar]
- Ratio and product type exponential estimators. J. Inform. Optim. Sci.. 1991;12(1):159-164.
- [Google Scholar]
- Measurement Errors in Surveys. John Wiley & Sons; 2011.
- Ranked set sampling and optional scrambling randomized response modeling. Investigación Operacional. 2018;39(1):100-107.
- [Google Scholar]
- On efficacy of empirical bayes estimation of a finite population mean of a sensitive variable through randomized responses. Model Assisted Stat. Appl.. 2015;10(4):283-288.
- [Google Scholar]
- Sampling techniques. New York: John Wiley & Sons; 1977.
- New scrambled response models for estimating the mean of a sensitive quantitative character. J. Appl. Stat.. 2010;37(11):1875-1890.
- [Google Scholar]
- Scrambled randomized response methods for obtaining sensitive quantitative data. J. Stat. Planning inference. 1983;7(4):307-316.
- [Google Scholar]
- Estimation in the presence of measurement error. Int. Stat. Rev.. 1995;63(2):121-141.
- [Google Scholar]
- Application of the randomized response technique in obtaining quantitative data. J. Am. Stat. Assoc.. 1971;66(334):243-250.
- [Google Scholar]
- An improved estimator of the finite population mean in simple random sampling. Model Assisted Stat. Appl.. 2011;6(1):47-55.
- [Google Scholar]
- Sensitivity estimation for personal interview survey questions. Statistica. 2004;64(4):643-653.
- [Google Scholar]
- On improvement in estimating the population mean in simple random sampling. J. Appl. Stat.. 2008;35(5):559-566.
- [Google Scholar]
- Mean and sensitivity estimation in optional randomized response models. J. Stat. Planning Inference. 2010;140(10):2870-2874.
- [Google Scholar]
- Improved exponential type estimators of the mean of a sensitive variable in the presence of nonsensitive auxiliary information. Commun. Statistics-Simul. Comput.. 2016;45(9):3317-3328.
- [Google Scholar]
- The problem of non-response in sample surveys. J. Am. Stat. Assoc.. 1946;41(236):517-529.
- [Google Scholar]
- Estimation of finite population mean in stratified sampling using scrambled responses in the presence of measurement errors. Commun. Statistics-Theory Methods 2018:1-9.
- [Google Scholar]
- Estimation of population mean in sample surveys using auxiliary character, method of call backs and subsampling from non-respondents. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences. 2013;83(1):49-54.
- [Google Scholar]
- Generalized two phase sampling estimators for the population mean in the presence of nonresponse. Aligarh Jouranal of Statistics. 2010;30:39-54.
- [Google Scholar]
- A stratified warner’s randomized response model. J. Stat. Planning Inference. 2004;120(1–2):155-165.
- [Google Scholar]
- Improved estimation of population mean in presence of non-response and measurement error. J. Stat. Theory Practice 2016 just-accepted
- [Google Scholar]
- Estimation of population mean in the presence of non-response and measurement error. Revista Colombiana de EstadÝstica. 2015;38(1):145-161.
- [Google Scholar]
- Ratio estimation with subsampling the nonrespondents. Survey Methodology. 1986;12(2):217-230.
- [Google Scholar]
- Fundamentals of biostatistics. Duxbury Press; 2015.
- A generalized class of estimators under two-phase stratified sampling for non response. Commun. Stat.-Theory Methods 2018:1-17.
- [Google Scholar]
- Some modified exponential-ratio type estimators in the presence of non-response under two-phase sampling scheme. Electronic J. Appl. Stati. Anal.. 2013;6(1):1-17.
- [Google Scholar]
- Ratio method of estimation in the presence of measurement errors. J. Indian Society Agric. Stat.. 1997;52:150-155.
- [Google Scholar]
- An estimator for mean estimation in presence of measurement error. Res. Rev.: A J. Stat.. 2012;1(1):1-8.
- [Google Scholar]
- Some imputation methods to compensate with non-response for estimation of population mean in two-occasion successive sampling. Commun. Statistics-Theory Methods. 2020;49(14):3329-3351.
- [Google Scholar]
- Estimation of mean in presence of non-response using two phase sampling scheme. Stat. Pap.. 2010;51(3):559-582.
- [Google Scholar]
- Combination of regression and ratio estimate in presence of nonresponse. Brazilian J. Prob. Stat.. 2011;25(2):205-217.
- [Google Scholar]
- Estimation of population mean when coefficient of variation is known using scrambled response technique. J. Stat. Planning Inference. 2005;131(1):135-144.
- [Google Scholar]
- Method of estimation in the presence of non-response and measurement errors simultaneously. J. Modern Appl. Stat. Methods. 2015;14(1):12.
- [Google Scholar]
- Randomized response: A survey technique for eliminating evasive answer bias. J. Am. Stat. Assoc.. 1965;60(309):63-69.
- [Google Scholar]
- Estimation of population mean in the presence of measurement error and non response under stratified random sampling. PloS one. 2018;13(2):e0191572
- [Google Scholar]
- Estimation of finite population mean for a sensitive variable using dual auxiliary information in the presence of measurement errors. PloS one. 2019;14(2):e0212111
- [Google Scholar]
Appendix A
Simplification of MSE
Squaring both sides of Eq. (25), and keeping the terms up to power two in errors, and then taking expectations, the MSE of is given by where .
Population
Population I.
, .
, .
, .
, .
, , , , , , , , , .
Population II.
, .
, .
, .
, .
, , , , , , , , , , .
Members of Generalized Proposed Class of Estimators for Different Choices of ( ) Members of the class of estimators by choosing different values of and are given below
1. For and in Eq. (20), the generalized proposed class of estimators reduces to usual mean estimator as:
2. For and in Eq. (20), the generalized proposed class of estimators reduces to usual ratio estimator:
3. For and in Eq. (20), the generalized proposed class of estimators reduces to usual product estimator:
4. For and in Eq. (20), the generalized proposed class of estimators reduces to Bahl and Tuteja, 1991 estimator:
5. For and in Eq. (20), the generalized proposed class of estimators reduces to Gupta and Shabbir, 2008 estimator:
6. For and in Eq. (20), the generalized proposed class of estimators reduces to Singh and Kumar, 2010 estimator:
7. For and in Eq. (20), the generalized proposed class of estimators reduces to Grover and Kaur, 2011 estimator:
8. For and in Eq. (20), the generalized proposed class of estimators reduces to difference estimator:
9. For
and
in Eq. (20), the generalized proposed class of estimators
reduces to Khalil et al., 2018 estimator given by,
10. For
and
in Eq. (20), the generalized proposed estimator
reduces to the proposed estimator.