7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Research Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Research Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
View/Download PDF

A generalized class of estimators for sensitive variable in the presence of measurement error and non-response under stratified random sampling

Department of Applied Mathematics and Statistics, Institute of Space Technology, Islamabad, Pakistan
Department of Statistics, Quaid-i-Azam University, Islamabad, Pakistan
Department of Statistics, University of Tabuk, Saudi Arabia

⁎Corresponding author. erumzahid22@gmail.com (Erum Zahid)

Disclaimer:
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.

Peer review under responsibility of King Saud University.

Abstract

In survey sampling an investigator may be unable to get the complete and correct information at the same time. So non-response and measurement error occur simultaneously and consequently may effect the estimator. Considering this problem, a generalized class of estimators is proposed for estimating the finite population mean for sensitive variable in the presence of measurement error and non-response under stratified random sampling. We conducted a study based on real data set at Quaid-i-Azam University, Islamabad. Simulation and real life data sets are used to observe the performances of the estimators. Bias and MSE values are given for the comparison of estimators.

Keywords

Auxiliary variable
Measurement error
Non-response
Randomized response
Stratified random sampling
1

1 Introduction

In survey sampling, certain surveys cause some problems for the researchers due to the fact that the respondents are reluctant to discuss sensitive topics such as drug use, abortion, sexually transmitted diseases etc. When surveying on those topics, measurement error and non-response can occur since the respondent may choose, not to respond some specific questions, not to give the accurate answers, or not to take part in the survey. The problem of measurement error is usually ignored during the sensitive surveys and the assumption is made that the information obtained through surveys is free from error. Another important factor in surveys is non-response, which may arises due to refusal of respondents to give the information or not at home or lack of interest due to some sensitive issues. Usually measurement error and non-response are studied separately for the sensitive variable using the known auxiliary or additional information. In reality, when the variable of interest is sensitive, the respondents hesitate to provide the personal information, which give rise to measurement error. In most of the cases, the information is not obtained from all units in surveys, specially when the variable of interest is stigmatizing in nature. Many researchers studied the problem of non-response, including (Hansen and Hurwitz, 1946; Cochran, 1977; Rao, 1986; Khare and Srivastava, 2010; Andridge and Little, 2010; Singh et al., 2011; Khare et al., 2013; Shabbir and Khan, 2013; Shabbir et al., 2018 and Singh and Khalid, 2020). In survey sampling, when the variable under study contains social stigma, then the respondents are not comfortable to provide their personal information. Direct survey on sensitive question increases the relative bias. Warner (1965) introduced the randomized response technique (RRT), which reduces the possible bias and is used to obtain the true information while insuring the privacy of the respondents. For estimation of mean of a sensitive quantitative variable the Randomized Response model (RRM) is extended by Greenberg et al. (1971). Further work is done by Eichhorn and Hayre (1983); Gupta and Shabbir (2004), Kim and Warde (2004); Singh and Mathur (2005), Gjestvang and Singh (2006); Diana and Perri (2010), Gupta et al. (2010); Chaudhuri and Pal (2015), Gupta et al. (2016) and Bouza et al. (2018).

The researchers dealt with the problem of measurement error for estimating the population mean. For more details, see Cochran (1968); Fuller (1995); Shalabh (1997); Biemer et al. (2011); Shukla et al. (2012), etc. Recently few researchers studied the problem of measurement error and non-response together like Kumar et al. (2015); Singh and Sharma (2015); Azeem and Hanif (2017) and Kumar (2016). Zahid and Shabbir (2018); Khalil et al. (2018) and Zahid and Shabbir (2019) have discussed the problem of measurement error and non-response under stratified random sampling.

In practice, the researchers who have studied measurement error, have ignored the presence of non-response and randomized response at the same time. In this study, we have proposed a class of estimators for estimating the population mean of a sensitive variable in the presence of measurement error and non-response simultaneously under stratified random sampling. The efficiency of the suggested class of estimators over the existing estimators is shown through simulation study and real data sets.

Consider a finite population of N identifiable units which are partitioned into L homogeneous subgroups called strata, such that the hth strata consist of Nh units, where h=1,2,,L and h=1LNh=N . It is assumed that N consists of two mutually exclusive groups called response and non-response groups. Let N1h and N2h are the responding and non-responding units in the hth stratum respectively. We select a sample of size nh from Nh by using simple random sampling without replacement (SRSWOR) and assume that n1h units respond and n2h units do not respond. We select a sub-sample of size kh,kh=n2hgh,gh>1 from n2h non-responding units in the hth stratum.

Let (zhi,yhi,xhi,rxhi) be the observed values and (Zhi,Yhi,Xhi,Rxhi) be the actual values of the ith(i=1,2,,n) sampled units in the hth stratum. Let rxhi , and Rxhi be the corresponding ranks of xhi and Xhi respectively, then the measurement errors be.

Qhi=zhi-Zhi,Vhi=xhi-Xhi and Thi=rxhi-Rxhi .

Let ShZ2,ShX2 and ShRx2 be the population variances for the responding units and ShZ(2)2,ShX(2)2 and ShRx(2)2 be the population variances for non-responding units. Let ShQ2,ShV2 and ShT2 be the population variances associated with the measurement error for responding units. Let ShQ(2)2,ShV(2)2 and ShT(2)2 be the population variances associated with measurement error for the non-responding part of the population. Let ρhZX,ρhZRx,ρhXRx be the coefficients of correlation, between their subscripts for respondents and ρhZX(2),ρhZRx(2),ρhXRx(2) be the coefficients of correlation, between their subscripts for non-respondents in the population.

In Section 2, some existing estimators of the finite population mean are given. In Section 3, a generalized class of estimators is suggested for estimating the finite population mean by incorporating both measurement error and non-response information simultaneously. Numerical results and simulation study are presented in Section 4. Conclusion is given in Section 5.

2

2 Existing Estimators in Literature

In this section we consider the following existing estimators.

2.1

2.1 Hansen and Hurwitz (1946) Estimator

In stratified random sampling, the Hansen and Hurwitz (1946) estimator for population mean Y , is given by

(1)
yS(HH)*=h=1LPhzh, where zh=(n1hnh)zn1h+(n2hnh)zkh and Ph=NhN .

Here zn1h=1n1hi=1n1hzhi and zkh=1khi=1khyhi are the sample means based on n1h of responding and kh units of sub-samples from n2h non-responding groups, respectively.

The variance of yS(HH)* , is given by

(2)
Var(yS(HH)*)=h=1LPh2Ah*, where Ah*=λ2h(ShZ2+ShQ2)+θh(ShZ(2)2+ShQ(2)2) , θh=P2h(gh-1)nh,P2h=N2hNh ,

λ2h=(nh-1-Nh-1) .

2.2

2.2 Ratio Estimator

The usual ratio estimator under stratified random sampling, is given by

(3)
yS(R)*=h=1LPhzhxhXh, where Xh=1Nhi=1Nhxhi is known population mean and xh=Xh+1nh(δhX+δhV) be the sample mean, given in Eq. (22).

The bias and MSE of yS(R)* , are given by

(4)
B(yS(R)*)h=1LPhXh[RhBh*-Ch*] and
(5)
MSE(yS(R)*)h=1LPh2[Ah*+Rh2Bh*-2RhCh*],
where

Rh=ZhXh ,

Bh*=λ2h(ShX2+ShV2)+θh(ShX(2)2+ShV(2)2) ,

Ch*=λ2hρhYXShYShX+θhρhYX(2)ShY(2)ShX(2) .

2.3

2.3 Product Estimator

The product estimator under stratified random sampling, is given by

(6)
yS(Pr)*=h=1LPhzhxhXh.

The bias and MSE of yS(Pr) , are given by

(7)
B(yS(Pr)*)h=1LPhCh*Xh and
(8)
MSE(yS(Pr)*)h=1LPh2[Ah*+Rh2Bh*+2RhCh*].

2.4

2.4 Bahl and Tuteja, 1991 Estimator

Bahl and Tuteja, 1991 estimator under stratified random sampling, is given by

(9)
yS(BT)*=h=1LPhzhexpXh-xhXh+xh.

The bias and MSE of yS(BT) , are given by

(10)
B(yS(BT)*)h=1LPh1Xh3RhBh*8-Ch*2 and
(11)
MSE(yS(BT)*)h=1LPh2Ah*+Rh2Bh*4-RhCh*.

2.5

2.5 Singh and Kumar, 2010 Estimator

Singh and Kumar, 2010 estimator under stratified random sampling, is given by

(12)
yS(SK)*=h=1LPhzhXhxh2.

The bias and MSE of yS(SK) , are given by

(13)
B(yS(SK)*)h=1LPh1Xh(3RhBh*-2Ch*) and
(14)
MSE(yS(SK)*)h=1LPh2[Ah*+4Rh2Bh*-4RhCh*].

2.6

2.6 Difference Estimator

The usual difference estimator under stratified random sampling, is given by

(15)
yS(D)*=h=1LPh[yh+dh(Xh-xh*)], where xh*=NhXh-nhxhNh-nh and dh is the constant.

The minimum variance of yS(D) , is given by

(16)
Var(yS(D)*)min=h=1LPh2Ah*-Ch*2Bh*.

The optimum value of dh is dh(opt)=-Ch*thBh* , where th=nhNh-nh .

2.7

2.7 Azeem and Hanif (2017) Estimator

Azeem and Hanif (2016) estimator under stratified random sampling, is given by

(17)
yS(AH)*=h=1LPhyhxh*Xhxh*xh.

The bias and MSE of yS(AH)* , are given by

(18)
B(yS(AH)*)h=1LPhXh[th2RhBh*-qhCh*] and
(19)
MSE(yS(AH)*))h=1LPh2[Ah*+qh2Rh2Bh*-2qhRhCh*],
where qh=Nh+nhNh-nh .

3

3 Proposed Generalized Class of Estimators

We suggest a generalized class of estimators for estimating the finite population mean for a sensitive variable, considering the problem of measurement error and non-response simultaneously in stratified random sampling. Measurement error and non-response are present on both the study variable and the auxiliary variable. The suggested estimator, is given by

(20)
yS(GP)*=h=1LPhm1hzhXhxh*α1+m2h(Xh-xh*)Xhxh*α2+m3h(Rxh-rxh)Xhxh*α3exp(1-α0)Xh-xh*Xh+xh*, where, m1h,m2h and m3h are constants whose values are to be determined, and αr(r=0,1,2,3) are the scalars, chosen arbitrary. For obtaining the bias and MSE, we assume that

δhZ=i=1nh(Yhi-Yh) , δhU=i=1nhhUhi ,

δhX=i=1nh(Xhi-Xh) , δhV=i=1nhhVhi ,

δRhx=i=1n(Rxhi-Rxh) , δT=i=1nhThi .

Adding δhY and δhU , we get.

δhZ+δhU=i=1nh(Zhi-Zh)+i=1nhUhi .

Dividing both sides by nh , and then simplifying, we get

(21)
zh=Zh+1nh(δhY+δhU).

Similarly, we can get

(22)
xh=Xh+1nh(δhX+δhV) and
(23)
rxh=Rxh+1nh(δhRx+δhT).

Further

EδhZ+δhQnh2=λ2h(ShZ2+ShQ2)+θh(ShZ(2)2+ShQ(2)2)=Ah* ,

EδhX+δhVnh2=λ2h(ShX2+ShV2)+θh(ShX(2)2+ShV(2)2)=Bh* ,

EδhRx+δhTnh2=λ2h(ShRx2+ShT2)+θh(ShRx(2)2+ShT(2)2)=Dh* ,

EδhZ+δhQnhδhX+δhVn=λ2hρhZXShZShX+θhρhZX(2)ShZ(2)ShX(2)=Ch* ,

EδhZ+δhQnhδhRx+δhTn=λ2hρhZRxShZShRx+θhρhZRx(2)ShZ(2)ShRx(2)=Eh* ,

EδhX+δhVnhδhRx+δhTnh=λ2hρhXRxShXShRx+θhρhXRx(2)ShX(2)ShRx(2)=Fh* .

On simplifying, we get

(24)
yS(GP)*=h=1LPhm1hZh+WhZ+e*RhthWhX+f*th2RhWhX2Xh+e*thWhXWhZXh+m2hthWhX+d*th2WhX2Xh+m3hthWhRx+c*thWhXWhRxXh+b*th2WhRx2Rxh, where

b*=α3 ,

c*=1-α02 ,

d*=α2+1-α02 ,

e*=α1+1-α02 , and.

f*=α02-4α0+38+α1(2-α0+α1)2 .

WhZ=δZ+δQn,WhX=δX+δVn and WhRx=δRx+δTn .

Further simplifying, and ignoring error terms greater than two, we have

(25)
yS(GP)*-Z=h=1LPh(m1h-1)Zh+m2hthWhX+d*th2WhX2Xh+m1hWhY+e*RhthWhX+f*th2RhWhX2+e*thWhXWhYXh+m3hthWhRx+c*thWhXWhRxXh+b*th2WhRx2Rxh.

Using Eq. (25), the bias and MSE of yS(GP)* to first order of approximation, is given by

(26)
B(yS(GP)*)h=1LPh(m1h-1)Zh+m1hf*th2RhBhXh+e*thChXh+m2hd*th2BhXh+m3hc*thFhXh+b*th2DhRxh and
(27)
MSE(yS(GP)*)h=1LPh2[Zh2+m1h2Ah1*+m2h2Bh1*+2m1hm2hCh1*-2m1hDh1*-2m2hEh1*+m3h2Fh1*+2m1hm3hGh1*+2m2hm3hHh1*-2m3hIh1*]
where,

Ah1*=Zh2+Ah+e*2th2Rh2Bh+4e*thRhCh+2f*th2Rh2Bh ,

Bh1*=th2Bh ,

Ch1*=thCh+th2RhBh(e*+d*) ,

Dh1*=Zh2+e*thRhCh+f*th2Rh2Bh ,

Eh1*=d*th2RhBh ,

Fh1*=th2Dh ,

Gh1*=c*thRhFh+e*th2RhFh+thEh+b*th2R1hDh ,

Hh1*=th2Fh ,

Ih1*=c*thRhFh+b*th2R1hDh .

For finding the optimal values of m1h,m2h and m3h , we differentiate Eq. (27) with respect to m1h,m2h and m3h respectively. The optimal values, are given by.

m1h(opt)=Bh1*Dh1*Fh1*-Ch1*Eh1*Fh1*+Eh1*Gh1*Hh1*-Dh1*Hh1*2-Bh1*Gh1*Ih1*+Ch1*Hh1*Ih1*Ah1*Bh1*Fh1*-Ch1*2Fh1*+2Ch1*Gh1*Hh1*-Ah1*Hh1*2 ,

m2h(opt)=Ah1*Eh1*Fh1*-Ch1*Dh1*Fh1*-Eh1*Gh1*2+Dh1*Gh1*Hh1*+Ch1*Gh1*Ih1*-Ah1*Hh1*Ih1*Ah1*Bh1*Fh1*-Ch1*2Fh1*+2Ch1*Gh1*Hh1*-Ah1*Hh1*2 , and.

m3(opt)=Ch1*Eh1*Gh1*-Bh1*Dh1*Gh1*+Ch1*Dh1*Hh1*-Ah1*Eh1*Hh1*+Ah1*Bh1*Ih1*-Ch1*2Ih1*Ah1*Bh1*Fh1*-Ch1*2Fh1*+2Ch1*Gh1*Hh1*-Ah1*Hh1*2 .

Substituting these optimum values in Eq. (27), we get the minimum MSE of yS(GP)* as:

(28)
MSE(yS(GP)*)minh=1LPh2Zh2-Lh1*Lh2*, where

Lh1*=Ah1*Eh1*2Fh1*-2Ch1*Dh1*Eh1*Fh1*-Eh1*2Gh1*2+2Dh1*Eh1*Gh1*Hh1*-Dh1*2Hh1*2+2Ch1*Eh1*Gh1*Ih1*+2Ch1*Dh1*Hh1*Ih1*-2Ah1*Eh1*Hh1*Ih1*-Ch1*2Ih1*2+Bh1*Dh1*2Fh1*-2Bh1*Dh1*Gh1*Ih1*+Bh1*Ah1*Ih1*2 and.

Lh2*=Ah1*Bh1*Fh1*-Ch1*2Fh1*+2Ch1*Gh1*Hh1*-Ah1*Hh1*2-Bh1*Gh1*2 .

4

4 Numerical Results

In this section simulated data and two real data sets are used to show the performance of the generalized class of proposed estimator. The results are given in Tables 1, 2 (simulation) and 5, 6 (real data).

Table 1 Mean squared error and Bias (in brackets) values of different estimators for Population I with and without measurement error.
Estimators with Measurement Error 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.107125 0.128072 0.169966 0.117775 0.160022 0.244517
yS(R)* 0.030971 0.037839 0.051576 0.036732 0.055122 0.091904
(0.026764) (0.034223) (0.049143) (0.034202) 0.056537) (0.101206)
yS(Pr)* 0.458469 0.555397 0.749251 0.517367 0.732089 1.161533
(0.087794) (0.106418) (0.143665) (0.098841) (0.139558) (0.220992)
yS(BT)* 0.034649 0.040819 0.053159 0.037435 0.049176 0.072660
(0.065539) (0.069173) (0.076441) (0.053816) (0.034004) (0.005620)
yS(SK)* 1.085004 1.319813 1.789432 1.235507 1.771322 2.842953
(0.157364) (0.193539) (0.265890) (0.183396) (0.271635) (0.448113)
yS(D)* 0.022166 0.026426 0.034858 0.024615 0.033426 0.050598
yS(AH)* 0.096090 0.119132 0.165217 0.115666 0.177860 0.302249
(0.103052) (0.124466) (0.167295) (0.115001) (0.160313) (0.250939)
α=0,yS(P1)* 0.022067 0.026286 0.034616 0.024492 0.033201 0.050087
(0.018672) (0.022202) (0.029160) (0.020721) (0.028004) (0.042072)
α=1,yS(P1)* 0.022075 0.026298 0.034638 0.024503 0.033223 0.050141
(0.018679) (0.022213) (0.029180) (0.020731) (0.028023) (0.042119)
α=-1,yS(P1)* 0.022076 0.026299 0.034638 0.024503 0.033223 0.050143
(0.018679) (0.022213) (0.029180) (0.020731) (0.028024) (0.042121)
αr=1,r=0,1,2,3yS(GP)* 0.021993 0.026179 0.034423 0.024394 0.033006 0.049605
(0.018608) (0.022109) (0.028993) (0.020636) (0.027834) (0.041651)
α0=0,α1,2,3=1yS(GP)* 0.022057 0.026230 0.034607 0.024428 0.033176 0.050079
(0.018698) (0.022240) (0.029231) (0.020752) (0.028070) (0.042240)
Estimators without Measurement Error 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.096942 0.115829 0.153602 0.106425 0.144277 0.219980
yS(R)* 0.004814 0.006475 0.009796 0.007385 0.014187 0.027792
(0.013029) (0.017786) (0.027300) (0.018722) (0.034866) (0.067153)
yS(Pr)* 0.432313 0.524032 0.707471 0.488020 0.691154 1.097421
(0.087794) (0.106418) (0.143665) (0.098841) (0.139558) (0.220992)
yS(BT)* 0.020473 0.023796 0.030441 0.021586 0.027134 0.038230
(0.141554) (0.160285) (0.197747) (0.139441) (0.153945) (0.182954)
yS(SK)* 1.010927 1.231085 1.671403 1.152171 1.654819 2.660115
(0.138073) (0.170463) (0.235244) (0.161654) (0.241207) (0.400314)
yS(D)* 0.001424 0.001860 0.002574 0.002043 0.003129 0.004569
yS(AH)* 0.049966 0.063866 0.091666 0.063824 0.105438 0.188667
(0.106485) (0.128576) (0.172756) (0.118871) (0.165731) (0.259452)
α=0,yS(P1)* 0.001423 0.001858 0.002570 0.002040 0.003124 0.004558
(0.001219) (0.001607) (0.002228) (0.001801) (0.002785) (0.004044)
α=1,yS(P1)* 0.001424 0.001859 0.002572 0.002042 0.003127 0.004564
(0.001220) (0.001608) (0.002230) (0.001802) (0.002787) (0.004050)
α=-1,yS(P1)* 0.001424 0.001859 0.002572 0.002042 0.003127 0.004564
(0.001220) (0.001608) (0.00223) (0.001802) (0.002787) (0.004050)
αr=1,r=0,1,2,3yS(GP)* 0.001409 0.001837 0.002530 0.002019 0.003079 0.004440
(0.001207) (0.001589) (0.002193) (0.001783) (0.002745) (0.003941)
α0=0,α1,2,3=1yS(GP)* 0.001422 0.001858 0.002571 0.002040 0.003126 0.004564
(0.001219) (0.001607) (0.002229) (0.001801) (0.002787) (0.004051)
Table 2 Mean squared error and Bias (in brackets) values of different estimators for Population II with and without measurement error
Estimators 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.089797 0.108185 0.144961 0.098465 0.135429 0.209356
yS(R)* 0.013224 0.016604 0.023364 0.014699 0.021331 0.034594
(0.000313) (0.000378) (0.001761) (0.003640) (0.004433) (0.006020)
yS(Pr)* 0.320038 0.387672 0.522939 0.334810 0.460273 0.711199
(0.074086) (0.089385) (0.119983) (0.081194) (0.111309) (0.171537)
yS(BT)* 0.032302 0.038906 0.052115 0.037509 0.052036 0.081090
(0.180648) (0.209399) (0.266902) (0.235166) (0.319488) (0.488131)
yS(SK)* 0.703946 0.855063 1.157297 0.723734 0.995864 1.540122
(0.067164) (0.081340) (0.109694) (0.063355) (0.087213) (0.134930)
yS(D)* 0.012522 0.015889 0.022418 0.014072 0.020597 0.033554
yS(AH)* 0.027173 0.034466 0.049053 0.024482 0.035358 0.057109
(0.089356) (0.107659) (0.144267) (0.098718) (0.135291) (0.208437)
α=0,yS(P1)* 0.012476 0.015816 0.022274 0.014014 0.020474 0.033231
(0.012576) (0.015866) (0.022247) (0.014418) (0.020942) (0.033839)
α=1,yS(P1)* 0.012477 0.015820 0.022281 0.014016 0.020480 0.033245
(0.012578) (0.015868) (0.022253) (0.014420) (0.020948) (0.033853)
α=-1,yS(P1)* 0.012478 0.015821 0.022282 0.014017 0.020481 0.033246
(0.012579) (0.015869) (0.022254) (0.014421) (0.020949) (0.033854)
αr=1,r=0,1,2,3yS(GP)* 0.012455 0.015786 0.022218 0.013989 0.020426 0.033115
(0.012556) (0.015835) (0.022191) (0.014392) (0.020892) (0.033718)
α0=0,α1,2,3=1yS(GP)* 0.012474 0.015809 0.022230 0.014013 0.020464 0.033224
(0.012585) (0.015879) (0.022273) (0.014429) (0.020964) (0.033895)
Estimators without Measurement Error 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.079750 0.095863 0.128091 0.087280 0.119695 0.184526
yS(R)* 0.001954 0.002369 0.003199 0.002038 0.002794 0.004305
(0.001402) (0.001226) (0.005081) (0.007060) (0.011019)
yS(Pr)* 0.308768 0.373437 0.502774 0.322149 0.441736 0.680909
(0.089385) (0.119983) (0.081194) (0.111309) (0.171537)
yS(BT)* 0.021949 0.026106 0.034421 0.025956 0.035602 0.054895
(0.222047) (0.288625) (0.245101) (0.338267) (0.524598)
yS(SK)* 0.689010 0.835090 1.127248 0.706644 0.968914 1.493455
(0.066129) (0.079822) (0.107208) (0.062190) (0.085148) (0.131064)
0.001194 0.001576 0.002205 0.001210 0.001686 0.002635
yS(D)* 0.014726 0.018444 0.025880 0.010445 0.014281 0.021953
(0.108009) (0.144843) (0.099002) (0.135796) (0.209385)
yS(AH)* 0.001190 0.001572 0.002203 0.001205 0.001682 0.002625
(0.001574) (0.002202) (0.001256) (0.001760) (0.002759)
α=0,yS(P1)* 0.001191 0.001573 0.002204 0.001206 0.001683 0.002626
(0.001575) (0.002203) (0.001257) (0.001761) (0.002760)
α=1,yS(P1)* 0.001192 0.001574 0.002205 0.001207 0.001684 0.002627
(0.001576) (0.002204) (0.001258) (0.001762) (0.002760)
αr=1,r=0,1,2,3yS(GP)* 0.001184 0.001564 0.002189 0.001200 0.001673 0.002604
(0.001566) (0.002189) (0.001251) (0.001751) (0.002737)
α0=0,α1,2,3=1yS(GP)* 0.001189 0.001570 0.002202 0.001204 0.001680 0.002622
(0.001573) (0.002200) (0.001254) (0.001759) (0.002760)

4.1

4.1 Simulation Study

We have generated two populations (Population I and II) from normal distribution by using R language program, which are given in Appendix A. The results based on these population are given in Tables 1 and 2.

Tables 1 and 2 show that the generalized class of proposed estimators yS(GP)* perform better than other existing estimators for both with and without measurement errors. The values of the absolute biases are given in brackets. In Table 1 the MSE for the generalized proposed estimator, when αr=1,r=0,1,2,3 is 0.021993 for 10% of non-response rate. When the non-response rate increases to 20%, the MSE for generalized proposed estimator increases to 0.024394. It is also observed that yS(P1)* is less biased and yS(SK)* is highly biased among all other considered estimator. Table 1 shows the same pattern of results for the case of no measurement error.

In Table 2 the MSE for the generalized proposed estimator, when αr=1,r=0,1,2,3 is 0.012455 for 10% non-response rate. When the non-response rate increases to 20%, the MSE for generalized proposed estimator increases to 0.013989. It is also observed that yS(R)* is less biased and yS(BT)* is highly biased among all other considered estimator. Table 2 shows the same pattern of results for the case of no measurement error.

Through the simulation study it is concluded that the generalized proposed class of estimators perform better as compared to the all other existing estimators. For 10% non-response rate, the MSE is minimum as compared to 20% of the non-response rate. The MSE also increases as the value of constant gh increases.

4.2

4.2 Application to Real Data Set

In this section we consider two real life data sets for numerical comparisons, Population III is taken from Rosner, 2015, Population IV is obtained by conducting a survey at Quaid-i-Azam University, Islamabad 4.2.1. The results based on these data sets are given in Tables 5 and 6.

Population III. [Source:Rosner, 2015].

Strata I consist of 318 observations and strata II contains 336 observations. The data summary is given in Tables 3 and 4.

Table 3 Data summary of Strata I.
Variable Mean st.Dev Min Med Max
Forced expiratory volume ( Y1 ) 2.45 0.65 0.79 2.48 3.83
Age ( X1 ) 9.84 2.93 3.00 10.00 19.00
Smoke ( S1 ) 0,1 0.12 0.32 0.00 0.00 1.00
Table 4 Data summary of Strata II.
Variable Mean st.Dev Min Med Max
Forced expiratory volume ( Y2 ) 2.68 1.00 0.79 2.61 5.79
Age ( X2 ) 10.01 2.97 3.00 10.00 19.00
Sex ( S2 )0,1 0.07 0.27 0.00 0.00 1.00

ρ1XY=0.7564,ρ1XRx=0.7831 and ρ1YRx=0.6151 .

ρ2XY=0.8109,ρ2XRx=0.7765 and ρ2YRx=0.6575 .

4.2.1

4.2.1 Data Collection

To see the practical implication of measurement error, we conducted a study based on real data set at Quaid-i-Azam University, Islamabad. We distributed 55 questionnaires to the students of BS Statistics (5th Semester Fall, 2018) and M.Phil Statistics (1st and 2nd Semesters, Fall 2018) of Quaid-i-Azam University, Islamabad. We consider our population of those students who gave the false response, which comes out to be 23. As we already have the true response from their academic record. In question (i) we asked for Y = Age, X = Marks in A level or Intermediate (in percentage). In question (ii) S = Social media effects the academic result is asked, where Y is the study variable, X is the auxiliary variable and S is the scrambling response variable. We have 23 students ( N=23 ), including 8 male students and 15 female students who gave the false response.

Population IV. [Source: Section 4.2.1].

Let, Y: Age of BS 5th and Mphil Students of Statistics department, X: Marks in A level or Intermediate,    S:Social media effects on the academic result

NumberofStrata=2 (Male and Female).

N1=8,N2=15,Z1=24.25,Z2=23.53,X1=54.25,X2=63.67,Rx1=4.5,Rx2=8,S1Z2=6.39,SZY2=3.75,S1X2=73.36,S2X2=76.95238,S1Rx2=6,S2Rx2=20,ρ1ZX=0.37740,ρ2ZX=0.15107,ρ1XRx=-0.25875,ρ2XRx=-0.10014,ρ1ZRx=-0.69314,ρ2ZRx=-0.62315 .

Tables 5 and 6 show that the generalized class of proposed estimators yS(GP)* perform better than other existing estimators for both with and without measurement error. The values of the absolute biases are given in brackets in the tables. In Table 5 the MSE for the generalized proposed estimator, when αr=1,r=0,1,2,3 is 0.004734 for 10% non-response rate. When the non-response rate becomes 20%, the MSE for generalized proposed estimator increases to 0.005848. It is also observed that yS(GP)* is less biased and yS(SK)* is highly biased among all other considered estimator. Table 5 shows the same pattern of results in case for no measurement error.

Table 5 Mean squared error and Bias (in brackets) values of different estimators for Population III with and without measurement error
Estimators 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.009864 0.011976 0.016201 0.012941 0.017808 0.027542
yS(R)* 0.009932 0.012202 0.016742 0.009053 0.012444 0.019226
(0.003997) (0.004906) (0.006722) (0.003688) (0.005117) (0.007975)
yS(Pr)* 0.037387 0.045484 0.061677 0.044100 0.060841 0.094323
(0.004802) (0.005828) (0.007881) (0.006228) (0.008600) (0.013344)
yS(BT)* 0.006449 0.007872 0.010719 0.007588 0.010418 0.016076
(0.005316) (0.006551) (0.009022) (0.003434) (0.004720) (0.007291)
yS(SK)* 0.092504 0.112726 0.153171 0.102530 0.141543 0.219569
(0.009534) (0.011631) (0.015826) (0.010642) (0.014727) (0.022896)
yS(D)* 0.006262 0.007627 0.010355 0.006917 0.009419 0.014419
yS(AH)* 0.016593 0.020397 0.028004 0.014191 0.019546 0.030256
(0.004941) (0.005987) (0.008080) (0.006670) (0.009207) (0.014279)
α=0,yS(P1)* 0.006252 0.007612 0.010329 0.006904 0.009395 0.014362
(0.004152) (0.005037) (0.006804) (0.004963) (0.006760) (0.010341)
α=1,yS(P1)* 0.006252 0.007613 0.010329 0.006904 0.009395 0.014362
(0.004153) (0.005038) (0.006804) (0.004963) (0.006760) (0.010342)
α=-1,yS(P1)* 0.006252 0.007613 0.010329 0.006904 0.009396 0.014363
(0.004153) (0.005038) (0.006804) (0.004963) (0.006760) (0.010342)
αr=1,r=0,1,2,3yS(GP)* 0.004734 0.005749 0.007771 0.005848 0.007960 0.012160
(0.003277) (0.003966) (0.005342) (0.004165) (0.005672) (0.008669)
α0=0,α1,2,3=1yS(GP)* 0.004737 0.005752 0.007777 0.005851 0.007966 0.012176
(0.003278) (0.003969) (0.005346) (0.004168) (0.005677) (0.008681)
Estimators without Measurement Error 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.008595 0.010446 0.014149 0.010151 0.014012 0.021732
yS(R)* 0.001193 0.001468 0.002018 0.000905 0.001243 0.001920
(0.000333) (0.000407) (0.000556) (0.000353) (0.000485) (0.000750)
yS(Pr)* 0.028648 0.034750 0.046953 0.035952 0.049640 0.077016
(0.004802) (0.005828) (0.007881) (0.006228) (0.008600) (0.013344)
yS(BT)* 0.003313 0.004041 0.005499 0.003459 0.004770 0.007392
(0.004884) (0.005936) (0.008042) (0.006588) (0.009102) (0.014129)
yS(SK)* 0.061352 0.074378 0.100428 0.078308 0.108130 0.167773
(0.004815) (0.005841) (0.007893) (0.006268) (0.008659) (0.013441)
yS(D)* 0.001144 0.001407 0.001934 0.0008737 0.001201 0.001856
yS(AH)* 0.001604 0.001962 0.002677 0.001586 0.002183 0.003379
(0.005709) (0.006930) (0.009371) (0.007385) (0.010198) (0.015823)
α=0,yS(P1)* 0.001143 0.001407 0.001933 0.000873 0.001200 0.001855
(0.000677) (0.000833) (0.001145) (0.000651) (0.000893) (0.001378)
α=1,yS(P1)* 0.001143 0.001407 0.001933 0.000873 0.001201 0.001855
(0.000677) (0.000833) (0.001145) (0.000651) (0.000893) (0.001378)
α=-1,yS(P1)* 0.001143 0.001407 0.001933 0.000873 0.001201 0.001855
(0.000677) (0.000833) (0.001145) (0.000651) (0.000893) (0.001378)
αr=1,r=0,1,2,3yS(GP)* 0.001060 0.001307 0.001800 0.000822 0.001128 0.001742
(0.000631) (0.000777) (0.001069) (0.000611) (0.000839) (0.001293)
α0=0,α1,2,3=1(yS(GP)* 0.001063 0.001309 0.001801 0.000824 0.001129 0.001743
(0.000631) (0.000777) (0.001070) (0.000611) (0.000839) (0.001294)
Table 6 Mean squared error and Bias (in brackets) values of different estimators for Population IV with and without measurement error.
Estimators 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.630021 0.837808 1.045595 0.630686 0.839804 1.048922
yS(R)* 1.921722 2.793240 3.464758 2.121722 3.170594 4.119467
(0.130760) (0.157226) (0.104294) (0.148446) (0.192598)
yS(Pr)* 2.506471 3.404393 4.302315 2.735328 4.090965 5.446601
(0.013928) (0.017060) (0.011940) (0.017360) (0.022779)
yS(BT)* 0.906671 1.228910 1.551149 0.945071 1.344110 1.743148
(307.1653) (395.3818) (248.3003) (395.2198) (542.1394)
yS(SK)* 7.622591 10.43603 13.24947 8.484520 13.02182 17.55912
(0.567979) (0.727644) (0.461703) (0.728147) (0.994592)
yS(D)* 0.619716 0.824887 1.029976 0.619579 0.824426 1.029194
4.905149 6.754218 8.603288 5.446953 8.379632 11.31231
(0.030936) (0.044995) (0.059054) (0.035498) (0.058680) (0.081861)
α=0,yS(P1)* 0.617825 0.821560 1.024813 0.617755 0.821341 1.024502
(0.076580) (0.095337) (0.056926) (0.074051) (0.091135)
α=1,yS(P1)* 0.617948 0.821788 1.025177 0.617894 0.821628 1.024985
(0.076603) (0.096939) (0.064079) (0.091183) (0.145374)
α=-1,yS(P1)* 0.617950 0.821792 1.025185 0.617896 0.821632 1.024994
(0.076603) (0.095375) (0.056940) (0.074080) (0.091184)
αr=1,r=0,1,2,3(yS(GP)* 0.334101 0.444233 0.552328 0.335842 0.447262 0.554985
(0.040907) (0.050769) (0.030738) (0.040354) (0.049608)
α0=0,α1,2,3=1yS(GP)* 0.336942 0.449357 0.560385 0.339090 0.453985 0.566332
(0.041422) (0.051582) (0.031060) (0.041023) (0.050737)
Estimators without Measurement Error 10% non-response 20% non-response
gh gh
2 4 8 2 4 8
yS(HH)* 0.531888 0.706251 0.880614 0.534662 0.714573 0.894483
yS(R)* 0.847220 1.129459 1.411698 0.893275 1.267623 1.641971
(0.043594) (0.054407) (0.037339) (0.057267) (0.077194)
yS(Pr)* 1.360452 1.797576 2.234700 1.458010 2.090249 2.722487
(0.013928) (0.017060) (0.011940) (0.017360) (0.022779)
yS(BT)* 0.546567 0.728539 0.910510 0.553723 0.750007 0.946291
(70.46203) (88.11975) (60.15501) (92.51417) (124.8733)
yS(SK)* 3.332912 4.403433 5.473954 3.663318 5.394650 7.125983
(0.172948) (0.214997) (0.147613) (0.223090) (0.298568)
yS(D)* 0.499049 0.664792 0.830500 0.499692 0.666329 0.832772
yS(AH)* 1.780605 2.369676 2.958748 1.941566 2.852558 3.763550
(0.003230) (0.003599) (0.002798) (0.003040) (0.003283)
α=0,yS(P1)* 0.497680 0.662389 0.826776 0.498415 0.664262 0.829706
(0.064500) (0.080376) (0.047678) (0.061733) (0.075745)
α=1,yS(P1)* 0.497707 0.662436 0.826849 0.498447 0.664327 0.829816
(0.064504) (0.080382) (0.047681) (0.061739) (0.075755)
α=-1,yS(P1)* 0.497708 0.662437 0.826850 0.498447 0.664328 0.829818
(0.064505) (0.080382) (0.047681) (0.061739) (0.075755)
αr=1,r=0,1,2,3yS(GP)* 0.222332 0.293731 0.363450 0.223663 0.296212 0.366048
(0.030276) (0.037404) (0.022712) (0.029331) (0.035698)
α0=0,α1,2,3=1yS(GP)* 0.223172 0.295152 0.365602 0.224612 0.298063 0.369087
(0.030408) (0.037603) (0.022800) (0.029499) (0.035973)

In Table 6 the MSE for the generalized proposed estimator, when αr=1,r=0,1,2,3 is 0.334101 for 10% non-response rate. When the non-response rate becomes 20%, the MSE for generalized proposed estimator increases to 0.335842. It is also observed that yS(R)* is less biased and yS(BT)* is most biased among all other considered estimator. Table 6 shows the same pattern of results in case for no measurement error.

Through real data sets it is concluded that the generalized proposed estimator performs better as compared to the other existing estimators. For 10% non-response rate the MSE is minimum. The MSE also increases as the value of constant gh increases.

5

5 Conclusion

In the present study, we proposed a generalized class of estimators in estimating the finite population mean for the sensitive variable in the presence of measurement error and non-response under stratified random sampling. Through simulation study and real life data sets it is observed that the proposed class of estimators perform better than the existing estimators. The MSE values are generally smaller under 10% of non-response as compared to 20% of non-response, which are expected results. Generally as the non-response rate increases, MSE also increases. Based on numerical findings, it turns out that the generalized proposed class of estimators is more efficient as compared to the other existing estimators, under certain situations.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. , , . A review of hot deck imputation for survey non-response. Int. Stat. Rev.. 2010;78(1):40-64.
    [Google Scholar]
  2. , , . Joint influence of measurement error and non response on estimation of population mean. Commun. Statistics-Theory Methods. 2017;46(4):1679-1693.
    [Google Scholar]
  3. , , . Ratio and product type exponential estimators. J. Inform. Optim. Sci.. 1991;12(1):159-164.
    [Google Scholar]
  4. , , , , , . Measurement Errors in Surveys. John Wiley & Sons; .
  5. , , , . Ranked set sampling and optional scrambling randomized response modeling. Investigación Operacional. 2018;39(1):100-107.
    [Google Scholar]
  6. , , . On efficacy of empirical bayes estimation of a finite population mean of a sensitive variable through randomized responses. Model Assisted Stat. Appl.. 2015;10(4):283-288.
    [Google Scholar]
  7. , . Errors of measurement in statistics. Technometrics. 1968;10(4):637-666.
    [Google Scholar]
  8. , . Sampling techniques. New York: John Wiley & Sons; .
  9. , , . New scrambled response models for estimating the mean of a sensitive quantitative character. J. Appl. Stat.. 2010;37(11):1875-1890.
    [Google Scholar]
  10. , , . Scrambled randomized response methods for obtaining sensitive quantitative data. J. Stat. Planning inference. 1983;7(4):307-316.
    [Google Scholar]
  11. , . Estimation in the presence of measurement error. Int. Stat. Rev.. 1995;63(2):121-141.
    [Google Scholar]
  12. , , . A new randomized response model. J. Roy. Stat. Soc.. 2006;68(3):523-530.
    [Google Scholar]
  13. , , , , . Application of the randomized response technique in obtaining quantitative data. J. Am. Stat. Assoc.. 1971;66(334):243-250.
    [Google Scholar]
  14. , , . An improved estimator of the finite population mean in simple random sampling. Model Assisted Stat. Appl.. 2011;6(1):47-55.
    [Google Scholar]
  15. , , . Sensitivity estimation for personal interview survey questions. Statistica. 2004;64(4):643-653.
    [Google Scholar]
  16. , , . On improvement in estimating the population mean in simple random sampling. J. Appl. Stat.. 2008;35(5):559-566.
    [Google Scholar]
  17. , , , . Mean and sensitivity estimation in optional randomized response models. J. Stat. Planning Inference. 2010;140(10):2870-2874.
    [Google Scholar]
  18. , , , , . Improved exponential type estimators of the mean of a sensitive variable in the presence of nonsensitive auxiliary information. Commun. Statistics-Simul. Comput.. 2016;45(9):3317-3328.
    [Google Scholar]
  19. , , . The problem of non-response in sample surveys. J. Am. Stat. Assoc.. 1946;41(236):517-529.
    [Google Scholar]
  20. , , , . Estimation of finite population mean in stratified sampling using scrambled responses in the presence of measurement errors. Commun. Statistics-Theory Methods 2018:1-9.
    [Google Scholar]
  21. , , , . Estimation of population mean in sample surveys using auxiliary character, method of call backs and subsampling from non-respondents. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences. 2013;83(1):49-54.
    [Google Scholar]
  22. , , . Generalized two phase sampling estimators for the population mean in the presence of nonresponse. Aligarh Jouranal of Statistics. 2010;30:39-54.
    [Google Scholar]
  23. , , . A stratified warner’s randomized response model. J. Stat. Planning Inference. 2004;120(1–2):155-165.
    [Google Scholar]
  24. , . Improved estimation of population mean in presence of non-response and measurement error. J. Stat. Theory Practice 2016 just-accepted
    [Google Scholar]
  25. , , , , . Estimation of population mean in the presence of non-response and measurement error. Revista Colombiana de EstadÝstica. 2015;38(1):145-161.
    [Google Scholar]
  26. , . Ratio estimation with subsampling the nonrespondents. Survey Methodology. 1986;12(2):217-230.
    [Google Scholar]
  27. , . Fundamentals of biostatistics. Duxbury Press; .
  28. , , , . A generalized class of estimators under two-phase stratified sampling for non response. Commun. Stat.-Theory Methods 2018:1-17.
    [Google Scholar]
  29. , , . Some modified exponential-ratio type estimators in the presence of non-response under two-phase sampling scheme. Electronic J. Appl. Stati. Anal.. 2013;6(1):1-17.
    [Google Scholar]
  30. , . Ratio method of estimation in the presence of measurement errors. J. Indian Society Agric. Stat.. 1997;52:150-155.
    [Google Scholar]
  31. , , , . An estimator for mean estimation in presence of measurement error. Res. Rev.: A J. Stat.. 2012;1(1):1-8.
    [Google Scholar]
  32. , , . Some imputation methods to compensate with non-response for estimation of population mean in two-occasion successive sampling. Commun. Statistics-Theory Methods. 2020;49(14):3329-3351.
    [Google Scholar]
  33. , , . Estimation of mean in presence of non-response using two phase sampling scheme. Stat. Pap.. 2010;51(3):559-582.
    [Google Scholar]
  34. , , . Combination of regression and ratio estimate in presence of nonresponse. Brazilian J. Prob. Stat.. 2011;25(2):205-217.
    [Google Scholar]
  35. , , . Estimation of population mean when coefficient of variation is known using scrambled response technique. J. Stat. Planning Inference. 2005;131(1):135-144.
    [Google Scholar]
  36. , , . Method of estimation in the presence of non-response and measurement errors simultaneously. J. Modern Appl. Stat. Methods. 2015;14(1):12.
    [Google Scholar]
  37. , . Randomized response: A survey technique for eliminating evasive answer bias. J. Am. Stat. Assoc.. 1965;60(309):63-69.
    [Google Scholar]
  38. , , . Estimation of population mean in the presence of measurement error and non response under stratified random sampling. PloS one. 2018;13(2):e0191572
    [Google Scholar]
  39. , , . Estimation of finite population mean for a sensitive variable using dual auxiliary information in the presence of measurement errors. PloS one. 2019;14(2):e0212111
    [Google Scholar]

Appendix A

Simplification of MSE

Squaring both sides of Eq. (25), and keeping the terms up to power two in errors, and then taking expectations, the MSE of yS(GP)* is given by MSE(yS(GP)*)h=1LPh2[Zh2+m1h2(Zh2+Ah+e*2th2Rh2Bh+4e*thRhCh+2f*th2Rh2Bh)+m2h2th2Bh+2m1hm2h(thCh+t2RhBh(e*+d*))-2mh1(Zh2+e*thRhCh+f*th2Rh2Bh)-2m2hd*th2RhBh+m3h2th2Dh+2m1hm3h(c*thRhFh+e*th2RhFh+thEh+b*th2R1hDh)+2m2hm3ht2Fh-2m3h(c*thRhFh+b*th2R1hDh)]. where R1h=ZhRxh .

Population

Population I.

X1=rnorm(1000,5,10),Y1=X1+rnorm(1000,0,1),y1=Y1+rnorm(1000,1,3) , x1=X1+rnorm(1000,1,3) .

X2=rnorm(1000,4,8),Y2=X2+rnorm(1000,0,1),y2=Y2+rnorm(1000,1,3) , x2=X2+rnorm(1000,1,3) .

X3=rnorm(1000,4,9),Y3=X3+rnorm(1000,0,1),y3=Y3+rnorm(1000,1,3) , x3=X3+rnorm(1000,1,3) .

X4=rnorm(1000,3,7),Y4=X4+rnorm(1000,0,1),y4=Y4+rnorm(1000,1,3) , x4=X4+rnorm(1000,1,3) .

NumberofStrata=4

N1=1000,N2=1000,N3=1000,N4=1000 , n1=200,n2=200,n3=200,n4=200 , Z1=5.719824,Z2=4.985474,Z3=4.85276,Z4=3.835371 , X1=5.666893,X2=3.643237,X3=3.968049,X4=2.918596 , Rxi=500.5,i=1,2,3,4 , S1Z2=124.6685,S2Z2=73.65976,S3Z2=90.66835,S4Z2=61.00159 , S1X2=104.2774,S2X2=66.19725,S3X2=81.06883,S4X2=45.99937,SiRx2=83416.67,i=1,2,3,4 , ρ1ZX=0.9953966,ρ2ZX=0.9927347,ρ3ZX=0.9940606,ρ4ZX=0.9891463 , ρ1ZRx=-0.003890,ρ2ZRx=0.016016,ρ3ZRx=0.062953,ρ4ZRx=-0.031585 , ρ1XRx=-0.044153,ρ2XRx=-0.011336,ρ3XRx=0.022509,ρ4XRx=0.033215 .

Population II.

X1=rnorm(1000,5,10),Y1=X1+rnorm(1000,0,1),y1=Y1+rnorm(1000,1,3) , x1=X1+rnorm(1000,0,1) .

X2=rnorm(1200,4,8),Y2=X2+rnorm(1200,0,1),y2=Y2+rnorm(1200,1,3) , x2=X2+rnorm(1200,0,1) .

X3=rnorm(1300,4,9),Y3=X3+rnorm(1300,0,1),y3=Y3+rnorm(1300,1,3) , x3=X3+rnorm(1300,0,1) .

X4=rnorm(1500,3,7),Y4=X4+rnorm(1500,0,1),y4=Y4+rnorm(1500,1,3) , x4=X4+rnorm(1500,1,3) .

NumberofStrata=4

N1=1000,N2=1200,N3=1300,N4=1500 , n1=200,n2=210,n3=220,n4=215 , Z1=4.648022,Z2=4.036113,Z3=4.032501,Z4=2.969091 , X1=5.666893,X2=3.807569,X3=4.627208,X4=3.241139 , Rx1=500.5,Rx2=600.5,Rx3=650.5,Rx4=750.5 , S1Z2=94.19621,S2Z2=66.76728,S3Z2=80.80177,S4Z2=51.14123 , S1X2=104.2774,S2X2=65.46337,S3X2=82.98812,S4X2=52.84269 , S1Rx2=83416.67,S2Rx2=120100,S3Rx2=140941.7,S4Rx2=187625 , ρ1ZX=0.984077,ρ2ZX=0.987461,ρ3ZX=0.991750,ρ4ZX=0.989362 , ρ1ZRx=0.015173,ρ2ZRx=-0.008185,ρ3ZRx=0.009465,ρ4ZRx=0.006319 , ρ1XRx=-0.105091,ρ2XRx=-0.124294,ρ3XRx=0.002547,ρ4XRx=-0.013688 .

Members of Generalized Proposed Class of Estimators yS(GP)* for Different Choices of ( α0,α1,α2,α3,m1h,m2h,m3h ) Members of the class of estimators yS(GP)* by choosing different values of α0,α1,α2,α3,m1h,m2h and m3h are given below

1. For α1=m2h=m3h=0 and α0=m1h=1 in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to usual mean estimator as: yS(0)*=h=1LPhzh.

2. For α0=α1=m1h=1 and m2h=m3h=0 in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to usual ratio estimator: yS(R)*=h=1LPhzhxhXh.

3. For α0=m1h=1,α1=-1 and m2h=m3h=0 in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to usual product estimator: yS(Pr)*=h=1LPhzhxhXh.

4. For α0=α1=m2h=m3h=0 and m1h=1 in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to Bahl and Tuteja, 1991 estimator: yS(BT)*=h=1LPhzexpXh-xhXh+xh.

5. For α0=α1=α2=1,m3h=0,m1h=m4h and m2h=m5h in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to Gupta and Shabbir, 2008 estimator: yS(GS)*=h=1LPhm4hzhXhxh+m5h(Xh-xh)Xhxh.

6. For α0=m2h=m3h=0,α1=2 and m1h=1 in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to Singh and Kumar, 2010 estimator: yS(SK)*=h=1LPhzhXhxh2.

7. For α0=α1=α2=0,m3h=0,m1h=m6h and m2h=m7h in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to Grover and Kaur, 2011 estimator: yS(GK)*=h=1LPhm6hzh+m7h(Xh-xh)expXh-xhXh+xh.

8. For α1=α2=m3h=0,α0=m1h=1 and m2h=dh* in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to difference estimator: yS(D)*=h=1LPh[zh+dh*(Xh-xh*)].

9. For α1=α2=g,m1h=1,m2h=kh and m3h=0 in Eq. (20), the generalized proposed class of estimators yS(GP)* reduces to Khalil et al., 2018 estimator given by,

(29)
yS(K)*=h=1LPh[zh+kh(Xh-xh)]Wwg.

10. For α0=α1=α2=α,m1h=m8h,m2h=m9h and m3h=0 in Eq. (20), the generalized proposed estimator yS(GP)* reduces to the proposed estimator.

(30)
yS(P1)*=h=1LPhm8hzh+m9h(Xh-xh*)Xhxh*αexp(1-α)Xh-xh*Xh+xh*.

Show Sections