A generalized class of estimators for sensitive variable in the presence of measurement error and non-response under stratified random sampling

Erum Zahid; Javid Shabbir; Osama Abdulaziz Alamri

doi:10.1016/j.jksus.2021.101741

View/Download PDF

A generalized class of estimators for sensitive variable in the presence of measurement error and non-response under stratified random sampling

Erum Zahid^{^a,⁎}, Javid Shabbir^{^b}, Osama Abdulaziz Alamri^{^c}

a Department of Applied Mathematics and Statistics, Institute of Space Technology, Islamabad, Pakistan

b Department of Statistics, Quaid-i-Azam University, Islamabad, Pakistan

c Department of Statistics, University of Tabuk, Saudi Arabia

⁎Corresponding author. erumzahid22@gmail.com (Erum Zahid)

Received: 2020-8-29, Accepted: 2021-11-25,

Disclaimer:
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.

Peer review under responsibility of King Saud University.

Abstract

In survey sampling an investigator may be unable to get the complete and correct information at the same time. So non-response and measurement error occur simultaneously and consequently may effect the estimator. Considering this problem, a generalized class of estimators is proposed for estimating the finite population mean for sensitive variable in the presence of measurement error and non-response under stratified random sampling. We conducted a study based on real data set at Quaid-i-Azam University, Islamabad. Simulation and real life data sets are used to observe the performances of the estimators. Bias and MSE values are given for the comparison of estimators.

Keywords

Auxiliary variable

Measurement error

Non-response

Randomized response

Stratified random sampling

Show Related Articles from PubMed

1 Introduction

In survey sampling, certain surveys cause some problems for the researchers due to the fact that the respondents are reluctant to discuss sensitive topics such as drug use, abortion, sexually transmitted diseases etc. When surveying on those topics, measurement error and non-response can occur since the respondent may choose, not to respond some specific questions, not to give the accurate answers, or not to take part in the survey. The problem of measurement error is usually ignored during the sensitive surveys and the assumption is made that the information obtained through surveys is free from error. Another important factor in surveys is non-response, which may arises due to refusal of respondents to give the information or not at home or lack of interest due to some sensitive issues. Usually measurement error and non-response are studied separately for the sensitive variable using the known auxiliary or additional information. In reality, when the variable of interest is sensitive, the respondents hesitate to provide the personal information, which give rise to measurement error. In most of the cases, the information is not obtained from all units in surveys, specially when the variable of interest is stigmatizing in nature. Many researchers studied the problem of non-response, including (Hansen and Hurwitz, 1946; Cochran, 1977; Rao, 1986; Khare and Srivastava, 2010; Andridge and Little, 2010; Singh et al., 2011; Khare et al., 2013; Shabbir and Khan, 2013; Shabbir et al., 2018 and Singh and Khalid, 2020 ). In survey sampling, when the variable under study contains social stigma, then the respondents are not comfortable to provide their personal information. Direct survey on sensitive question increases the relative bias. Warner (1965) introduced the randomized response technique (RRT), which reduces the possible bias and is used to obtain the true information while insuring the privacy of the respondents. For estimation of mean of a sensitive quantitative variable the Randomized Response model (RRM) is extended by Greenberg et al. (1971). Further work is done by Eichhorn and Hayre (1983); Gupta and Shabbir (2004), Kim and Warde (2004); Singh and Mathur (2005), Gjestvang and Singh (2006); Diana and Perri (2010), Gupta et al. (2010); Chaudhuri and Pal (2015), Gupta et al. (2016) and Bouza et al. (2018).

The researchers dealt with the problem of measurement error for estimating the population mean. For more details, see Cochran (1968); Fuller (1995); Shalabh (1997); Biemer et al. (2011); Shukla et al. (2012), etc. Recently few researchers studied the problem of measurement error and non-response together like Kumar et al. (2015); Singh and Sharma (2015); Azeem and Hanif (2017) and Kumar (2016). Zahid and Shabbir (2018); Khalil et al. (2018) and Zahid and Shabbir (2019) have discussed the problem of measurement error and non-response under stratified random sampling.

In practice, the researchers who have studied measurement error, have ignored the presence of non-response and randomized response at the same time. In this study, we have proposed a class of estimators for estimating the population mean of a sensitive variable in the presence of measurement error and non-response simultaneously under stratified random sampling. The efficiency of the suggested class of estimators over the existing estimators is shown through simulation study and real data sets.

Consider a finite population of N identifiable units which are partitioned into L homogeneous subgroups called strata, such that the $h^{th}$ strata consist of $N_{h}$ units, where $h = 1, 2, \dots, L$ and $\sum_{h = 1}^{L} N_{h} = N$ . It is assumed that N consists of two mutually exclusive groups called response and non-response groups. Let $N_{1 h}$ and $N_{2 h}$ are the responding and non-responding units in the $h^{th}$ stratum respectively. We select a sample of size $n_{h}$ from $N_{h}$ by using simple random sampling without replacement (SRSWOR) and assume that $n_{1 h}$ units respond and $n_{2 h}$ units do not respond. We select a sub-sample of size $k_{h}, (k_{h} = \frac{n_{2 h}}{g_{h}}, g_{h} > 1)$ from $n_{2 h}$ non-responding units in the $h^{th}$ stratum.

Let $(z_{hi}^{*}, y_{hi}^{*}, x_{hi}^{*}, r_{x_{hi}}^{*})$ be the observed values and $(Z_{hi}^{*}, Y_{hi}^{*}, X_{hi}^{*}, R_{x_{hi}}^{*})$ be the actual values of the $i^{th} (i = 1, 2, \dots, n)$ sampled units in the $h^{th}$ stratum. Let $r_{x_{hi}}^{*}$ , and $R_{x_{hi}}^{*}$ be the corresponding ranks of $x_{hi}^{*}$ and $X_{hi}^{*}$ respectively, then the measurement errors be.

$Q_{hi}^{*} = z_{hi}^{*} - Z_{hi}^{*}, V_{hi}^{*} = x_{hi}^{*} - X_{hi}^{*}$ and $T_{hi} = r_{x_{hi}} - R_{x_{hi}}$ .

Let $S_{hZ}^{2}, S_{hX}^{2}$ and $S_{{hR}_{x}}^{2}$ be the population variances for the responding units and $S_{hZ (2)}^{2}, S_{hX (2)}^{2}$ and $S_{{hR}_{x} (2)}^{2}$ be the population variances for non-responding units. Let $S_{hQ}^{2}, S_{hV}^{2}$ and $S_{hT}^{2}$ be the population variances associated with the measurement error for responding units. Let $S_{hQ (2)}^{2}, S_{hV (2)}^{2}$ and $S_{hT (2)}^{2}$ be the population variances associated with measurement error for the non-responding part of the population. Let $ρ_{hZX}, ρ_{{hZR}_{x}}, ρ_{{hXR}_{x}}$ be the coefficients of correlation, between their subscripts for respondents and $ρ_{hZX (2)}, ρ_{{hZR}_{x} (2)}, ρ_{{hXR}_{x} (2)}$ be the coefficients of correlation, between their subscripts for non-respondents in the population.

In Section 2, some existing estimators of the finite population mean are given. In Section 3, a generalized class of estimators is suggested for estimating the finite population mean by incorporating both measurement error and non-response information simultaneously. Numerical results and simulation study are presented in Section 4. Conclusion is given in Section 5.

2 Existing Estimators in Literature

In this section we consider the following existing estimators.

2.1

2.1 Hansen and Hurwitz (1946) Estimator

In stratified random sampling, the Hansen and Hurwitz (1946) estimator for population mean $\overline{Y}$ , is given by

(1)

{\overline{y}}_{S (HH)}^{*'} = \sum_{h = 1}^{L} P_{h} {\overline{z}}_{h}^{*},

where

{\overline{z}}_{h}^{*} = (\frac{n_{1 h}}{n_{h}}) {\overline{z}}_{n_{1 h}} + (\frac{n_{2 h}}{n_{h}}) {\overline{z}}_{k_{h}}

and

P_{h} = \frac{N_{h}}{N}

Here ${\overline{z}}_{n_{1 h}} = \frac{1}{n_{1 h}} \sum_{i = 1}^{n_{1 h}} z_{hi}$ and ${\overline{z}}_{k_{h}} = \frac{1}{k_{h}} \sum_{i = 1}^{k_{h}} y_{h_{i}}$ are the sample means based on $n_{1 h}$ of responding and $k_{h}$ units of sub-samples from $n_{2 h}$ non-responding groups, respectively.

The variance of ${\overline{y}}_{S (HH)}^{*'}$ , is given by

(2)

Var ({\overline{y}}_{S (HH)}^{*'}) = \sum_{h = 1}^{L} P_{h}^{2} A_{h}^{*'},

where

A_{h}^{*'} = λ_{2 h} (S_{hZ}^{2} + S_{hQ}^{2}) + θ_{h} (S_{hZ (2)}^{2} + S_{hQ (2)}^{2})

θ_{h} = \frac{P_{2 h} (g_{h} - 1)}{n_{h}}, P_{2 h} = \frac{N_{2 h}}{N_{h}}

$λ_{2 h} = (n_{h}^{- 1} - N_{h}^{- 1})$ .

2.2

2.2 Ratio Estimator

The usual ratio estimator under stratified random sampling, is given by

(3)

{\overline{y}}_{S (R)}^{*'} = \sum_{h = 1}^{L} P_{h} \frac{{\overline{z}}_{h}^{*}}{{\overline{x}}_{h}^{*}} {\overline{X}}_{h},

where

{\overline{X}}_{h} = \frac{1}{N_{h}} \sum_{i = 1}^{N_{h}} x_{hi}

is known population mean and

{\overline{x}}_{h}^{*} = {\overline{X}}_{h} + \frac{1}{n_{h}} (δ_{hX}^{*} + δ_{hV}^{*})

be the sample mean, given in Eq. (22).

The bias and MSE of ${\overline{y}}_{S (R)}^{*'}$ , are given by

(4)

B ({\overline{y}}_{S (R)}^{*'}) ≅ \sum_{h = 1}^{L} \frac{P_{h}}{{\overline{X}}_{h}} [R_{h}^{'} B_{h}^{*'} - C_{h}^{*'}]

and

(5)

MSE ({\overline{y}}_{S (R)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h}^{2} [A_{h}^{*'} + R_{h}^{' 2} B_{h}^{*'} - 2 R_{h}^{'} C_{h}^{*'}],

where

$R_{h}^{'} = \frac{{\overline{Z}}_{h}}{{\overline{X}}_{h}}$ ,

$B_{h}^{*'} = λ_{2 h} (S_{hX}^{2} + S_{hV}^{2}) + θ_{h} (S_{hX (2)}^{2} + S_{hV (2)}^{2})$ ,

$C_{h}^{*'} = λ_{2 h} ρ_{hYX} S_{hY} S_{hX} + θ_{h} ρ_{hYX (2)} S_{hY (2)} S_{hX (2)}$ .

2.3

2.3 Product Estimator

The product estimator under stratified random sampling, is given by

(6)

{\overline{y}}_{S (\Pr)}^{*'} = \sum_{h = 1}^{L} P_{h} [{\overline{z}}_{h}^{*} \frac{{\overline{x}}_{h}^{*}}{{\overline{X}}_{h}}] .

The bias and MSE of ${\overline{y}}_{S (\Pr)}^{*}$ , are given by

(7)

B ({\overline{y}}_{S (\Pr)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h} [\frac{C_{h}^{*'}}{\overline{X_{h}}}]

and

(8)

MSE ({\overline{y}}_{S (\Pr)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h}^{2} [A_{h}^{*'} + R_{h}^{' 2} B_{h}^{*'} + 2 R_{h}^{'} C_{h}^{*'}] .

2.4

2.4 Bahl and Tuteja, 1991 Estimator

Bahl and Tuteja, 1991 estimator under stratified random sampling, is given by

(9)

{\overline{y}}_{S (BT)}^{*'} = \sum_{h = 1}^{L} P_{h} [{\overline{z}}_{h}^{*} \exp (\frac{{\overline{X}}_{h} - {\overline{x}}_{h}^{*}}{{\overline{X}}_{h} + {\overline{x}}_{h}^{*}})] .

The bias and MSE of ${\overline{y}}_{S (BT)}^{*}$ , are given by

(10)

B ({\overline{y}}_{S (BT)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h} [\frac{1}{{\overline{X}}_{h}} (\frac{3 R_{h}^{'} B_{h}^{*'}}{8} - \frac{C_{h}^{*'}}{2})]

and

(11)

MSE ({\overline{y}}_{S (BT)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h}^{2} [A_{h}^{*'} + \frac{R_{h}^{' 2} B_{h}^{*'}}{4} - R_{h}^{'} C_{h}^{*'}] .

2.5

2.5 Singh and Kumar, 2010 Estimator

Singh and Kumar, 2010 estimator under stratified random sampling, is given by

(12)

{\overline{y}}_{S (SK)}^{*'} = \sum_{h = 1}^{L} P_{h} [{\overline{z}}_{h}^{*} {(\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*}})}^{2}] .

The bias and MSE of ${\overline{y}}_{S (SK)}^{*}$ , are given by

(13)

B ({\overline{y}}_{S (SK)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h} [\frac{1}{{\overline{X}}_{h}} (3 R_{h}^{'} B_{h}^{*'} - 2 C_{h}^{*'})]

and

(14)

MSE ({\overline{y}}_{S (SK)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h}^{2} [A_{h}^{*'} + 4 R_{h}^{' 2} B_{h}^{*'} - 4 R_{h}^{'} C_{h}^{*'}] .

2.6

2.6 Difference Estimator

The usual difference estimator under stratified random sampling, is given by

(15)

{\overline{y}}_{S (D)}^{*'} = \sum_{h = 1}^{L} P_{h} [{\overline{y}}_{h}^{*} + d_{h} ({\overline{X}}_{h} - {\overline{x}}_{h}^{*'})],

where

{\overline{x}}_{h}^{*'} = \frac{N_{h} {\overline{X}}_{h} - n_{h} {\overline{x}}_{h}^{*}}{N_{h} - n_{h}}

and

d_{h}

is the constant.

The minimum variance of ${\overline{y}}_{S (D)}^{*}$ , is given by

(16)

Var {({\overline{y}}_{S (D)}^{*'})}_{\min} = \sum_{h = 1}^{L} P_{h}^{2} [A_{h}^{*'} - \frac{C_{h}^{*' 2}}{B_{h}^{*'}}] .

The optimum value of $d_{h}$ is $d_{h (opt)} = - \frac{C_{h}^{*'}}{t_{h} B_{h}^{*'}}$ , where $t_{h} = \frac{nh}{N_{h} - n_{h}}$ .

2.7

2.7 Azeem and Hanif (2017) Estimator

Azeem and Hanif (2016) estimator under stratified random sampling, is given by

(17)

{\overline{y}}_{S (AH)}^{*'} = \sum_{h = 1}^{L} P_{h} {\overline{y}}_{h}^{*} (\frac{{\overline{x}}_{h}^{*'}}{{\overline{X}}_{h}}) (\frac{{\overline{x}}_{h}^{*'}}{{\overline{x}}_{h}^{*}}) .

The bias and MSE of ${\overline{y}}_{S (AH)}^{*'}$ , are given by

(18)

B ({\overline{y}}_{S (AH)}^{*'}) ≅ \sum_{h = 1}^{L} \frac{P_{h}}{{\overline{X}}_{h}} [t_{h}^{2} R_{h} B_{h}^{*'} - q_{h} C_{h}^{*'}]

and

(19)

MSE ({\overline{y}}_{S (AH)}^{*'})) ≅ \sum_{h = 1}^{L} P_{h}^{2} [A_{h}^{*'} + q_{h}^{2} R_{h}^{' 2} B_{h}^{*'} - 2 q_{h} R_{h}^{'} C_{h}^{*'}],

where

q_{h} = \frac{N_{h} + nh}{N_{h} - n_{h}}

3 Proposed Generalized Class of Estimators

We suggest a generalized class of estimators for estimating the finite population mean for a sensitive variable, considering the problem of measurement error and non-response simultaneously in stratified random sampling. Measurement error and non-response are present on both the study variable and the auxiliary variable. The suggested estimator, is given by

(20)

\begin{matrix} {\overline{y}}_{S (GP)}^{*'} = \sum_{h = 1}^{L} P_{h} \{m_{1 h} {\overline{z}}_{h}^{*} {\{\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*'}}\}}^{α_{1}} + m_{2 h} ({\overline{X}}_{h} - {\overline{x}}_{h}^{*'}) {\{\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*'}}\}}^{α_{2}} + m_{3 h} ({\overline{R}}_{xh} - {\overline{r}}_{xh}^{*}) {\{\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*'}}\}}^{α_{3}}\} \\ \exp (1 - α_{0}) (\frac{{\overline{X}}_{h} - {\overline{x}}_{h}^{*'}}{{\overline{X}}_{h} + {\overline{x}}_{h}^{*'}}), \end{matrix}

where,

m_{1 h}, m_{2 h}

and

m_{3 h}

are constants whose values are to be determined, and

α_{r} (r = 0, 1, 2, 3)

are the scalars, chosen arbitrary. For obtaining the bias and MSE, we assume that

$δ_{hZ}^{*} = \sum_{i = 1}^{n_{h}} (Y_{hi}^{*} - {\overline{Y}}_{h})$ , $δ_{hU}^{*} = \sum_{i = 1}^{n_{h_{h}}} U_{hi}^{*}$ ,

$δ_{hX}^{*} = \sum_{i = 1}^{n_{h}} (X_{hi}^{*} - {\overline{X}}_{h})$ , $δ_{hV}^{*} = \sum_{i = 1}^{n_{h_{h}}} V_{hi}^{*}$ ,

$δ_{R_{hx}}^{*} = \sum_{i = 1}^{n} (R_{x_{hi}}^{*} - {\overline{R}}_{x_{h}})$ , $δ_{T}^{*} = \sum_{i = 1}^{n_{h}} T_{hi}^{*}$ .

Adding $δ_{hY}^{*}$ and $δ_{hU}^{*}$ , we get.

$δ_{hZ}^{*} + δ_{hU}^{*} = \sum_{i = 1}^{n_{h}} (Z_{hi}^{*} - {\overline{Z}}_{h}) + \sum_{i = 1}^{n_{h}} U_{hi}^{*}$ .

Dividing both sides by $n_{h}$ , and then simplifying, we get

(21)

{\overline{z}}_{h}^{*} = {\overline{Z}}_{h} + \frac{1}{n_{h}} (δ_{hY}^{*} + δ_{hU}^{*}) .

Similarly, we can get

(22)

{\overline{x}}_{h}^{*} = {\overline{X}}_{h} + \frac{1}{n_{h}} (δ_{hX}^{*} + δ_{hV}^{*})

and

(23)

{\overline{r}}_{xh}^{*} = {\overline{R}}_{x_{h}} + \frac{1}{n_{h}} (δ_{{hR}_{x}}^{*} + δ_{hT}^{*}) .

Further

$E {(\frac{δ_{hZ}^{*} + δ_{hQ}^{*}}{n_{h}})}^{2} = λ_{2 h} (S_{hZ}^{2} + S_{hQ}^{2}) + θ_{h} (S_{hZ (2)}^{2} + S_{hQ (2)}^{2}) = A_{h}^{*'}$ ,

$E {(\frac{δ_{hX}^{*} + δ_{hV}^{*}}{n_{h}})}^{2} = λ_{2 h} (S_{hX}^{2} + S_{hV}^{2}) + θ_{h} (S_{hX (2)}^{2} + S_{hV (2)}^{2}) = B_{h}^{*'}$ ,

$E {(\frac{δ_{{hR}_{x}}^{*} + δ_{hT}^{*}}{n_{h}})}^{2} = λ_{2 h} (S_{{hR}_{x}}^{2} + S_{hT}^{2}) + θ_{h} (S_{{hR}_{x} (2)}^{2} + S_{hT (2)}^{2}) = D_{h}^{*'}$ ,

$E (\frac{δ_{hZ}^{*} + δ_{hQ}^{*}}{n_{h}}) (\frac{δ_{hX}^{*} + δ_{hV}^{*}}{n}) = λ_{2 h} ρ_{hZX} S_{hZ} S_{hX} + θ_{h} ρ_{hZX (2)} S_{hZ (2)} S_{hX (2)} = C_{h}^{*'}$ ,

$E (\frac{δ_{hZ}^{*} + δ_{hQ}^{*}}{n_{h}}) (\frac{δ_{{hR}_{x}}^{*} + δ_{hT}^{*}}{n}) = λ_{2 h} ρ_{{hZR}_{x}} S_{hZ} S_{{hR}_{x}} + θ_{h} ρ_{{hZR}_{x} (2)} S_{hZ (2)} S_{{hR}_{x} (2)} = E_{h}^{*'}$ ,

$E (\frac{δ_{hX}^{*} + δ_{hV}^{*}}{n_{h}}) (\frac{δ_{{hR}_{x}}^{*} + δ_{hT}^{*}}{n_{h}}) = λ_{2 h} ρ_{{hXR}_{x}} S_{hX} S_{{hR}_{x}} + θ_{h} ρ_{{hXR}_{x} (2)} S_{hX (2)} S_{{hR}_{x} (2)} = F_{h}^{*'}$ .

On simplifying, we get

(24)

\begin{matrix} {\overline{y}}_{S (GP)}^{*'} = \sum_{h = 1}^{L} P_{h} [m_{1 h} ({\overline{Z}}_{h} + W_{hZ} + e^{*'} R_{h}^{'} t_{h} W_{hX} + \frac{f^{*'} t_{h}^{2} R_{h}^{'} W_{hX}^{2}}{{\overline{X}}_{h}} + e^{*'} t_{h} \frac{W_{hX} W_{hZ}}{{\overline{X}}_{h}}) \\ + m_{2 h} (t_{h} W_{hX} + d^{*'} t_{h}^{2} \frac{W_{hX}^{2}}{{\overline{X}}_{h}}) + m_{3 h} (t_{h} W_{{hR}_{x}} + c^{*'} t_{h} \frac{W_{hX} W_{{hR}_{x}}}{{\overline{X}}_{h}} + b^{*'} t_{h}^{2} \frac{W_{{hR}_{x}}^{2}}{{\overline{R}}_{xh}})], \end{matrix}

where

$b^{*'} = α_{3}$ ,

$c^{*'} = \frac{1 - α_{0}}{2}$ ,

$d^{*'} = α_{2} + \frac{1 - α_{0}}{2}$ ,

$e^{*'} = α_{1} + \frac{1 - α_{0}}{2}$ , and.

$f^{*'} = \frac{α_{0}^{2} - 4 α_{0} + 3}{8} + \frac{α_{1} (2 - α_{0} + α_{1})}{2}$ .

$W_{hZ} = \frac{δ_{Z}^{*} + δ_{Q}^{*}}{n}, W_{hX} = \frac{δ_{X}^{*} + δ_{V}^{*}}{n}$ and $W_{{hR}_{x}} = \frac{δ_{R_{x}}^{*} + δ_{T}^{*}}{n}$ .

Further simplifying, and ignoring error terms greater than two, we have

(25)

\begin{matrix} {\overline{y}}_{S (GP)}^{*'} - \overline{Z} = \sum_{h = 1}^{L} P_{h} [(m_{1 h} - 1) {\overline{Z}}_{h} + m_{2 h} (t_{h} W_{hX} + d^{*'} t_{h}^{2} \frac{W_{hX}^{2}}{{\overline{X}}_{h}}) \\ + m_{1 h} (W_{hY} + e^{*'} R_{h}^{'} t_{h} W_{hX} + f^{*'} t_{h}^{2} R_{h}^{'} W_{hX}^{2} + e^{*'} t_{h} \frac{W_{hX} W_{hY}}{{\overline{X}}_{h}}) \\ + m_{3 h} (t_{h} W_{{hR}_{x}} + c^{*'} t_{h} \frac{W_{hX} W_{{hR}_{x}}}{{\overline{X}}_{h}} + b^{*'} t_{h}^{2} \frac{W_{{hR}_{x}}^{2}}{{\overline{R}}_{xh}})] . \end{matrix}

Using Eq. (25), the bias and MSE of ${\overline{y}}_{S (GP)}^{*'}$ to first order of approximation, is given by

(26)

\begin{matrix} B ({\overline{y}}_{S (GP)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h} [(m_{1 h} - 1) {\overline{Z}}_{h} + m_{1 h} (\frac{f^{*'} t_{h}^{2} R_{h} B_{h}}{{\overline{X}}_{h}} + \frac{e^{*'} t_{h} C_{h}}{{\overline{X}}_{h}}) \\ + m_{2 h} (\frac{d^{*'} t_{h}^{2} B_{h}}{{\overline{X}}_{h}}) + m_{3 h} (\frac{c^{*'} t_{h} F_{h}}{{\overline{X}}_{h}} + \frac{b^{*'} t_{h}^{2} D_{h}}{{\overline{R}}_{xh}})] \end{matrix}

and

(27)

\begin{matrix} MSE ({\overline{y}}_{S (GP)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h}^{2} [{\overline{Z}}_{h}^{2} + m_{1 h}^{2} A_{h 1}^{*'} + m_{2 h}^{2} B_{h 1}^{*'} + 2 m_{1 h} m_{2 h} C_{h 1}^{*'} - 2 m_{1 h} D_{h 1}^{*'} \\ - 2 m_{2 h} E_{h 1}^{*'} + m_{3 h}^{2} F_{h 1}^{*'} + 2 m_{1 h} m_{3 h} G_{h 1}^{*'} + 2 m_{2 h} m_{3 h} H_{h 1}^{*'} - 2 m_{3 h} I_{h 1}^{*'}] \end{matrix}

where,

$A_{h 1}^{*'} = {\overline{Z}}_{h}^{2} + A_{h} + e^{*' 2} t_{h}^{2} R_{h}^{' 2} B_{h} + 4 e^{*'} t_{h} R_{h}^{'} C_{h} + 2 f^{*'} t_{h}^{2} R_{h}^{' 2} B_{h}$ ,

$B_{h 1}^{*'} = t_{h}^{2} B_{h}$ ,

$C_{h 1}^{*'} = t_{h} C_{h} + t_{h}^{2} R_{h}^{'} B_{h} (e^{*'} + d^{*'})$ ,

$D_{h 1}^{*'} = {\overline{Z}}_{h}^{2} + e^{*'} t_{h} R_{h}^{'} C_{h} + f^{*'} t_{h}^{2} R_{h}^{' 2} B_{h}$ ,

$E_{h 1}^{*'} = d^{*'} t_{h}^{2} R_{h}^{'} B_{h}$ ,

$F_{h 1}^{*'} = t_{h}^{2} D_{h}$ ,

$G_{h 1}^{*'} = c^{*'} t_{h} R_{h}^{'} F_{h} + e^{*'} t_{h}^{2} R_{h}^{'} F_{h} + t_{h} E_{h} + b^{*'} t_{h}^{2} R_{1 h}^{'} D_{h}$ ,

$H_{h 1}^{*'} = t_{h}^{2} F_{h}$ ,

$I_{h 1}^{*'} = c^{*'} t_{h} R_{h}^{'} F_{h} + b^{*'} t_{h}^{2} R_{1 h}^{'} D_{h}$ .

For finding the optimal values of $m_{1 h}, m_{2 h}$ and $m_{3 h}$ , we differentiate Eq. (27) with respect to $m_{1 h}, m_{2 h}$ and $m_{3 h}$ respectively. The optimal values, are given by.

$m_{1 h (opt)} = \frac{B_{h 1}^{*'} D_{h 1}^{*'} F_{h 1}^{*'} - C_{h 1}^{*'} E_{h 1}^{*'} F_{h 1}^{*'} + E_{h 1}^{*'} G_{h 1}^{*'} H_{h 1}^{*'} - D_{h 1}^{*'} H_{h 1}^{* 2} - B_{h 1}^{*'} G_{h 1}^{*'} I_{h 1}^{*'} + C_{h 1}^{*'} H_{h 1}^{*'} I_{h 1}^{*'}}{A_{h 1}^{*'} B_{h 1}^{*'} F_{h 1}^{*'} - C_{h 1}^{* 2} F_{h 1}^{*'} + 2 C_{h 1}^{*'} G_{h 1}^{*'} H_{h 1}^{*'} - A_{h 1}^{*'} H_{h 1}^{* 2}}$ ,

$m_{2 h (opt)} = \frac{A_{h 1}^{*'} E_{h 1}^{*'} F_{h 1}^{*'} - C_{h 1}^{*'} D_{h 1}^{*'} F_{h 1}^{*'} - E_{h 1}^{*'} G_{h 1}^{* 2} + D_{h 1}^{*'} G_{h 1}^{*'} H_{h 1}^{*'} + C_{h 1}^{*'} G_{h 1}^{*'} I_{h 1}^{*'} - A_{h 1}^{*'} H_{h 1}^{*'} I_{h 1}^{*'}}{A_{h 1}^{*'} B_{h 1}^{*'} F_{h 1}^{*'} - C_{h 1}^{* 2} F_{h 1}^{*'} + 2 C_{h 1}^{*'} G_{h 1}^{*'} H_{h 1}^{*'} - A_{h 1}^{*'} H_{h 1}^{* 2}}$ , and.

$m_{3 (opt)} = \frac{C_{h 1}^{*'} E_{h 1}^{*'} G_{h 1}^{*'} - B_{h 1}^{*'} D_{h 1}^{*'} G_{h 1}^{*'} + C_{h 1}^{*'} D_{h 1}^{*'} H_{h 1}^{*'} - A_{h 1}^{*'} E_{h 1}^{*'} H_{h 1}^{*'} + A_{h 1}^{*'} B_{h 1}^{*'} I_{h 1}^{*'} - C_{h 1}^{* 2} I_{h 1}^{*'}}{A_{h 1}^{*'} B_{h 1}^{*'} F_{h 1}^{*'} - C_{h 1}^{* 2} F_{h 1}^{*'} + 2 C_{h 1}^{*'} G_{h 1}^{*'} H_{h 1}^{*'} - A_{h 1}^{*'} H_{h 1}^{* 2}}$ .

Substituting these optimum values in Eq. (27), we get the minimum MSE of ${\overline{y}}_{S (GP)}^{*'}$ as:

(28)

MSE {({\overline{y}}_{S (GP)}^{*'})}_{\min} ≅ \sum_{h = 1}^{L} P_{h}^{2} [{\overline{Z}}_{h}^{2} - \frac{L_{h 1}^{*'}}{L_{h 2}^{*'}}],

where

$L_{h 1}^{*'} = A_{h 1}^{*'} E_{h 1}^{* 2} F_{h 1}^{*'} - 2 C_{h 1}^{*'} D_{h 1}^{*'} E_{h 1}^{*'} F_{h 1}^{*'} - E_{h 1}^{*'} 2 G_{h 1}^{*'} 2 + 2 D_{h 1}^{*'} E_{h 1}^{*'} G_{h 1}^{*'} H_{h 1}^{*'} - D_{h 1} * 2 H_{h 1}^{* 2} + 2 C_{h 1}^{*'} E_{h 1}^{*'} G_{h 1}^{*'} I_{h 1}^{*'} + 2 C_{h 1}^{*'} D_{h 1}^{*'} H_{h 1}^{*'} I_{h 1}^{*'} - 2 A_{h 1}^{*'} E_{h 1}^{*'} H_{h 1}^{*'} I_{h 1}^{*'} - C_{h 1}^{* 2} I_{h 1}^{* 2} + B_{h 1}^{*'} D_{h 1}^{* 2} F_{h 1}^{*'} - 2 B_{h 1}^{*'} D_{h 1}^{*'} G_{h 1}^{*'} I_{h 1}^{*'} + B_{h 1}^{*'} A_{h 1}^{*'} I_{h 1}^{* 2}$ and.

$L_{h 2}^{*'} = A_{h 1}^{*'} B_{h 1}^{*'} F_{h 1}^{*'} - C_{h 1}^{* 2} F_{h 1}^{*'} + 2 C_{h 1}^{*'} G_{h 1}^{*'} H_{h 1}^{*'} - A_{h 1}^{*'} H_{h 1}^{* 2} - B_{h 1}^{*'} G_{h 1}^{* 2}$ .

4 Numerical Results

In this section simulated data and two real data sets are used to show the performance of the generalized class of proposed estimator. The results are given in Tables 1, 2 (simulation) and 5, 6 (real data).

Table 1 Mean squared error and

|Bias|

(in brackets) values of different estimators for Population I with and without measurement error.

Estimators with Measurement Error	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.107125	0.128072	0.169966	0.117775	0.160022	0.244517
${\overline{y}}_{S (R)}^{*'}$	0.030971	0.037839	0.051576	0.036732	0.055122	0.091904
${\overline{y}}_{S (R)}^{*'}$	(0.026764)	(0.034223)	(0.049143)	(0.034202)	0.056537)	(0.101206)
${\overline{y}}_{S (\Pr)}^{*'}$	0.458469	0.555397	0.749251	0.517367	0.732089	1.161533
${\overline{y}}_{S (\Pr)}^{*'}$	(0.087794)	(0.106418)	(0.143665)	(0.098841)	(0.139558)	(0.220992)
${\overline{y}}_{S (BT)}^{*'}$	0.034649	0.040819	0.053159	0.037435	0.049176	0.072660
${\overline{y}}_{S (BT)}^{*'}$	(0.065539)	(0.069173)	(0.076441)	(0.053816)	(0.034004)	(0.005620)
${\overline{y}}_{S (SK)}^{*'}$	1.085004	1.319813	1.789432	1.235507	1.771322	2.842953
${\overline{y}}_{S (SK)}^{*'}$	(0.157364)	(0.193539)	(0.265890)	(0.183396)	(0.271635)	(0.448113)
${\overline{y}}_{S (D)}^{*'}$	0.022166	0.026426	0.034858	0.024615	0.033426	0.050598
${\overline{y}}_{S (AH)}^{*'}$	0.096090	0.119132	0.165217	0.115666	0.177860	0.302249
${\overline{y}}_{S (AH)}^{*'}$	(0.103052)	(0.124466)	(0.167295)	(0.115001)	(0.160313)	(0.250939)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.022067	0.026286	0.034616	0.024492	0.033201	0.050087
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.018672)	(0.022202)	(0.029160)	(0.020721)	(0.028004)	(0.042072)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.022075	0.026298	0.034638	0.024503	0.033223	0.050141
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.018679)	(0.022213)	(0.029180)	(0.020731)	(0.028023)	(0.042119)
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	0.022076	0.026299	0.034638	0.024503	0.033223	0.050143
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.018679)	(0.022213)	(0.029180)	(0.020731)	(0.028024)	(0.042121)
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	0.021993	0.026179	0.034423	0.024394	0.033006	0.049605
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	(0.018608)	(0.022109)	(0.028993)	(0.020636)	(0.027834)	(0.041651)
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	0.022057	0.026230	0.034607	0.024428	0.033176	0.050079
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	(0.018698)	(0.022240)	(0.029231)	(0.020752)	(0.028070)	(0.042240)

Estimators without Measurement Error	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.096942	0.115829	0.153602	0.106425	0.144277	0.219980
${\overline{y}}_{S (R)}^{*'}$	0.004814	0.006475	0.009796	0.007385	0.014187	0.027792
${\overline{y}}_{S (R)}^{*'}$	(0.013029)	(0.017786)	(0.027300)	(0.018722)	(0.034866)	(0.067153)
${\overline{y}}_{S (\Pr)}^{*'}$	0.432313	0.524032	0.707471	0.488020	0.691154	1.097421
${\overline{y}}_{S (\Pr)}^{*'}$	(0.087794)	(0.106418)	(0.143665)	(0.098841)	(0.139558)	(0.220992)
${\overline{y}}_{S (BT)}^{*'}$	0.020473	0.023796	0.030441	0.021586	0.027134	0.038230
${\overline{y}}_{S (BT)}^{*'}$	(0.141554)	(0.160285)	(0.197747)	(0.139441)	(0.153945)	(0.182954)
${\overline{y}}_{S (SK)}^{*'}$	1.010927	1.231085	1.671403	1.152171	1.654819	2.660115
${\overline{y}}_{S (SK)}^{*'}$	(0.138073)	(0.170463)	(0.235244)	(0.161654)	(0.241207)	(0.400314)
${\overline{y}}_{S (D)}^{*'}$	0.001424	0.001860	0.002574	0.002043	0.003129	0.004569
${\overline{y}}_{S (AH)}^{*'}$	0.049966	0.063866	0.091666	0.063824	0.105438	0.188667
${\overline{y}}_{S (AH)}^{*'}$	(0.106485)	(0.128576)	(0.172756)	(0.118871)	(0.165731)	(0.259452)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.001423	0.001858	0.002570	0.002040	0.003124	0.004558
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.001219)	(0.001607)	(0.002228)	(0.001801)	(0.002785)	(0.004044)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.001424	0.001859	0.002572	0.002042	0.003127	0.004564
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.001220)	(0.001608)	(0.002230)	(0.001802)	(0.002787)	(0.004050)
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	0.001424	0.001859	0.002572	0.002042	0.003127	0.004564
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.001220)	(0.001608)	(0.00223)	(0.001802)	(0.002787)	(0.004050)
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	0.001409	0.001837	0.002530	0.002019	0.003079	0.004440
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	(0.001207)	(0.001589)	(0.002193)	(0.001783)	(0.002745)	(0.003941)
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	0.001422	0.001858	0.002571	0.002040	0.003126	0.004564
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	(0.001219)	(0.001607)	(0.002229)	(0.001801)	(0.002787)	(0.004051)

Table 2 Mean squared error and

|Bias|

(in brackets) values of different estimators for Population II with and without measurement error

Estimators	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.089797	0.108185	0.144961	0.098465	0.135429	0.209356
${\overline{y}}_{S (R)}^{*'}$	0.013224	0.016604	0.023364	0.014699	0.021331	0.034594
${\overline{y}}_{S (R)}^{*'}$	(0.000313)	(0.000378)	(0.001761)	(0.003640)	(0.004433)	(0.006020)
${\overline{y}}_{S (\Pr)}^{*'}$	0.320038	0.387672	0.522939	0.334810	0.460273	0.711199
${\overline{y}}_{S (\Pr)}^{*'}$	(0.074086)	(0.089385)	(0.119983)	(0.081194)	(0.111309)	(0.171537)
${\overline{y}}_{S (BT)}^{*'}$	0.032302	0.038906	0.052115	0.037509	0.052036	0.081090
${\overline{y}}_{S (BT)}^{*'}$	(0.180648)	(0.209399)	(0.266902)	(0.235166)	(0.319488)	(0.488131)
${\overline{y}}_{S (SK)}^{*'}$	0.703946	0.855063	1.157297	0.723734	0.995864	1.540122
${\overline{y}}_{S (SK)}^{*'}$	(0.067164)	(0.081340)	(0.109694)	(0.063355)	(0.087213)	(0.134930)
${\overline{y}}_{S (D)}^{*'}$	0.012522	0.015889	0.022418	0.014072	0.020597	0.033554
${\overline{y}}_{S (AH)}^{*'}$	0.027173	0.034466	0.049053	0.024482	0.035358	0.057109
${\overline{y}}_{S (AH)}^{*'}$	(0.089356)	(0.107659)	(0.144267)	(0.098718)	(0.135291)	(0.208437)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.012476	0.015816	0.022274	0.014014	0.020474	0.033231
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.012576)	(0.015866)	(0.022247)	(0.014418)	(0.020942)	(0.033839)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.012477	0.015820	0.022281	0.014016	0.020480	0.033245
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.012578)	(0.015868)	(0.022253)	(0.014420)	(0.020948)	(0.033853)
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	0.012478	0.015821	0.022282	0.014017	0.020481	0.033246
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.012579)	(0.015869)	(0.022254)	(0.014421)	(0.020949)	(0.033854)
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	0.012455	0.015786	0.022218	0.013989	0.020426	0.033115
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	(0.012556)	(0.015835)	(0.022191)	(0.014392)	(0.020892)	(0.033718)
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	0.012474	0.015809	0.022230	0.014013	0.020464	0.033224
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	(0.012585)	(0.015879)	(0.022273)	(0.014429)	(0.020964)	(0.033895)

Estimators without Measurement Error	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.079750	0.095863	0.128091	0.087280	0.119695	0.184526
${\overline{y}}_{S (R)}^{*'}$	0.001954	0.002369	0.003199	0.002038	0.002794	0.004305
${\overline{y}}_{S (R)}^{*'}$	(0.001402)	(0.001226)	(0.005081)	(0.007060)	(0.011019)
${\overline{y}}_{S (\Pr)}^{*'}$	0.308768	0.373437	0.502774	0.322149	0.441736	0.680909
${\overline{y}}_{S (\Pr)}^{*'}$	(0.089385)	(0.119983)	(0.081194)	(0.111309)	(0.171537)
${\overline{y}}_{S (BT)}^{*'}$	0.021949	0.026106	0.034421	0.025956	0.035602	0.054895
${\overline{y}}_{S (BT)}^{*'}$	(0.222047)	(0.288625)	(0.245101)	(0.338267)	(0.524598)
${\overline{y}}_{S (SK)}^{*'}$	0.689010	0.835090	1.127248	0.706644	0.968914	1.493455
(0.066129)	(0.079822)	(0.107208)	(0.062190)	(0.085148)	(0.131064)
	0.001194	0.001576	0.002205	0.001210	0.001686	0.002635
${\overline{y}}_{S (D)}^{*'}$	0.014726	0.018444	0.025880	0.010445	0.014281	0.021953
${\overline{y}}_{S (D)}^{*'}$	(0.108009)	(0.144843)	(0.099002)	(0.135796)	(0.209385)
${\overline{y}}_{S (AH)}^{*'}$	0.001190	0.001572	0.002203	0.001205	0.001682	0.002625
${\overline{y}}_{S (AH)}^{*'}$	(0.001574)	(0.002202)	(0.001256)	(0.001760)	(0.002759)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.001191	0.001573	0.002204	0.001206	0.001683	0.002626
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.001575)	(0.002203)	(0.001257)	(0.001761)	(0.002760)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.001192	0.001574	0.002205	0.001207	0.001684	0.002627
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.001576)	(0.002204)	(0.001258)	(0.001762)	(0.002760)
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	0.001184	0.001564	0.002189	0.001200	0.001673	0.002604
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	(0.001566)	(0.002189)	(0.001251)	(0.001751)	(0.002737)
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	0.001189	0.001570	0.002202	0.001204	0.001680	0.002622
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	(0.001573)	(0.002200)	(0.001254)	(0.001759)	(0.002760)

4.1

4.1 Simulation Study

We have generated two populations (Population I and II) from normal distribution by using R language program, which are given in Appendix A. The results based on these population are given in Tables 1 and 2.

Tables 1 and 2 show that the generalized class of proposed estimators ${\overline{y}}_{S (GP)}^{*'}$ perform better than other existing estimators for both with and without measurement errors. The values of the absolute biases are given in brackets. In Table 1 the MSE for the generalized proposed estimator, when $α_{r} = 1, r = 0, 1, 2, 3$ is 0.021993 for 10% of non-response rate. When the non-response rate increases to 20%, the MSE for generalized proposed estimator increases to 0.024394. It is also observed that ${\overline{y}}_{S (P 1)}^{*'}$ is less biased and ${\overline{y}}_{S (SK)}^{*'}$ is highly biased among all other considered estimator. Table 1 shows the same pattern of results for the case of no measurement error.

In Table 2 the MSE for the generalized proposed estimator, when $α_{r} = 1, r = 0, 1, 2, 3$ is 0.012455 for 10% non-response rate. When the non-response rate increases to 20%, the MSE for generalized proposed estimator increases to 0.013989. It is also observed that ${\overline{y}}_{S (R)}^{*'}$ is less biased and ${\overline{y}}_{S (BT)}^{*'}$ is highly biased among all other considered estimator. Table 2 shows the same pattern of results for the case of no measurement error.

Through the simulation study it is concluded that the generalized proposed class of estimators perform better as compared to the all other existing estimators. For 10% non-response rate, the MSE is minimum as compared to 20% of the non-response rate. The MSE also increases as the value of constant $g_{h}$ increases.

4.2

4.2 Application to Real Data Set

In this section we consider two real life data sets for numerical comparisons, Population III is taken from Rosner, 2015, Population IV is obtained by conducting a survey at Quaid-i-Azam University, Islamabad 4.2.1. The results based on these data sets are given in Tables 5 and 6.

Population III. [Source:Rosner, 2015].

Strata I consist of 318 observations and strata II contains 336 observations. The data summary is given in Tables 3 and 4.

Table 3 Data summary of Strata I.

Variable	Mean	st.Dev	Min	Med	Max
Forced expiratory volume ( $Y_{1}$ )	2.45	0.65	0.79	2.48	3.83
Age ( $X_{1}$ )	9.84	2.93	3.00	10.00	19.00
Smoke ( $S_{1}$ ) 0,1	0.12	0.32	0.00	0.00	1.00

Table 4 Data summary of Strata II.

Variable	Mean	st.Dev	Min	Med	Max
Forced expiratory volume ( $Y_{2}$ )	2.68	1.00	0.79	2.61	5.79
Age ( $X_{2}$ )	10.01	2.97	3.00	10.00	19.00
Sex ( $S_{2}$ )0,1	0.07	0.27	0.00	0.00	1.00

$ρ_{1 XY} = 0.7564, ρ_{1 {XR}_{x}} = 0.7831$ and $ρ_{1 {YR}_{x}} = 0.6151$ .

$ρ_{2 XY} = 0.8109, ρ_{2 {XR}_{x}} = 0.7765$ and $ρ_{2 {YR}_{x}} = 0.6575$ .

4.2.1

4.2.1 Data Collection

To see the practical implication of measurement error, we conducted a study based on real data set at Quaid-i-Azam University, Islamabad. We distributed 55 questionnaires to the students of BS Statistics (5th Semester Fall, 2018) and M.Phil Statistics (1st and 2nd Semesters, Fall 2018) of Quaid-i-Azam University, Islamabad. We consider our population of those students who gave the false response, which comes out to be 23. As we already have the true response from their academic record. In question (i) we asked for $Y$ = Age, $X$ = Marks in A level or Intermediate (in percentage). In question (ii) $S$ = Social media effects the academic result is asked, where Y is the study variable, X is the auxiliary variable and S is the scrambling response variable. We have 23 students ( $N = 23$ ), including 8 male students and 15 female students who gave the false response.

Population IV. [Source: Section 4.2.1].

Let, Y: Age of BS $5^{th}$ and Mphil Students of Statistics department, X: Marks in A level or Intermediate, S:Social media effects on the academic result

$NumberofStrata = 2$ (Male and Female).

$N_{1} = 8, N_{2} = 15, {\overline{Z}}_{1} = 24.25, {\overline{Z}}_{2} = 23.53, {\overline{X}}_{1} = 54.25, {\overline{X}}_{2} = 63.67, {\overline{R}}_{x 1} = 4.5, {\overline{R}}_{x 2} = 8, S_{1 Z}^{2} = 6.39, S_{ZY}^{2} = 3.75, S_{1 X}^{2} = 73.36, S_{2 X}^{2} = 76.95238, S_{1 R_{x}}^{2} = 6, S_{2 R_{x}}^{2} = 20, ρ_{1 ZX} = 0.37740, ρ_{2 ZX} = 0.15107, ρ_{1 {XR}_{x}} = - 0.25875, ρ_{2 {XR}_{x}} = - 0.10014, ρ_{1 {ZR}_{x}} = - 0.69314, ρ_{2 {ZR}_{x}} = - 0.62315$ .

Tables 5 and 6 show that the generalized class of proposed estimators ${\overline{y}}_{S (GP)}^{*'}$ perform better than other existing estimators for both with and without measurement error. The values of the absolute biases are given in brackets in the tables. In Table 5 the MSE for the generalized proposed estimator, when $α_{r} = 1, r = 0, 1, 2, 3$ is 0.004734 for 10% non-response rate. When the non-response rate becomes 20%, the MSE for generalized proposed estimator increases to 0.005848. It is also observed that ${\overline{y}}_{S (GP)}^{*'}$ is less biased and ${\overline{y}}_{S (SK)}^{*'}$ is highly biased among all other considered estimator. Table 5 shows the same pattern of results in case for no measurement error.

Table 5 Mean squared error and

|Bias|

(in brackets) values of different estimators for Population III with and without measurement error

Estimators	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.009864	0.011976	0.016201	0.012941	0.017808	0.027542
${\overline{y}}_{S (R)}^{*'}$	0.009932	0.012202	0.016742	0.009053	0.012444	0.019226
${\overline{y}}_{S (R)}^{*'}$	(0.003997)	(0.004906)	(0.006722)	(0.003688)	(0.005117)	(0.007975)
${\overline{y}}_{S (\Pr)}^{*'}$	0.037387	0.045484	0.061677	0.044100	0.060841	0.094323
${\overline{y}}_{S (\Pr)}^{*'}$	(0.004802)	(0.005828)	(0.007881)	(0.006228)	(0.008600)	(0.013344)
${\overline{y}}_{S (BT)}^{*'}$	0.006449	0.007872	0.010719	0.007588	0.010418	0.016076
${\overline{y}}_{S (BT)}^{*'}$	(0.005316)	(0.006551)	(0.009022)	(0.003434)	(0.004720)	(0.007291)
${\overline{y}}_{S (SK)}^{*'}$	0.092504	0.112726	0.153171	0.102530	0.141543	0.219569
${\overline{y}}_{S (SK)}^{*'}$	(0.009534)	(0.011631)	(0.015826)	(0.010642)	(0.014727)	(0.022896)
${\overline{y}}_{S (D)}^{*'}$	0.006262	0.007627	0.010355	0.006917	0.009419	0.014419
${\overline{y}}_{S (AH)}^{*'}$	0.016593	0.020397	0.028004	0.014191	0.019546	0.030256
${\overline{y}}_{S (AH)}^{*'}$	(0.004941)	(0.005987)	(0.008080)	(0.006670)	(0.009207)	(0.014279)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.006252	0.007612	0.010329	0.006904	0.009395	0.014362
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.004152)	(0.005037)	(0.006804)	(0.004963)	(0.006760)	(0.010341)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.006252	0.007613	0.010329	0.006904	0.009395	0.014362
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.004153)	(0.005038)	(0.006804)	(0.004963)	(0.006760)	(0.010342)
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	0.006252	0.007613	0.010329	0.006904	0.009396	0.014363
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.004153)	(0.005038)	(0.006804)	(0.004963)	(0.006760)	(0.010342)
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	0.004734	0.005749	0.007771	0.005848	0.007960	0.012160
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	(0.003277)	(0.003966)	(0.005342)	(0.004165)	(0.005672)	(0.008669)
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S {(GP)}^{*'}}$	0.004737	0.005752	0.007777	0.005851	0.007966	0.012176
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S {(GP)}^{*'}}$	(0.003278)	(0.003969)	(0.005346)	(0.004168)	(0.005677)	(0.008681)

Estimators without Measurement Error	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.008595	0.010446	0.014149	0.010151	0.014012	0.021732
${\overline{y}}_{S (R)}^{*'}$	0.001193	0.001468	0.002018	0.000905	0.001243	0.001920
${\overline{y}}_{S (R)}^{*'}$	(0.000333)	(0.000407)	(0.000556)	(0.000353)	(0.000485)	(0.000750)
${\overline{y}}_{S (\Pr)}^{*'}$	0.028648	0.034750	0.046953	0.035952	0.049640	0.077016
${\overline{y}}_{S (\Pr)}^{*'}$	(0.004802)	(0.005828)	(0.007881)	(0.006228)	(0.008600)	(0.013344)
${\overline{y}}_{S (BT)}^{*'}$	0.003313	0.004041	0.005499	0.003459	0.004770	0.007392
${\overline{y}}_{S (BT)}^{*'}$	(0.004884)	(0.005936)	(0.008042)	(0.006588)	(0.009102)	(0.014129)
${\overline{y}}_{S (SK)}^{*'}$	0.061352	0.074378	0.100428	0.078308	0.108130	0.167773
${\overline{y}}_{S (SK)}^{*'}$	(0.004815)	(0.005841)	(0.007893)	(0.006268)	(0.008659)	(0.013441)
${\overline{y}}_{S (D)}^{*'}$	0.001144	0.001407	0.001934	0.0008737	0.001201	0.001856
${\overline{y}}_{S (AH)}^{*'}$	0.001604	0.001962	0.002677	0.001586	0.002183	0.003379
${\overline{y}}_{S (AH)}^{*'}$	(0.005709)	(0.006930)	(0.009371)	(0.007385)	(0.010198)	(0.015823)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.001143	0.001407	0.001933	0.000873	0.001200	0.001855
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.000677)	(0.000833)	(0.001145)	(0.000651)	(0.000893)	(0.001378)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.001143	0.001407	0.001933	0.000873	0.001201	0.001855
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.000677)	(0.000833)	(0.001145)	(0.000651)	(0.000893)	(0.001378)
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	0.001143	0.001407	0.001933	0.000873	0.001201	0.001855
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.000677)	(0.000833)	(0.001145)	(0.000651)	(0.000893)	(0.001378)
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	0.001060	0.001307	0.001800	0.000822	0.001128	0.001742
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	(0.000631)	(0.000777)	(0.001069)	(0.000611)	(0.000839)	(0.001293)
$α_{0} = 0, α_{1, 2, 3} = 1 ({\overline{y}}_{S (GP)}^{*'}$	0.001063	0.001309	0.001801	0.000824	0.001129	0.001743
$α_{0} = 0, α_{1, 2, 3} = 1 ({\overline{y}}_{S (GP)}^{*'}$	(0.000631)	(0.000777)	(0.001070)	(0.000611)	(0.000839)	(0.001294)

Table 6 Mean squared error and

|Bias|

(in brackets) values of different estimators for Population IV with and without measurement error.

Estimators	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.630021	0.837808	1.045595	0.630686	0.839804	1.048922
${\overline{y}}_{S (R)}^{*'}$	1.921722	2.793240	3.464758	2.121722	3.170594	4.119467
${\overline{y}}_{S (R)}^{*'}$	(0.130760)	(0.157226)	(0.104294)	(0.148446)	(0.192598)
${\overline{y}}_{S (\Pr)}^{*'}$	2.506471	3.404393	4.302315	2.735328	4.090965	5.446601
${\overline{y}}_{S (\Pr)}^{*'}$	(0.013928)	(0.017060)	(0.011940)	(0.017360)	(0.022779)
${\overline{y}}_{S (BT)}^{*'}$	0.906671	1.228910	1.551149	0.945071	1.344110	1.743148
${\overline{y}}_{S (BT)}^{*'}$	(307.1653)	(395.3818)	(248.3003)	(395.2198)	(542.1394)
${\overline{y}}_{S (SK)}^{*'}$	7.622591	10.43603	13.24947	8.484520	13.02182	17.55912
${\overline{y}}_{S (SK)}^{*'}$	(0.567979)	(0.727644)	(0.461703)	(0.728147)	(0.994592)
${\overline{y}}_{S (D)}^{*'}$	0.619716	0.824887	1.029976	0.619579	0.824426	1.029194
${\overline{y}}_{S (D)}^{*'}$	4.905149	6.754218	8.603288	5.446953	8.379632	11.31231
(0.030936)	(0.044995)	(0.059054)	(0.035498)	(0.058680)	(0.081861)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.617825	0.821560	1.024813	0.617755	0.821341	1.024502
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.076580)	(0.095337)	(0.056926)	(0.074051)	(0.091135)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.617948	0.821788	1.025177	0.617894	0.821628	1.024985
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.076603)	(0.096939)	(0.064079)	(0.091183)	(0.145374)
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	0.617950	0.821792	1.025185	0.617896	0.821632	1.024994
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.076603)	(0.095375)	(0.056940)	(0.074080)	(0.091184)
$α_{r} = 1, r = 0, 1, 2, 3 ({\overline{y}}_{S (GP)}^{*'}$	0.334101	0.444233	0.552328	0.335842	0.447262	0.554985
$α_{r} = 1, r = 0, 1, 2, 3 ({\overline{y}}_{S (GP)}^{*'}$	(0.040907)	(0.050769)	(0.030738)	(0.040354)	(0.049608)
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	0.336942	0.449357	0.560385	0.339090	0.453985	0.566332
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	(0.041422)	(0.051582)	(0.031060)	(0.041023)	(0.050737)

Estimators without Measurement Error	10% non-response			20% non-response
	$g_{h}$			$g_{h}$
	2	4	8	2	4	8
${\overline{y}}_{S (HH)}^{*'}$	0.531888	0.706251	0.880614	0.534662	0.714573	0.894483
${\overline{y}}_{S (R)}^{*'}$	0.847220	1.129459	1.411698	0.893275	1.267623	1.641971
${\overline{y}}_{S (R)}^{*'}$	(0.043594)	(0.054407)	(0.037339)	(0.057267)	(0.077194)
${\overline{y}}_{S (\Pr)}^{*'}$	1.360452	1.797576	2.234700	1.458010	2.090249	2.722487
${\overline{y}}_{S (\Pr)}^{*'}$	(0.013928)	(0.017060)	(0.011940)	(0.017360)	(0.022779)
${\overline{y}}_{S (BT)}^{*'}$	0.546567	0.728539	0.910510	0.553723	0.750007	0.946291
${\overline{y}}_{S (BT)}^{*'}$	(70.46203)	(88.11975)	(60.15501)	(92.51417)	(124.8733)
${\overline{y}}_{S (SK)}^{*'}$	3.332912	4.403433	5.473954	3.663318	5.394650	7.125983
${\overline{y}}_{S (SK)}^{*'}$	(0.172948)	(0.214997)	(0.147613)	(0.223090)	(0.298568)
${\overline{y}}_{S (D)}^{*'}$	0.499049	0.664792	0.830500	0.499692	0.666329	0.832772
${\overline{y}}_{S (AH)}^{*'}$	1.780605	2.369676	2.958748	1.941566	2.852558	3.763550
${\overline{y}}_{S (AH)}^{*'}$	(0.003230)	(0.003599)	(0.002798)	(0.003040)	(0.003283)
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	0.497680	0.662389	0.826776	0.498415	0.664262	0.829706
$α = 0, {\overline{y}}_{S (P 1)}^{*'}$	(0.064500)	(0.080376)	(0.047678)	(0.061733)	(0.075745)
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	0.497707	0.662436	0.826849	0.498447	0.664327	0.829816
$α = 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.064504)	(0.080382)	(0.047681)	(0.061739)	(0.075755)
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	0.497708	0.662437	0.826850	0.498447	0.664328	0.829818
$α = - 1, {\overline{y}}_{S (P 1)}^{*'}$	(0.064505)	(0.080382)	(0.047681)	(0.061739)	(0.075755)
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	0.222332	0.293731	0.363450	0.223663	0.296212	0.366048
$α_{r} = 1, r = 0, 1, 2, 3 {\overline{y}}_{S (GP)}^{*'}$	(0.030276)	(0.037404)	(0.022712)	(0.029331)	(0.035698)
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	0.223172	0.295152	0.365602	0.224612	0.298063	0.369087
$α_{0} = 0, α_{1, 2, 3} = 1 {\overline{y}}_{S (GP)}^{*'}$	(0.030408)	(0.037603)	(0.022800)	(0.029499)	(0.035973)

In Table 6 the MSE for the generalized proposed estimator, when $α_{r} = 1, r = 0, 1, 2, 3$ is 0.334101 for 10% non-response rate. When the non-response rate becomes 20%, the MSE for generalized proposed estimator increases to 0.335842. It is also observed that ${\overline{y}}_{S (R)}^{*'}$ is less biased and ${\overline{y}}_{S (BT)}^{*'}$ is most biased among all other considered estimator. Table 6 shows the same pattern of results in case for no measurement error.

Through real data sets it is concluded that the generalized proposed estimator performs better as compared to the other existing estimators. For 10% non-response rate the MSE is minimum. The MSE also increases as the value of constant $g_{h}$ increases.

5 Conclusion

In the present study, we proposed a generalized class of estimators in estimating the finite population mean for the sensitive variable in the presence of measurement error and non-response under stratified random sampling. Through simulation study and real life data sets it is observed that the proposed class of estimators perform better than the existing estimators. The MSE values are generally smaller under 10% of non-response as compared to 20% of non-response, which are expected results. Generally as the non-response rate increases, MSE also increases. Based on numerical findings, it turns out that the generalized proposed class of estimators is more efficient as compared to the other existing estimators, under certain situations.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Andridge R.R., Little R.J., . A review of hot deck imputation for survey non-response. Int. Stat. Rev.. 2010;78(1):40-64.
[Google Scholar]
Azeem M., Hanif M., . Joint influence of measurement error and non response on estimation of population mean. Commun. Statistics-Theory Methods. 2017;46(4):1679-1693.
[Google Scholar]
Bahl S., Tuteja R., . Ratio and product type exponential estimators. J. Inform. Optim. Sci.. 1991;12(1):159-164.
[Google Scholar]
Biemer P.P., Groves R.M., Lyberg L.E., Mathiowetz N.A., Sudman S., . Measurement Errors in Surveys. John Wiley & Sons; 2011.
Bouza C.N., Singh P., Singh R., . Ranked set sampling and optional scrambling randomized response modeling. Investigación Operacional. 2018;39(1):100-107.
[Google Scholar]
Chaudhuri A., Pal S., . On efficacy of empirical bayes estimation of a finite population mean of a sensitive variable through randomized responses. Model Assisted Stat. Appl.. 2015;10(4):283-288.
[Google Scholar]
Cochran W.G., . Errors of measurement in statistics. Technometrics. 1968;10(4):637-666.
[Google Scholar]
Cochran W.G., . Sampling techniques. New York: John Wiley & Sons; 1977.
Diana G., Perri P.F., . New scrambled response models for estimating the mean of a sensitive quantitative character. J. Appl. Stat.. 2010;37(11):1875-1890.
[Google Scholar]
Eichhorn B.H., Hayre L.S., . Scrambled randomized response methods for obtaining sensitive quantitative data. J. Stat. Planning inference. 1983;7(4):307-316.
[Google Scholar]
Fuller W.A., . Estimation in the presence of measurement error. Int. Stat. Rev.. 1995;63(2):121-141.
[Google Scholar]
Gjestvang C.R., Singh S., . A new randomized response model. J. Roy. Stat. Soc.. 2006;68(3):523-530.
[Google Scholar]
Greenberg B.G., Kuebler R.R. Jr, Abernathy J.R., Horvitz D.G., . Application of the randomized response technique in obtaining quantitative data. J. Am. Stat. Assoc.. 1971;66(334):243-250.
[Google Scholar]
Grover L.K., Kaur P., . An improved estimator of the finite population mean in simple random sampling. Model Assisted Stat. Appl.. 2011;6(1):47-55.
[Google Scholar]
Gupta S., Shabbir J., . Sensitivity estimation for personal interview survey questions. Statistica. 2004;64(4):643-653.
[Google Scholar]
Gupta S., Shabbir J., . On improvement in estimating the population mean in simple random sampling. J. Appl. Stat.. 2008;35(5):559-566.
[Google Scholar]
Gupta S., Shabbir J., Sehra S., . Mean and sensitivity estimation in optional randomized response models. J. Stat. Planning Inference. 2010;140(10):2870-2874.
[Google Scholar]
Gupta S., Shabbir J., Sousa R., Corte-Real P., . Improved exponential type estimators of the mean of a sensitive variable in the presence of nonsensitive auxiliary information. Commun. Statistics-Simul. Comput.. 2016;45(9):3317-3328.
[Google Scholar]
Hansen M.H., Hurwitz W.N., . The problem of non-response in sample surveys. J. Am. Stat. Assoc.. 1946;41(236):517-529.
[Google Scholar]
Khalil S., Gupta S., Hanif M., . Estimation of finite population mean in stratified sampling using scrambled responses in the presence of measurement errors. Commun. Statistics-Theory Methods 2018:1-9.
[Google Scholar]
Khare B., Pandey S., Kumar A., . Estimation of population mean in sample surveys using auxiliary character, method of call backs and subsampling from non-respondents. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences. 2013;83(1):49-54.
[Google Scholar]
Khare B., Srivastava S., . Generalized two phase sampling estimators for the population mean in the presence of nonresponse. Aligarh Jouranal of Statistics. 2010;30:39-54.
[Google Scholar]
Kim J.-M., Warde W.D., . A stratified warner’s randomized response model. J. Stat. Planning Inference. 2004;120(1–2):155-165.
[Google Scholar]
Kumar S., . Improved estimation of population mean in presence of non-response and measurement error. J. Stat. Theory Practice 2016 just-accepted
[Google Scholar]
Kumar S., Bhougal S., Nataraja N., Viswanathaiah M., . Estimation of population mean in the presence of non-response and measurement error. Revista Colombiana de EstadÝstica. 2015;38(1):145-161.
[Google Scholar]
Rao P., . Ratio estimation with subsampling the nonrespondents. Survey Methodology. 1986;12(2):217-230.
[Google Scholar]
Rosner B., . Fundamentals of biostatistics. Duxbury Press; 2015.
Shabbir J., Gupta S., Ahmed S., . A generalized class of estimators under two-phase stratified sampling for non response. Commun. Stat.-Theory Methods 2018:1-17.
[Google Scholar]
Shabbir J., Khan N.S., . Some modified exponential-ratio type estimators in the presence of non-response under two-phase sampling scheme. Electronic J. Appl. Stati. Anal.. 2013;6(1):1-17.
[Google Scholar]
Shalabh S., . Ratio method of estimation in the presence of measurement errors. J. Indian Society Agric. Stat.. 1997;52:150-155.
[Google Scholar]
Shukla D., Pathak S., Thakur N., . An estimator for mean estimation in presence of measurement error. Res. Rev.: A J. Stat.. 2012;1(1):1-8.
[Google Scholar]
Singh G.N., Khalid M., . Some imputation methods to compensate with non-response for estimation of population mean in two-occasion successive sampling. Commun. Statistics-Theory Methods. 2020;49(14):3329-3351.
[Google Scholar]
Singh H.P., Kumar S., . Estimation of mean in presence of non-response using two phase sampling scheme. Stat. Pap.. 2010;51(3):559-582.
[Google Scholar]
Singh H.P., Kumar S., . Combination of regression and ratio estimate in presence of nonresponse. Brazilian J. Prob. Stat.. 2011;25(2):205-217.
[Google Scholar]
Singh H.P., Mathur N., . Estimation of population mean when coefficient of variation is known using scrambled response technique. J. Stat. Planning Inference. 2005;131(1):135-144.
[Google Scholar]
Singh R.S., Sharma P., . Method of estimation in the presence of non-response and measurement errors simultaneously. J. Modern Appl. Stat. Methods. 2015;14(1):12.
[Google Scholar]
Warner S.L., . Randomized response: A survey technique for eliminating evasive answer bias. J. Am. Stat. Assoc.. 1965;60(309):63-69.
[Google Scholar]
Zahid E., Shabbir J., . Estimation of population mean in the presence of measurement error and non response under stratified random sampling. PloS one. 2018;13(2):e0191572
[Google Scholar]
Zahid E., Shabbir J., . Estimation of finite population mean for a sensitive variable using dual auxiliary information in the presence of measurement errors. PloS one. 2019;14(2):e0212111
[Google Scholar]

Appendix A

Simplification of MSE

Squaring both sides of Eq. (25), and keeping the terms up to power two in errors, and then taking expectations, the MSE of ${\overline{y}}_{S (GP)}^{*'}$ is given by $\begin{matrix} MSE ({\overline{y}}_{S (GP)}^{*'}) ≅ \sum_{h = 1}^{L} P_{h}^{2} [{\overline{Z}}_{h}^{2} + m_{1 h}^{2} ({\overline{Z}}_{h}^{2} + A_{h} + e^{* 2} t_{h}^{2} R_{h}^{' 2} B_{h} + 4 e^{*'} t_{h} R_{h}^{'} C_{h} + 2 f^{*'} t_{h}^{2} R_{h}^{' 2} B_{h}) \\ + m_{2 h}^{2} t_{h}^{2} B_{h} + 2 m_{1 h} m_{2 h} (t_{h} C_{h} + t^{2} R_{h}^{'} B_{h} (e^{*'} + d^{*'})) - 2 m_{h 1} ({\overline{Z}}_{h}^{2} + e^{*'} t_{h} R_{h}^{'} C_{h} + f^{*'} t_{h}^{2} R_{h}^{' 2} B_{h}) \\ - 2 m_{2 h} d^{*'} t_{h}^{2} R_{h}^{'} B_{h} + m_{3 h}^{2} t_{h}^{2} D_{h} + 2 m_{1 h} m_{3 h} (c^{*'} t_{h} R_{h}^{'} F_{h} + e^{*'} t_{h}^{2} R_{h}^{'} F_{h} + t_{h} E_{h} + b^{*'} t_{h}^{2} R_{1 h}^{'} D_{h}) \\ + 2 m_{2 h} m_{3 h} t^{2} F_{h} - 2 m_{3 h} (c^{*'} t_{h} R_{h}^{'} F_{h} + b^{*'} t_{h}^{2} R_{1 h}^{'} D_{h})] . \end{matrix}$ where $R_{1 h}^{'} = \frac{{\overline{Z}}_{h}}{{\overline{R}}_{xh}}$ .

Population

Population I.

$X_{1} = rnorm (1000, 5, 10), Y_{1} = X_{1} + rnorm (1000, 0, 1), y_{1} = Y_{1} + rnorm (1000, 1, 3)$ , $x_{1} = X_{1} + rnorm (1000, 1, 3)$ .

$X_{2} = rnorm (1000, 4, 8), Y_{2} = X_{2} + rnorm (1000, 0, 1), y_{2} = Y_{2} + rnorm (1000, 1, 3)$ , $x_{2} = X_{2} + rnorm (1000, 1, 3)$ .

$X_{3} = rnorm (1000, 4, 9), Y_{3} = X_{3} + rnorm (1000, 0, 1), y_{3} = Y_{3} + rnorm (1000, 1, 3)$ , $x_{3} = X_{3} + rnorm (1000, 1, 3)$ .

$X_{4} = rnorm (1000, 3, 7), Y_{4} = X_{4} + rnorm (1000, 0, 1), y_{4} = Y_{4} + rnorm (1000, 1, 3)$ , $x_{4} = X_{4} + rnorm (1000, 1, 3)$ .

$NumberofStrata = 4$

$N_{1} = 1000, N_{2} = 1000, N_{3} = 1000, N_{4} = 1000$ , $n_{1} = 200, n_{2} = 200, n_{3} = 200, n_{4} = 200$ , ${\overline{Z}}_{1} = 5.719824, {\overline{Z}}_{2} = 4.985474, {\overline{Z}}_{3} = 4.85276, {\overline{Z}}_{4} = 3.835371$ , ${\overline{X}}_{1} = 5.666893, {\overline{X}}_{2} = 3.643237, {\overline{X}}_{3} = 3.968049, {\overline{X}}_{4} = 2.918596$ , ${\overline{R}}_{xi} = 500.5, i = 1, 2, 3, 4$ , $S_{1 Z}^{2} = 124.6685, S_{2 Z}^{2} = 73.65976, S_{3 Z}^{2} = 90.66835, S_{4 Z}^{2} = 61.00159$ , $S_{1 X}^{2} = 104.2774, S_{2 X}^{2} = 66.19725, S_{3 X}^{2} = 81.06883, S_{4 X}^{2} = 45.99937, S_{{iR}_{x}}^{2} = 83416.67, i = 1, 2, 3, 4$ , $ρ_{1 ZX} = 0.9953966, ρ_{2 ZX} = 0.9927347, ρ_{3 ZX} = 0.9940606, ρ_{4 ZX} = 0.9891463$ , $ρ_{1 {ZR}_{x}} = - 0.003890, ρ_{2 {ZR}_{x}} = 0.016016, ρ_{3 {ZR}_{x}} = 0.062953, ρ_{4 {ZR}_{x}} = - 0.031585$ , $ρ_{1 {XR}_{x}} = - 0.044153, ρ_{2 {XR}_{x}} = - 0.011336, ρ_{3 {XR}_{x}} = 0.022509, ρ_{4 {XR}_{x}} = 0.033215$ .

Population II.

$X_{1} = rnorm (1000, 5, 10), Y_{1} = X_{1} + rnorm (1000, 0, 1), y_{1} = Y_{1} + rnorm (1000, 1, 3)$ , $x_{1} = X_{1} + rnorm (1000, 0, 1)$ .

$X_{2} = rnorm (1200, 4, 8), Y_{2} = X_{2} + rnorm (1200, 0, 1), y_{2} = Y_{2} + rnorm (1200, 1, 3)$ , $x_{2} = X_{2} + rnorm (1200, 0, 1)$ .

$X_{3} = rnorm (1300, 4, 9), Y_{3} = X_{3} + rnorm (1300, 0, 1), y_{3} = Y_{3} + rnorm (1300, 1, 3)$ , $x_{3} = X_{3} + rnorm (1300, 0, 1)$ .

$X_{4} = rnorm (1500, 3, 7), Y_{4} = X_{4} + rnorm (1500, 0, 1), y_{4} = Y_{4} + rnorm (1500, 1, 3)$ , $x_{4} = X_{4} + rnorm (1500, 1, 3)$ .

$NumberofStrata = 4$

$N_{1} = 1000, N_{2} = 1200, N_{3} = 1300, N_{4} = 1500$ , $n_{1} = 200, n_{2} = 210, n_{3} = 220, n_{4} = 215$ , ${\overline{Z}}_{1} = 4.648022, {\overline{Z}}_{2} = 4.036113, {\overline{Z}}_{3} = 4.032501, {\overline{Z}}_{4} = 2.969091$ , ${\overline{X}}_{1} = 5.666893, {\overline{X}}_{2} = 3.807569, {\overline{X}}_{3} = 4.627208, {\overline{X}}_{4} = 3.241139$ , ${\overline{R}}_{x 1} = 500.5, {\overline{R}}_{x 2} = 600.5, {\overline{R}}_{x 3} = 650.5, {\overline{R}}_{x 4} = 750.5$ , $S_{1 Z}^{2} = 94.19621, S_{2 Z}^{2} = 66.76728, S_{3 Z}^{2} = 80.80177, S_{4 Z}^{2} = 51.14123$ , $S_{1 X}^{2} = 104.2774, S_{2 X}^{2} = 65.46337, S_{3 X}^{2} = 82.98812, S_{4 X}^{2} = 52.84269$ , $S_{1 R_{x}}^{2} = 83416.67, S_{2 R_{x}}^{2} = 120100, S_{3 R_{x}}^{2} = 140941.7, S_{4 R_{x}}^{2} = 187625$ , $ρ_{1 ZX} = 0.984077, ρ_{2 ZX} = 0.987461, ρ_{3 ZX} = 0.991750, ρ_{4 ZX} = 0.989362$ , $ρ_{1 {ZR}_{x}} = 0.015173, ρ_{2 {ZR}_{x}} = - 0.008185, ρ_{3 {ZR}_{x}} = 0.009465, ρ_{4 {ZR}_{x}} = 0.006319$ , $ρ_{1 {XR}_{x}} = - 0.105091, ρ_{2 {XR}_{x}} = - 0.124294, ρ_{3 {XR}_{x}} = 0.002547, ρ_{4 {XR}_{x}} = - 0.013688$ .

Members of Generalized Proposed Class of Estimators ${\overline{y}}_{S (GP)}^{*'}$ for Different Choices of ( $α_{0}, α_{1}, α_{2}, α_{3}, m_{1 h}, m_{2 h}, m_{3 h}$ ) Members of the class of estimators ${\overline{y}}_{S (GP)}^{*'}$ by choosing different values of $α_{0}, α_{1}, α_{2}, α_{3}, m_{1 h}, m_{2 h}$ and $m_{3 h}$ are given below

1. For $α_{1} = m_{2 h} = m_{3 h} = 0$ and $α_{0} = m_{1 h} = 1$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to usual mean estimator as: ${\overline{y}}_{S (0)}^{*'} = \sum_{h = 1}^{L} P_{h} {\overline{z}}_{h}^{*} .$

2. For $α_{0} = α_{1} = m_{1 h} = 1$ and $m_{2 h} = m_{3 h} = 0$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to usual ratio estimator: ${\overline{y}}_{S (R)}^{*'} = \sum_{h = 1}^{L} P_{h} (\frac{{\overline{z}}_{h}^{*}}{{\overline{x}}_{h}^{*}} {\overline{X}}_{h}) .$

3. For $α_{0} = m_{1 h} = 1, α_{1} = - 1$ and $m_{2 h} = m_{3 h} = 0$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to usual product estimator: ${\overline{y}}_{S (\Pr)}^{*'} = \sum_{h = 1}^{L} P_{h} ({\overline{z}}_{h}^{*} \frac{{\overline{x}}_{h}^{*}}{{\overline{X}}_{h}}) .$

4. For $α_{0} = α_{1} = m_{2 h} = m_{3 h} = 0$ and $m_{1 h} = 1$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to Bahl and Tuteja, 1991 estimator: ${\overline{y}}_{S (BT)}^{*'} = \sum_{h = 1}^{L} P_{h} {\overline{z}}^{*} \exp (\frac{{\overline{X}}_{h} - {\overline{x}}_{h}^{*}}{{\overline{X}}_{h} + {\overline{x}}_{h}^{*}}) .$

5. For $α_{0} = α_{1} = α_{2} = 1, m_{3 h} = 0, m_{1 h} = m_{4 h}$ and $m_{2 h} = m_{5 h}$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to Gupta and Shabbir, 2008 estimator: ${\overline{y}}_{S (GS)}^{*'} = \sum_{h = 1}^{L} P_{h} [m_{4 h} {\overline{z}}_{h}^{*} (\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*}}) + m_{5 h} ({\overline{X}}_{h} - {\overline{x}}_{h}^{*}) (\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*}})] .$

6. For $α_{0} = m_{2 h} = m_{3 h} = 0, α_{1} = 2$ and $m_{1 h} = 1$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to Singh and Kumar, 2010 estimator: ${\overline{y}}_{S (SK)}^{*'} = \sum_{h = 1}^{L} P_{h} [{\overline{z}}_{h}^{*} {(\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*}})}^{2}] .$

7. For $α_{0} = α_{1} = α_{2} = 0, m_{3 h} = 0, m_{1 h} = m_{6 h}$ and $m_{2 h} = m_{7 h}$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to Grover and Kaur, 2011 estimator: ${\overline{y}}_{S (GK)}^{*'} = \sum_{h = 1}^{L} P_{h} [m_{6 h} {\overline{z}}_{h}^{*} + m_{7 h} ({\overline{X}}_{h} - {\overline{x}}_{h}^{*}) \exp (\frac{{\overline{X}}_{h} - {\overline{x}}_{h}^{*}}{{\overline{X}}_{h} + {\overline{x}}_{h}^{*}})] .$

8. For $α_{1} = α_{2} = m_{3 h} = 0, α_{0} = m_{1 h} = 1$ and $m_{2 h} = d_{h}^{*'}$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to difference estimator: ${\overline{y}}_{S (D)}^{*'} = \sum_{h = 1}^{L} P_{h} [{\overline{z}}_{h}^{*} + d_{h}^{*'} ({\overline{X}}_{h} - {\overline{x}}_{h}^{*'})] .$

9. For $α_{1} = α_{2} = g, m_{1 h} = 1, m_{2 h} = k_{h}$ and $m_{3 h} = 0$ in Eq. (20), the generalized proposed class of estimators ${\overline{y}}_{S (GP)}^{*'}$ reduces to Khalil et al., 2018 estimator given by,

(29)

{\overline{y}}_{S (K)}^{*'} = \sum_{h = 1}^{L} P_{h} [{\overline{z}}_{h}^{*} + k_{h} ({\overline{X}}_{h}^{*} - {\overline{x}}_{h}^{*})] {(\frac{\overline{W^{*}}}{\overline{w^{*}}})}^{g} .

10. For $α_{0} = α_{1} = α_{2} = α, m_{1 h} = m_{8 h}, m_{2 h} = m_{9 h}$ and $m_{3 h} = 0$ in Eq. (20), the generalized proposed estimator ${\overline{y}}_{S (GP)}^{*'}$ reduces to the proposed estimator.

(30)

{\overline{y}}_{S (P 1)}^{*'} = \sum_{h = 1}^{L} P_{h} [\{m_{8 h} {\overline{z}}_{h}^{*} + m_{9 h} ({\overline{X}}_{h} - {\overline{x}}_{h}^{*'})\} {\{\frac{{\overline{X}}_{h}}{{\overline{x}}_{h}^{*'}}\}}^{α} \exp (1 - α) (\frac{{\overline{X}}_{h} - {\overline{x}}_{h}^{*'}}{{\overline{X}}_{h} + {\overline{x}}_{h}^{*'}})] .

Show Sections

[1] Andridge R.R., Little R.J., . A review of hot deck imputation for survey non-response. Int. Stat. Rev.. 2010;78(1):40-64.
[Google Scholar]

[2] Azeem M., Hanif M., . Joint influence of measurement error and non response on estimation of population mean. Commun. Statistics-Theory Methods. 2017;46(4):1679-1693.
[Google Scholar]

[3] Bahl S., Tuteja R., . Ratio and product type exponential estimators. J. Inform. Optim. Sci.. 1991;12(1):159-164.
[Google Scholar]

[4] Biemer P.P., Groves R.M., Lyberg L.E., Mathiowetz N.A., Sudman S., . Measurement Errors in Surveys. John Wiley & Sons; 2011.

[5] Bouza C.N., Singh P., Singh R., . Ranked set sampling and optional scrambling randomized response modeling. Investigación Operacional. 2018;39(1):100-107.
[Google Scholar]

[6] Chaudhuri A., Pal S., . On efficacy of empirical bayes estimation of a finite population mean of a sensitive variable through randomized responses. Model Assisted Stat. Appl.. 2015;10(4):283-288.
[Google Scholar]

[7] Cochran W.G., . Errors of measurement in statistics. Technometrics. 1968;10(4):637-666.
[Google Scholar]

[8] Cochran W.G., . Sampling techniques. New York: John Wiley & Sons; 1977.

[9] Diana G., Perri P.F., . New scrambled response models for estimating the mean of a sensitive quantitative character. J. Appl. Stat.. 2010;37(11):1875-1890.
[Google Scholar]

[10] Eichhorn B.H., Hayre L.S., . Scrambled randomized response methods for obtaining sensitive quantitative data. J. Stat. Planning inference. 1983;7(4):307-316.
[Google Scholar]

[11] Fuller W.A., . Estimation in the presence of measurement error. Int. Stat. Rev.. 1995;63(2):121-141.
[Google Scholar]

[12] Gjestvang C.R., Singh S., . A new randomized response model. J. Roy. Stat. Soc.. 2006;68(3):523-530.
[Google Scholar]

[13] Greenberg B.G., Kuebler R.R. Jr, Abernathy J.R., Horvitz D.G., . Application of the randomized response technique in obtaining quantitative data. J. Am. Stat. Assoc.. 1971;66(334):243-250.
[Google Scholar]

[14] Grover L.K., Kaur P., . An improved estimator of the finite population mean in simple random sampling. Model Assisted Stat. Appl.. 2011;6(1):47-55.
[Google Scholar]

[15] Gupta S., Shabbir J., . Sensitivity estimation for personal interview survey questions. Statistica. 2004;64(4):643-653.
[Google Scholar]

[16] Gupta S., Shabbir J., . On improvement in estimating the population mean in simple random sampling. J. Appl. Stat.. 2008;35(5):559-566.
[Google Scholar]

[17] Gupta S., Shabbir J., Sehra S., . Mean and sensitivity estimation in optional randomized response models. J. Stat. Planning Inference. 2010;140(10):2870-2874.
[Google Scholar]

[18] Gupta S., Shabbir J., Sousa R., Corte-Real P., . Improved exponential type estimators of the mean of a sensitive variable in the presence of nonsensitive auxiliary information. Commun. Statistics-Simul. Comput.. 2016;45(9):3317-3328.
[Google Scholar]

[19] Hansen M.H., Hurwitz W.N., . The problem of non-response in sample surveys. J. Am. Stat. Assoc.. 1946;41(236):517-529.
[Google Scholar]

[20] Khalil S., Gupta S., Hanif M., . Estimation of finite population mean in stratified sampling using scrambled responses in the presence of measurement errors. Commun. Statistics-Theory Methods 2018:1-9.
[Google Scholar]

[21] Khare B., Pandey S., Kumar A., . Estimation of population mean in sample surveys using auxiliary character, method of call backs and subsampling from non-respondents. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences. 2013;83(1):49-54.
[Google Scholar]

[22] Khare B., Srivastava S., . Generalized two phase sampling estimators for the population mean in the presence of nonresponse. Aligarh Jouranal of Statistics. 2010;30:39-54.
[Google Scholar]

[23] Kim J.-M., Warde W.D., . A stratified warner’s randomized response model. J. Stat. Planning Inference. 2004;120(1–2):155-165.
[Google Scholar]

[24] Kumar S., . Improved estimation of population mean in presence of non-response and measurement error. J. Stat. Theory Practice 2016 just-accepted
[Google Scholar]

[25] Kumar S., Bhougal S., Nataraja N., Viswanathaiah M., . Estimation of population mean in the presence of non-response and measurement error. Revista Colombiana de EstadÝstica. 2015;38(1):145-161.
[Google Scholar]

[26] Rao P., . Ratio estimation with subsampling the nonrespondents. Survey Methodology. 1986;12(2):217-230.
[Google Scholar]

[27] Rosner B., . Fundamentals of biostatistics. Duxbury Press; 2015.

[28] Shabbir J., Gupta S., Ahmed S., . A generalized class of estimators under two-phase stratified sampling for non response. Commun. Stat.-Theory Methods 2018:1-17.
[Google Scholar]

[29] Shabbir J., Khan N.S., . Some modified exponential-ratio type estimators in the presence of non-response under two-phase sampling scheme. Electronic J. Appl. Stati. Anal.. 2013;6(1):1-17.
[Google Scholar]

[30] Shalabh S., . Ratio method of estimation in the presence of measurement errors. J. Indian Society Agric. Stat.. 1997;52:150-155.
[Google Scholar]

[31] Shukla D., Pathak S., Thakur N., . An estimator for mean estimation in presence of measurement error. Res. Rev.: A J. Stat.. 2012;1(1):1-8.
[Google Scholar]

[32] Singh G.N., Khalid M., . Some imputation methods to compensate with non-response for estimation of population mean in two-occasion successive sampling. Commun. Statistics-Theory Methods. 2020;49(14):3329-3351.
[Google Scholar]

[33] Singh H.P., Kumar S., . Estimation of mean in presence of non-response using two phase sampling scheme. Stat. Pap.. 2010;51(3):559-582.
[Google Scholar]

[34] Singh H.P., Kumar S., . Combination of regression and ratio estimate in presence of nonresponse. Brazilian J. Prob. Stat.. 2011;25(2):205-217.
[Google Scholar]

[35] Singh H.P., Mathur N., . Estimation of population mean when coefficient of variation is known using scrambled response technique. J. Stat. Planning Inference. 2005;131(1):135-144.
[Google Scholar]

[36] Singh R.S., Sharma P., . Method of estimation in the presence of non-response and measurement errors simultaneously. J. Modern Appl. Stat. Methods. 2015;14(1):12.
[Google Scholar]

[37] Warner S.L., . Randomized response: A survey technique for eliminating evasive answer bias. J. Am. Stat. Assoc.. 1965;60(309):63-69.
[Google Scholar]

[38] Zahid E., Shabbir J., . Estimation of population mean in the presence of measurement error and non response under stratified random sampling. PloS one. 2018;13(2):e0191572
[Google Scholar]

[39] Zahid E., Shabbir J., . Estimation of finite population mean for a sensitive variable using dual auxiliary information in the presence of measurement errors. PloS one. 2019;14(2):e0212111
[Google Scholar]

A generalized class of estimators for sensitive variable in the presence of measurement error and non-response under stratified random sampling

Abstract

Keywords

Auxiliary variable

Measurement error

Non-response

Randomized response

Stratified random sampling

1 Introduction

2 Existing Estimators in Literature

2.1 Hansen and Hurwitz (1946) Estimator

2.2 Ratio Estimator

2.3 Product Estimator

2.4 Bahl and Tuteja, 1991 Estimator

2.5 Singh and Kumar, 2010 Estimator

2.6 Difference Estimator

2.7 Azeem and Hanif (2017) Estimator

3 Proposed Generalized Class of Estimators

4 Numerical Results

4.1 Simulation Study

4.2 Application to Real Data Set

4.2.1 Data Collection

5 Conclusion

Declaration of Competing Interest

References

Appendix A

Suggested read for related articles: