Full article: Enhanced estimation of population mean using simple random sampling

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The present study suggests an enhanced class of estimators for the population mean estimation utilizing simple random sampling (SRS). The bias, mean square errors (MSE), and minimum MSE of the suggested estimators are computed to the approximation of order one. The efficiency conditions are obtained by comparing the MSE of the proposed and existing estimators. An empirical investigation is carried out using some real and artificially generated symmetric and asymmetric populations. The empirical findings are observed to be positive, clearly demonstrating the preponderance of the suggested estimators over the existing estimators.

KEYWORDS:

1 Introduction

The traditional ratio, regression, and exponential approaches have been widely utilized for population mean estimation in the recent years due to their simple structure and computational ease. With the help of various auxiliary variable-related parameters, including coefficients of variation, skewness and kurtosis, along with the population mean, the standard ratio and exponential estimators have been improved by various researchers to estimate the unknown population mean of the study variable. When applying the ratio, regression, and exponential estimation methods, the population characteristics of the auxiliary variable must also be available ahead of time. Watson (Citation1937) considered auxiliary information and proposed the conventional regression estimator of population mean of the study variable which is best linear unbiased (BLU) estimator. Cochran (Citation1940) proposed the conventional ratio estimator of population mean under SRS. Srivastava (Citation1967) utilized auxiliary information and suggested a power ratio estimator of population mean under SRS. Walsh (Citation1970) suggested a ratio type estimator for the population mean under SRS. Bahl and Tuteja (Citation1991) proposed an exponential ratio estimator of population mean under SRS. Sisodia and Dwivedi (Citation1981), Singh and Kakran (Citation1993), Upadhyaya and Singh (Citation1999), and Singh (Citation2003a) utilized known parameters of auxiliary variable and suggested some modified ratio estimators of population mean under SRS. Singh et al. (Citation2009) suggested a generalized exponential ratio type estimator of population mean which was later on enhanced by Yadav and Kadilar (Citation2013) by utilizing Searls (Citation1964) technique. Yan and Tian (Citation2010) utilized the coefficient of skewness of auxiliary variable and presented a ratio method of estimation of population mean, whereas Subramani and Kumarapandiyan (Citation2012) considered co-efficient of variation and median of an auxiliary variable and developed an estimation procedure of population mean. Jeelani et al. (Citation2013) developed a modified ratio estimators of population mean utilizing a linear combination of co-efficient of skewness and quartile deviation, while Jerajuddin and Kishun (Citation2016) suggested a modified ratio estimators for population mean utilizing size of the sample chosen from the population. Kadilar (Citation2016) suggested an improved ratio cum exponential ratio estimator of population mean. Soponviwatkul and Lawson (Citation2017) considered utilizing a coefficient of variation, correlation coefficient and a regression coefficient, and construct a new ratio estimator for estimating population mean under SRS. Ijaz and Ali (Citation2018) suggested some improved ratio estimators for estimating population mean utilizing SRS. Yadav et al. (Citation2019) adopted various auxiliary information and developed a class of population mean estimators under SRS. Yadav et al. (Citation2019) examined the efficiency of their developed estimators using primary data of production of peppermint oil obtained from the crop from Banikodar Block of Barabanki District situated in Uttar Pradesh, India.

The studies discussed in previous paragraph are either equally or less efficient than the conventional regression (BLU) estimator. In this article, we propose an enhanced class of estimators for the population mean (PM) estimation of the study variable (SV) employing the data from auxiliary variable (AV) based on SRS. The aim of this paper is to:

Propose an enhanced class of estimators for the PM estimation under SRS that competes with the existing estimators, especially regression (BLU) estimator,
Compare theoretically the efficiency of the suggested estimators with the current estimators,
Exemplify the proposed estimators using some real-life populations,
Perform a simulation study using some artificially generated symmetric and asymmetric populations.

The current paper is further designed in the succeeding sections. In Section 2, methodology and terminologies used are explained. In Section 3, some prominent estimators along with their bias, MSE, and minimum MSE expressions are discussed. In Section 4, we suggest an enhanced class of estimators for PM estimation under SRS along with the bias, MSE, and minimum MSE expressions. The efficiency conditions are obtained in Section 5. In Section 6, an empirical illustration of the efficiency of the suggested estimators is provided by using some examples based on some real populations as well as some artificially generated populations. The paper is concluded in Section 7.

2 Methodology and notations

Let a population $P =$ ( $P_{1}, P_{2}$ ,…, $P_{N})$ be the composition of the distinguishable items with a finite size N. Choose a sample of n size based on a finite population of N length by employing simple random sampling without replacement. Let y_i and x_i symbolize the SV and AV for the i^th (i $=$ 1, 2,…, N) unit from P. Let $\bar{y} = \sum_{i = 1}^{n} y_{i} / n$ and $\bar{Y} = \sum_{i = 1}^{N} y_{i} / N$ be the sample and population mean of SV y, respectively; $\bar{x} =$ $\sum_{i = 1}^{n} x_{i} / n$ and $\bar{X} = \sum_{i = 1}^{N} x_{i} / N$ be the sample and population mean of AV x. Let M_x and M_y be the median of AV and SV, respectively. Let $s_{y =} \sqrt{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2} / (n - 1)}$ and $S_{y =} \sqrt{\sum_{i = 1}^{N} {(y_{i} - \bar{Y})}^{2} / (N - 1)}$ be the sample and population standard deviation of SV y. Let $s_{x} = \sqrt{\sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2} / (n - 1)}$ and $S_{x} = \sqrt{\sum_{i = 1}^{N} {(x_{i} - \bar{X})}^{2} / (N - 1)}$ be the sample and population standard deviation of AV x. Let C $_{y} =$ S $_{y/} \bar{Y}$ and C_x = S_x/ $\bar{X}$ be the population coefficient of variation of SV and AV, respectively. Let ρ_xy represent the population correlation coefficient between AV and SV.

The characteristics of the suggested estimators are further deduced using the notations below:

Let us consider the error terms as $e_{0} = (\bar{y} - \bar{Y})$ / $\bar{Y}$ and $e_{1} = (\bar{x} - \bar{X}) / \bar{X}$ such that the expected values of these error terms are zero and E(e $_{0}^{2}) =$ $f C_{y}^{2},$ E(e $_{1}^{2}) =$ $f C_{x}^{2}$ , and E(e₀e $_{1}) =$ $f ρ_{xy} C_{x} C_{y}$ , where $f =$ 1/n.

3 Existing estimators

The current section devotes to some prominent available estimators which are discussed below along with their MSE/minimum MSE expressions.

The usual unbiased estimator is given by: $t_{m} = \bar{y},$ with the variance as $V (t_{m}) = f {\bar{Y}}^{2} C_{y}^{2} .$

The usual ratio estimator was proposed by Cochran (Citation1940) which is given by: $t_{r} = \bar{y} \frac{\bar{X}}{\bar{x}},$ with the bias and MSE as $\begin{matrix} Bias (t_{r}) = f \bar{Y} (C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ MSE (t_{r}) = f {\bar{Y}}^{2} (C_{x}^{2} + C_{y}^{2} - 2 ρ_{xy} C_{x} C_{y}) . \end{matrix}$

The usual regression estimator was proposed by Watson (Citation1937) which is given by $t_{reg} = \bar{y} + β (\bar{X} - \bar{x}),$ with the bias and minimum MSE at the optimum value of scalar $β_{(opt)} = ρ_{xy} \frac{\bar{Y}}{\bar{X}} \frac{C_{y}}{C_{x}}$ as $\begin{matrix} Bias (t_{reg}) = 0, \\ m inMSE (t_{reg}) = f {\bar{Y}}^{2} C_{y}^{2} (1 - ρ_{xy}^{2}) \end{matrix}$

Srivastava (Citation1967) suggested the power ratio estimator as follows $t_{s} = \bar{y} {(\frac{\bar{X}}{\bar{x}})}^{δ},$ with the bias and minimum MSE at the optimum value of scalar $δ_{(opt)} = ρ_{xy} \frac{C_{y}}{C_{x}}$ as $\begin{matrix} Bias (t_{s}) = f \bar{Y} δ (\frac{(δ + 1)}{2} C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ m inMSE (t_{s}) = f {\bar{Y}}^{2} C_{y}^{2} (1 - ρ_{xy}^{2}) . \end{matrix}$

Walsh (Citation1970) suggested the following ratio estimator as $t_{w} = \bar{y} (\frac{\bar{X}}{\bar{X} + θ (\bar{x} - \bar{X}}),$ with the bias and minimum MSE at the optimum value of scalar $θ_{(opt)} = ρ_{xy} \frac{C_{y}}{C_{x}}$ as $\begin{matrix} Bias (t_{w}) = f \bar{Y} θ (θ C_{x}^{2} - ρ_{x y} C_{x} C_{y}), \\ m inMSE (t_{w}) = f {\bar{Y}}^{2} C_{y}^{2} (1 - ρ_{xy}^{2}) . \end{matrix}$

Sisodia and Dwivedi (Citation1981) utilized the auxiliary information and suggested the following ratio estimator $t_{sd} = \bar{y} [\frac{\bar{X} + C_{x}}{\bar{x} + C_{x}}],$ with the bias and MSE as $\begin{matrix} Bias (t_{sd}) = f \bar{Y} λ_{1} (λ_{1} C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ MSE (t_{sd}) = f {\bar{Y}}^{2} (C_{y}^{2} + λ_{1}^{2} C_{x}^{2} - 2 λ_{1} ρ_{xy} C_{x} C_{y}), \end{matrix}$ where $λ_{1} = \frac{\bar{X}}{\bar{X} + C_{x}}$ .

Bahl and Tuteja (Citation1991) proposed the exponential ratio estimator as $t_{re} = \bar{y} exp [\frac{\bar{X} - \bar{x}}{\bar{X} + \bar{x}}],$ with the bias and MSE as $\begin{matrix} Bias (t_{re}) = f \bar{Y} (\frac{3}{8} C_{x}^{2} - \frac{1}{2} ρ_{xy} C_{x} C_{y}), \\ MSE (t_{re}) = f {\bar{Y}}^{2} (C_{y}^{2} + \frac{C_{x}^{2}}{4} - ρ_{xy} C_{x} C_{y}) . \end{matrix}$

Singh and Kakran (Citation1993) used the coefficient of kurtosis and construct the following ratio estimator $t_{sk} = \bar{y} [\frac{\bar{X} + β_{2} (x)}{\bar{X} + β_{2} (x)}],$ with the bias and MSE as $\begin{matrix} Bias (t_{sk}) = f \bar{Y} λ_{2} (λ_{2} C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ MSE (t_{sk}) = f {\bar{Y}}^{2} (C_{y}^{2} + λ_{2}^{2} C_{x}^{2} - 2 λ_{2} ρ_{xy} C_{x} C_{y}), \end{matrix}$ where $λ_{2} = \frac{\bar{X}}{\bar{X} + β_{2} (x)}$ .

Upadhyaya and Singh (Citation1999) utilized the transformed auxiliary information and suggested the following estimators: $\begin{matrix} t_{u p_{1}} = \bar{y} [\frac{C_{x} \bar{X} + β_{2} (x)}{C_{x} \bar{X} + β_{2} (x)}], \\ t_{u p_{2}} = \bar{y} [\frac{β_{2} (x) \bar{X} + C_{x}}{β_{2} (x) \bar{X} + C_{x}}], \end{matrix}$ with the bias and MSE as $\begin{matrix} Bias (t_{u p_{1}}) = f \bar{Y} λ_{3} (λ_{3} C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ Bias (t_{u p_{2}}) = f \bar{Y} λ_{4} (λ_{4} C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ MSE (t_{u p_{1}}) = f {\bar{Y}}^{2} (C_{y}^{2} + λ_{3}^{2} C_{x}^{2} - 2 λ_{3} ρ_{xy} C_{x} C_{y}), \\ MSE (t_{u p_{2}}) = f {\bar{Y}}^{2} (C_{y}^{2} + λ_{4}^{2} C_{x}^{2} - 2 λ_{4} ρ_{xy} C_{x} C_{y}), \end{matrix}$ where $λ_{3} = \frac{C_{x} \bar{X}}{C_{x} \bar{X} + β_{2} (x)}$ and $λ_{4} = \frac{β_{2} (x) \bar{X}}{β_{2} (x) \bar{X} + C_{x}}$

Singh (Citation2003a) examined the following ratio estimator: $t_{s_{1}} = \bar{y} [\frac{\bar{X} + ρ_{xy}}{\bar{X} + ρ_{xy}}],$ with the bias and MSE as $\begin{matrix} Bias (t_{s_{1}}) = f \bar{Y} λ_{5} (λ_{5} C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ MSE (t_{s_{1}}) = f {\bar{Y}}^{2} (C_{y}^{2} + λ_{5}^{2} C_{x}^{2} - 2 λ_{5} ρ_{xy} C_{x} C_{y}), \end{matrix}$ where $λ_{5} = \frac{\bar{X}}{\bar{X} + ρ_{xy}}$ .

Singh et al. (Citation2009) considered various auxiliary information and suggested following class of exponential estimator: $t_{sn} = \bar{y} exp [\frac{(u \bar{X} + v) - (u \bar{x} + v)}{(u \bar{X} + v) + (u \bar{x} + v)}],$ with the bias and MSE as $\begin{matrix} Bias (t_{sn}) = f \bar{Y} (2 τ^{2} C_{x}^{2} - τ ρ_{xy} C_{x} C_{y}), \\ MSE (t_{sn}) = f {\bar{Y}}^{2} (C_{y}^{2} + τ^{2} C_{x}^{2} - 2 τ ρ_{xy} C_{x} C_{y}), \end{matrix}$ where $τ = \frac{a \bar{X}}{2 (a \bar{X} + b)}$ .

Yadav and Kadilar (Citation2013) suggested an improved class of exponential ratio estimator: $t_{yk} = k \bar{y} exp [\frac{(u \bar{X} + v) - (u \bar{x} + v)}{(u \bar{X} + v) + (u \bar{x} + v)}]$ with the bias and minimum MSE at optimum value of $k_{(opt)} = A / B$ as $\begin{matrix} Bias (t_{yk}) = kf \bar{Y} (2 τ^{2} C_{x}^{2} - τ ρ_{xy} C_{x} C_{y}) + \bar{Y} (k - 1), \\ min . MSE (t_{yk}) = {\bar{Y}}^{2} (1 - \frac{A^{2}}{B}), \end{matrix}$ where $A = 1 + f (2 τ^{2} C_{x}^{2} - τ ρ_{xy} C_{x} C_{y})$ and $B = 1 + f (C_{y}^{2} + 5 τ^{2} C_{x}^{2} - 4 τ ρ_{xy} C_{x} C_{y})$ .

Here, u and v are either real amounts or some available population parametric values of AV x.

Kadilar (Citation2016) developed an improved ratio cum exponential ratio class of estimator: $t_{gk} = \bar{y} {(\frac{\bar{X}}{\bar{x}})}^{α} exp [\frac{\bar{X} - \bar{x}}{\bar{X} + \bar{x}}],$ with the bias and minimum MSE at optimum value of scalar $α_{opt} = \frac{(2 ρ_{xy} C_{y} - C_{x})}{2 C_{x}}$ as $\begin{matrix} Bias (t_{gk}) = f \bar{Y} [\frac{α (1 + α)}{2} C_{x}^{2} + \frac{α}{2} C_{x}^{2} + \frac{3}{8} C_{x}^{2} \\ - (α + \frac{1}{2}) ρ_{xy} C_{x} C_{y}], \\ m inMSE (t_{gk}) = f {\bar{Y}}^{2} C_{y}^{2} (1 - ρ_{xy}^{2}) . \end{matrix}$

Ijaz and Ali (Citation2018) suggested the following class of estimators for the estimation of population mean: $t_{ia} = w \bar{y} + (1 - w) \bar{y} \frac{\bar{X}}{\bar{x}},$ with the bias and minimum MSE at optimum value of scalar $w_{(opt)} = 1 - ρ_{xy} \frac{C_{y}}{C_{x}}$ as $\begin{matrix} Bias (t_{ia}) = (1 - w) f \bar{Y} (C_{x}^{2} - ρ_{xy} C_{x} C_{y}), \\ m inMSE (t_{ia}) = f {\bar{Y}}^{2} C_{y}^{2} (1 - ρ_{xy}^{2}) . \end{matrix}$

Yadav et al. (Citation2019) suggested the following class of estimators for population mean under SRS: $\begin{matrix} t_{p_{1}} = \bar{y} [\frac{β_{2} (x) M_{x} \bar{X} + ρ_{xy}}{β_{2} (x) M_{x} \bar{X} + ρ_{xy}}], \\ t_{p_{2}} = \bar{y} [\frac{β_{2} (x) M_{x} \bar{X} + ρ_{xy} C_{x}}{β_{2} (x) M_{x} \bar{X} + ρ_{xy} C_{x}}]' \\ t_{p_{3}} = \bar{y} [\frac{β_{1} (x) M_{x} \bar{X} + ρ_{xy}}{β_{1} (x) M_{x} \bar{X} + ρ_{xy}}], \\ t_{p_{4}} = \bar{y} [\frac{β_{1} (x) M_{x} \bar{X} + ρ_{xy} C_{x}}{β_{1} (x) M_{x} \bar{X} + ρ_{xy} C_{x}}], \\ t_{p_{5}} = \bar{y} [\frac{n \bar{X} + ρ_{xy}}{n \bar{X} + ρ_{xy}}], \\ t_{p_{6}} = \bar{y} [\frac{n \bar{X} + C_{x}}{n \bar{X} + C_{x}}], \\ t_{p_{7}} = \bar{y} [\frac{n \bar{X} + ρ_{xy} C_{x}}{n \bar{X} + ρ_{xy} C_{x}}], \\ t_{p_{8}} = \bar{y} [\frac{n ρ_{xy} \bar{X} + C_{x}}{n ρ_{xy} \bar{X} + C_{x}}], \\ t_{p_{9}} = \bar{y} [\frac{n C_{x} \bar{X} + ρ_{xy}}{n C_{x} \bar{X} + ρ_{xy}}] . \end{matrix}$ with the bias and MSE as $\begin{matrix} Bias (t_{p_{i}}) = f \bar{Y} φ_{i} (φ_{i} C_{x}^{2} - ρ_{xy} C_{x} C_{y}), i = 1, 2, \dots, 9, \\ MSE (t_{p_{i}}) = f {\bar{Y}}^{2} (C_{y}^{2} + φ_{i}^{2} C_{x}^{2} - 2 φ_{i} ρ_{xy} C_{x} C_{y}), \end{matrix}$

where $φ_{1} = \frac{β_{2} (x) M_{x} \bar{X}}{β_{2} (x) M_{x} \bar{X} + ρ_{xy}}, φ_{2} = \frac{β_{2} (x) M_{x} \bar{X}}{β_{2} (x) M_{x} \bar{X} + ρ_{xy} C_{x}}, φ_{3} = \frac{β_{1} (x) M_{x} \bar{X}}{β_{1} (x) M_{x} \bar{X} + ρ_{xy}}, φ_{4} = \frac{β_{1} (x) M_{x} \bar{X}}{β_{1} (x) M_{x} \bar{X} + ρ_{xy} C_{x}}, φ_{5} = \frac{n \bar{X}}{n \bar{X} + ρ_{xy}}, φ_{6} = \frac{n \bar{X}}{n \bar{X} + C_{x}}, φ_{7} = \frac{n \bar{X}}{n \bar{X} + ρ_{xy} C_{x}}, φ_{8} = \frac{n ρ_{xy} \bar{X}}{n ρ_{xy} \bar{X} + C_{x}}$ , and $φ_{9} = \frac{n C_{x} \bar{X}}{n C_{x} \bar{X} + ρ_{xy}}$

4 Proposed estimators

Almost all estimators reviewed in the previous section are either less or equally efficient to the classical regression (BLU) estimator. This work aims to develop an enhanced class of estimators for the estimation of PM of SV using information on AV. Here is the proposed class of estimators: $T = k_{1} \bar{y} {(\frac{\bar{X}}{\bar{x}})}^{k_{2}} exp {\frac{(u \bar{X} + v) - (u \bar{x} + v)}{(u \bar{X} + v) + (u \bar{x} + v)}},$ where k_j, j $=$ 1, 2 are constant to optimize the MSE, while u and v are either real amounts or some available population parametric values of AV x. Some sub-class of T are tabulated in .

Table 1 Several individuals from the proposed estimators.

Display Table

By employing the notations provided in “Methodology and notations,” we can express the suggested estimator as $\begin{matrix} T = k_{1} \bar{Y} (1 + e_{0}) {1 - k_{2} e_{1} + \frac{k_{2} (k_{2} + 1)}{2!} e_{1}^{2} - \dots} \\ {1 - λ e_{1} (1 - λ e_{1} + \frac{3}{2} λ^{2} e_{1}^{2})} \end{matrix}$ where $λ = 2 (u \bar{X} / (u \bar{X} + v))$ .

Here, Taylor’s series expansion is used and the error terms having power greater than 2 are neglected. This provides the following expression: $\begin{matrix} T - \bar{Y} = \bar{Y} [k_{1} {1 + e_{0} - k_{2} e_{1} - λ e_{1} - k_{2} e_{0} e_{1} - λ e_{0} e_{1} \\ + \frac{k_{2} (k_{2} + 1)}{2} e_{1}^{2} + k_{2} λ e_{1}^{2} + \frac{3}{2} λ^{2} e_{1}^{2}} - 1] . \end{matrix}$

Considering expectation both the sides to the above expression, we get $\begin{matrix} Bias (T) = f \bar{Y} [k_{1} {1 - (k_{2} + λ) ρ_{xy} C_{x} C_{y} \\ + \frac{k_{2} (k_{2} + 1)}{2} C_{x}^{2} + k_{2} λ C_{x}^{2} + \frac{3}{2} λ^{2} C_{x}^{2}} - 1] . \end{matrix}$

Squaring and considering expectation both sides to the above expression provides $\begin{matrix} MSE (T) = {\bar{Y}}^{2} (1 + k_{1}^{2} [1 + f C_{y}^{2} + {k_{2} (2 k_{2} + 1) \\ + 4 λ^{2} + 4 k_{2} λ} f C_{x}^{2} - 4 (k_{2} + λ) f ρ_{xy} C_{x} C_{y}] \\ - 2 k_{1} [1 + {\frac{k_{2} (k_{2} + 1)}{2} + k_{2} λ + \frac{3}{2} λ^{2}} f C_{x}^{2} \\ - (k_{2} + λ) f ρ_{xy} C_{x} C_{y}]) . \end{matrix}$

The above MSE equation can further be expressed as $MSE (T) = {\bar{Y}}^{2} (1 + k_{1}^{2} F_{1} - 2 k_{1} F_{2}),$ where $F_{1} = 1 + f C_{y}^{2} + {k_{2} (2 k_{2} + 1) + 4 λ^{2} + 4 k_{2} λ} f C_{x}^{2} - 4 (k_{2} + λ) f ρ_{xy} C_{x} C_{y}$ and $F_{2} = 1 + {\frac{k_{2} (k_{2} + 1)}{2} + k_{2} λ + \frac{3}{2} λ^{2}} f C_{x}^{2} - (k_{2} + λ) f ρ_{xy} C_{x} C_{y}$ .

Minimizing the MSE(T) against k₁ provides the optimum value of k₁ as $k_{1 (opt)} = \frac{F_{2}}{F_{1}} .$

Using the amount of $k_{1 (opt)}$ in the MSE(T), we obtain $MSE {(T)}_{min} = {\bar{Y}}^{2} (1 - \frac{F_{2}^{2}}{F_{1}}) .$

The optimization of k₁and k₂ simultaneously is very typical. Therefore, putting $k_{1} =$ 1 in the estimator T and minimize the MSE of T against k₂ provides the optimum value of k₂ as $k_{2 (opt)} = \frac{ρ_{xy} C_{y}}{C_{x}} - λ .$

5 Efficiency conditions

The MSE ${(T)}_{min}$ is compared with the variance/MSE/minimum MSE of the estimators discussed in “Existing estimators” and obtained the efficiency conditions mentioned below. $\begin{matrix} MSE (T) < V (t_{m}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 - f C_{y}^{2} . \\ MSE (T) < MSE (t_{r}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 - f (C_{y}^{2} + C_{x}^{2} - 2 ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t^{*}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f C_{y}^{2} (1 - ρ_{xy}^{2}), where t^{*} = t_{reg}, t_{s}, t_{w}, t_{gk}, and t_{ia} . \\ MSE (T) < MSE (t_{sd}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f (C_{y}^{2} + λ_{1}^{2} C_{x}^{2} - 2 λ_{1} ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t_{re}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 - f (C_{y}^{2} + \frac{C_{x}^{2}}{4} - ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t_{sk}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f (C_{y}^{2} + λ_{2}^{2} C_{x}^{2} - 2 λ_{2} ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t_{u p_{1}}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f (C_{y}^{2} + λ_{3}^{2} C_{x}^{2} - 2 λ_{3} ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t_{u p_{2}}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f (C_{y}^{2} + λ_{4}^{2} C_{x}^{2} - 2 λ_{4} ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t_{s_{1}}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f (C_{y}^{2} + λ_{5}^{2} C_{x}^{2} - 2 λ_{5} ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t_{sn}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f (C_{y}^{2} + τ^{2} C_{x}^{2} - 2 τ ρ_{xy} C_{x} C_{y}) . \\ MSE (T) < MSE (t_{yk}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > \frac{A^{2}}{B} . \\ MSE (T) < MSE (t_{p_{i}}) \Rightarrow \frac{F_{2}^{2}}{F_{1}} > 1 \\ - f (C_{y}^{2} + φ_{i}^{2} C_{x}^{2} - 2 φ_{i} ρ_{xy} C_{x} C_{y}), i = 1, 2, \dots, 9. \end{matrix}$

Under these efficiency conditions, the proposed class of estimators represses the reviewed estimators. In practice, these efficiency conditions are verified through the empirical study performed in the next section.

6 Empirical study

This section presents an empirical study in three subsections. In Section 6.1, a numerical study is presented based on four different real populations, in Section 6.2, a simulation study is presented based on artificially generated symmetric and asymmetric populations, while in Section 6.3, discussion of empirical results is presented.

6.1 Numerical study

To improve the theoretical establishment, we have conducted an empirical investigation on four real populations. The descriptive information of these populations is tabulated in .

Table 2 Descriptive statistics of different populations.

Display Table

The optimum values of k₁ and k₂ are computed and the results are listed in for all populations. Further, the range of k₁ (for fixed values of $k_{2})$ and k₂ (for fixed values of $k_{1})$ are computed and the results are listed in and , respectively, under which the proposed class of estimators T is more efficient than the existing estimators. It is noticed from and that there is sufficient scope of choosing the scalars k₁ and k₂ to get better estimators than the existing estimators. From the common range of k₁ and k₂ and optimum values k₁ and k₂ for all populations, it is noticed that the scope of obtaining better estimators from the proposed class of estimators is wide even if the guessed values of the scalars k₁ and k₂ departs substantially from the exact optimum values k₁ and k₂.

Table 3 Optimum values of $(k_{1}, k_{2})$ of the members of the proposed estimator for different populations.

Display Table

Table 4 Range of k₁ at the fixed value of k₂ for different populations.

Display Table

Table 5 Range of k₂ at the fixed value of k₁ for different populations.

Display Table

We have also computed the bias, MSE, and PRE of the estimators and reported in . The following expression is used to calculate PRE: $PRE = \frac{V (t_{m})}{MSE (T^{*})} \times 100,$ where $\begin{matrix} T^{*} = t_{m}, t_{r} t_{reg}, t_{s}, t_{w}, t_{sd}, t_{re}, t_{sk}, t_{u p_{1}}, t_{u p_{2}}, t_{s_{1}}, t_{sn}, t_{yk}, t_{gk}, t_{ia}, \\ t_{p_{1}}, t_{p_{2}}, t_{p_{3}}, t_{p_{4}}, t_{p_{5}}, t_{p_{6}}, t_{p_{7}}, t_{p_{8}}, t_{p_{9}} and T \end{matrix}$

Table 6 Bias, MSE and PRE for several estimators.

Display Table

6.2 Simulation study

To extrapolate the theoretical findings as well as to strengthen the findings of numerical study, we conducted a simulation study based on hypothetically generated symmetric and asymmetric populations. In order to generated the data, following Singh and Horn (Citation1998), we used the models listed below: $\begin{matrix} y = 3.4 + \sqrt{(1 - ρ_{xy})} y^{*} + ρ_{xy} (\frac{S_{y}}{S_{x}}) x^{*} \\ x = 3.2 + x^{*} \end{matrix}$ where $x^{*}$ and $y^{*}$ are possessing the corresponding distributions. In particular, we generated the following populations.

We generated a normal population of size N $=$ 1000 using $x^{*} \sim N (15, 65)$ and $y^{*} \sim N (20, 70)$ .
We generated a gamma population of size N $=$ 1000 using $x^{*} \sim gamma (6.005, 0.05)$ and $y^{*} \sim gamma (7.009, 1.09)$ .

We have taken a sample of size $n =$ 200 from the above populations. To observe the behavior of the proposed estimators, utilizing 15,000 replications, we have computed MSE and percent relative efficiency (PRE) for the varying values of correlation coefficient as 0.1, 0.5, and 0.9. The PRE is calculated with the help of following expression. $PRE = \frac{\sum_{i = 1}^{15, 000} {(t_{m} - \bar{Y})}^{2}}{\sum_{i = 1}^{15, 000} {(T^{*} - \bar{Y})}^{2}} \times 100$

The simulation findings are reported in and for normal and gamma populations with bias, MSE, and PRE, respectively.

Table 7 Bias, MSE, and PRE of several estimators for simulated normal population.

Display Table

Table 8 Bias, MSE, and PRE of several estimators for simulated gamma population.

Display Table

6.3 Discussion of empirical results

After carefully observing the findings of the numerical and simulation studies, we have drawn the following observations:

From , for each population, the outcomes show that the members $T_{(j)}, j = 1, 2, \dots, 8$ of the suggested estimator T obtain the lowest MSE and highest PRE as compare to the existing estimators such as the conventional mean estimator, usual ratio and regression estimators, Srivastava (Citation1967) estimator, Walsh (Citation1970) estimator, Sisodia and Dwivedi (Citation1981) estimator, Bahl and Tuteja (Citation1991) estimator, Singh and Kakran (Citation1993) estimator, Upadhyaya and Singh (Citation1999) estimators, Singh (Citation2003a) estimator, Singh et al. (Citation2009) estimators, Yadav and Kadilar (Citation2013) estimators, Kadilar (Citation2016) estimator, Ijaz and Ali (Citation2018) estimator, and Yadav et al. (Citation2019) estimators.
From , the member T₍₁₎ is found to be best among the proposed class of estimator T in each population.
The results reported in are based on normal population which reveal that the members $T_{(j),} j = 1, 2, \dots, 8$ of the proposed estimator T repress the reviewed estimators with minimum MSE and maximum PRE for each value of $ρ_{xy}$ . Moreover, as $ρ_{xy}$ increase, the MSE and PRE of the members of the proposed estimators also decreases and increases, respectively.
The results reported in are based on gamma population which demonstrate that the members $T_{(j),} j = 1, 2, \dots, 8$ of the proposed estimator T repress the reviewed estimators with minimum MSE and maximum PRE for each value of $ρ_{xy}$ . Moreover, as $ρ_{xy}$ increases, the MSE and PRE of the members of the proposed estimators also increase, respectively.
From and , the member T₍₁₎ is found to be best among the proposed class of estimator T in normal and gamma populations.

7 Conclusions

This paper suggested an enhanced class of estimators for the population mean estimation utilizing the simple random sampling scheme. The bias, mean square error, and minimum mean square error of the proposed class of estimators are obtained mathematically to the approximation of first order. By comparing the minimum MSE expression of the proposed estimators with the MSE/minimum MSE of the existing estimators, the efficiency conditions are determined. A numerical illustration involving four real populations and a simulation study involving artificially generated normal and gamma populations provide support for the efficiency conditions developed in “Efficiency conditions.” The empirical findings demonstrate that the proposed estimator represses the conventional estimators in each population which are further extended with the simulation findings. Therefore, it is advised to utilize the suggested estimators for the population mean estimation in practical issues.

Further, the suggested estimators can be examined for the population mean estimation using different sampling schemes like, stratified random sampling, stratified ranked set sampling, two-phase sampling, adaptive cluster sampling, probability proportion to size sampling, for more details refer the studies of Ahmad and Shabbir (Citation2018), Ahmad et al. (Citation2021), (2022), Iftikhar et al. (Citation2022), Bhushan et al. (Citation2022a), (2022b), (2023), Rana et al. (Citation2022), Bhushan and Kumar (Citation2023a), (2023b), Qureshi and Hanif (Citation2019).

Acknowledgment

The authors are extremely grateful to the learned referees for their valuable comments and to Editor-in-Chief.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

Alomair MA, Shahzad U. 2023. Compromised-imputation and EWMA-based memory-type mean estimators using quantile regression. Symmetry. 15:1888.
Google Scholar
Ahmad S, Hussain S, Aamir M, Yasmeen U, Shabbir J, Ahmad Z. 2021. Dual use of auxiliary information for estimating the finite population mean under the stratified random sampling scheme. J Math. 2021:1–12.
Web of Science ®Google Scholar
Ahmad S, Hussain S, Shabbir J, Aamir M, El-Morshedy M, Ahmad Z, Alrajhi S. 2022. Improved generalized class of estimators in estimating the finite population mean using two auxiliary variables under two-stage sampling. AIMS Math. 7:10609–10624.
Web of Science ®Google Scholar
Ahmad S, Shabbir J. 2018. Use of extreme values to estimate finite population mean under pps sampling scheme. J Reliab Stat Stud. 11:99–112.
Google Scholar
Bahl S, Tuteja RK. 1991. Ratio and product type exponential estimators. J Inform Optim Sci. 12:159–164.
Google Scholar
Bhushan S, Kumar A. 2023a. Enhanced estimation of population mean under two-phase sampling. Int J Math Modell Numer Optim. 13:34–48.
Google Scholar
Bhushan S, Kumar A. 2023b. New efficient class of estimators of population mean using two-phase sampling. Int J Math Oper Res. 24:155–172.
Google Scholar
Bhushan S, Kumar A, Shahab S, Lone SA, Akhtar MT. 2022a. On efficient estimation of population mean under stratified ranked set sampling. J Math. 2022:1–20.
Web of Science ®Google Scholar
Bhushan S, Kumar A, Shahzad U, Al-Omari AI, Almanjahie AI. 2022b. On some improved class of estimators by using stratified ranked set sampling. Mathematics. 10:3283.
Web of Science ®Google Scholar
Bhushan S, Kumar A, Singh S. 2023. Some efficient classes of estimators under stratified sampling. Commun Stat Theory Methods. 52:1767–1796.
Web of Science ®Google Scholar
Cochran WG. 1940. The estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. J Agric Sci. 30:262–275.
Google Scholar
Iftikhar S, Khalil A, Ali A. 2022. A novel and improved logarithmic ratio-product type estimator of mean in stratified random sampling. Math Prob Eng. 2022:1–20.
Web of Science ®Google Scholar
Ijaz M, Ali H. 2018. Some improved ratio estimators for estimating mean of finite population. Res Rev J Stat Math Sci. 4:18–23.
Google Scholar
Jeelani MI, Maqbool S, Mir SA. 2013. Modified ratio estimators of population mean using linear combination of co-efficient of skewness and quartile deviation. Int J Mod Math Sci. 6:174–183.
Google Scholar
Jerajuddin M, Kishun J. 2016. Modified ratio estimators for population mean using size of the sample, selected from population. Int J Sci Res Sci Eng Technol. 2:10–16.
Google Scholar
Kadilar C, Cingi H. 2003. Ratio estimators in stratified random sampling. Biometrical J. 45:218–225.
Web of Science ®Google Scholar
Kadilar GO. 2016. A new exponential type estimator for the population mean in simple random sampling. J Mod App Stat Meth. 15:207–214.
Google Scholar
Qureshi MN, Hanif M. 2019. Generalized estimator for the estimation of clustered population mean in adaptive cluster sampling. Commun Stat Theory Method. 50:3262–3275.
Web of Science ®Google Scholar
Rana Q, Qureshi MN, Hanif M. 2022. Generalized estimators for population mean using an auxiliary attribute in stratified two-phase sampling. J Stat Theory Appl. 21:44–57.
Web of Science ®Google Scholar
Sarndal CE, Swensson B, Wretman J. 2003. Model assisted survey sampling. New York (NY): Springer-Verlag.
Google Scholar
Searls DT. 1964. The utilization of a known coefficient of variation in the estimation procedure. J Am Stat Assoc. 59:1225–1226.
Web of Science ®Google Scholar
Singh GN. 2003a. On the improvement of product method of estimation in sample surveys. J Ind Soc Agric Stat. 56:267–275.
Google Scholar
Singh R, Chauhan P, Sawan N, Smarandache F. 2009. Improvement in estimating the population mean using exponential estimator in simple random sampling. Bull Stat Econ. 3:13–18.
Google Scholar
Singh HP, Kakran MS. 1993. A modified ratio estimator using known coefficient of kurtosis of an auxiliary character. Unpublished.
Google Scholar
Singh HP, Horn S. 1998. An alternative estimator for multi-character surveys. Metrika. 48:99–107.
Web of Science ®Google Scholar
Singh S. 2003b. Advanced sampling theory with applications: how Michael “selected” Amy. Vol. 2. Dordrecht: Springer.
Google Scholar
Sisodia BVS, Dwivedi VK. 1981. A modified ratio estimator using coefficient of variation of auxiliary variable. J Ind Soc Agric Stat. 33:13–18.
Google Scholar
Soponviwatkul K, Lawson N. 2017. New ratio estimators for estimating population mean in simple random sampling using a coefficient of variation, correlation coefficient and a regression coefficient. Gazi Univ J Sci. 30:610–621.
Web of Science ®Google Scholar
Srivastava SK. 1967. An estimator using auxiliary information. Calcutta Stat Assoc Bull. 16:121–132.
Google Scholar
Subramani J, Kumarapandiyan G. 2012. Estimation of population mean using co-efficient of variation and median of an auxiliary variable. Int J Prob Stat. 1:36–40.
Google Scholar
Upadhyaya LN, Singh HP. 1999. Use of transformed auxiliary variable in estimating the finite population mean. Biom J. 41:627–636.
Web of Science ®Google Scholar
Walsh JE. 1970. Generalization of ratio estimator for population total. Sankhya A. 32:99–106.
Google Scholar
Watson DJ. 1937. The estimation of leaf area in field crops. J Agric Sci. 27:474–483.
Google Scholar
Yadav SK, Dixit MK, Dungana HN, Mishra SS. 2019. Improved estimators for estimating average yield using auxiliary variable. Int J Math Eng Manag Sci. 4:1228–1238.
Google Scholar
Yadav SK, Kadilar C. 2013. Efficient family of exponential estimators for the population mean. Hacettepe J Math Stat. 42:671–677.
Web of Science ®Google Scholar
Yan Z, Tian B. 2010. Ratio method to the mean estimation using coefficient of skewness of auxiliary variable. In: Zhu R, Zhang Y, Liu B, Liu C, editors. Information Computing and Applications. ICICA 2010. Communications in Computer and Information Science. Vol. 106. Berlin: Springer.
Google Scholar

Enhanced estimation of population mean using simple random sampling

Abstract

1 Introduction

2 Methodology and notations

3 Existing estimators

4 Proposed estimators

Table 1 Several individuals from the proposed estimators.

5 Efficiency conditions

6 Empirical study

6.1 Numerical study

Table 2 Descriptive statistics of different populations.

Table 3 Optimum values of $(k_{1}, k_{2})$ of the members of the proposed estimator for different populations.

Table 4 Range of k₁ at the fixed value of k₂ for different populations.

Table 5 Range of k₂ at the fixed value of k₁ for different populations.

Table 6 Bias, MSE and PRE for several estimators.

6.2 Simulation study

Table 7 Bias, MSE, and PRE of several estimators for simulated normal population.

Table 8 Bias, MSE, and PRE of several estimators for simulated gamma population.

6.3 Discussion of empirical results

7 Conclusions

Acknowledgment

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

Enhanced estimation of population mean using simple random sampling

Abstract

1 Introduction

2 Methodology and notations

3 Existing estimators

4 Proposed estimators

Table 1 Several individuals from the proposed estimators.

5 Efficiency conditions

6 Empirical study

6.1 Numerical study

Table 2 Descriptive statistics of different populations.

Table 3 Optimum values of (k1, k2) of the members of the proposed estimator for different populations.

Table 4 Range of k1 at the fixed value of k2 for different populations.

Table 5 Range of k2 at the fixed value of k1 for different populations.

Table 6 Bias, MSE and PRE for several estimators.

6.2 Simulation study

Table 7 Bias, MSE, and PRE of several estimators for simulated normal population.

Table 8 Bias, MSE, and PRE of several estimators for simulated gamma population.

6.3 Discussion of empirical results

7 Conclusions

Acknowledgment

Disclosure statement

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature

Table 3 Optimum values of $(k_{1}, k_{2})$ of the members of the proposed estimator for different populations.

Table 4 Range of k₁ at the fixed value of k₂ for different populations.

Table 5 Range of k₂ at the fixed value of k₁ for different populations.