New Modification of Ranked Set Sampling For Estimating Population Mean: Neutrosophic Median Ranked Set Sampling With An Application To Demographic Data
New Modification of Ranked Set Sampling For Estimating Population Mean: Neutrosophic Median Ranked Set Sampling With An Application To Demographic Data
https://doi.org/10.1007/s44196-024-00548-y
RESEARCH ARTICLE
Abstract
The study addressed the limitations of classical statistical methods when dealing with ambiguous data, emphasizing the
importance of adopting neutrosophic statistics as a more effective alternative. Classical methods falter in managing uncer-
tainty inherent in such data, necessitating a shift towards methodologies like neutrosophic statistics. To address this gap, the
research introduced a novel sampling approach called “neutrosophic median ranked set sampling” and incorporated neu-
trosophic estimators tailored for estimating the population mean in the presence of ambiguity. This modification aims to
address the inherent challenges associated with estimating the population mean when dealing with neutrosophic data. The
methods employed involved modifying traditional ranked set sampling techniques to accommodate neutrosophic data char-
acteristics. Additionally, neutrosophic estimators were developed to leverage auxiliary information within the framework of
median-ranked set sampling, enhancing the accuracy of population mean estimation under uncertain conditions. The methods
employed involved modifying traditional ranked set sampling techniques to accommodate neutrosophic data characteristics.
Bias and mean squared error equations for the suggested estimators were provided, offering insights into their theoretical
underpinnings. To illustrate the effectiveness and practical applications of the proposed methodology and estimators, a numer-
ical demonstration and simulation study have been conducted using the R programming language. The key results highlighted
the superior performance of the proposed estimators compared to existing alternatives, as demonstrated through comprehen-
sive evaluations based on mean squared error and percentage relative efficiency criteria. The conclusions drawn underscored
the effectiveness of the neutrosophic median ranked set sampling approach and suggested estimators in estimating the pop-
ulation mean under conditions of uncertainty, particularly when utilizing neutrosophic auxiliary information and validated
real-life applicability. The methodology and estimators presented in the study were shown to yield interval-based results, pro-
viding a more realistic representation of uncertainty associated with population parameters. This interval estimation, coupled
with minimum mean squared error considerations, enhanced the efficacy of the estimators in determining population mean
values. The novelty of the work lies in its introduction of a tailored sampling approach and estimators designed specifically
for neutrosophic data, filling a significant gap in the literature. By extending classical statistics to accommodate ambiguity,
the study offers a substantial advancement in statistical methodology, particularly in domains where precise data is scarce
and uncertainty is prevalent. Furthermore, the empirical validation through numerical demonstrations and simulation studies
using the R programming language adds robustness to the proposed methodology and contributes to its practical applicability.
Keywords Neutrosophic statistics · Study variable · Auxiliary variable · Bias · Mean squared error · Percentage relative
efficiency
Florentin Smarandache
Rajesh Singh and Florentin Smarandache contributed equally to this
[email protected]
work.
1 Department of Statistics, Banaras Hindu University, Varanasi,
B Anamika Kumari Uttar Pradesh 221005, India
[email protected]
2 Department of Mathematics, University of New Mexico,
Rajesh Singh Gallup, USA
[email protected]
123
International Journal of Computational Intelligence Systems (2024) 17:210 Page 3 of 15 210
tion, was validated through empirical analysis using both low symmetric distribution, making it an attractive avenue
real and simulated population data. Results unequivocally for further exploration in the context of NMRSS.
demonstrated the superiority of the recommended estima- Several factors drive our exploration of NMRSS and its
tors over existing ones, showcasing lower MSEs and higher associated estimators for population parameter estimation.
PREs, with t p2 emerging as the optimal estimator. As sam- A primary motivation is to introduce MRSS and MRSS
ple sizes and correlation coefficients increased, the MSE estimators in a neutrosophic setting. Previous research in
decreased and PRE increased for the recommended esti- survey sampling has predominantly focused on clear, well-
mator, indicating its robustness and efficiency. NMRSS defined data, while classical sampling methods yield precise
estimators surpassed their counterparts NSRS, with RSS results, albeit with potential risks of inaccuracies, overes-
proving to be a superior replacement for SRS. Similarly, timations, or underestimations. However, classical methods
under NRSS, MRSS outperformed RSS, particularly when fall short when handling set-type or undetermined data, char-
data exhibited non-symmetric distributions. Comparative acteristic of neutrosophic data, which is more prevalent in
analysis revealed that MSEs and PREs of suggested esti- real-world scenarios than crisp data. Thus, there is a growing
mators obtained through classical MRSS fell between those need for additional neutrosophic statistical techniques. Tradi-
obtained using NMRSS, suggesting the latter’s superiority. tional statistical approaches are ill-suited to compute accurate
The study underscored the inadequacy of classical RSS in estimates of unknown parameters when dealing with indeter-
handling vague or indeterminate data, emphasizing NMRSS minate, vague, imprecise, or interval-type data. Neutrosophic
as a superior method for estimating uncertain or interval data. statistics serve as a suitable replacement for classical statis-
It can be computationally burdensome to analyze neutro- tics in such scenarios.
sophic data, particularly when processing large datasets with The necessity to close the gap between classical and neu-
advanced models or algorithms. It can be difficult to inter- trosophic statistics, and inspired by [36, 37] work, our work
pret analysis based on neutrosophic data, particularly when introduces a new sampling technique NMRSS, and enhanced
dealing with contradicting or ambiguous results. Compared NMRSS estimators for population mean estimation. Despite
to other approaches or even just SRS, RSS implementation thorough research in the field, we found no prior studies
can be more complicated. Before sampling, the population in survey sampling that addressed the estimation of pop-
must be carefully ranked and sorted, which can take time and ulation means in the presence of auxiliary variables using
require specialized knowledge. neutrosophic data under MRSS. This research represents a
Our study is designed as follows: Sect. 1 presents an significant step toward filling this gap and contributes to the
introduction, and Sect. 2 outlines motivation, needs, and evolving domain of neutrosophic statistics.
research gaps. Section 3 outlines the modified and novel It has been well established by multiple authors that RSS
sampling method, neutrosophic median ranked set sampling is a more suitable option than SRS when dealing with cum-
method. Section 4 presents existing NMRSS estimators. Sec- bersome, expensive, or time-consuming measurements and
tion 5 presents proposed NMRSS estimators, while Sect. 6 MRSS is a better option than RSS when data does not follow
presents an application with the help of empirical study using the symmetric distribution. The challenges associated with
demographic data, and Sect. 7 offers a simulation study using measurements in a neutrosophic context exacerbate these
artificial data for different sets of values. Section 8 is dedi- problems. Therefore, our research introduces an NMRSS
cated to a discussion, and Sect. 9 covers concluding remarks method to enhance the accuracy of the population mean esti-
and future research directions. mators in this unique context.
This article’s main objective is to introduce a novel approach Numerous methods can be used to display the neutrosophic
known as the “neutrosophic median ranked set sampling” for observations, and the neutrosophic numbers may include an
dealing with interval-type or neutrosophic data. Our study unknown interval [a, b]. We are describing neutrosophic
focuses on sampling theory, marking the first instance of values as Z r ss N ≡ Z r ss L + Z r ssU Ir ss N with Ir ss N ∈
proposing an MRSS technique tailored to neutrosophic data, [Ir ss L , Ir ssU ], N is here to represent the neutrosophic num-
along with the development of NMRSS estimators for pop- ber. Hence, our neutrosophic observations will lie in an
ulation mean estimation. This marks a significant step in interval Z r ss N ∈ [a, b], where ‘a’ and ‘b’ denote the neu-
expanding the field of sampling theory by comparing these trosophic data’s lower and upper values.
estimators with existing neutrosophic methods such as ratio, In MRSS, a small subset of randomly chosen popula-
product, and regression estimators. MRSS is considered a tion units are measured after they have been ranked solely
superior alternative to SRS and RSS when data does not fol- based on observation or experience. In the context of MRSS,
123
210 Page 4 of 15 International Journal of Computational Intelligence Systems (2024) 17:210
1
r mN
m independent sets of random samples, each consisting of
m units, are drawn from the overall population. Every unit y (o)N = y m N +1 (2)
rmN i 2 j
j=1 i=1
within a set has an equal probability of selection. The mem-
bers of each random set are then arranged in order based on
and their corresponding variances are given as
the characteristics of the auxiliary variable. Then, if the sam-
ple size m is even, identify ( m2 )th smallest ranked unit from ⎛ ⎞
1
r
mN
the first ( m2 ) sets, and select ( m+2
2 )th smallest ranked unit Var(x (o)N ) = Var ⎝ x m N +1 ⎠
from the other ( m2 ) sets. In case when the sample size m is rmN i 2 j
j=1 i=1
odd, identify ( m+1 2 )th smallest ranked unit from all sets for 1
actual quantification. Throughout the process, rm (= n) units = σ 2 (3)
r m N x m N2+1
have been acquired as this cycle is replicated r times. ⎛ ⎞
The method of NMRSS consists of selecting m N ∈ 1 r mN
size and NMRSSe for an even set size. For jth cycle Var(y (e)N )
⎛ ⎛m ⎞⎞
MRSSo is denoted by (X 1( m N +1 ) j , Y1[ m N +1 ] j ), (X 2( m N +1 ) j , N
2 2 2 ⎜ 1
r
⎜
2
mN
⎟⎟
Y2[ m N +1 ] j ), ..., (X m m N +1 , Ym m N +1 ); ( j = 1, 2, ..., r ). = Var ⎝ ⎝ y mN + y m N +2 ⎠⎠
2 N( 2 )j N[ 2 ]j rmN i 2 i j 2
j=1 i=1 mN
Let the sample means of X N and Y N are, respectively, as i= 2 +1
1
= σ 2 m N + σ 2 [m N +2] (8)
2r m N y 2 y 2
1
r mN
x (o)N = x m N +1 (1) Consider a neutrosophic random sample of size n N ∈
rmN i j
j=1 i=1
2 [n L , n U ] using NMRSS, which is acquired from a finite
123
International Journal of Computational Intelligence Systems (2024) 17:210 Page 5 of 15 210
population of N units (U1 , U2 , ..., U N ). The neutrosophic The MSE of the estimator y pN is given by
study and auxiliary variable are Y N ∈ [Y L , YU ] and
X N ∈ [X L , X U ]. Let y [n]N ∈ [y [n]L , y [n]U ] and x (n)N ∈ 2
MSE(y r N ) = Y N (Vyr N + Vxr N + 2Vyxr N ). (14)
[x (n)L , x (n)U ] be the sample means of the neutrosophic study
and auxiliary variables respectively, and also, let Y N ∈
The regression estimator under NMRSS for the population
[Y L , Y U ] and X N ∈ [X L , X U ] be the population means of
mean Y
the neutrosophic study and auxiliary variables, respectively.
The correlation coefficient between both neutrosophic study
y regN = y [n]N + β(X N − x (n)N ). (15)
and auxiliary variables is ρ yx N ∈ [ρ yx L , ρ yxU ].
Let the neutrosophic mean error terms are 0N ∈
The MSE of the estimator y regN is given by
[0 L , 0U ] and 1N ∈ [1 L , 1U ]. To obtain the bias and
MSE of the estimators, we write
y [n]N = Y N (1 + 0N ), x (n)N = X N (1 + 1N ) 2 Vyxr N 2
Var(y [n]N )
MSE(y regN ) = YN Vyr N − . (16)
2 )=
E(0N = Vyr N Vxr N
2
YN
2 )= Var(x (n)N )
E(1N 2 = Vxr N The exponential ratio-type estimator under NMRSS for the
XN
Cov(y [n]N ,x (n)N ) population mean Y is represented as
E(0N , 0N ) = = Vyxr N
YN XN
e0N ∈ [e0L
2 2 , e2 ];
0U e1N ∈ [e1 L , e1U
2 2 2 ]; e e
0N 1N ∈ [e0 L e1 L , X N − x (n)N
e0U e0U ]; y expr N = y [n]N exp . (17)
X N + x (n)N
Vyr N ∈ [Vyr L , VyrU ]; Vxr N ∈ [Vxr L , VxrU ]; Vyxr N ∈
[Vyxr L , VyxrU ].
The MSE of the estimator y expr N is given by
2 Vyr N
4 Existing Estimators MSE(y expN ) = Y Vyr N + − Vyxr N . (18)
4
The usual unbiased estimator for the population mean Y using
The exponential product type estimator under NMRSS for
NMRSS technique is given by
the population mean Y is represented as
1
nN
y [n]N = y[i]N . (9) x (n)N − X N
nN y exp pN = y [n]N exp . (19)
i=1 x (n)N + X N
The variance of the estimator y [n]N is given by The MSE of the estimator y exp pN is given by
2
var(y [n]N ) = Y N Vyr N . (10) 2 Vyr N
MSE(y exp pN ) = Y Vyr N + + Vyxr N . (20)
4
The ratio estimator under NMRSS for the population mean
Y
5 Proposed Estimators
XN
y r N = y [n]N . (11) No single estimator is universally effective in all situations.
x (n)N
Hence, it is preferable to employ estimators that yield min-
imal MSE and high precision. The objective of this section
The MSE of the estimator y r N is given by
is to formulate estimators that demonstrate effective perfor-
mance across a broader range of circumstances. In pursuit
2
MSE(y r N ) = Y N (Vyr N + Vxr N − 2Vyxr N ). (12) of this goal, we have embraced the [14] estimator within the
framework of NMRSS and introduced two novel estimators
The product estimator under NMRSS for the population for the mean of a finite population under NMRSS, utilizing
mean Y auxiliary variables.
x (n)N x (n)N
y pN = y [n]N . (13) (1)P1N = y [n]N (g1N + 1) + g2N log , (21)
XN XN
123
210 Page 6 of 15 International Journal of Computational Intelligence Systems (2024) 17:210
where the constants g1N and g2N ensure that the MSE of the ×(1 + log (1 + 1N ) ) (28)
estimators is minimized. P2N − Y N = (g3N − 1) Y N + g3N Y N 0N
When we express the estimator P1N from Eq. (21) into terms
1N 51N 2
of ’s, we obtain +g4N 1 + − (29)
2 8
P1N = Y N (1 + 01 ) 5
Bias(P2N ) = Y N (g3N − 1) + g4N 1 − Vxr N . (30)
X N (1 + 1N ) 8
(g1N + 1) + g2N log . (22)
XN
Case 1: If sum of weights is fixed (g3N + g4N = 1)
Taking expectations by focusing on first-order approxima- The MSE of the estimator P2N is shown as
tion, we obtain MSE,
2
MSE(P2N ) = Y N Vyr N + g4N
2
Vxr N − 2g4N Vyxr N . (31)
2
MSE (P1N ) = Y N Vyr N + g1N
2
A1N + g2N
2
B1N
+2g1N C1N + 2g2N D1N + 2g1N g2N E 1N , To find out the minimum value of MSE for P2N , we partially
(23) differentiate equation (31) w.r.t. g4N , and equating to zero
we get
where
Vyxr N
g4N ∗ = . (32)
2 Vxr N
A1N = Y N (1 + Vyr N )
B1N = Vxr N By putting the optimum value of g4N in Eq. (31), we can
2
C1N = Y N Vyr N determine the minimum MSE of P2N as
D1N = Y Vyxr N
Vyxr N 2
1 MinMSE = Y N
2
− .
E 1N = Y Vyxr N − Vxr N . Vyr N
Vxr N
(33)
2
To find out the minimum MSE for P1N , we partially differ- Case 2: If the sum of weights is adjustable (g3N + g4N =
entiate equation (23) w.r.t. g1N & g2N and equating to zero, 1)
we obtain
By putting the optimum value of g1N & g2N in the Eq. Squaring on both sides we get
(23), we can determine the minimum MSE of P1N as
2
(P2N − Y N )
MinMSE 2 2 2
2
B1N C1N + A1N D1N
2 − 2C = Y N + Y N g3N (1 + 01
2
) + g4N
2
1 − 1N
2
1N D1N E 1N
= C1N + (26)
E 1N − A1N B1N
2
2
2
51N
−2g3N Y N − 2g4N Y N 1 −
X N − x (n)N 8
(2)P2N = g3N y [n]N + g4N exp
X N + x (n)N 51N 2 0N 1N
+2g3N g4N Y N 1 − + . (35)
x (n)N 8 2
1 + log . (27)
XN
Taking expectations by focusing on first-order approxima-
When we express the estimator P2N from Eq. (27) into terms tion, we obtain MSE,
of ’s, we obtain
MSE (P2N ) = Y N + g3N
2 2
A2N + g4N
2
B2N − 2g3N C2N
−1N
P2N = g3N Y N (1 + 0N ) + g4N exp −2g4N D2N + 2g3N g4N E 2N , (36)
2 + 1N
123
International Journal of Computational Intelligence Systems (2024) 17:210 Page 7 of 15 210
Case 1: If sum of weights is fixed (g5N + g6N = 1) To find out the minimum MSE for P3N , we partially differ-
The MSE of the estimator P3N is shown as entiate equation (49) w.r.t. g5N & g6N and equating to zero,
we get
2
MSE(P3N ) = Y N Vyr N + g6N
2
Vxr N − 2g6N Vyxr N . B3N C3N − D3N E 3N
g5N ∗ = (50)
(44) A3N B3N − E 3N
2
123
210 Page 8 of 15 International Journal of Computational Intelligence Systems (2024) 17:210
123
International Journal of Computational Intelligence Systems (2024) 17:210 Page 9 of 15 210
Table 2 The MSE and PRE of the estimators for odd case Similarly, in Tables 4 and 5, the MSEs of the recommended
Estimators MSE PRE and existing estimators are given along with PRE through
a simulation study based on artificial neutrosophic data for
y [n]N [0.351545, 1.209508] [100, 100] different values of the correlation coefficient and different
yr N [0.110387, 0.200354] [318, 603] sample sizes. Like Tables 2 and 3, the superiority of the sug-
y pN [0.849801, 3.365292] [35, 41] gested NMRSS estimators over the existing ones is displayed
y regN [0.085726, 0.117522] [410, 1029] in Tables 4 and 5. We also see that the MSEs and PREs of the
y expr N [0.198829, 0.561602] [176, 215] recommended estimators are lesser and higher, respectively
y exp pN [0.568536, 2.144071] [56, 61] than those of other existing estimators. Hence, Tables 4 and
t p1 [0.085315, 0.115937] [412, 1043] 5 mirrored these findings, with the proposed estimators con-
t p2 [0.013749, 0.028534] [2556, 4238] tinuing to outshine existing ones, demonstrating lower MSEs
t p3 [0.020461, 0.029224] [1718, 4138] and higher PREs in the simulation study too.
Tables 4 and 5 show that with the increase in sample sizes
and correlation coefficients, the MSE decreases, and the PRE
Table 3 The MSE and PRE of the estimators for even case
increases for the recommended estimator. Therefore, under
Estimators MSE PRE NMRSS, the suggested estimators exhibit sensitivity similar
y [n]N [0.402361, 0.760615] [100, 100] to that of classical ranked set sampling.
yr N [0.13007, 0.203868] [309, 373]
Table 6 features PRE values of the proposed NMRSS esti-
mators compared to NSRS counterparts. We observe from
y pN [0.918725, 1.843483] [41, 43]
Table 6 that all PRE values exceeded 100 indicating that all
y regN [0.083821, 0.121898] [480, 623]
the NMRSS estimators are superior to corresponding estima-
y expr N [0.235706, 0.416477] [170, 182]
tors under NSRS, as RSS is the best replacement for SRS.
y exp pN [0.630034, 1.236284] [61, 63]
Table 7 features PRE values of the proposed NMRSS esti-
t p1 [0.083418, 0.121109] [482, 628] mators over NRSS counterparts. We see from Table 7 that all
t p2 [0.010757, 0.018951] [3740, 4013] PRE values exceeded 100, indicating that all the NMRSS
t p3 [0.017894, 0.027317] [2248, 2784] estimators are superior to corresponding estimators under
NRSS, as MRSS is the best replacement for RSS when data
do not follow the symmetric distribution.
123
210 Page 10 of 15 International Journal of Computational Intelligence Systems (2024) 17:210
123
International Journal of Computational Intelligence Systems (2024) 17:210 Page 11 of 15 210
Table 6 PREs of the NMRSS estimators over estimators under NSRS The comparison between classical MRSS and NMRSS
n = 12 ρ = 0.9 ρ = 0.8 ρ = 0.7 ρ = 0.6 using MSEs and PREs is provided in Tables 8 and 9. Tables 8
Estimators PRE PRE PRE PRE and 9 demonstrate that the MSEs of the suggested estimators
obtained through classical MRSS are between the lower and
y [n]N [119, 209] [122, 187] [118, 160] [119, 152]
higher value of MSE obtained using NMRSS, and the same
yr N [118, 118] [122, 130] [117, 125] [123, 140]
goes for the PREs, indicating that the latter method is more
y pN [121, 250] [120, 233] [121, 226] [120, 210]
effective than the former.
y regN [116, 119] [125, 126] [117, 118] [123, 127]
The study highlights that classical ranked set sampling
y expr N [117, 151] [124, 139] [116, 119] [120, 126] is ill-suited for dealing with vague or indeterminate data.
y exp pN [120, 237] [121, 218] [120, 201] [120, 186] NMRSS proves superior for estimating uncertain or interval
t p1 [115, 119] [125, 125] [117, 118] [123, 127] data. The tables present dependable results for neutrosophic
t p2 [124, 203] [120, 230] [124, 264] [125, 256] data compared to classical results.
t p3 [119, 120] [124, 132] [118, 130] [124, 144]
123
210 Page 12 of 15 International Journal of Computational Intelligence Systems (2024) 17:210
123
International Journal of Computational Intelligence Systems (2024) 17:210 Page 13 of 15 210
123
210 Page 14 of 15 International Journal of Computational Intelligence Systems (2024) 17:210
123
International Journal of Computational Intelligence Systems (2024) 17:210 Page 15 of 15 210
Cognitive intelligence with neutrosophic statistics in bioinformat- 35. Takahasi, K., Wakimoto, K.: On unbiased estimates of the popu-
ics 417–429 (2023) lation mean based on the sample stratified by means of ordering.
25. Sisodia, B., Dwivedi, V.K.: A modified ratio estimator using co- Ann. Inst. Stat. Math. 20, 1–31 (1968)
efficient of variation of auxiliary variable. J. Indian Soc. Agric. 36. Vishwakarma, G., Singh, A.: Computing the effect of measurement
Stat. 33(1), 13–18 (1981) errors on ranked set sampling estimators of the population mean.
26. Smarandache, F.: Neutrosophy: neutrosophic probability, set, and Concurr. Comput. Pract. Exp. 34(27), e7333 (2022)
logic: analytic synthesis & synthetic analysis. ProQuest Inf. Learn. 37. Vishwakarma, G.K., Singh, A.: Generalized estimator for compu-
105, 118–123 (1998) tation of population mean under neutrosophic ranked set technique:
27. Smarandache, F.: A unifying field in logics: neutrosophic logic. an application to solar energy data. Comput. Appl. Math. 41(4), 144
Philosophy. American Research Press, 1–141 (1999) (2022)
28. Smarandache, F.: A unifying field in logics: neutrosophic logic,
neutrosophic set, neutrosophic probability, and statistics (2001)
arXiv:math/0101228
Publisher’s Note Springer Nature remains neutral with regard to juris-
29. Smarandache, F.: Neutrosophic set a generalization of the intuition-
dictional claims in published maps and institutional affiliations.
istic fuzzy set. Int. J. Pure Appl. Math. 24(3), 287 (2005)
30. Smarandache, F.: Neutrosophic logic—a generalization of the
intuitionistic fuzzy logic. Multispace Multi Struct. Neutrosophic
Transdiscipl. 4, 396 (2010)
31. Smarandache, F.: Introduction to neutrosophic measure, neutro-
sophic integral, and neutrosophic probability. In: Infinite study
(2013)
32. Smarandache, F.: Introduction to neutrosophic statistics. In: Infinite
study (2014)
33. Stokes, L.: Ranked set sampling with concomitant variables. Com-
mun. Stat. Theory Methods 6, 1207–1211 (1977)
34. Tahir, Z., Khan, H., Aslam, M., Shabbir, J., Mahmood, Y., Smaran-
dache, F.: Neutrosophic ratio-type estimators for estimating the
population mean. Complex Intell. Syst. 7, 2991–3001 (2021)
123