Jackknife Empirical Likelihood Methods
Yongcheng Qi*
Department of Mathematics and Statistics, University of Minnesota Duluth, USA
Submission: February 15, 2018; Published: June 25, 2018
*Corresponding author: Yongcheng Qi, Department of Mathematics and Statistics, University of Minnesota Duluth, 1117 University Drive, Duluth, MN 55812, USA; Email: ptrnch@yahoo.co.uk
How to cite this article: Yongcheng Q. Jackknife Empirical Likelihood Methods. Biostat Biometrics Open Acc J. 2018; 7(2): 555708. DOI: 10.19080/BBOAJ.2018.07.555708
Abstract
Since Jing et al. [1] propose the jackknife empirical likelihood method for U-statistics, the method has been developed and applied to inference problems in statistical theory and other areas such as biostatistics, medical statistics and insurance. In this short review paper, we give an introduction on one sample and two-sample jackknife empirical methods and smoothed jackknife empirical likelihood methods and present a brief literature review on applications of these methods.
Keywords: Empirical likelihood; Jackknife empirical likelihood; Smoothed jackknife empirical; Likelihood; Wilks’ theorem; confidence interval; Medical statistics; Nonparametric; Probability; Normal-approximation; Computing; Nonlinear statistics; U-statistics; Independent random vectors; Chi-square distribution; Pseudo-sample; Density functions; Asymptotic covariances; Dimensional means; Heavy computational; Errors-in-variables; Lagrange multipliers
Abbrevations: ELM: Empirical Likelihood Method; JEL: Jackknife Empirical Likelihood; ROC: Receiver Operating Characteristic
Introduction
The study of empirical likelihood method (ELM) dates back to Thomas & Grunkemeier [2] who introduce a nonparametric likelihood ratio test statistic for constructing confidence intervals for the survival probability for right censored data. The method has drawn much attention and has been extensively investigated and widely used to construct confidence regions and to test hypotheses after the work by Owen [3,4] for the mean vectors and functionals. The empirical likelihood is a non-parametric method, but it possesses some good properties of the parametric likelihood method, and it has many advantages over some classical and modern methods, such as the normal-approximation-based method and the bootstrap method. Computing the empirical likelihood ratio involves optimizing the likelihood function over nparameters, where n is the sample size. For linear functionals or estimating functions, the optimization problem can be easily solved by using the method of Lagrange multipliers. The major problem of extending the ELM to nonlinear functionals or nonlinear statistics is the difficulty of nonlinear optimizations. The research in this direction has been greatly stimulated since Jing et al. [1] propose a jackknife empirical likelihood (JEL) method for U-statistics. The main idea of the JEL method is to apply the ELM to the jackknife samples, and the parameters of interest become the means of jackknife samples. For jackknife method, see, e.g., Shao & Tu [5].
Jackknife empirical likelihood method
Throughout, for each define X(-i)− as the (n-1)− dimensional sub- Vector of Xwhen the i−th component xi is removed from X for 1≤i≤n that is,
One-sample JEL. Suppose X1,....,Xn are independent random vectors with common distribution function F and a d-dimensional parameter θ of interest is associated with .F Suppose 1,,nXX are independent random vectors with common distribution function .F write X=(X1,....,Xn) If there exists a function Tn(X) such that
for some positive definite matrix .Σ The so-called jackknife pseudo-sample {t1,..,tn} is defined as
As in Tukey [6], one expects that its′ are approximately independent. This motivates us to apply the standard empirical likelihood method to the jackknife sample {t1,..,tn} for constructing empirical likelihood confidence regions for .θ The jackknife empirical likelihood function is defined as
It follows from the Lagrange multiplier technique that the above maximization is achieved at and the log empirical likelihood ratio is given by
Where λ satisfies
We say that Wilks’ theorem holds if l J(θ0) converges in distribution to a chi-square distribution
With d degrees of freedom, where θ0 is the true value of the parameter θ . Wilks’ theorem can be
used to construct 1−α confidence regions for θ
Where is the α level critical value for a chi-square distribution with d degrees of freedom, or
we can reject the null hypothesis
Two-sample JEL
Suppose that X1,....,Xn1 and Y1,....,Yn2 are two independent samples with distribution function F and G, respectively. Write X=X1,....,Xn1, Y =Y1,....,Yn2 , Assume that the parameter θ we are interested in can be estimated by Tn1n2 (X Y) and (2.1) holds for this estimate. The two-sample jackknife pseudo-sample is defined as
and
Then, based on the jackknife sample {t1,...,tn} we define the log empirical likelihood ratio l J (α ) as in (2.2). When the parameters of interest are functionals of one distribution function F or two distribution functions F and G, the estimates of the parameters can naturally defined as functionals of the empirical distributions for F and G.
Smoothed JEL
If the functionals of the empirical distributions are used to estimate the parameters of interest and the asymptotic covariance matrices in (2.1) depend on local properties such as the density functions of the underlying distributions, the sample covariance matrices based on resulting jackknife samples are usually not consistent estimates of the asymptotic covariances in the central limit theorem (2.1), and in this case, Wilks’ theorem will fail to hold. One should consider smoothed JEL methods. The so-called smoothed JEL uses the functionals of the smoothed empirical distributions for F and G to generate jackknife samples, which can overcome the problem of variance inconsistency.
Applications
In addition to the work by Jing et al. [1] for one- and twosample U-statistics, there are a few applications of the standard JEL methods in statistics. Wang et al. [7] propose JEL based test for equality of two high dimensional means, Zhang et al. [8] investigate the population mean with ranked set samples. JEL based confidence intervals for the mean absolute deviation and difference of two Gini indices are studied by Zhao et al. [9] and Wang & Zhao [10], respectively. The JEL methods have been applied in many problems in insurance and actuarial sciences. For instance, JEL-based confidence intervals for copulas are proosed by Peng et al. [11], and Wang et al. [12]. Wilks’ theorem for JEL methods for Spearmans rho and a class of risk measures are proved by Wang & Peng [13] and Peng, et al. [14], respectively. For the tail copulas and difference of two quantles, Wilks’ theorem are shown to be valid for smoothed JEL methods by Peng & Qi [15] and Yang & Zhao [16,17].
In diagnostic medicine, the accuracy of a diagnostic test in discriminating diseased patients from non-diseased ones is measured by the receiver operating characteristic (ROC) curve when the response of a test is continuous. Let F and G be the distribution functions of the diseased and non-diseased populations, respectively. Then the ROC curve can be written as θ =θ (t ) =1− F (G−1 (1− t )), 0 < t <1, which is a functional of two populations. Gong et al. [18] propose smoothed JEL method and construct the confidence intervals for θ (t ), and Yang and Zhao [19] extend the method under the setup of missing data. Adimari & Chiogna [20] and An & Zhao [21] employ JEL methods for confidence intervals for partial areas under ROC curves and the difference of two volumes under ROC surfaces, respectively. JEL based confidence regions for quantities in sensitivity and specificity for continuous-scale diagnostic tests are investigated in Wang & Qin [22]. To construct confidence regions when many nuisance paramters are present, a profile empirical likelihood method has to be employed, which is computationally costly in general. Li et al. [23] and Peng [24] propose JEL method for estimating equations to avoid heavy computational burden. Further, Zhang et al. [25] propose jackknife-blockwise empirical likelihood methods for data under dependence.
JEL methods have also been applied in regression models. JEL based confidence intervals for the regression parameters in accelerated failure time model with censored observations and in linear transformation models under right censoring are studied by Bouadoumou et al. [26] and Yang et al. [27], respectively. JEL based confidence intervals for the error variances in linear mode land in partially linear varying-coefficient errors-in-variables models are investigated by Lin et al. [28] and Liu and Liang [29]. The JEL based intervals of mean with regression imputation is considered by Zhong & Chen [30]. In summary, a common feature for these applications is that the JEL methods maintain the advantages of empirical likelihood methods over normal approximation methods and perform very well for small samples [31]. Computationally JEL methods are easy and straight forward even for complicated statistical problems.
References
- Jing BY, Yuan J, Zhou W (2009) Jackknife empirical likelihood. Journal of the American Statistical Association 104(487): 1224-1232.
- Thomas DR, Grunkemeier GL (1975) Confidence interval estimation of survival probabilities for censored data. Journal of the American Statistical Association 70: 865-871.
- Owen AB (1988) Empirical likelihood ratio confidence intervals for a single functional. Biometrika 75(2): 237-249.
- Owen AB (1990) Empirical likelihood ratio confidence regions. Annals of Statistics 18: 90-120.
- Shao J, Tu D (1995) The Jackknife and Bootstrap, Springer, UK.
- Tukey JW (1958) Bias and confidence in not-quite large samples. The Annals of Statistics 29: 614-623.
- Wang R, Peng L and Qi Y (2013) Jackknife empirical likelihood test for equality of two high dimensional means. Statistica Sinica 23: 667-690.
- Zhang Z, Liu T, Zhang B (2016) Jackknife empirical likelihood inferences for the population mean with ranked set samples. Statistics & Probability Letters 108: 16-22.
- Zhao Y, Meng X, Yang H (2015) Jackknife empirical likelihood inference for the mean absolute deviation. Computational Statistics & Data Analysis 91: 92-101.
- Wang D, Zhao Y (2016) Jackknife empirical likelihood for comparing two Gini indices. Canadian Journal of Statistics DOI: 10.1002/cjs.11275.
- Peng L, Qi Y, Van Keilegom I (2012) Jackknife empirical likelihood method for copulas. Test 21: 74-92.
- Wang R, Peng L, Yang J (2013) Jackknife empirical likelihood for parametric copulas. Scandinavian Actuarial Journal 1: 325-339.
- Wang R, Peng L (2011) Jackknife empirical likelihood intervals for Spearmans rho. North American Actuarial Journal 15: 475-486
- Peng L, Qi Y, Wang R, Yang J (2012) Jackknife empirical likelihood methods for some risk measures and related quantities. Insurance: Mathematics and Economics 51: 142-150.
- Peng L, Qi Y (2010) Smoothed jackknife empirical likelihood method for tail copulas. Test 19: 514-536.
- Yang H, Zhao Y (2017) Smoothed jackknife empirical likelihood for the difference of two quantiles. Ann Inst Statist Math 69: 1059-1073.
- Yang H, Zhao Y (2018) Smoothed jackknife empirical likelihood for the one-sample difference of quantiles. Computational Statistics & Data Analysis 120: 58-69.
- Gong Y, Peng L, Qi Y (2010) Smoothed jackknife empirical likelihood method for ROC curve. Journal of Multivariate Analysis 101: 520-1531.
- Yang H, Zhao Y (2015) Smoothed jackknife empirical likelihood inference for ROC curves with missing data. Journal of Multivariate Analysis 140: 123-138.
- Adimari G, Chiogna M (2012) Jackknife empirical likelihood based confidence intervals for partial areas under ROC curves. Statistica Sinica 22: 1457-1477.
- An Y, Zhao Y (2017) Jackknife empirical likelihood for the difference of two volumes under ROC surfaces. Annals of the Institute of Statistical Mathematics Pp. 1-8.
- Wang B, Qin G (2016) Jackknife empirical likelihood confidence regions for the evaluation of continuous-scale diagnostic tests with verification bias. Stat Methods Med Res 25(5): 2120-2137.
- Li M, Peng L, Qi Y (2011) Reduce computation in profile empirical likelihood method. The Canadian Journal of Statistics 38: 370-384.
- Peng L (2012) Approximate jackknife empirical likelihood method for estimating equations. Canadian Journal of Statistics 40: 110-123.
- Zhang R, Peng L, Qi Y (2012) Jackknife-blockwise empirical likelihood methods under dependence. Journal of Multivariate Analysis 104: 56- 72.
- Bouadoumou M, Zhao Y, Lu Y (2015) Jackknife empirical likelihood for the accelerated failure time model with censored data. Communications in Statistics-Simulation and Computation 44(7): 1818-1832.
- Yang H, Liu S, Zhao Y (2016) Jackknife empirical likelihood for linear transformation models with right censoring. Annals of the Institute of Statistical Mathematics 68(5): 1095-1109.
- Lin HL, Li Z, Wang D, Zhao Y (2017) Jackknife empirical likelihood for the error variance in linear models. Journal of Nonparametric Statistics 29(2): 151-166.
- Liu AA, Liang HY (2017) Jackknife empirical likelihood of error variance in partially linear varying-coefficient errors-in-variables models. Statistical Papers 58: 95-122.
- Zhong PS, Chen S (2014) Jackknife empirical likelihood inference with regression imputation and survey data. Journal of Multivariate Analysis 129: 193-205.
- Hall P, La Scala B (1990) Methodology and algorithms of empirical likelihood. International Statistical Review 58: 109-127.