Yongcheng Qi

doi:10.19080/BBOAJ.2018.07.555708

Mini Review

Jackknife Empirical Likelihood Methods

Yongcheng Qi*

Department of Mathematics and Statistics, University of Minnesota Duluth, USA

Submission: February 15, 2018; Published: June 25, 2018

*Corresponding author: Yongcheng Qi, Department of Mathematics and Statistics, University of Minnesota Duluth, 1117 University Drive, Duluth, MN 55812, USA; Email: ptrnch@yahoo.co.uk

How to cite this article: Yongcheng Q. Jackknife Empirical Likelihood Methods. Biostat Biometrics Open Acc J. 2018; 7(2): 555708. DOI: 10.19080/BBOAJ.2018.07.555708

Abstract

Since Jing et al. [1] propose the jackknife empirical likelihood method for U-statistics, the method has been developed and applied to inference problems in statistical theory and other areas such as biostatistics, medical statistics and insurance. In this short review paper, we give an introduction on one sample and two-sample jackknife empirical methods and smoothed jackknife empirical likelihood methods and present a brief literature review on applications of these methods.

Keywords: Empirical likelihood; Jackknife empirical likelihood; Smoothed jackknife empirical; Likelihood; Wilks’ theorem; confidence interval; Medical statistics; Nonparametric; Probability; Normal-approximation; Computing; Nonlinear statistics; U-statistics; Independent random vectors; Chi-square distribution; Pseudo-sample; Density functions; Asymptotic covariances; Dimensional means; Heavy computational; Errors-in-variables; Lagrange multipliers

Abbrevations: ELM: Empirical Likelihood Method; JEL: Jackknife Empirical Likelihood; ROC: Receiver Operating Characteristic

Introduction

The study of empirical likelihood method (ELM) dates back to Thomas & Grunkemeier [2] who introduce a nonparametric likelihood ratio test statistic for constructing confidence intervals for the survival probability for right censored data. The method has drawn much attention and has been extensively investigated and widely used to construct confidence regions and to test hypotheses after the work by Owen [3,4] for the mean vectors and functionals. The empirical likelihood is a non-parametric method, but it possesses some good properties of the parametric likelihood method, and it has many advantages over some classical and modern methods, such as the normal-approximation-based method and the bootstrap method. Computing the empirical likelihood ratio involves optimizing the likelihood function over nparameters, where n is the sample size. For linear functionals or estimating functions, the optimization problem can be easily solved by using the method of Lagrange multipliers. The major problem of extending the ELM to nonlinear functionals or nonlinear statistics is the difficulty of nonlinear optimizations. The research in this direction has been greatly stimulated since Jing et al. [1] propose a jackknife empirical likelihood (JEL) method for U-statistics. The main idea of the JEL method is to apply the ELM to the jackknife samples, and the parameters of interest become the means of jackknife samples. For jackknife method, see, e.g., Shao & Tu [5].

Jackknife empirical likelihood method

Throughout, for each define X_{(-i)− as the (n-1)− dimensional sub- Vector of Xwhen the i−th component x_i
is removed from X for 1≤i≤n that is,}

One-sample JEL. Suppose X₁,....,X_n are independent random vectors with common distribution function F and a d-dimensional parameter θ of interest is associated with .F Suppose 1,,nXX are independent random vectors with common distribution function .F write X=(X₁,....,X_n) If there exists a function T_n(X) such that

for some positive definite matrix .Σ The so-called jackknife pseudo-sample {t₁,..,t_n} is defined as

As in Tukey [6], one expects that its′ are approximately independent. This motivates us to apply the standard empirical likelihood method to the jackknife sample {t₁,..,t_n} for constructing empirical likelihood confidence regions for .θ The jackknife empirical likelihood function is defined as

It follows from the Lagrange multiplier technique that the above maximization is achieved at and the log empirical likelihood ratio is given by

Where λ satisfies

We say that Wilks’ theorem holds if l^J(θ₀) converges in distribution to a chi-square distribution

With d degrees of freedom, where θ₀ is the true value of the parameter θ . Wilks’ theorem can be

used to construct 1−α confidence regions for θ

Where is the α level critical value for a chi-square distribution with d degrees of freedom, or

we can reject the null hypothesis

Two-sample JEL

Suppose that X₁,....,X_n1 and Y₁,....,Y_n2 are two independent samples with distribution function F and G, respectively. Write X=X₁,....,X_n1, Y =Y₁,....,Y_n2 , Assume that the parameter θ we are interested in can be estimated by T_n1_n2 (X Y) and (2.1) holds for this estimate. The two-sample jackknife pseudo-sample is defined as

and

Then, based on the jackknife sample {t₁,...,t_n} we define the log empirical likelihood ratio l ^J (α ) as in (2.2). When the parameters of interest are functionals of one distribution function F or two distribution functions F and G, the estimates of the parameters can naturally defined as functionals of the empirical distributions for F and G.

Smoothed JEL

If the functionals of the empirical distributions are used to estimate the parameters of interest and the asymptotic covariance matrices in (2.1) depend on local properties such as the density functions of the underlying distributions, the sample covariance matrices based on resulting jackknife samples are usually not consistent estimates of the asymptotic covariances in the central limit theorem (2.1), and in this case, Wilks’ theorem will fail to hold. One should consider smoothed JEL methods. The so-called smoothed JEL uses the functionals of the smoothed empirical distributions for F and G to generate jackknife samples, which can overcome the problem of variance inconsistency.

Applications

In addition to the work by Jing et al. [1] for one- and twosample U-statistics, there are a few applications of the standard JEL methods in statistics. Wang et al. [7] propose JEL based test for equality of two high dimensional means, Zhang et al. [8] investigate the population mean with ranked set samples. JEL based confidence intervals for the mean absolute deviation and difference of two Gini indices are studied by Zhao et al. [9] and Wang & Zhao [10], respectively. The JEL methods have been applied in many problems in insurance and actuarial sciences. For instance, JEL-based confidence intervals for copulas are proosed by Peng et al. [11], and Wang et al. [12]. Wilks’ theorem for JEL methods for Spearmans rho and a class of risk measures are proved by Wang & Peng [13] and Peng, et al. [14], respectively. For the tail copulas and difference of two quantles, Wilks’ theorem are shown to be valid for smoothed JEL methods by Peng & Qi [15] and Yang & Zhao [16,17].

In diagnostic medicine, the accuracy of a diagnostic test in discriminating diseased patients from non-diseased ones is measured by the receiver operating characteristic (ROC) curve when the response of a test is continuous. Let F and G be the distribution functions of the diseased and non-diseased populations, respectively. Then the ROC curve can be written as θ =θ (t ) =1− F (G⁻¹ (1− t )), 0 < t <1, which is a functional of two populations. Gong et al. [18] propose smoothed JEL method and construct the confidence intervals for θ (t ), and Yang and Zhao [19] extend the method under the setup of missing data. Adimari & Chiogna [20] and An & Zhao [21] employ JEL methods for confidence intervals for partial areas under ROC curves and the difference of two volumes under ROC surfaces, respectively. JEL based confidence regions for quantities in sensitivity and specificity for continuous-scale diagnostic tests are investigated in Wang & Qin [22]. To construct confidence regions when many nuisance paramters are present, a profile empirical likelihood method has to be employed, which is computationally costly in general. Li et al. [23] and Peng [24] propose JEL method for estimating equations to avoid heavy computational burden. Further, Zhang et al. [25] propose jackknife-blockwise empirical likelihood methods for data under dependence.

JEL methods have also been applied in regression models. JEL based confidence intervals for the regression parameters in accelerated failure time model with censored observations and in linear transformation models under right censoring are studied by Bouadoumou et al. [26] and Yang et al. [27], respectively. JEL based confidence intervals for the error variances in linear mode land in partially linear varying-coefficient errors-in-variables models are investigated by Lin et al. [28] and Liu and Liang [29]. The JEL based intervals of mean with regression imputation is considered by Zhong & Chen [30]. In summary, a common feature for these applications is that the JEL methods maintain the advantages of empirical likelihood methods over normal approximation methods and perform very well for small samples [31]. Computationally JEL methods are easy and straight forward even for complicated statistical problems.

BBOAJ.MS.ID.555708

Our Media Partner

BBOAJ Menu

Useful Links

Downloads