Kachiashvili KJ

doi:10.19080/BBOAJ.2019.09.555759

Short Communication

Modern State of Statistical Hypotheses Testing and Perspectives of its Development

Kachiashvili KJ^1,2*

¹ Faculty of Informatics and Control Systems, Georgian Technical University, Georgia

² Vekua Institute of Applied Mathematics of the Tbilisi State University, Georgia

Submission: January 17, 2019; Published: March 12, 2019

*Corresponding author: K J Kachiashvili, Faculty of Informatics and Control Systems, Georgian Technical University, Georgia and Vekua Institute of Applied Mathematics of the Tbilisi State University, Tbilisi, Georgia

How to cite this article: Kachiashvili KJ. Modern State of Statistical Hypotheses Testing and Perspectives of its Development. Biostat Biometrics Open Acc J. 2019; 9(2): 555759. DOI: 10.19080/BBOAJ.2019.09.555759

Abstract

A statistical hypothesis is a formalized record of properties of the investigated phenomenon and relevant assumptions. The statistical hypotheses are set when random factors affect the investigated phenomena, i.e. when the observation results of the investigated phenomena are random. The properties of the investigated phenomenon are completely defined by its probability distribution law. Therefore, the statistical hypothesis is an assumption concerning this or that property of the probability distribution law of a random variable. Mathematical statistics is the set of the methods for studying the events caused by random variability and estimates the measures (the probabilities) of possibility of occurrence of these events. For this reason, it uses distribution laws as a rule. Practically all methods of mathematical statistics one way or another, in different doses, use hypotheses testing techniques. Therefore, it is very difficult to overestimate the meaning of the methods of statistical hypotheses testing in the theory and practice of mathematical statistics.

Introduction

A lot of investigations are dedicated to the statistical hypotheses testing theory and practice (see, for example, Berger [1], Berger et al. [2], Bernardo, et al. [3], Christensen, et al. [4], Hubbard, et al. [5]; Lehmann, et al. [6,7], Moreno, et al. [8], Wolpert, et al. [9]) and their number increase steadily. But, despite of this fact, there are only three following basic ideas (philosophies) of hypotheses testing at parallel experiments: the Fisher, [10], the Neyman-Pearson [11,12] and the Jeffreys et al. [13]. They use different ideas for testing hypotheses but all of them are identical in one aspect: they all necessarily accept one of stated hypotheses at making decision despite of existence or absence enough information for making decision with given reliability. The considered methods have well known positive and negative sides. All other existed methods are the particular cases of these approaches taking into account the peculiarities of the concrete problems and adapting to these specificities for increasing the reliability of the decision (see, for example, Berger, et al. [14]; Bernardo, et al. [15]; Delampady, et al. [16]; Kiefer [17]; Bansal, et al. [18]; Bansal, et al. [19]; Bansal et al. [20].

An attempt to reconcile the different points of view of noted philosophies was made in Berger [21], and as a result there was offered a new, compromise method of testing. The method uses the Fisher’s -value criterion for making a decision, the Neyman-Pearson’s statement (using basic and alternative hypotheses) and Jeffrey’s formulae for computing the Type I and Type II conditional error probabilities for every observation result on the basis of which the decision is made.

A new approach (philosophy) to the statistical hypotheses testing, called Constrained Bayesian Methods (CBM), was comparatively recently developed [22-34]. This method differs from the traditional Bayesian approach with a risk function split into two parts, reflecting risks for incorrect rejection and incorrect acceptance of hypotheses and stating the risk minimization problem as a constrained optimization problem when one of the risk components is restricted and the another one is minimized. It generates data-dependent measures of evidence with regard to the level of restriction. In spite of absolutely different motivations of introduction of and CBM, they lead to the hypotheses acceptance regions with identical properties in principle. Namely, in despite of the classical cases when the observation space is divided into two complementary sub-spaces for acceptance and rejection of tested hypotheses, here the observation space contains the regions for making the decision and the regions for no-making the decision (see, for example, Berger [21]; Kachiashvili et al. [35]; Kachiashvili et al. [31]; Kachiashvili, et al. [33]; Kachiashvili, [28,35]). Though, for CBM, the situation is more differentiated than for.

For CBM the regions for no-making the decision are divided into the regions of impossibility of making the decision and the regions of impossibility of making unique decision. In the first case, the impossibility of making the decision is equivalent to the impossibility of making the decision with given probability of the error for a given observation result, and it becomes possible when the probability of the error decreases. In the second case, it is impossible to make a unique decision when the probability of the error is required to be small, and it is unattainable for the given observation result. By increasing the error probability, it becomes possible to make a decision.

In our opinion these properties of and CBM are very interesting and useful. They bring the statistical hypotheses testing rule much close to the everyday decision-making rule when, at shortage of necessary information, acceptance of one of made suppositions is not compulsory.

The specific features of hypotheses testing regions of the Berger’s test and CBM, namely, the existence of the no-decision region in the test and the existence of regions of impossibility of making a unique or any decision in CBM give the opportunities to develop the sequential tests on their basis [2,36,26,28]. The sequential test was introduced by Wald in the middle of forty of last century [37,38]. Since Wald’s pioneer works, a lot of different investigations were dedicated to the sequential analysis problems (see, for example, Berger, et al. [39]; Ghosh, [40]; Ghosh, et al. [41]; Siegmund, [42]) and efforts to the development of this approach constantly increase as it has many important advantages in comparison with the parallel methods [43].

Application of CBM to different types of hypotheses (two and many simple, composite, directional and multiple hypotheses) with parallel and sequential experiments showed the advantage and uniqueness of the method in comparison with existing ones [24-29,44]. The advantage of the method is the optimality of made decisions with guaranteed reliability and minimality of necessary observations for given reliability. CBM uses not only loss functions and a priori probabilities for making decisions as the classical Bayesian rule does, but also a significance level as the frequentist method does. The combination of these opportunities improves the quality of made decisions in CBM in comparison with other methods. This fact is many times confirmed by application of CBM to the solution of different practical problems [45-47,32,44].

Finally, it must be noted that, the detailed investigation of different statements of CBM and the choice of optimal loss functions in the constrained statements of the Bayesian testing problem opens wide opportunities in statistical hypotheses testing with new, beforehand unknown and interesting properties. On the other hand, the statement of the Bayesian estimation problem as a constrained optimization problem gives new opportunities in finding optimal estimates with new, unknown beforehand properties, and it seems that these properties will advantageously differ from those of the approaches known today.

In our opinion, the proposed CBM are the ways for future, perspective investigations which will give researchers the opportunities for obtaining new perspective results in the theory and practice of statistical inferences and it completely corresponds to the thoughts of the well-known statistician B Efron [48]: “Broadly speaking, nineteenth century statistics was Bayesian, while the twentieth century was frequentist, at least from the point of view of most scientific practitioners. Here in the twenty-first century scientists are bringing statisticians much bigger problems to solve, often comprising millions of data points and thousands of parameters. Which statistical philosophy will dominate practice? My guess, backed up with some recent examples, is that a combination of Bayesian and frequentist ideas will be needed to deal with our increasingly intense scientific environment. This will be a challenging period for statisticians, both applied and theoretical, but it also opens the opportunity for a new golden age, rivaling that of Fisher, Neyman, and the other giants of the early 1900s.”

BBOAJ.MS.ID.555759

Our Media Partner

BBOAJ Menu

Useful Links

Downloads

Modern State of Statistical Hypotheses Testing and Perspectives of its Development

Kachiashvili KJ^1,2*

Abstract

Introduction

References

Member In:

BBOAJ.MS.ID.555759

Our Media Partner

BBOAJ Menu

Useful Links

Downloads

Modern State of Statistical Hypotheses Testing and Perspectives of its Development

Kachiashvili KJ1,2*

Abstract

Introduction

References

Member In:

Kachiashvili KJ^1,2*