Biostatistics and Biometrics Open Access Journal

Mini Review

The Reversible Diffusion in Genetic

Abdel-Rehim EA*

Department of Mathematics and Computer Science, Suez Canal University, Egypt

Submission: March 10, 2017; Published: May 08, 2017

*Corresponding author: Abdel-Rehim EA, Department of Mathematics and Computer Science, Suez Canal University, Egypt, Email: entsarabdelrehim@gmail.com

How to cite this article: Abdel-Rehim E. The Reversible Diffusion in Genetic. Biostat Biometrics Open Acc J. 2017;1(3): 555565. DOI: 10.19080/BBOAJ.2017.01.555565

Abstract

In this review article, I prove mathematically that the diffusion process in genetics is a stochastic process and is a reversible process. MSC 2010: Primary 26A33, Secondary 45K05, 60J60, 44A10, 42A38, 60G50, 65N06, 47G30, 80-99 Key Words and Phrases: exponential diffusion in genetic, stochastic process, Markov chain, reversibility property.

Introduction

Feller [1] was th first scientist introducing the still open problem "diffusion process in genetics". R. A. Fisher and Sewall Wright studied many problems in this field. Most of these problems were based on the simple branching process; see [2] and [3]. R. A. Fisher used this branching process to describe the simplest possible populations in which there is no attraction among the possible generations. There are many difficulties arise when there is interaction between individuals or the population consists of different types of individuals being provided by various breading. These simplest assumptions lead to Markov process.

In this review article, I am interested in give a survey about the relation between the diffusion in genetic process and Markov chain. I am interested also in proving that this stochastic process is reversible.

The Genetic Diffusion Processes

The branching process is a class of Markov chains. In this process, one considers a population which consists of individuals being able to produce offsprings of the same kind. By supposing that each individuals will be by the end of his lifetime produce j new offspring with 0,1,2,... with probabilities p₀, p₁, p₂,... independently of all the other offsprings. Naturally Let Z⁽ⁿ⁾ be a random variable of a Markov chain and suppose that p_jk be the transition probabilities to the next generation Z⁽ⁿ⁺¹⁾ = k. It is not surprise that it was difficult to find an explicit expression for p_jk. It is definitely that these transition probabilities satisfy the Chapman-Kolmogrov [4] equation

The simple random mating is a good example to this approach in which one has amount of 2N genes. This number of genes is formed in 2N independent trials. For example suppose you have a pair of genes a and A. Each individual belongs to one of the three genotypes (a,a), (a,A) and (A,A). If the parent population consists of j of a- genes then there are 2N- j for A-genes. At each trials the genes a or A happen with the probabilities , That leads to the a transition probability having the binomial distribution

This type of treatment is similar to transition of the weight and black balls between the two urns of the Ehrenfest model.

The Derivation of the Genetic Diffusion Equation

Let Y (t) is a random variable such as gene frequency or population size depending on the time variable t. As described above Y (t) is a stochastic process of a Markov type. In the case of a finite chain, the process depends on its initial value at t = 0. Define the function p(t; y, x) as the conditional probability density that Y (t₀+t) = x given that Y (t₀) = y at the fixed time t₀. Then the expected value of Y (t) at a later time t₀ + is

The average value of the increase (the so called mean of displacement) is

and its variance is

Taking the limit as ,one gets

Since the stochastic variable is of Markov type then where p is the rate of increasing. In other words the population increases exponentially. It was shown by Kolmogorov that the probability density u(x; t) of the stochastic variable Y (t) satisfies the diffusion equation

where the coefficients a(x) and b(x) depend on the choice of the genetic model. It worth to say that this equation is considered as a special form of the Fokker-Planck equation which has a huge applications on all various scientific fields.

The exponential genetic diffusion equation

According to W. Feller [1] the general exponential equation reads

where u(x, t) is the probability density of the gene frequency at time t with the initial condition u(x, 0) = and is subject to the boundary condition u(0, t) = u(1, t) = 0. T_nis the time of the n -th generation. Here a and β are constants. α is the diffusion constants and β is drift constant and takes the values {1,0,-1} according to the increasing, no change and decreasing population respectively. Suppose here α = 1 and β = 1 and apply the common finite difference rule to descretize equation (2). To do so, let t = n with 0 < << 1 and x = jh with 0 < h << 1, 0 < j < N where N is the number of diffused genes. The discrete difference scheme reads

Define the scaling relatrion as ,putand solve for,

to get

the only condition imposed on in this scheme is in order to have a stable scheme. This equation can be written in the following form

where p_jj,p_jj+1,p_jj-1 are the transition probabilities from the point x_j(t_n) to the points x_j-1(t_n+1), x_j+1(t_n+1)) respectively. As discrete transition probabilities

Constitute the column vector the equation (4) can be written in matrix form

the elements of the matrix P are the above transition probabilities. Then all the rows of the matrix P are summed to one and it really represents the transition matrix of a Markov chain. Then the matrix P is a stochastic matrix. According to theory of stochastic process, especially Ehrenfest model, this matrix has the stationary probability distribution vector nj whose its elements are the binomial j = 0, 1, 2,..,N. In what follows, I prove that the diffusion in genetics is reversible. Firstly, the definition of the reversible process as is stated at the book of Kelly [5] is as follows: A stochastic process Y (t) is said to be reversible if Y(t₁),Y(t₂),.....,Y(t_n) has the same distribution as y(T-t₁),y(T-t₁),....,y(T-t_n),for all t1, t2,..., tn, see [5]. The balance condition reads