128
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Brain MR image segmentation for tumor detection based on Riesz probability distributions

& ORCID Icon
Article: 2343711 | Received 23 Dec 2021, Accepted 11 Mar 2024, Published online: 03 May 2024

ABSTRACT

This research introduces a new approach using the Riesz mixture model for medical image segmentation, specifically for diagnosing and treating brain tumors. We developed a novel technique for pixel classification based on the Riesz distribution, which is generated using an extended Bartlett decomposition. Our work is pioneering, as there are no existing studies addressing this issue in the literature. We aim to demonstrate the effectiveness of this distribution for brain image segmentation. We used the Expectation-Maximization algorithm to estimate the mixture parameters. To validate our segmentation algorithm, we conducted a comparative study with a recent method based on the Wishart distribution using Matlab software. Experiments with the Riesz mixture model showed that our method produces more intuitive results with a recognition rate of 94.52%. These results confirm the reliability of our method in detecting tumors using both synthetic and real brain images.

1. Introduction

The medical image is a crucial part of the medical healthcare. It is the technique of creating visual representation of the interior of a body for clinical analysis and medical intervention. From this perspective, different medical imaging technologies are invested as radiography, computed tomography, ultrasound and magnetic resonance (MR) imaging.

To identify the affected area whether tumours, cyst, oedema and other abnormalities, we opted for medical image segmentation methods. Segmentation of image is an intrinsic research field of image processing and pattern recognition. It is a supervised or unsupervised method of partitioning an image into regions. It is a complex task that is affected by numerous aspects like noise. As a matter of fact, various image segmentation approaches have been proposed in literature as the Thresholding method which aims to discriminate pixels or voxels through using their grey levels. These thresholds can be either global or local. They are determined by using the histogram of an image. The segmentation of the image may be either carried out by the application of many individual thresholds or by utilising a multi-Thresholding technique. There are also region based techniques which are designed to locate homogeneous areas in the images. They create disjoint regions by combining neighbourhood pixels with homogeneous characteristics. The approaches contours are characterised with the presence of borders between regions. We have also pixel classification technique which classifies similar objects into a single cluster. It is created by using multiple supervised or unsupervised clusterings, like: Fuzzy C-means clustering and K-means algorithm (Zhou and Yang Citation2020). In addition, we mention the probabilistic method in which hidden Markov model is introduced to model the tissue. Moreover, mixture models are used to represent the data density of brain image (Bhagwat et al. Citation2021; Zitouni and Tounsi Citation2021; Tounsi and Zitouni Citation2021; Jiao et al. Citation2022). We can also use the finite mixture model for complex image segmentation problem which corresponds to a convex combination of two or more probability density functions (Zitouni et al. Citation2018, Citation2023). It is used in different fields such as finance, handwriting recognition, medicine, and insurance. Among finite mixture models, Gaussian mixture model (GMM) is one of the most used tool for image segmentation (Saygılı Citation2022; Huang and Gou Citation2023). It provides better classification in terms of Bayes risk than several popular methods. The paper highlights an unsupervised method (Panić et al. Citation2022; Pu et al. Citation2023) of image segmentation based on the Riesz mixture model and the Expectation-Maximization (EM) algorithm for the parameters estimation of the proposed model. Recently, The Riesz mixture model was developed by Kessentini et al. (Citation2020). It is a combination of two or more Riesz probability densities which is generated by extending the famous Bartlett decomposition of Wishart random matrices (Bartlett Citation1934) to the Bartlett decomposition of Riesz random matrices.

From 2001 until now Riesz distributions and its derivative probabilities have occupied a lofty position in terms of multivariate analysis. Numerous researchers were highly interested in this new family of probability distributions and they provided multiple theoretical aspects which allow to master, characterise and therefore to understand the usefulness of this new family subsequently in real applications. The Riesz probability distribution was introduced in 2001 by Hassairi and Lajmi (Hassairi and Lajmi Citation2001) on the modern framework of symmetric cones and simple Euclidean Jordan algebra. In 2008, Hassairi et al. (Citation2008) developed a characterisation of the Riesz probability distribution in the cone of positive definite symmetric matrices to make them accessible to readers. The Riesz probability distribution represents a significant generalisation of the Wishart distribution and therefore, of the usual gamma distribution in the real line. Multiple interesting properties of the latter are identified and discussed (Hassairi et al. Citation2007, Citation2017; Tounsi and Zine Citation2017). Within this framework, numerous other derived probability distributions are addressed such as the inverse Riesz probability distributions (Tounsi and Zine Citation2012), the Dirichlet-Riesz (Tounsi and Zine Citation2017) and the Extended matrix-variate Beta distributions (Tounsi Citation2020). However, the Riesz and the inverse Riesz probability distributions displays a prominent property namely the positivity set that is imposed on the covariance matrices of a multivariate normal distribution. For this reason, we can adopt these distributions for the estimation of covariance matrix of a multivariate normal distribution. It is in fact more advantageous to use these distributions than the Wishart and the inverse Wishart distributions because they depends on multivariate shape parameters thus offering a higher degree of freedom. Therefore, the Riesz and inverse Riesz probability distributions can be used in a wide variety of problems such as statistical analysis, economics and physics. Thus, the search for a statistical models and algorithms facilitating the investigation of the Riesz and inverse Riesz in real world problems remains a topic of high interest to statisticians. We recall that in this context, recently Kessentini et al. (Citation2020) have suggested an algorithm generating the Riesz and the inverse Riesz probability distributions based on an extension of the Bartlett decomposition (Bartlett Citation1934). Within the theoretical framework, many works about the Riesz probability properties are developed. Yet, in real-life applications no work has been recorded in terms of demonstrating the applicability of this distribution (Kessentini et al. Citation2020). Hence, our research is considered as the first scientific research that stands for the pioneering work aiming at proving the efficiency of the Riesz distribution in real life problems such as segmentation of medical imagery. The latter represents a challenging problem due to the complexity of the images, as well as the absence of models of the anatomy that fully capture the possible deformations in each structure. In addition, to estimate the parameters of the Riesz mixture model, Kessentini et al. (Citation2020) adopted the EM algorithm (Dempster et al. Citation1977). For real-life applications like segmentation of complex structure, parameters estimation is a basic step when implementing completely data. From this perspective, we suggest in this paper an iterative algorithm based on the EM algorithm. The proposed segmentation algorithm has been validated on synthetic data generated by using the Riesz distribution and has been tested on real medical images. The obtained results proved the high performance and the goodness of fit of the proposed method with Riesz distribution compared to a recent method of segmentation based on the Wishart distribution (Hidot and Saint-Jean Citation2010). Besides, the finite sample performance was evaluated by calculating such evaluation criteria as: Misclassification Rate (MCR) (Zitouni et al. Citation2023), Peak Signal to Noise Ratio (PSNR) (Saladi and Amutha Prabha Citation2017), Adjusted Rand Index (ARI) (Sundqvist et al. Citation2023), Rand Index (RI) (Fouad et al. Citation2017), Mirkin Metric (MI) (Correa-Morris et al. Citation2022) and Hubert Index (HI) (Somashekara and Manjunatha Citation2014). The different evaluation criteria values suggest the reliability of the proposed distribution. The remaining sections of this paper are organised as follows. In Section 2, some notations and definitions are provided within the general framework of positive definite symmetric matrices. Furthermore, we recalled some basic concepts of the Riesz mixture model and an estimation of the parameters using maximum likelihood method. In Section 2 image segmentation using Riesz mixture model is explored and discussed. Additionally a comparative study with respect to the Wishart distribution is carried out. The proposed algorithm was applied with synthetic and real medical images. The different experiments results demonstrated the high performance of the proposed unsupervised algorithm. Moreover, the importance of using Riesz models in real life applications is highlighted. The last section wraps up the conclusion and provides new perspectives for future works.

2. Notations and definitions

To clarify the results of this paper, we first recall some notations and review some characteristic properties concerning the Riesz distributions on the modern framework of positive definite symmetric matrices.

2.1. The Riesz probability distribution

Let Vr be the space of real symmetric r×r matrices equipped with the scalar product X,Y=tr(XY), where tr(XY) is the trace of matrices’ ordinary product. For X=xij1i,jr in Vr and 1lr, we provide the sub-matrices

Pl(X)=xij1i,jl,

and

Pl(X)=xijrl+1i,jr.

For convenience, Pl(X) and Pl(X) may be considered a matrices in Vr with zero-filled additional rows and columns.

We denote by Ωr the cone of positive definite elements of Vr and by Tr the set of r×r lower triangular matrices with strictly positive diagonal. Any element Y of Ωr admits the Cholesky decomposition Y=TT, where T is in Tr and T is its transpose. This decomposition is unique, and therefore we can define on Vr the automorphism

π(Y)(X)=TXT,

which may be regarded as the product of X by Y, and we define the ‘quotient’ of X by Y as

π1(Y)(X)=T1X(T1).

Definition 2.1. An r×r positive definite symmetric matrix X is said to have a matrix-variate Wishart distribution with shape parameter p>r12 and scale parameter Σ in Ωr, if its probability density function is defined by

(1) Wr(p,Σ)=(detΣ)p2rpΓΩr(p)e12Σ1,X(detX)pr+121Ωr(X),(1)

where ΓΩr(.) denotes the multivariate gamma function given by

(2) ΓΩr(p)=πr(r1)4j=1rΓ(pj12).(2)

The Riesz probability distribution was proposed by Hassairi and Lajmi (Citation2001) as an extension of the Wishart one. It is an interesting probability distribution with outstanding probabilistic properties like independence between blocks (Hassairi et al. Citation2007), invariance (Hassairi and Lajmi Citation2001), moments and constancy regression on the mean (Hassairi et al. Citation2017). Departing from the fact that the Riesz probability distribution verifies the property of set positivity that is imposed on the covariance matrices, the latter can be applied for the estimation of a covariance matrix of a multivariate normal distribution. The definition of the Riesz probability distribution is based on the notion of generalised power in the space of real symmetric r×r matrices, which reduces to the ordinary determinant in a particular situation.

Definition 2.2. For s=(s1,,sr)Rr, the generalised power of an element XΩr is expressed by

(3) Δs(X)=Δ1(X)s1s2Δ2(X)s2s3Δr(X)sr,(3)

where Δl(X) is the determinant of the l×l sub-matrix Pl(X) of X.

Note that Δs(X)=(detX)p if s=(p,,p) with pR. It is also easy to check that Δs+s (X)=Δs(X)Δs (X), where s =(s1,,sr)Rr. In particular, if mR and s+m=(s1+m,,sr+m), we have Δs+m(X)=Δs(X)\break(detX)m. Furthermore, Hassairi et al. (Citation2007) demonstrated that, for all X and Y in Ωr

(4) Δs(π(Y)(X))=Δs(Y)Δs(X),(4)

and

(5) Δs(π1(Y)(X))=Δs(Y)Δs(X).(5)

Definition 2.3. An r×r positive definite symmetric matrix X is said to have a Riesz probability distribution on Ωr with shape parameter s=(s1,,sr)Rr, such that sj>j12 for all 1jr and scale parameter Σ in Ωr, if its probability density function is indicated by

(6) Rr(s,Σ)=12j=1rsjΓΩr(s)Δs(Σ)e12Σ1,XΔsr+12(X)1Ωr(X),(6)

where

(7) ΓΩr(s)=πr(r1)4j=1rΓ(sjj12).(7)

The probability distribution Rr(s,Σ) reduces to the Wishart Wr(p,Σ), when s1=s2==sr=p.

In the following theorem, we recall a significant result concerning the expected value of a Riesz random matrix. For further details, please consult this reference (Hassairi et al. Citation2017).

Theorem 2.4.

Let X be a Riesz probability with parameters s=(s1,,sr)Rr such that s10 and ΣΩr. Then, the expected value of X is denoted by

(8) E(X)=l=1r(srlsrl+1)Pl(Σ)1.(8)

If X is a Riesz random matrix, then the distribution of Y=X1 is called the inverse Riesz probability distribution (Tounsi and Zine Citation2012), which can be regarded as a generalisation of the inverse Wishart.

Definition 2.5.

An r×r positive definite symmetric matrix Y is said to have an inverse Riesz probability distribution on Ωr with shape parameter s=(s1,,sr)Rr, such that sj>j12 for all 1jr and scale parameter Σ in Ωr, if its probability density function is determined by

(9) IRr(s,Σ)=12j=1rsjΓΩr(s)Δs(Σ)e12Σ1,Y1Δs+r+12(Y1)1Ωr(Y).(9)

When s1=s2==sr=p, the IR r(s,Σ) reduces to the inverse Wishart IWr(p,Σ) given as following.

Definition 2.6.

An r×r positive definite symmetric matrix Y is said to have a matrix-variate inverse Wishart distribution with shape parameter p>r12 and scale parameter Σ in Ωr, if its probability density function is specified by

(10) IWr(p,Σ)=(detΣ)p2rpΓΩr(p)e12Σ1,Y1(detY)pr+121Ωr(Y).(10)

In the following, we briefly review the simulation procedures for random Riesz and inverse Riesz matrices. According to Kessentini et al. (Citation2020), Rr(s,Σ) and IRr(s,Σ) are generated using an extension of the Bartlett decomposition (Bartlett Citation1934). Let T be in Tr with entries tij satisfying:

  1. tij is standard normal for all 1j<ir.

  2. (tii)1ir is γ(2sii+12,12), where γ(2sii+12,12) is the gamma distribution with shape parameter 2sii+12; si>i12 for all 1ir and scale parameter equal to 12.

  3. The (tij)1jir‘s are independent.

The matrix X=TT has a Riesz Rr(s,Ir) distribution, where s=(s1,,sr) and Ir is the identity matrix. Handling a matrix other than the identity matrix is easy. If the r×r symmetric matrix Σ has the Cholesky factorization Σ=LL, with LTr, then Z=LT(LT) has a Rr(s,Σ). In fact, according to (Tounsi and Zine Citation2012), the matrix Y=Z1 follows an IRr(s,Σ).

2.2. The mixture of the riesz probability distribution

In modern statistics, mixture models correspond to an important probabilistic model which are used for data classification providing efficient approaches for model-based clustering (Fraley and Raftery Citation2002). They have been developed so far by various authors like McLachlan et al. (Citation2002), Melnykov and Maitra (Citation2010) and Gormley et al. (Citation2023). Hence, a wide mixture models examples can be reported including mixture of gamma, mixture of exponential, mixture of Gaussian and mixture of the Riesz distributions (Kessentini et al. Citation2020; He and Chen Citation2022; Shafiq et al. Citation2022; Huang and Gou Citation2023).

In the following subsection, we exhibit a description of the mixture of the Riesz probability distribution which corresponds to a weighted sum of K1N component Riesz densities demonstrated given below.

Let X1,X2,,XN be N i.i.d. matrices with common density mixture expressed by

(11) f(Xi|Φ)=Kk=1πkfk(Xi|θk),(11)

where πk represents the mixing weights, 0<πk<1 such that k=1Kπk=1. The fk(.|θk) denotes the Riesz probability density function with parameters θk=(sk,Σk), 1kK (see EquationEquation (6)), where sk=(sk,1,sk,2,,sk,r). In addition, Φ=(π1,,πK,θ1,θK) denotes the vector of unknown parameters to be estimated by the EM algorithm. The standard reference on the EM algorithm and its convergence is Dempster et al. (Citation1977).

At this stage of analysis, we briefly recall the principle of the use of the EM algorithm. It is often used to estimate parameters of a mixture model, in which the exact component model from which a data point is generated is hidden from us. It starts with randomly assigning values to all the parameters to be estimated. It then iterately alternates between two steps, called the Expectation step (E-step) and the Maximization step (M-step), respectively. In the E-step, it computes the expected likelihood for the complete data (the so-called Q-function) where the expectation is taken for the computed conditional distribution of the latent variables given the current settings of parameters and our observed (incomplete) data. In the M-step, it re-estimates all the parameters by maximizing the Q-function. Once we have a new generation of parameter values, we can repeat the E-step and another M-step. This process continues until the likelihood converges, i.e. reaching a local maxima. Intuitively, what EM does is to iteratively increase the data by guessing the values of the hidden variables and to re-estimate the parameters by assuming that the guessed values are the true values. Recently, Kessentini et al. (Citation2020) developed an Expectation-Maximization algorithm to estimate the parameters of the Riesz mixture model. Therefore, the parameters estimation of πk,sk and Σk, 1kK are calculated, according to the authors Kessentini et al. (Citation2020), using these points.

  • The unclassified mixture X=(X1,,XN) is augmented by the hidden Ndimentional vector z=(z1,,zN) where zi=k means that Xi is issued from the kth distribution. The complete-data likelihood and log likelihood are formulated as

    (12) Lc(Φ)=Ni=1Kk=11{zi=k}πkfk(Xi|θk),(12)
    (13) logLc(Φ)=Ni=1Kk=11{zi=k}logπkfk(Xi|θk).(13)

  • The expectation step is equivalent to estimate at iteration (t+1) the conditional expectation QΦ|Φ(t) of the complete-data log likelihood given the previous value of parameters Φ(t)

    (14) QΦ|Φ(t)=EΦ(t)logLc(Φ)=Ni=1Kk=1EΦ(t)zi|Xilogπk(t)fkXi|θk(t)=Ni=1Kk=1p˜iklogπk(t)fkXi|θk(t),(14)
    where p˜ik is the posterior probability of the component membership PΦ(t)zi=k|Xi at iteration (t+1) and which is given by the Bayes rule
    (15) p˜ik=PΦ(t)zi=kfkXi|θk(t)f(Xi|Φ(t))=πk(t)fkXi|θk(t)Kl=1πl(t)flXi|θl(t).(15)

  • The maximisation step of the algorithm consists of estimating, at the (t+1)th iteration, the new Φˆ=Φ(t+1) that maximises the expectation subject to the constraint Kk=1πk=1

    (16) Φˆ=argmaxΦQΦ|Φ(t).(16)
    Given the concavity of the log likelihood, solving the first order condition is sufficient.

Therefore, the maximisation step gives a new estimate of the unknown parameters (πk,sk,Σk) for all 1kK as indicated below.

We have the new estimate of the mixing proportion πk is given by

(17) πˆk=1NNi=1p˜ik.(17)

The new estimate of the components sk,j of the vector sk is expressed by

(18) sˆk,j=ψ1log(2Δ1(Σk))+Ni=1p˜iklogΔ1(Xi)Ni=1p˜ik;j=1j12+ψ1logΔj1(Σk)2Δj(Σk)+Ni=1p˜iklogΔj(Xi)Δj1(Xi)Ni=1p˜ik;2jr,(18)

where the utilised function ψ is the digamma function defined as the logarithmic derivative of the gamma function. We have ψ=ΓΓ.

And, finally the new estimate of the scale matrix Σk of the Riesz distribution is determined by

(19) rj=1(sˆk,jsˆk,j+1)ΣkˆPj(Σkˆ)1Σkˆ=12Ni=1p˜ikNi=1p˜ikXi,(19)

where sˆk,r+1=sk,r+1=0.

Note that the latter equation leads to a system of linear equations that can be solved, as described on the work done by Kessentini et al. (Citation2020), to determine Σkˆ.

The EM algorithm for the Riesz mixture model is then summarised by Algorithm 1.

3. Application

This section is devoted to the application of the mixture of Riesz probability distributions for medical image segmentation. The significance of the latter refers to its ability to extract information about the human body, and more specifically about tissue characterisation, organs, anatomic structures, lesions and tumours. This information helps doctors delineate and track the progress of diseases. The brain is a particulary complex structure, and its segmentation is an important step for solving many problems. We are basically interested in brain MR image unsupervised segmentation. Our objective is therefore, to segment brain tumours by using the proposed Riesz mixture model for both simulated image and real brain MR image in order to identify two regions (tumour and non-tumour). The creation of a robust and a flexible algorithm to ensure the best image segmentation result based on Riesz distribution is hence necessary. In multiple classical works, the segmentation process rests on the use of the Gaussian distribution (Fenyi et al. Citation2020; Yang et al. Citation2022). However, in many fields like finance, handwriting recognition or image segmentation, mixture models were commonly extended from Gaussian mixture model to the multivariate one in order to fit vectors of unknown parameters. They were even extended to fit matrix samples as shown by Hidot and Saint-Jean using the Wishart probability distribution (Hidot and Saint-Jean Citation2010).

Until achieving the final segmentation result using Riesz mixture model and verifying the high performance of this algorithm over than the Wishart one, such stages must be followed (see below).

Figure 1. The flowchart of the proposed image segmentation method.

Figure 1. The flowchart of the proposed image segmentation method.

The proposed algorithm of segmentation relies upon eight steps, which are described as follows.

  • Step 1: Divided image into blocks and choice of the number of image regions (K=2) for each block.

  • Step 2: Input observed image in a vector of the matrix of pixels.

  • Step 3: Initialization of the vector of the parameters Φ=πk,sk,Σk, k=1,2 by using the Fuzzy C-means algorithm (Dunn Citation1973; Yu et al. Citation2023).

  • Step 4 (Expectation): Computing the posterior probabilities at iteration (t+1) by the Bayes rule

    p˜ik=πk(t)fkXi|sk(0),Σk(0)Kj=1πj(t)fjXi|sj(0),Σj(0),forall k=1,2.

  • Step 5 (Maximization): Maximizing the conditional expectation

    QΦ|Φ(t)=Ni=1Kk=1p˜iklogπk(t)fkXi|sk(t),Σk(t),
    with respect to sk,Σk and πk, ∀k=1,2. The update of the Riesz mixture parameters (the mixing weight πk0,1, the shape parameter of the Riesz distribution sk=(sk,1,sk,2,,sk,r)Rr, where sk,j>j12,1jr and its scale parameter ΣkΩr) is supplied by
    πˆk=1NNi=1p˜ik,
    sˆk,j=ψ1log(2Δ1(Σk))+Ni=1p˜iklogΔ1(Xi)Ni=1p˜ik;j=1j12+ψ1logΔj1(Σk)2Δj(Σk)+Ni=1p˜iklogΔj(Xi)Δj1(Xi)Ni=1p˜ik;2jr,
    In the case r=3, the rows of the symmetric matrix Σˆk can be computed in the order following
    Σˆk1j=Ni=1p˜ikXi1j2Ni=1p˜ik/sˆk,1,forj=1,2,3.
    then
    (20) Σˆk2j=Ni=1p˜ikXi2j2Ni=1p˜ik(sˆk,1sˆk,2)Σˆk12Σˆk1jΣˆk11/sˆk,2,forj=2,3.(20)
    and finally
    (21) Σˆk33=Ni=1p˜ikXi332Ni=1p˜ik(sˆk,1sˆk,2)(Σˆk13)2Σˆk11(sˆk,2sˆk,3)Σˆk22(Σˆk13)22Σˆk12Σˆk13Σˆk23+Σˆk11(Σˆk23)2Σˆk11Σˆk22Σˆk122/sˆk,3.a(21)
    More details are given by Kessentini et al. (Citation2020).

  • Step 6: Iterate steps 4 and 5 until the following constraint is obtained

    (22) ||Φ(t+1)Φ(t)||<ϵ.(22)
    That is the convergence of the EM algorithm for the Riesz mixture model (EMR) is assumed when the absolute difference between successive estimates is less than 103. Therefore, the stop criterion of the EMR is accomplished after 20 iterations.

  • Step 7: Classify all the matrix of pixels into one of the K classes by calculating the posterior probability denoted P(α;sk,Σk) of the matrix of pixels α for each class C1 and C2. The image pixels are then classified (labelled) based on the highest posterior probability. Hence, the matrix of pixels denoted α is said to be a tumour if

    (23) P(α;s1,Σ1)>P(α;s2,Σ2);(23)
    otherwise, it is considered as normal tissue.

  • Step 8: Construct labelled image corresponding of each block.

In order to prove further the effectiveness of the proposed algorithm and examine the quality of our segmentation results, some performance metrics are defined in the following subsection.

3.1. Performance metrics

Resting on the original image I and the segmented image Iˆ, six evaluation criteria can be calculated: PSNR, MCR, RI, ARI, MI and HI. These performance measure parameters can be found in (Saladi and Amutha Prabha Citation2017; Fouad et al. Citation2017; Correa-Morris et al. Citation2022; Zitouni et al. Citation2023; Sundqvist et al. Citation2023) and (Somashekara and Manjunatha Citation2014).

The Peak Signal to Noise Ratio (PSNR) in dB is most commonly used as a quality measure of reconstruction that is expressed by

(24) PSNR=10logImax2MSE,(24)

where, Imax corresponds to the maximum intensity of pixel value of the image and MSE represents the Mean Squared Error. Small MSE value means less errors, resulting in high PSNR. Acceptable PSNR range is >20 dB.

In order to measure the segmentation accuracy, we also need to define the Misclassification Ratio (MCR)

(25) MCR=thenumberofmisclassifiedpixelsthenumberofallpixelsfromtheoriginalimage.(25)

We have MCR[0,1]. The smaller MCR is the better the segmentation result becomes.

To further assess the performances of the proposed algorithm, we used other image segmentation evaluation metrics which are indicated as follows.

Rand Index (RI) is a measure of the similarity between two data clusterings. It has a value between 0 and 1, with 0 indicating that the two data clusterings do not agree on any pair of points and 1 indicating that the data clusterings are exactly the same.

The Adjusted Rand Index (ARI) is defined as the similarity between the actual labels and predicted labels, ignoring permutations. The ARI has a range of [1,1], negative values are considered as bad clusterings and the value one means perfect clustering. The ARI is denoted as

(26) ARI=RIE[RI]max(RI)E[RI],(26)

where E[RI] is the expected value of RI.

Further statistical tests for clustering evaluation are also depicted below.

The Mirkin Metric (MI) is a transformation of Rand Index. It is the probability of a disagreement expressed by

(27) MI=n(n1)(RI1),(27)

where n is the number of data items.

The Hubert index (HI) is the difference between the probability of an agreement and a disagreement.

These measures are helpful to evaluate the ability of the proposed segmentation algorithm and discriminate tissues especially tumours.

3.2. Experimental results

This section presents the experimental results of the proposed segmentation algorithm with synthetic and real brain MR image.

3.2.1. Synthetic image segmentation

At this stage of analysis, we shall address the obtained results, discuss them and evaluate them in order to assess the effectiveness and feasibility of our approach. The proposed method is evaluated first using synthetic image. A simulated image is generated by the use of the Riesz distributions. Using Matlab software, simulated image Iˆ is generated by a sample of total size 8000 of random matrices. Each matrix belongs to the cone of 3×3 symmetric positive definite matrices denoted by Ω3. Thus, we generate the pixels of these matrices based on the extension of the Bartlett decomposition for the Riesz distribution (as described in Section 2) with gamma on the diagonal and Gaussian off-diagonal. The simulated image Iˆ is composed of K=2 classes as described below.

We consider a mixture of two Riesz probability distributions (Riesz mixture model 1) generated using the extended Bartlett decomposition of Riesz random matrices (Kessentini et al. Citation2020) on Ω3. From this perspective, let

  • M1R3(s1,Σ1), where s1=(2,5,6) and Σ1=2.20.20.20.22.20.20.20.22.2.

  • M2R3(s2,Σ2), where s2=(3,4,7) and Σ2=4.30.30.30.34.30.30.30.34.3.

Recall that, to generate the two random matrices M1 and M2, we set X=T1T1 and Y=T2T2, where T1 and T2 are in Tr with independent elements such that Tk=(tkij)1jir for all k=1,2. Thus, we generate the diagonal elements tkii2 with the usual real gamma distribution γ(2sk,ii+12,12); sk,i>i12 for all 1ir and we generate the off-diagonal elements tkij with the Gaussian distribution N(0,1); 1j<ir. Therefore, we get XR3(s1,I3) and YR3(s2,I3). Now, by using the Cholesky decomposition of Σk=LkLk, where LkTr for all k=1,2, we construct the random matrices Mk=LkTkLkTk which follow the Riesz distribution R3(sk,Σk).

Besides, for further statistical analysis and simulation studies, we introduce other mixture of Riesz probability distributions with other parameters which are given in below. Note that we adopt the same scale matrices as in Ref (Hidot and Saint-Jean Citation2010). for the third Riesz mixture model.

Table 1. Other riesz mixture models.

To this extent, we generate a sample X1,,XN of total size N=8000 belonging to one of the two Riesz probability distributions for each Riesz mixture model, with mixing weights for the first class C1 and the second class C2 that are indicated respectively as π1=0.4 and π2=0.6. To check whether we have generated matrix of Riesz distribution, comparative studies between the empirical mean and the expected value of the Riesz random matrix (see EquationEquation (8))are carried out. The parameter estimates are obtained by the EM algorithm and the convergence is assumed when the absolute differences between successive estimates are less than 103. So, after 20 iterations, the task is accomplished.

In order to prove the efficiency and the superiority of the EM algorithm for the Riesz mixture models (EMR) over the EM algorithm for the Wishart mixture models (EMW), and examine the quality of our estimation results, we can adopt the following two metrics.

The Euclidean distance d(sk,skˆ)=||skskˆ||2 for the shape parameter sk, and the Hilbert-Schmidt distance given by

(28) dHS(Σk,Σkˆ)=tr(ΣkΣkˆ)(ΣkΣkˆ),(28)

for the scale matrix Σk.

The shape parameters and the scale matrices estimates for the different Riesz mixture models and their distances are plotted in below.

Table 2. The resulting estimated parameters using the EMR and their distances toward the actual parameters.

In the following (), we shall compare the EMR with the EMW.

Table 3. The resulting estimated parameters using the EMW and their distances toward the actual parameters.

Referring to , we assume that the lower Euclidean distance and dHS are reached by the EMR algorithm. Hence, it is inferred that the EMR performs better than the EMW. This shows that the Riesz probability distributions would even have more applications than the Wishart ones as they have more parameters (the shape parameter has a distant components) and then offer a higher degree of freedom. As we described above, simulation is used to generate synthetic images (See below).

Figure 2. The original synthetic image.

Figure 2. The original synthetic image.

Therefore, in order to validate the proposed EMR algorithm, the latter was applied with synthetic images. The synthetic images associated to the three mixture Riesz models consists of two strongly noisy regions (K=2). It is portrayed in .

Figure 3. Synthetic noisy images for the different riesz mixture models.

Figure 3. Synthetic noisy images for the different riesz mixture models.

Various kinds of statistical methods have been elaborated for image segmentation. Among them, we mention those that apply Markov random field (MRF) model (Mouna et al. Citation2020). For instance, the region-based classification of polarimetric Synthetic Aperture Radar (SAR) images which uses Wishart MRF (Yin et al. Citation2020). The segmentation strategy based on the Wishart distribution is implemented for comparison. Owing to the best form of the Riesz distribution, more reliable results would be obtained. The corresponding segmentation results are illustrated in below.

Figure 4. Simulated segmented images by the mixture model 1.

Figure 4. Simulated segmented images by the mixture model 1.

Figure 5. Simulated segmented images by the mixture model 2.

Figure 5. Simulated segmented images by the mixture model 2.

Figure 6. Simulated segmented images by the mixture model 3.

Figure 6. Simulated segmented images by the mixture model 3.

Although there is much noise, we may record a great possibility of identifying the class of each pixel using the proposed Riesz probability distribution rather than the classical method based on Wishart probability distribution. Therefore, good image segmentation results are obtained using the new strategy through the first step estimation of the parameters. The estimators are very close to the true parameters values. Note that the estimated shape parameter by EMW is almost the average value of the associated vector sk. Referring to the segmented images (), the Riesz distribution can better characterise the class in heterogeneous areas and distinguish the regions with different textural information.

The table presented below () outlines the quantitative results of the synthetic images.

Table 4. Different evaluation criteria values with Riesz and Wishart distributions.

helps us derive the average values of MCR, PSNR,ARI,RI,MI and HI. The lowest MCR and the highest PSNR values illustrate the good performance of the proposed clustering method, which implies less errors and higher quality of the segmented image. The ARI and the RI values obtained of the Riesz distributions are higher than those obtained with the Wishart distributions. The lowest MI and HI values also indicate that the proposed algorithm based on the Riesz is more suitable than the Wishart one, which corroborates the effectiveness as well as reliability of the proposed algorithm. Between Riesz and Wishart distributions, there exists a tiny variation but as the different metrics values are more accepted with the new strategy, we can consider that the latter is much better to classify tissues than the classical method. The RI values are, respectively, 0.6927, 0.8577 and 0.8986 using the Riesz distributions. There are close to 1 and are higher than the ones obtained by the Wishart distributions. From this perspective, the most correct and plausible segmentation result is provided by the proposed technique. The experiments reveal that these settings work well and can achieve satisfactory results.

3.2.2. Brain image segmentation

In this subsection, we highlight and we further discuss the robustness of the proposed method for settling brain image segmentation problem.

Firstly, we assume that the pixels intensities are independent samples from a mixture of the Riesz probability distributions. In order to illustrate the proposed distribution and the algorithm of segmentation, we consider the original real brain MR image and its blocks (see ). The image is of size (183×183) given from the Magnetic Resonance Imaging (MRI) Center of Sfax, Road of El-An, Sfax-Tunisia. In order to segment the brain image, we have divided the image into blocks.

Figure 7. Real brain MR image and its blocks.

Figure 7. Real brain MR image and its blocks.

We modelled each block of the brain image by a Riesz mixture model. We applied the proposed method to each block. Therefore, we suppose that each block of real brain MR image can be divided into two classes (K=2): class of tumor and class of normal tissue. Each block is associated with mixing weights πk, which should satisfy k=12πk=1. To classify the matrix of pixels, we shall first estimate the mixture model parameters by using EM algorithm where initialisation of parameters are satisfied using the Fuzzy C-means algorithm (Kannan et al. Citation2010; Chaira Citation2011). Secondly, the classifier labels a test sample with the class that has the highest likelihood value. The classification result of pixels matrix for each block proved to be accurate and yielded satisfactory results.

In order to visualise the difference between the proposed segmentation algorithm with Riesz distribution and the proposed segmentation algorithm with Wishart distribution, the proposed technique has been applied on various brain images volume (See ).

Figure 8. Segmentation results of the brain image. (a) the segmented image by the riesz distribution, (b) the segmented image by the Wishart distribution.

Figure 8. Segmentation results of the brain image. (a) the segmented image by the riesz distribution, (b) the segmented image by the Wishart distribution.

Figure 9. Segmentation results of the brain image. (c) the segmented image by the riesz distribution, (d) the segmented image by the Wishart distribution.

Figure 9. Segmentation results of the brain image. (c) the segmented image by the riesz distribution, (d) the segmented image by the Wishart distribution.

The segmentation results are plotted in using the Riesz and the Wishart distributions, respectively. It is worth noting that the selection of the exact tumour region is intrinsic for a successful surgical intervention.

Referring to the segmented images by the Riesz ( with respect to the segmented image by the Wishart (, we infer that the proposed clustering algorithms stand for the most reliable method for brain image segmentation. For real brain MR image, we classified the image pixels into two classes using the proposed algorithm with Riesz distribution: tumour tissue and healthy tissue. The dark black area corresponds to the tumour with a few pixels are classified wrongly. Obtained experimental results from brain real image indicate excellent agreement and accuracy with Riesz distribution.

This confirmed further by the quantitative segmentation results in terms of the evaluation criteria which are portrayed in below.

Table 5. The MCR and PSNR values by varying mixture with brain images.

Resting upon , it is clearly noticed that statistical metrics values of the proposed segmentation approach are much closer to the ideal segmented image. The smallest MCR values as well as the highest PSNR values specify the high quality of segmented image using proposed algorithm with mixture Riesz.

For this reason, we may assert that our approach fulfils a good interpretation and a high accuracy.

Remark 1. As for the Riesz probability distribution, we suggest using a mixture of the inverse Riesz distribution (Tounsi and Zine Citation2012) for real life applications like image segmentation. Thus, a classification method based on the Expectation-Maximization algorithm within a maximum likelihood framework needs to be developed.

4. Conclusion and discussion

In the current work, we elaborated an iterative algorithm based on the Riesz mixture model and the EM algorithm for settling image segmentation problem. It is an extension to previous research which was confined uniquely to theoretical results of the Riesz distribution without real application. Moreover, our research corroborates the efficiency of the matrix-variate Riesz probability distributions on symmetric matrices in different fields such as image processing. Synthetic and real images are invested to demonstrate the effectiveness of the proposed method, using both visual presentation and numerical evaluation as Misclassification Rate, Peak Signal to Noise Ratio, Adjusted Rand Index, Rand index, Mirkin Metric and Hubert Index. Riesz distribution proved to better characterise and distinguish the regions with different textural information than the Wishart one. Compared to the classical iterative method based on the Wishart distribution, the proposed strategy can effectively improve the computation efficiency as well as segmentation accuracy. With real brain images, we get two classes: class of brain tumour and class of normal tissue. According to experts, the Riesz distribution has a good characterisation which can be invested to characterise the classes in heterogeneous areas. Hence, the proposed method can obtain more accurate and visually pleasing results than the classical statistical methods. The initialisation of the parameters stands for the first step in the proposed segmentation method and represents the most significant stage which is extremely influential in terms of the final estimation results. The unsatisfactory segmentation results can be obtained when the initial parameters are not well established. As a final note, we would assert that in future research, more diligent efforts would be performed in this area so as to overcome this problem.

Acknowledgements

The authors would like to express their gratitude to the reviewers for the time taken to review the manuscript and their valuable comments and suggestions, which have undoubtably improved the quality of the manuscript. The authors express their hearty gratitude to the Radiologist Dr. Moez Bradai (Magnetic Resonance Imaging Center, Road of El-An Km 0.5, 3000 Sfax, Tunisia) for providing useful brain MR images.

Disclosure statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Funding

The authors declare that there was no funding received for this research.

Notes on contributors

Mariem Tounsi

Mariem Tounsi is an Assistant Professor in Sfax National Engineering School, University of Sfax, Tunisia. She received Ph.D. degree in Mathematics from Faculty of Sciences of Sfax, University of Sfax, Tunisia, in 2013. She also received M.S. degree in Applied Mathematics from Faculty of Sciences of Sfax, in 2009. Her research interests are related to multivariate analysis, random matrices, statistical inference and theory of probability.

Mouna Zitouni

Mouna Zitouni is an Assistant Professor in Higher Institute of Computer Sciences and Multimedia of Gabes, University of Gabes, Tunisia. She received the Ph.D. degree in Mathematics at Faculty of Sciences of Sfax, University of Sfax, Tunisia, in 2017 and the M.S. degree in Applied Mathematics at Faculty of Sciences of Sfax, in 2013. Her research interests include mixture models and image processing, mainly segmentation and classification techniques.

References

  • Bartlett MS. 1934. XX.—on the theory of statistical regression. Proc R Soc Edinb. 53:260–13. doi: 10.1017/S0370164600015637.
  • Bhagwat M. 2021. Statistical evaluation of application specific image segmentation algorithms. Turk J Comput Math Educ. 12(10):4468–4479. doi:10.17762/turcomat.v12i10.5450.
  • Chaira T. 2011. A novel intuitionistic fuzzy c means clustering algorithm and its application to medical images. Appl Soft Comput. 11(2):1711–1717. doi: 10.1016/j.asoc.2010.05.005.
  • Correa-Morris J, Urra-Yglesias A, Reyes E, Martínez J, Gonzalez B. 2022. Comparing partitions: metric characterizations, mean partition, and distortion. Intell Comput Proce 2021 Comput Conf. 1:857–875. Springer.
  • Dempster AP, Laird NM, Rubin DB. 1977. Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Series B Stat Methodol. 39(1):1–22. doi: 10.1111/j.2517-6161.1977.tb01600.x.
  • Dunn JC. 1973. A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters. J Cybern. 3(3):32–57. doi:10.1080/01969727308546046.
  • Fenyi A, Nkansah E, Eghan E. 2020. Gaussian based image segmentation algorithm. Int J Comput Appl. 175(14):10–16. doi: 10.5120/ijca2020920615.
  • Fouad S, Randell D, Galton A, Mehanna H, Landini G, Tang D. 2017. Unsupervised morphological segmentation of tissue compartments in histopathological images. PLOS ONE. 12(11):e0188717. doi: 10.1371/journal.pone.0188717.
  • Fraley C, Raftery AE. 2002. Model-based clustering, discriminant analysis, and density estimation. J Am Stat Assoc. 97(458):611–631. doi: 10.1198/016214502760047131.
  • Gormley IC, Murphy TB, Raftery AE. 2023. Model-based clustering. Annu Rev Stat Appl. 10(1):573–595. doi: 10.1146/annurev-statistics-033121-115326.
  • Hassairi A, Lajmi S. 2001. Riesz exponential families on symmetric cones. J Theor Probab. 14(4):927–948. doi: 10.1023/A:1012592618872.
  • Hassairi A, Lajmi S, Zine R. 2007. Riesz inverse gaussian distributions. J Stat Plan Inference. 137(6):2024–2033. doi: 10.1016/j.jspi.2005.10.007.
  • Hassairi A, Lajmi S, Zine R. 2008. A characterization of the riesz probability distribution. J Theor Probab. 21(4):773–790. doi: 10.1007/s10959-008-0142-1.
  • Hassairi A, Lajmi S, Zine R. 2017. Some new properties of the riesz probability distribution. Math Methods Appl Sci. 40(16):5946–5958. doi: 10.1002/mma.4445.
  • He M, Chen J. 2022. Consistency of the mle under a two-parameter gamma mixture model with a structural shape parameter. Metrika. 85(8):951–975. doi: 10.1007/s00184-021-00856-9.
  • Hidot S, Saint-Jean C. 2010. An expectation–maximization algorithm for the wishart mixture model: application to movement clustering. Pattern Recognit Lett. 31(14):2318–2324. doi: 10.1016/j.patrec.2010.07.002.
  • Huang Z, Gou Z. 2023. Gaussian mixture model based pattern recognition for understanding the long-term impact of COVID-19 on energy consumption of public buildings. J Build Eng. 72:106653. doi: 10.1016/j.jobe.2023.106653.
  • Jiao L, Denœux T, Liu ZG, Pan Q. 2022. Egmm: an evidential version of the gaussian mixture model for clustering. Appl Soft Comput. 129:109619. doi: 10.1016/j.asoc.2022.109619.
  • Kannan S, Ramathilagam S, Sathya A, Pandiyarajan R. 2010. Effective fuzzy c-means based kernel function in segmenting medical images. computers in biology and medicine. Comput Biol Med. 40(6):572–579. doi: 10.1016/j.compbiomed.2010.04.001.
  • Kessentini S, Tounsi M, Zine R. 2020. The riesz probability distribution: generation and em algorithm. Commun Stat Simul Comput. 49(8):2114–2133. doi: 10.1080/03610918.2018.1513139.
  • McLachlan GJ, Bean RW, Peel D. 2002. A mixture model-based approach to the clustering of microarray expression data. Bioinformatics. 18(3):413–422. doi: 10.1093/bioinformatics/18.3.413.
  • Melnykov V, Maitra R. 2010. Finite mixture models and model-based clustering. Stat Surv. 4:80–116. doi: 10.1214/09-SS053.
  • Mouna Z, Mourad Z, Afif M 2020. Unsupervised image segmentation using thmrf model. In: International Conference on Hybrid Intelligent Systems. Springer. p. 41–48.
  • Panić B, Nagode M, Klemenc J, Oman S. 2022. On methods for merging mixture model components suitable for unsupervised image segmentation tasks. Mathematics. 10(22):4301. doi: 10.3390/math10224301.
  • Pu Y, Sun J, Tang N, Xu Z. 2023. Deep expectation-maximization network for unsupervised image segmentation and clustering. Image Vis Comput. 135:104717. doi: 10.1016/j.imavis.2023.104717.
  • Saladi S, Amutha Prabha N. 2017. Analysis of denoising filters on mri brain images. Int J Imaging Syst Technol. 27(3):201–208. doi: 10.1002/ima.22225.
  • Saygılı A. 2022. Computer-aided detection of COVID-19 from ct images based on gaussian mixture model and kernel support vector machines classifier. Arab J Sci Eng. 47(2):2435–2453. doi: 10.1007/s13369-021-06240-z.
  • Shafiq A, C¸ Olak AB, Lone SA, Sindhu TN, Muhammad T. 2022. Reliability modeling and analysis of mixture of exponential distributions using artificial neural network. Wiley Online Library. doi: 10.1002/mma.8178.
  • Somashekara M, Manjunatha D. 2014. Performance evaluation of spectral clustering algorithm using various clustering validity indices. Inter J Elect Com Comp Eng. 5(6):1274–1276.
  • Sundqvist M, Chiquet J, Rigaill G. 2023. Adjusting the adjusted rand index: a multinomial story. Comput Stat. 38(1):327–347. doi: 10.1007/s00180-022-01230-7.
  • Tounsi M. 2020. The extended matrix-variate beta probability distribution on symmetric matrices. Methodol Comput Appl Probab. 22(2):647–676. doi: 10.1007/s11009-019-09725-5.
  • Tounsi M, Zine R. 2012. The inverse riesz probability distribution on symmetric matrices. J Multivar Anal. 111:174–182. doi: 10.1016/j.jmva.2012.05.013.
  • Tounsi M, Zine R. 2017. Wilks’ factorization of the matrix-variate dirichlet–riesz distributions. Commun Stat Theory Methods. 46(9):4494–4509. doi: 10.1080/03610926.2015.1085565.
  • Tounsi M, Zitouni M. 2021. Matrix-variate lindley distributions and its applications. Braz J Probab Stat. 35(4):721–748. doi: 10.1214/21-BJPS504.
  • Yang Y, Ren H, Hou X. 2022. Level set framework based on local scalable gaussian distribution and adaptive-scale operator for accurate image segmentation and correction. Signal Process Image Commun. 104:116653. doi: 10.1016/j.image.2022.116653.
  • Yin J, Liu X, Yang J, Chu CY, Chang YL. 2020. Polsar image classification based on statistical distribution and mrf. Remote Sens. 12(6):1027. doi: 10.3390/rs12061027.
  • Yu H, Jiang L, Fan J, Xie S, Lan R. 2023. A feature-weighted suppressed possibilistic fuzzy c- means clustering algorithm and its application on color image segmentation. Expert Syst Appl. 241:122270. doi: 10.1016/j.eswa.2023.122270.
  • Zhou K, Yang S. 2020. Effect of cluster size distribution on clustering: a comparative study of k-means and fuzzy c-means clustering. Pattern Anal Appl. 23(1):455–466. doi: 10.1007/s10044-019-00783-6.
  • Zitouni M, Afif M, Zribi M. 2023. Tweedie hidden markov random field and the expectation- method of moments and maximisation algorithm for brain mr image segmentation. Comput Meth Biomech Biomed Eng Imaging Vis. 11(1):67–79. doi: 10.1080/21681163.2022.2045225.
  • Zitouni M, Tounsi M. 2021. Image segmentation using matrix-variate lindley distributions. The 21st International Conference on Intelligent Systems Design and Applications (ISDA 2021):389–398. doi: 10.1007/978-3-030-96308-8_36.
  • Zitouni M, Zribi M, Masmoudi A. 2018. Asymptotic properties of the estimator for a finite mixture of exponential dispersion models. Filomat. 32(19):6575–6598. doi: 10.2298/FIL1819575Z.