An implementation of normal distribution based segmentation and entropy controlled features selection for skin lesion detection and classification

Khan, M. Attique; Akram, Tallha; Sharif, Muhammad; Shahzad, Aamir; Aurangzeb, Khursheed; Alhussein, Musaed; Haider, Syed Irtaza; Altamrah, Abdualziz

doi:10.1186/s12885-018-4465-8

Research article
Open access
Published: 05 June 2018

An implementation of normal distribution based segmentation and entropy controlled features selection for skin lesion detection and classification

M. Attique Khan¹,
Tallha Akram²,
Muhammad Sharif¹,
Aamir Shahzad ORCID: orcid.org/0000-0003-4480-226X³,
Khursheed Aurangzeb^4,5,
Musaed Alhussein⁴,
Syed Irtaza Haider⁴ &
…
Abdualziz Altamrah⁴

BMC Cancer volume 18, Article number: 638 (2018) Cite this article

4803 Accesses
87 Citations
Metrics details

Abstract

Background

Melanoma is the deadliest type of skin cancer with highest mortality rate. However, the annihilation in its early stage implies a high survival rate therefore, it demands early diagnosis. The accustomed diagnosis methods are costly and cumbersome due to the involvement of experienced experts as well as the requirements for the highly equipped environment. The recent advancements in computerized solutions for this diagnosis are highly promising with improved accuracy and efficiency.

Methods

In this article, a method for the identification and classification of the lesion based on probabilistic distribution and best features selection is proposed. The probabilistic distribution such as normal distribution and uniform distribution are implemented for segmentation of lesion in the dermoscopic images. Then multi-level features are extracted and parallel strategy is performed for fusion. A novel entropy-based method with the combination of Bhattacharyya distance and variance are calculated for the selection of best features. Only selected features are classified using multi-class support vector machine, which is selected as a base classifier.

Results

The proposed method is validated on three publicly available datasets such as PH2, ISIC (i.e. ISIC MSK-2 and ISIC UDA), and Combined (ISBI 2016 and ISBI 2017), including multi-resolution RGB images and achieved accuracy of 97.5%, 97.75%, and 93.2%, respectively.

Conclusion

The base classifier performs significantly better on proposed features fusion and selection method as compared to other methods in terms of sensitivity, specificity, and accuracy. Furthermore, the presented method achieved satisfactory segmentation results on selected datasets.

Peer Review reports

Background

Skin cancer is reported to be one of the most rapidly spreading cancer amongst other types. It is broadly classified into two primary classes; Melanoma and Benign. The Melanoma is the deadliest type of cancer with highest mortality rate worldwide [1]. In the US alone, an astonishing mortality rate of 75% is reported due to melanoma compared to other types of skin cancers [2]. The occurrence of melanoma reported to be doubled (increases 2 to 3% per year) in the last two decades, faster than any other types of cancer. American Cancer Society (ACS) has estimated, 87,110 new cases of melanoma will be diagnosed and 9,730 people will die in the US only in 2017 [3]. Malignant melanoma can be cured if detected at its early stages, e.g., if diagnosed at stage I, the possible survival rate is 96%, compared to 5% at its stage IV [4, 5]. However, early detection is strenuous due to its high resemblance with benign cancer, even an expert dermatologist can diagnose it wrongly. A specialized technique of dermatoscopy is mostly followed by dermatologist to diagnose melanoma. In a clinical examination, most commonly adopted methods of visual features inspection are; Menzies method [6], ABCD rule [7], and 7-point checklist [8]. The most commonly used methods are the ABCD (atypical, border, color, diameter) rules and pattern analysis. It is reported that this traditional dermoscopy method can increase the detection rate 10 to 27% [9]. These methods distinctly increases the detection rate compared to conventional methods but still dependent on dermatologist’s skills and training [10]. To facilitate experts numerous computerized analysis systems have been proposed recently [11, 12] which are referred to as pattern analysis/ computerized dermoscopic analysis systems. These methods are non-invasive and image analysis based technique to diagnose the melanoma.

In the last decade, several non-invasive methods were introduced for the diagnosis of melanoma including optical imaging system (OIS) [13], optical coherence tomography (OCT) [14], light scattering (LS) [15], spectropolarimetric imaging system (SIM) [16, 17], fourier polarimetry (FP) [18], polarimetric imaging [19], reectance confocal microscopy (RCM) [20, 21], photo-acoustic microscopy [22], optical transfer diagnosis (OTD) [23], etc. All these above mentioned methods have enough potential to diagnose the skin lesions and also accurate enough to distinguish the melanoma and benign. The optical methods are mostly utilized during a clinal tests to evaluate the presurgical boundaries of the basal cell carcinoma. It can help in drawing boundaries around the region of interest (ROI) in the dermoscopic images. LS skin methods give the information about the micro-architecture, which is represented with small pieces of pigskin and mineral element and helps to determine the extent of various types of skin cancers. The SIM method correctly evaluates the polarimetric contrast of the region of interest or infectious region such as melanoma, compared to the background or healthy region. However, in FP method human skins is observed with laser scattering and difference is identified using optical method for the diagnostic test for differentiating melanoma and benign.

Problem statement

It is proved that malignant melanoma is a lethal skin cancer that is extra dominant between the 15 and above aged people [24]. The recent research shows high rate of failure to detect and diagnose this type of cancer at the early stages [25]. Generally, it consists of four major steps: preprocessing, which consists of hair removal, contrast enhancement, segmentation, feature extraction, and finally classification. The most challenging task in dermoscopy is an accurate detection of lesion’s boundary because of different artifacts such as hairs, illumination effects, low lesion contrast, asymmetrical and irregular border, nicked edges, etc. Therefore, for an early detection of melanoma, shape analysis is more important. In features extraction step, several types of features are extracted such as shape, color, texture, local etc. But, we have no clear knowledge about salient features for classification.

Contribution

In this article, we propose a new method of lesion detection and classification by implementing probabilistic distribution based segmentation method and conditional entropy controlled features selection. The proposed technique is an amalgamation of five major steps: a) contrast stretching; b) lesion extraction; c) multi-level features extraction; d) features selection and e) classification of malignant and benign. The results are tested on three publicly available datasets which are PH2, ISIC (i.e. ISIC MSK-2 and ISIC UDA), and Combined (ISBI 2016 and ISBI 2017), containing RGB images of different resolutions, which are later normalized in our proposed technique. Our main contributions are enumerated below:

1
Enhanced the contrast of a lesion area by implementing a novel contrast stretching technique, in which we first calculated the global minima and maxima from the input image and then utilized low and high threshold values to enhance the lesion.
2
Implemented a novel segmentation method based on normal and uniform distribution. Mean of the uniform distribution is calculated from the enhanced image and the value is added in an activation function, which is introduced for segmentation. Similarly, mean deviation of the normal distribution is calculated using enhanced image and also inserted their values in an activation function for segmentation.
3
A fusion of segmented images is implemented by utilizing additive law of probability.
4
Implemented a novel feature selection method, which initially calculate the Euclidean distance between fused feature vector by implementing an Entropy-variance method. Only most discriminant features are later utilized by multi-class support vector machine for classification.

Paper organization

The chronological order of this article is as follows: The related work of skin cancer detection and classification is described in “Related work” section. “Methods” section explains the proposed method, which consists of several sub steps including contrast stretching, segmentation, features extraction, features fusion, classification etc. The experimental results and conclusion of this article are described in “Results” and “Discussion” sections.

Related work

In the last few decades, advance techniques in different domains of medical image processing, machine learning, etc., have introduced tremendous improvements in computer aided diagnostic systems. Similarly, improvements in dermatological examination tools have led the revolutions in the prognostic and diagnostic practices. The computerized features extractions of cutaneous lesion images and features analysis by machine learning techniques have potential to enroute the conventional surgical excision diagnostic methods towards CAD systems.

In literature several methods are implemented for automated detection and classification of skin cancer from the dermoscopic images. Omer et al. [26] introduced an automated system for an early detection of skin lesion. They utilized color features prior to global thresholding for lesion’s segmentation. The enhanced image was later subjected to 2D Discrete Fourier Transform (DCT) and 2D Fast Fourier Transform (FFT) for features extraction prior to the classification step. The results were tested on a publicly available dataset PH2. Barata et al. [27] described the importance of color features for detection of skin lesion. The color sampling method is utilized with Harris detector and compared their performance with grayscale sampling. Also, compared the color-SIFT (scale invariant feature transform) and SIFT features and conclude that color-SIFT features performs good as compare to SIFT. Yanyang et al. [28] introduced an novel method for melanoma detection based on Mahalanobis distance learning and graph regularized non-negative matrix factorization. The introduced method treated as a supervised learning method and reduced the dimensionality of extracted set of features and improves the classification rate. The method is evaluated on PH2 dataset and achieved improved performance. Catarina et al. [29] described the strategy of combination of global and local features. The local features (BagOf Features) and global features (shape and geometric) are extracted from original image and fused these features based of early fusion and late fusion. The author claim the late fusion is never been utilized in this context and it gives better results as compared to early fusion.

Ebtihal et al. [30] introduced an hybrid method for lesion classification using color and texture features. Four moments such as mean standard deviation, degree of asymmetry and variance is calculated against each channel, which are treated as a features. The local binary pattern (LBP) and gray level co-occurrences matrices (GLCM) were extracted as a texture features. Finally, the combined features were classified using support vector machine (SVM). Agn et al. [31] introduced a saliency detection technique for accurate lesion detection. The introduced method resolve the problems when the lesion borders are vague and the contrast between the lesion and inundating skin is low. The saliency method is reproduced with the sparse representaion method. Further, a Bayesian network is introduced that better explains the shape and boundary of the lesion. Euijoon et al. [38] introduced a saliency based segmentation technique where the background of original image was detected by spatial layout which includes boundaries and color information. They implemented Bayesian framework to minimize the detection errors. Similarly, Lei et al. [32] introduced a new method of lesion detection and classification based on multi-scale lesion biased representation (MLR). This proposed method has the advantage of detecting the lesion using different rotations and scales, compared to conventional methods of single rotation.

From above recent studies, we noticed that the colour information and contrast stretching is an important factor for accurately detection of lesion from dermoscopic images. Since the contrast stretching methods improves the visual quality of lesion area and improves the segmentation accuracy. Additionally, for improved classification, several features are utilized in literature but according to best our knowledge, serial based features fusion is not yet utilized. However, in our case only salient features are utilized which are later subjected to fusion for improved classification.

Methods

A new method is proposed for lesion detection and classification using probabilistic distribution based segmentation method and conditional entropy controlled features selection. The proposed method is consists of two major steps: a) lesion identification; b) lesion classification. For lesion identification, we first enhance the contrast of input image and then segment the lesion by implementation of novel probabilistic distribution (uniform distribution, normal distribution). The lesion classification is done based of multiple features extraction and entropy controlled most prominent features selection. The detailed flow diagram of proposed method is shown in Fig. 1.

Contrast stretching

There are numerous contrast stretching or normalization techniques [34], which attempt to improve the image contrast by stretching pixels’ specific intensity range to a different level. Most of the available options take gray image as an input and generate an improved output gray image. In our research work, the primary objective is to acquire a three channel RGB image having dimensions m×n×3. Although, the proposed technique can only work on a single channel of size m×n, therefore, in proposed algorithm we separately processed red, green and blue channel.

In RGB dermoscopic images, mostly the available contents are visually distinguishable into foreground which is infected region and the background. This distinctness is also evident in each and every gray channel, as shown in Fig. 2.

Considering the fact [35], details are always high with higher gradient regions which is foreground and details are low with the background due to low gradient values. We firstly divide the image into equal sized blocks and the compute weights for all regions and for each channel. For a single channel information, details are given below.

1
Gray channel is preprocessed using Sobel edge filter to compute gradients where kernel size is selected to be 3×3.
2
Gradient calculation for each equal sized block and rearranging in an ascending order. For each block the weights are assigned according to the gradient magnitude.
$$ \Gamma\zeta(x,y) = \left\{\begin{array}{ll} \varsigma_{w}^{b1} & if \upsilon_{c}(x,y)\leq T_{1}; \\ \varsigma_{w}^{b2} & T_{1} < \upsilon_{c}(x,y)\leq T_{2}; \\ \varsigma_{w}^{b3} & T_{1} < \upsilon_{c}(x,y) \leq T_{3}; \\ \varsigma_{w}^{b4} & otherwise \\ \end{array}\right. $$
(1)

where $\varsigma _{w}^{bi}~(i\leq 4)$ are statistical weight coefficient and T_i is gradient intervals threshold.
3
Cumulative weighted gray value is calculated for each block using:
$$ N_{g}(z)=\sum\limits_{i=1}^{4}\varsigma_{w}^{bi} n_{i}(z) $$
(2)

where n_i(z) represents cumulative number of gray level pixels for each block i.
4
Concatenate red, green and blue channel to produce enhanced RGB image.

For each channel, three basic conditions are considered for optimized solution: I) extraction of regions with maximum information; II) selection of a block size; III) an improved weighting criteria. In most of the dermoscopic images, maximum informative regions are with in the range of 25−75%. Therefore, considering the minimum value of 25%, the number of blocks are selected to be 12 as an optimal number, with an aspect ratio of 8.3%. These blocks are later selected according to the criteria of maximal information retained (cumulative number of pixels for each block). Laplacian of Gaussian method (LOG) [36] is used with sigma value of two for edge detection. Weights are assigned according to the number of edge points, E_pi for each block:

$$ B_{wi}=\frac{E_{pi}}{E^{b}_{max}} $$

(3)

where $E^{b}_{max}$ is the block with maximum edges. Finally, adjust the intensity levels of enhance image and perform log operation to improved lesion region as compare to original.

$$ \varphi(AI)=\zeta (B_{wi}) $$

(4)

$$ \varphi(t)=C \times log(\beta + \varphi(AI)) $$

(5)

Where β is a constant value, (β≤10), which is selected to be 3 for producing most optimal results. ζ denotes the adjust intensity operation, φ(AI) is enhance image after ζ operation and φ(t) is final enhance image. The final contrast stretching results are shown in Fig. 3.

Lesion segmentation

Segmentation of skin lesion is an important task in the analysis of skin lesions due to several problems such as color variation, presence of hairs, irregularity of lesion in the image and necked edges. Accurate segmentation provides important cues for accurate border detection. In this article, a novel method is implemented based of probabilistic distribution. The probabilistic distribution is consists of two major steps: a) uniform distribution based mean segmentation; b) normal distribution based segmentation.

Mean segmentation

The uniform distribution of mean segmentation is calculated from enhanced image φ(t) and then perform threshold function for lesion extraction. The detailed description of mean segmentation is defined below: Let t denotes the enhanced dermoscopic image and f(t) denotes the function of uniform distribution, which is determined as $f(t)=\frac {1}{y-x}$. Where y and x denotes the maximum and minimum pixels values of φ(t). Then the mean value is calculated as follows:

$$ \mu = \int_{x}^{y}t \ f(t)\ dt $$

(6)

$$ \quad=\int_{x}^{y}t \ \frac{1}{y-x} \ dt $$

(7)

$$ \quad=\frac{1}{y-x}\left [ \frac{t^{2}}{2} \right ]^{y}_{x} $$

(8)

$$ \quad=\frac{1}{2(y-x)}\left [(y+x)(y-x) \right ] $$

(9)

$$ \mu=\frac{1}{2}\left [(y+x) \right] $$

(10)

Then perform an activation function, which is define as follows:

$$ A(\mu)=\frac{1}{\left (1+\left (\frac{\mu}{\varphi(t)} \right) \right)^{\alpha}}+\frac{1}{2\mu}+ C $$

(11)

$$ F(\mu)=\left\{\begin{array}{ll} 1 & if\ A(\mu)\geq \delta_{thresh}\\ 0 & if\ A(\mu)<\delta_{thresh} \end{array}\right. $$

(12)

where δ_thresh is Otus’s threshold, α is a scaling factor which controls the lesion area and its value is selected on the basis of simulations performed, α≤10, and finally got α=7 to be most optimal number. C is a constant value which is randomly initialized within the range of 0 to 1. The segmentation results are shown in Fig. 4.

Mean deviation based segmentation

The mean deviation (M.D) of normal distribution is is calculated from φ(t) having parameter μ and σ. The value of M.D is utilized by activation function for extraction of lesion from the dermoscopic images. Let t denotes the enhanced dermoscopic image and f(t) denotes the normalized function, which determined as $f(t)=\frac {1}{\sqrt {2\pi }\sigma }e^{-\frac {1}{2}(\frac {t-\mu }{\sigma })^{2}}$. Then initialize the M.D as:

$$ M.D=\int_{-\infty}^{+\infty}\left | t-\mu \right |f(t) $$

(13)

$$ \qquad=\int_{-\infty}^{+\infty}\left | t-\mu \right | \frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{1}{2}\left(\frac{t-\mu}{\sigma}\right)^{2}} dt $$

(14)

Then put $g=\frac {t- \mu }{\sigma }$ in Eq. 14.

$$ M.D=\frac{1}{\sqrt{2\pi}\sigma}\int_{-\infty}^{+\infty}\left | \sigma g \right | e^{\frac{-g^{2}}{2}} dg $$

(15)

$$ \qquad=\frac{\sigma}{\sqrt{2\pi}}\left [ \int_{0}^{\infty}g \ e^{\frac{-g^{2}}{2}} dg + \int_{0}^{\infty}g \ e^{\frac{-g^{2}}{2}} dg \right ] $$

(16)

$$ M.D=\frac{2\sigma}{\sqrt{2\pi}} \int_{0}^{\infty}g \ e^{\frac{-g^{2}}{2}} dg $$

(17)

Put $\frac {g^{2}}{2}=l$ in Eq. 17 and it becomes:

$$ M.D=\frac{2\sigma}{\sqrt{2\pi}} \int_{0}^{\infty}\sqrt{2l} \ e^{-l} \ \frac{dl}{\sqrt{2l}} $$

(18)

$$ \qquad=\frac{2\sigma}{\sqrt{2\pi}} \int_{0}^{\infty} e^{-l} \ dl $$

(19)

$$ \qquad=\sqrt{\frac{2}{\pi}}\sigma \left [ \frac{e^{-l}}{-1} \right ]^{\infty}_{0} $$

(20)

$$ \qquad=-\sqrt{\frac{2}{\pi}}\sigma \left [ \frac{1}{e^{l}} \right ]^{\infty}_{0} $$

(21)

$$ \qquad=-\sqrt{\frac{2}{\pi}}\sigma (-1) $$

(22)

Hence

$$ M.D=0.7979 \sigma $$

(23)

Then perform an activation function to utilize M.D as:

$$ AC(M.D)=\frac{1}{\left (1+\left (\frac{M.D}{\varphi(t)} \right) \right)^{\alpha}}+\frac{1}{2 \ M.D}+ C $$

(24)

$$ F(M.D)=\left\{\begin{array}{ll} 1 & if\ AC(M.D)\geq \delta_{thresh}\\ 0 & if\ AC(M.D)< \delta_{thresh} \end{array}\right. $$

(25)

The segmentation results of M.D is shown in Fig. 5.

Image fusion

The term image fusion mean to combine the information of two or more than two images in one resultant image, which contains better information as compare to any individual image or source. The image fusion reduces the redundancy between two or more images and increase the clinical applicability for diagnosis. In this work, we implemented a union based fusion of two segmented images into one image. The resultant image is more accurate and having much information as compare to individual. Suppose N denotes the sample space, which contains 200 dermoscopic images. Let X₁∈F(μ) which is mean segmented image. Let X₂∈F(M.D) which M.D based segmented image. Let i denotes the X₁ pixels values and j denotes the X₂ pixels values and S denotes the both i and j pixels which pixels values are 1. It mean all 1 value pixels fall in S. Then X₁∪X₂ written as:

$$ X_{1}\cup X_{2}=(X_{1} \cup X_{2})\cap \phi $$

(26)

$$ P(X_{1}\cup X_{2})=P((X_{1} \cup X_{2}))\cap P(\phi) $$

(27)

$$ =\left\{\begin{array}{lll} \xi((X_{1}, X_{2})==1) & if &(i,j) \in z_{1} \\ \xi((X_{1}, X_{2})==0) & if & (i,j) \in z_{2} \end{array}\right. $$

(28)

Where z₁ represented as ground truth Table 1.

Table 1 Ground truth table for z₁

An implementation of normal distribution based segmentation and entropy controlled features selection for skin lesion detection and classification

Abstract

Background

Methods

Results

Conclusion

Background

Problem statement

Contribution

Paper organization

Related work

Methods

Contrast stretching

Lesion segmentation

Mean segmentation

Mean deviation based segmentation

Image fusion

Analysis

Image representation

HOG features

Harlick features

Color features

Features fusion

Features selection

Results

Evaluation protocol

Datasets & results

PH2 Dataset

ISIC dataset

ISBI - 2016 & 17

Discussion

Conclusion

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Competing interests

Publisher’s Note

Additional information

Availability of data and materials

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Cancer

Contact us