Postępy Dermatologii i Alergologii

1/2026 vol. 43
Original paper

Hexa classification of erythemato-squamous disease using deep dual features-based neural network

  1. Department of Information Technology, Mahendra Engineering College (Autonomous), Mahendhirapuri, Namakkal, India
  2. Department of Electrical and Electronics Engineering, Mahendra Engineering College (Autonomous), Mahendhirapuri, Namakkal, India
Adv Dermatol Allergol 2026; XLIII (1): 70-85
Data publikacji online: 2026/02/10
Article file
Hexa.pdf

Introduction

Skin disease detection has become a crucial area of research and technological advancement, driven by the growing significance of early diagnosis and treatment in dermatology [1]. Skin disorders are a broad category that includes anything from prevalent issues including acne and eczema to more serious illnesses like malignancy and psoriasis. Timely and precise analysis is vital for effective medical intervention and patient care [2, 3]. Some skin disorders are not fatal but can significantly impact quality of life by causing chronic symptoms, and severe pain. It is crucial to make a precise analysis and timely, appropriate management of dermatological conditions due to their variability in appearance and progression, even among highly qualified medical professionals [4].

The erythemato-squamous disease (ESD) condition is one of the most common skin conditions in dermatology. A reddening of the epidermis causes damage to and loss of the epidermis [5, 6]. Precise and appropriate detection of ESD is vital for current medical intervention and treatment planning. A dermatologist with an extensive and relevant understanding of these disorders is required for the patient [7]. During thorough observation, the ESD may present differently in various patients and may have distinct clinical symptoms in various body parts [8, 9]. The assessment of the ESD in dermatology is difficult. Patients are initially clinically assessed according to 12 features which are crucial indicators in the classification of ESDs [10].

In recent days, machine learning (ML) [11] and deep learning (DL) [12] have become significant approaches for diagnosing diverse ailments. The use of advanced imaging methods, ML algorithms, and computer-aided diagnostic tools has shown promise in enhancing the efficiency and precision of detection, allowing healthcare professionals to offer tailored and prompt care to individuals affected by these skin disorders [13, 14]. Dermatologists find it difficult to diagnose ESD in dermatology since these conditions share some similarities while still exhibiting subtle differences. These illnesses share all clinical and histological characteristics except for a few minor exceptions. The major contributions of this research are described as follows:

  • A novel Hexa-ESD framework is introduced for efficiently classifying clinical skin images into various skin diseases.

  • The clinical skin images are gathered and pre-processed using the Contrast stretching Adaptive Histogram Equalization (CSAHE) technique for eliminating the noise artifacts.

  • The deep learning-based Duo network, which is an integration of DarkNet and ShuffleNet is employed to retrieve the spatial features from the pre-processed images.

  • Then, these relevant features are fed into the walrus optimization (WalO) algorithm by dealing with complex non-linear problems for selecting the best features.

  • These selected features are fused for classification using deep belief network (DBN) to detect the hexa ESD cases namely lichen planus, seborrheic dermatitis, psoriasis, pityriasis rosea, chronic dermatitis, and pityriasis rubra pilaris.

The rest of the work was arranged as follows. The literature works are reviews the related literature in the part Literature reviews. Material and methods presents a comprehensive description of the proposed Hexa-ESD framework for classifying hexa ESD cases using clinical skin images. Results and discussion covers the experimental results and their interpretation. Finally, Conclusions concludes the study and outlines future research directions.

Literature review

Numerous researchers have used digital image processing and classification techniques to discover skin illnesses. A wide range of works contain recent developments in ML and DL methodologies.

In 2021 Shastri et al. [15] presented the Grading-AdaBoost (GB) ensemble framework for analysing and predicting ESD. The first stage used distinct classifiers without any ensemble techniques, while the second stage used dynamic meta-classifiers and dynamic base-classifiers for model generation. The best classifier from phase 1 was associated with the finest GB ensemble set from the second phase to determine the optimal method for ESD identification. The suggested system utilizing the GB ensemble model outperformed all previous studies using this dataset by 99.45%.

In 2022, Elsayad et al. [16] designed a Bayesian Optimization – Support Vector Machine (BO-SVM) model to differentiate the ESD dataset from the University of California Irvine machine learning (UCI ML) repository. This dataset includes the fallouts of medical and histological tests for various 6 skin disorders. The recommended BO was employed by the Gaussian process (GP) method with the Matérn covariance filter. The fallouts demonstrate the benefit of multiple classes BO-SVM for testing classification accuracy of 99.07%.

In 2022, Alotaibi [17] integrated feature optimization and a prediction strategy to develop a hybrid model. This proposed hybrid methodology comprised two stages: initially, the ReliefF Algorithm was employed to identify optimal features for ESD, followed by the k-nearest neighbour (KNN) technique for predicting the best features selected. The experimentation utilized a benchmark dataset for ESD. Additionally, a KNN technique was utilized to assess the hybrid model, yielding a classification accuracy of 94.5%.

In 2020, Putatunda [18] devised the Derm2Vec hybrid deep learning method for diagnosing ESD. Derm2Vec is a combined deep learning model integrating Deep Neural Networks and Autoencoders. Derm2Vec and Deep neural network (DNN) were utilized alongside other traditional machine learning techniques utilizing a practical dermatology dataset. The results show that Derm2Vec is the most successful strategy, with DNN and Extreme Gradient Boosting (XGBoost). These techniques achieve mean scores of 95.70%, 96.55%, and 96.72%, respectively.

In 2021, Rajashekar [19] had an Enhanced Pipeline Feature Selection technique specifically designed for ESD recognition. This system selects pertinent features and crucial attributes to minimize dimensionality. The combination of a recursive deletion feature and an enhanced pipeline technique aims to identify optimal features in multi-class problems, offering an integrated approach to resilient system design. The algorithms categorized and sequenced five distinct classifier types including Random Forest, Classification and Regression Tree (CART), Multi-Layered Perceptron, Logistic Regression, and Gradient Boost technique.

In 2020, Igodan et al. [20] employed ensemble feature selection methods to assess the effectiveness of methods resulting from ML frameworks. To identify unique feature subsets within the clinical skin dataset, various filter-based feature selection approaches such as gain ratio, information gain, χ2, and reliefF were applied, along with embedded feature selection techniques.

Studies mentioned above indicate that skin ESD diseases have been identified using a variety of methods in recent years. Researchers used methods such as pre-processing the images, segmenting the disease sections, and classifying skin diseases using some training models to identify and categorize ESD diseases. However, the existing works do not focus on the feature extraction stage for early detection of ESD diseases, which effectively diagnose ailments in clinical skin images. Our research work aims to design a DL approach for the early recognition of skin diseases in terms of their feature variations, facilitating accurate classification of ESD cases and simplifying disease analysis in various situations.

Material and methods

This section introduced a novel Hexa-ESD framework to classify the ESD cases from the available clinical skin dataset. In this model, a Duo-Net is introduced for extracting the spatial features using DarkNet and ShuffleNet. The features are selected using the WalO algorithm to classify the hexa cases of skin diseases. The schematic workflow is presented in Figure 1.

Figure 1

Schematic depiction of the proposed Hexa-ESD Model

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g001_min.jpg

Data description

In this research work, we have created a self-prepared dataset by gathering different clinical skin images from available sources (https://www.kaggle.com/datasets/shubhamgoel27/dermnet). For this preparation, we have collected these images based on the information in the UCI repository [21] (https://archive.ics.uci.edu/dataset/33/dermatology) gathered by Ubeyli and Guler to classify the ESD types. Among the 33 features and predictions, 12 clinical features (e.g., erythema, scaling, itching, Koebner phenomenon, different limits, polygonal papules, and (g) follicular papules) are included. Other factors include (h) involvement of the oral mucosa, (i) involvement of the knees and elbows, (j) involvement of the scalp, (k) family history, and (l) age. There are 366 observations in this dataset that include eight missing values for the “Age” variable [2226].

In our self-prepared ESD_DATASET_348, the clinical skin images are manually relabelled from the Kaggle dataset based on the organized classification provided by the UCI repository. This method enhanced the quality of the dataset and preserved uniformity in class definitions. The sample images from our self-prepared dataset are displayed in Figure 2 with hexa different classes of ESD. All the hexa classes are signified due to the 155 images provided by UCI and the 193 images provided by Kaggle as illustrated in Table 1. The distribution shows the class-specific totals by maintaining data sources stable. The target variables are hexa classes and each class includes psoriasis (111 images), seborrheic dermatitis (52 images), lichen planus (71 images), pityriasis rubra pilaris (20 images), chronic dermatitis (46 images), and rosea (48 images).

Table 1

Distribution of images from Kaggle and UCI Datasets

ClassKaggle imagesUCI imagesTotal images
Psoriasis6150111
Seborrheic dermatitis262652
Lichen planus403171
Pityriasis rubra pilaris101020
Chronic dermatitis262046
Pityriasis rosea301848
Total193155348
Figure 2

Sample clinical skin images of our self-prepared dataset

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g002_min.jpg

Data pre-processing

In this denoising phase, every original intensity value is replaced, and the comparison of histograms is conducted through a locally modified contrast-stretching adjustment. The CSAHE is a combination of contrast stretching [27] and adaptive histogram equalization [28] to remove the noise from skin images. The new range is allocated to each pixel by employing a flexible transfer-function derived from the characteristics of the input images. The linear filters that are formed by merging the pixel intensity values of adjacent inputs yield the output pixel values from the input images. This method pursues to increase the contrast between various structures, improve image visualization, and aid in the identification of various ESD cases by applying CSAHE on images. The original level is delivered to every pixel by use of a configurable transfer function that is generated from the attributes of the input images. Assuming that that refers to the input image and that the range of input intensity is calculated.

(1)
rg=ILmaxLminI

Where Lmax and Lmin are represented as maximum and minimum intensity values of the input clinical skin image respectively. Now an additional intensity is included for each pixel stated in equation (2)

(2)
Pr={Liqi   ifLi=LmaxLi+qi        ifLi=Lmin

where qi denotes the adaptive intensity offset added to each pixel, computed based on the local intensity distribution to enhance contrast between structures. The pixel values are modified using the formulas described previously. These values depend on the local image characteristics in which qi is calculated based on the local intensity distribution of the image to enhance contrast between structures.

(3)
fi=Lmin+rj|(QnQmin)/(QmaxQmin)|

Here, fj is the adjusted pixel intensity after transformation, Li defines the intensity value of the ith pixel, rj is the scaling factor for the jth pixel, and Qmin and Qmax denote the minimum and maximum gray-level values within the local image region. These pixels are targeted for adjustment according to the conditions in (3). The result is an enhanced image feature and reduced noise in the image. Image manipulation uses histogram equalization to re-distribute pixel intensities for improved contrast. The final resultant image is generated by combining all the sub-images.

Data augmentation

In this phase, various augmentation methods like rotating, zooming, flipping, and rescaling are utilized for increasing the input samples. The sample size was increased using standard augmentation methods such as scaling, flipping, and zooming, introducing more variation to the training set. This allows the model to train on a varied range of clinical skin images, reducing overfitting and improving sensitivity and predictive performance. Through these changes, the training set contains a diverse group of images. The major variations can be incorporated into the training process due to the variety of clinical skin images included in the dataset. Thus, we can lessen overfitting and enhance the sensitivity and predictive abilities of our proposed model. We scaled each image to a fixed 64 × 64 pixel size and extracted RGB values from each image to use as features. The description of the self-prepared dataset before and after augmentation is shown in Table 2. The self-prepared ESD_DATASET_348 is divided into 1120 images for training (80%) and 620 images for testing (20%). The seven different angles of a clinical skin image are shown in Figure 3.

Table 2

Description of self-prepared ESD_DATASET_348

ESD classesBefore augmentationAfter augmentationTotal images
Class 0111444555
Class 152208260
Class 271284355
Class 346184230
Class 448192240
Class 52080100
Total34813921740
Figure 3

Augmentation output of a sample clinical skin image

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g003_min.jpg

Duo-Net for feature extraction

The proposed Duo-feature extraction network is a hybridization of DarkNet and ShuffleNet for extracting the relevant features. The structure of the Duo-feature extraction network is shown in Figure 4.

Figure 4

Architecture of the proposed Duo Feature Extraction Net

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g004_min.jpg

DarkNet-53

In this phase, DarkNet-53 (DN-53) [26] is employed for extracting relevant features from the input clinical skin images. DN-53 splits the feature maps into two parts and processes each half independently via separate network branches. This network is used in various computer vision tasks for feature extraction. DN-53 consists of 53 convolutional layers organized into multiple blocks. The DN-53 uses a DBL module that comprises a convolution layer of DarkNet (DNconv), batch normalization (BN), and a Leaky-ReLU activation function. The architecture employs skip connections, which allow information to bypass one or more layers and flow through the network more efficiently. This leads to improved data exchange and enhanced feature reuse across different stages of the network. In DN-53, convolution is performed in the convolutional layers, and the architecture has been optimized to merge feature maps from the early stages of the network to the last layers. The convolutional layer is applied to retrieve the vital characteristics from the input data using convolutional filters. Let the input feature map be represented as Iw× Ih× Ic, where Iw, Ih, and Ic denote the width, height, and number of channels respectively.

This correlated portion is made up of “n” residual units (Res-unit) and the residual block (Resblock). A residual unit is then created by connecting two DBL parts. The branch of the first dimension convolves and up-samples the feature map to create new features. The third residual block of DarkNet-53 is then fused with the fourth residual block to enhance feature representation. Finally, the convolutional layer and DBL layer are employed to obtain the feature map with the result. To create new features, the dimension of the feature map is convolved, up-sampled by the second dimension’s branch, and spliced and fused with the third residual block of DN-53. After that, a convolutional layer and a DBL layer are used to generate the final result.

The model applies max-pooling multiple times, adjusting the size of the input windows to build the feature set while using convolutional layers to extract feature maps from images. Initially, input feature maps of varying sizes undergo three convolution steps, followed by max-pooling with various kernel sizes. This approach allows the model to detect and analyse features at different scales and sizes. Max-pooling layers downsampled the spatial dimensions to capture features more effectively. The resulting feature maps represent hierarchical characteristics from the input image, and the final DN-53 model balances speed and accuracy, improving network performance with high efficiency.

ShuffleNet

ShuffleNet [29] is utilized in feature extraction that removes redundant features from the clinical skin images. ShuffleNet downsampling introduces two main changes. Rather than appending the data, first a 3x3 average pooling layer is created with a stride of 2, then use channel cascaded to increase the channel dimension at a low cost. The input is provided as follows:

(4)
I=Nc×Hc×Wc

where I defines input, and Nc indicates the number of channels. Hc and Wc defines the height and width of feature maps. The flops required for ShuffleNet are derived in (5),

(5)
Fsh=HcWc(2CM/(g+9M))

where g signifies the number of groups which indicates the number of channels and bottleneck channels M. In equation (4), CM is the product of input channels (C) and bottleneck channels (M) indicating the computation cost of grouped convolutions. The channels were reduced and processed channel-by-channel during ShuffleNet operations. GAP is used in the shuffling phase that the surface features generated by channel-wise data. The spatial dimensions of the input feature map, represented by Hc and Wc are key factors in determining the global average pooled feature vector ZRNc, which uses global average pooling as its basic aggregation method.

(6)
Z=Fsh(u)=1/(Hc×Wc)Σi=1HcΣj=1Wcu(i,j)

First, a 3x3 convolutional kernel-corresponding single-channel feature maps are generated for each channel. There is no difference in the results of pointwise convolution and ordinary convolution, but significantly less processing is required. The output is passed to leaky ReLU to reduce the non-linearity and gives the value for the threshold that cannot neglect the small changes in the input. The activation function S is determined as,

(7)
S=F(Z,Wc)=σ(g,(Z,Wc))=σ(Wjδ(WiZ))

where δ denotes the Leaky ReLU activation function, and W i, Wj ∈ R represent the learnable weight parameters of the feature extraction layers. Leaky ReLU is used to overcome the difficulties of dying ReLU problems. After the process of concatenation, this network takes feature maps as inputs, which reduces computational costs and increases training capacity.

Feature selection

Feature selection reduces the dimensions of data by choosing the optimal feature group from a high-dimension feature space according to predefined criteria. A feature space also contains redundant, irrelevant, and relevant features. Decreasing the number of features extracted from images and choosing a subset from the available features. A feature selection technique is widely used for both classifying previously unknown important characteristics and removing redundant features that do not contribute to categorization. The process of feature selection is frequently used to eradicate unrelated features to categorize previously unknown traits. The WalO algorithm [30] is a feature selection procedure that chooses the significant features from ShuffleNet and DarkNet based on the natural behaviour of walruses.

In this WalO algorithm, the initial input population is set as 50 walruses with each individual representing a unique feature set. Each solution is defined by 20 variables, corresponding to different features derived from the input. The algorithm iterates through 100 generations, where it refines these solutions by evaluating and selecting the best features. These parameters are typical for balancing exploration and computational cost in feature selection tasks to accurately classify the hexa ESD cases. To determine which features are most pertinent, the retrieved features are fed into the WalO algorithm. The WalO algorithm procedure uses the retrieved features to select the best features. The searcher members of the population used by the population-based metaheuristic WalO algorithm are walruses. Every walrus in the WalO algorithm signifies a possible solution to the optimization problem. The stopping criterion is set as a maximum of 100 generations when the change in the objective function (OF) falls below the consecutive generations which indicates the convergence rate towards an optimal solution. Based on the walrus’s natural activities, the procedure of updating its position in the WalO algorithm is divided into three phases as shown in Figure 5.

Figure 5

Flow chart of the Walrus optimization algorithm

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g005_min.jpg

Phase 1. Feeding strategy: Walruses consume about 60 different class of marine animals. It is the strongest walrus in the group with the strongest tusks who lead the others in their hunt for food. There is a correlation between the walrus’ tusk length and the calibre of the OF values of the potential solutions. Thus, the strong walrus in the group is the solution with the highest OF value. In the global search, the WalO algorithm is enhanced by the numerous scanning zones of the search space produced by the walruses.

The exploration strength of the WalO algorithm in global search is enhanced by the walruses’ habitual scanning patterns, which result in diverse scanning areas of the search area. A statistical simulation of walrus position updates is done using (8) and (9), with the most important member of the group serving as guidance. This operation generates a new site for walrus based on equation (8). In equation (9), the new location takes the place of the preceding one if it increases the OF.

(8)
Wi,jpl=Wi,j+ri,j(Sj(fi,j×Wi,j))
(9)
Wi={Wip1,   Fip1<FiWi,else

where Wip1 denotes the newly created position of the ith walrus, and wi,jp1 represents its jth dimensional component. The fitness of this new position is evaluated using the objective function Fip1. The variable ri,j is a random number uniformly distributed in the range [0,1], while Sj indicates the position of the strongest (best) walrus in the jth dimension. The index fi,j∈ {1,2} refers to a randomly selected feature, and β = 2 is the exploration control parameter of the WaLO algorithm. It results in important and extensive variations to the positions of walruses related to 1, which represents the typical state of this relocation.

Phase 2. Migration strategy. Walruses migrate to stony beaches or outcrops when the air warms in the late summer. In the WalO algorithm, migration is used to guide the walruses to areas that are suitable for discovery. Walruses travel to different destinations in distinct regions of the search space according to this model. This results in the generated new place being generated with equation (10). If the OF increases, then this new location takes the position of the walrus.

(10)
Wi,jp2={Wi,j+ri,j(Wk,j(fi,j×Wi,j)),  Fk<FiWi,j+ri,j(Wi,jWk,j),else
(11)
Wi={Wip2,Fip2<FiWi, else)

where Wip2 denotes the newly created position of the ith walrus after migration with wi,jp2 represents its jth dimension. The fitness corresponding to this position is expressed as Fip2. Here wk,j refers to the jth dimensional component of a randomly selected walrus, where the index k ∈ {1,2,3,…,N} is chosen randomly from the population.

Phase 3. Escaping strategy. Walruses are continually attacked by killer whales and polar bears. Through their strategies to escape and protect themselves from these predators, walruses adopt different postures around their sites. Best-escaping walrus signified the finest feature, and the worst-escaping walrus signified the worst feature. The finest part of the trip is avoiding polar bears and killer whales. There is nothing worse than when killer whales or polar bears kill people. During the feature selection, the worst feature was removed. The following procedure briefly explains this feature selection method. The WalO method simulates this behaviour by creating a new position within the vicinity of each walrus, using (12) and (13).

In this case, equation (14) states that the new location takes over the preceding site if the goal function value is increased.

(12)
Wi,jp3=Wi,j+(lblocal,jc+(ublocal,jcr×lblocal,jc))
(13)
Local bounds:{lblocal,jc=lbj/cublocal,jc=ubj/c
(14)
Wi={(Wip3,Fip3<FiWi,  else

where Wip3 defines the newly created position of the ith walrus in the third phase, wi,jp3 refers to its jth dimension, F denotes the objective function, t is the iteration index, lbj and ubj are the lower and upper bounds of the jth variable respectively. The parameters lblocal,jc and ublocal,jc represent the localized lower and upper bounds used to simulate the search area within potential solutions.

Classification

A random set of features is added after multiplying the input by the feature vectors. DBN with additional categorization is derived from the WalO algorithm, which uses the features acquired from attacking and migrating operations. DBN is a probabilistic generation module made up of several Restricted Boltzmann machines (RBM). The visible and hidden units of the RBMs are connected but not to one another internally, and the hidden units can receive input from the higher-order relationship of the visible units. Energy-based approaches characterize the joint distribution of RBMs as,

(15)
P(v,h;θ)exp(E(v,h;θ))/Z

where z = ΣvΣh exp (–E(v, h; θ)), the probability of the visible unit being active is

(16)
P(v;θ)=(Σhexp(E(v,h;θ))/Z

The energy function for RBMs, featuring binary units in both the visible and hidden layers is expressed as follows:

(17)
E(v,h;θ)=Σi=1MΣj=1NWijvihjΣi=1MbiviΣj=1Najhj

Among the visible and hidden units, wij represents the connection weight between visible unit vi and hidden unit hj, while bi and aj denote the bias terms of the visible and hidden layers respectively. The variables M and N indicate the number of visible and hidden units. The DBN uses the cross-entropy loss function that estimates the variance between the detected prospect distribution and the actual distribution in ESD classification. It assigns higher penalties to incorrect predictions and guides the DBN to improve its accuracy. The cross-entropy loss is calculated as,

(18)
CEloss=Σi=1Nyilog(yj^)

where yi is the true class label, yj^ is the predicted probability of the ith class and N denotes the total number of classes. This network categorizes the input image into hexa classes of ESD based on the relevant features of the WalO algorithm.

Table 3 illustrates the hyperparameters of the DBN which controls the learning process and overall performance. The learning rate (0.001) defines the step size during gradient updates for ensuring steady learning. Weight decay (0.0001) regularizes the network by reducing overfitting. A batch size of 64 ensures efficient updates by processing data in manageable chunks, while 200 epochs allow sufficient training without overfitting. The dropout rate (0.3) prevents overfitting by randomly deactivating neurons, and weight decay (0.0001) regularizes the DBN to avoid large weight updates. Hidden layers and neurons with 512, 256, and 128 units were chosen to capture relevant features while reducing complexity. During the hyperparameter tuning process, 10% of the training data was reserved for validation. As a result, the efficiency of the proposed Heax/Hexa-ESD was optimized before being tested on the test set. The cross-entropy loss function minimizes the error among detected and actual outputs by guiding the learning process of the DBN in the ESD classification task.

Table 3

Hyperparameter setting of DBN

HyperparametersValues
Learning rate0.001
Number of hidden layers3
Batch size64
Epochs200
Dropout rate0.3
Weight decay0.0001
Loss functionCross entropy

Results and discussion

The experimental arrangement was executed with Matlab-2020b, and the capability of the proposed Hexa-ESD model was evaluated with various efficiency metrics.

Clinical skin images for the input are gathered from the accessible dataset and evaluated using different metrics. In addition to the capabilities of the proposed Hexa-ESD, the comparison provides a thorough explanation and scrutiny of the overall accuracy rate. Additionally, the proposed network is compared with existing DL networks.

Figure 6 portrays the fallouts of the proposed Hexa-ESD framework with the self-prepared images to classify the ESD cases. Besides, the skin images from the collected dataset were denoised by the CSAHE technique to eradicate the unwanted artifacts and augment these images to increase the dataset. The next method was carried out in two steps by segmenting the images by retrieving the relevant features from the denoised images with Duo-feature extraction networks. DBN is applied to categorize the ESD into 6 different cases in terms of the best features.

Figure 6

Experimental fallouts of the proposed Hexa-ESD model

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g006_min.jpg

Competence analysis

The competence of the Hexa-ESD Model was assessed with the network metrics viz., precision (P), F1 score (F1), specificity (S), accuracy (A), and recall (R).

(19)
A=((TP+TN))/((TP+TN+FN+FP))
(20)
P=TP/(TP+FP)
(21)
S=TN/(TN+FP)
(22)
R=TP/(TP+FN)
(23)
F1=2((P*R)/(P+R))

where TP and TN denote the true positives and true negatives of the sample images, respectively, while FP and FN represent the false positives and false negatives of the input images. For the experimental setup, the hexa classes of ESD are defined as “class-0” for psoriasis, “class-1” for seborrheic dermatitis, “class-2” for lichen planus, “class-3” for pityriasis rosea, “class-4” for chronic dermatitis, and “class-5” for pityriasis rubra pilaris. The competence of the Hexa-ESD model for classifying several forms of ESD is tabulated in Table 4 and it is visually shown in Figure 6.

Table 4

Competence valuation of the proposed Hexa-ESD framework

ClassesClass nameASPRF1
Class-0Psoriasis98.2597.1897.2197.1498.62
Class-1Seborrheic dermatitis97.1596.2396.1897.3298.25
Class-2Lichen planus98.0597.0297.2298.0898.43
Class-3Pityriasis rosea97.1296.1497.4298.1598.52
Class-4Chronic dermatitis97.0895.0696.2596.4298.28
Class-5Pityriasis rubra pilaris98.5298.1798.1198.1498.02

The Hexa-ESD framework classified the six different ESD classes from class-0 to class-5 as shown in Figure 7. The proposed Hexa-ESD is measured in terms of A, P, R, S, and F1 respectively. The proposed Hexa-ESD model reaches A of 97.69%. Also, the proposed Hexa-ESD method displays an overall S of 96.63%, P of 97.06%, R of 97.54%, and an F1 of 98.35% respectively.

Figure 7

Classification performance scrutiny for Hexa classes

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g007_min.jpg

Figure 8 depicts the accuracy curve, showcasing the accuracy on the x-axis against the number of epochs on the y-axis. The accuracy of the Hexa-ESD increases with more training epochs are illustrated in Figure 9, where the loss decreases as the epochs progress. In this study, we aim to determine how many training epochs are best for obtaining high testing accuracy. The results indicate that after 200 epochs, the Hexa-ESD framework attains a notable classification accuracy of 97.69%, with a low error rate.

Figure 8

Accuracy graph of the proposed Hexa-ESD model

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g008_min.jpg
Figure 9

Loss graph of the proposed Hexa-ESD model

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g009_min.jpg

Moreover, we evaluated the performance of the proposed Hexa-ESD using 3-fold and 5-fold cross-validation to ensure the robustness and reliability of the Hexa-ESD framework. The dataset was split into respective folds, and the results were averaged across all runs with two different cross-validations (CV) as illustrated in Table 5.

Table 5

Cross-validation outcomes of the proposed model

Metric3-fold CV5-fold CV
Accuracy (A)96.8%97.1%
Precision (P)96.4%96.8%
Recall (R)96.6%96.9%
F1-score (F1)96.5%97.0%
Specificity (S)96.3%96.7%

In addition, the 3-fold and 5-fold CV results support the robustness of the Hexa-ESD framework. In the 3-fold and 5-fold CV, the dataset is separated into 577 and 328 images for testing, 1173 and 1412 images for training. The fallouts of a 5-fold CV indicate that more folds can lower bias and variability and produce more reliable performance results. Those findings confirm that the suggested framework generalizes effectively across different data partitions, which is consistent with the previously reported accuracy of 97.69% based on a single 80-20 split of the dataset.

Comparative analysis

The predicted performance of each DL network was measured to confirm that the proposed Hexa-ESD framework achieves a high level of accuracy. Competency evaluations compared the Hexa-ESD framework to four DL networks, including CNN and RNN methods. The Hexa-ESD framework demonstrated superior accuracy, achieving a rate of 97.69%, outperforming traditional DL networks. Various parameters were used to evaluate the efficacy of each DL network during the assessment.

To maximize classification accuracy for ESD instances, a comparison of various conventional models was performed, as detailed in Table 6. Figure 10 presents a visual analysis between the Hexa-ESD framework and these conventional networks. Notably, models like AlexNet, DenseNet, and LeNet demonstrate lower accuracy compared to ShuffleNet and DarkNet. The proposed ShuffleNet increases the accuracy by 6.48%, 5.35%, and 2.21% better than AlexNet, DenseNet and LeNet respectively. Furthermore, the proposed DarkNet increases the accuracy by 10.08%, 9.0%, and 5.96% better than AlexNet, DenseNet and LeNet respectively. From this analysis, the proposed Hexa-ESD framework achieves a better accuracy of 97.69% due to the combination of ShuffleNet and DarkNet models.

Table 6

Comparison among conventional networks

Conventional networksSPRF1A
AlexNet [22]88.5287.1987.1688.2188.15
DenseNet [23]86.1785.4285.3287.0389.21
LeNet [24]90.0489.1488.0589.1492.17
ShuffleNet [25]98.1193.0594.8294.5194.26
DarkNet [26]97.2897.0496.8096.2898.04
Figure 10

Visual comparison of conventional networks and Duo-Net

/f/fulltexts/PDIA/57527/PDIA-43-1-57527-g010_min.jpg

Table 7 compares the parameters of different neural networks for ESD detection. The proposed DBN model achieves the highest accuracy (97.69%) and specificity (96.63%), outperforming DNN, CNN, RNN, and ANN across all metrics. The DBN also attains high precision (97.06%), recall (97.54%), and F1 score (98.35%) for indicating superior classification performance compared to the other models. CNN shows strong performance with 97.15% accuracy, while ANN has the lowest accuracy at 93.54%. The DBN increases the average accuracy range by 1.65%, 0.55%, 2.27% and 4.24% better than DNN, CNN, RNN and ANN respectively. The DBN model demonstrates a significant improvement in detecting ESD cases compared to other existing classifiers.

Table 7

Comparison of existing classification networks

NetworksAPRSF1
DNN96.0795.2595.2896.0395.23
CNN97.1595.0495.6395.2596.35
RNN95.4794.2894.8795.0495.04
ANN93.5492.1293.2492.2593.18
DBN97.6997.0697.5496.6398.35

As shown in Table 8, the testing process measured the time taken to process an input signal from the dataset to assess the accuracy of various approaches. The performance of previous methods was compared using specific criteria to ensure correct classification accuracy. The proposed Hexa-ESD framework increases the overall accuracy of 3.26%, 1.93% and 18.21% for ReliefF Algorithm [17], Extreme Gradient Boosting [18] and Enhanced Pipeline Feature Selection (EPFS) Algorithm [19] respectively. However, the prior techniques have not reached the best accuracy range when compared to the Hexa-ESD framework.

Table 8

Accuracy evaluation of existing methods and proposed method

AuthorsMethodsClassification accuracy
Alotaibi et al. [17]ReliefF Algorithm94.5%
Putatunda [18]Extreme Gradient Boosting95.8%
Rajashekar [19]EPFS algorithm79.9%
ProposedHexa-ESD framework97.69%

Conclusions

This work introduces a Hexa-ESD model to efficiently categorize the clinical skin images into various skin diseases. The DL-based DarkNet and ShuffleNet were employed to retrieve the spatial features from the pre-processed images. Then, these retrieved features were fed into the WalO algorithm by dealing with complex non-linear problems for selecting the best features. These selected features were fused for classification using a fully connected layer to detect the hexa ESD cases. The efficiency of the Hexa-ESD was assessed with some metrics like S, P, R, A and F1. From this analysis, the Hexa-ESD framework achieves a better accuracy of 97.69% due to the combination of ShuffleNet and DarkNet models. The proposed ShuffleNet rises the accurateness/accuracy in 6.48%, 2.21% and 5.35% better than AlexNet, LeNet and DenseNet respectively. Furthermore, the proposed DarkNet increases the accuracy by 10.08%, 9.0%, and 5.96% better than AlexNet, LeNet and DenseNet respectively. The Hexa-ESD framework increases the overall accuracy by 3.26%, 1.93% and 18.21% for ReliefF Algorithm, XGBoosting and EPFS algorithm respectively. The result of the Hexa-ESD framework attains precise results compared with the existing networks. The inclusion of hair in rare cases and conditions deduces the categorization accuracy of the proposed Hexa-ESD to recognize a wide range of skin disorders. In the future, we will develop a model that can provide accurate diagnosis in the presence of hairs through advanced DL models. Additionally, we plan to collaborate with domain experts to construct or use more rigorously labelled datasets to mitigate dataset inconsistencies.

Acknowledgments

The author would like to express his heartfelt gratitude to the supervisor for his guidance and unwavering support during this research for his guidance and support.

Dataset availability

Self-prepared ESD_DATASET_348: https://researchpoint-portfolio.github.io/latha/

Ethical approval

My research guide reviewed and ethically approved this manuscript for publishing in this journal.

Conflict of interest

The authors declare no conflict of interest.

References

1 

Bukhari SNH, Masoodi F, Dar MA, et al. Prediction of erythemato-squamous diseases using machine learning. In: Applications of Machine Learning and Deep Learning on Biological Data. Masoodi F, Quasim M, Bukhari S, et al. Auerbach Publications 2023; 87.

2 

Ahmed A, Ahmad H, Khurshid M, et al. Classification of Skin Disease using Machine Learning. VFAST Trans Software Engineering 2023; 11: 109-22.

3 

Afrid V, Reddy BMK. An efficient feature reduction approach for dermatology disease detection utilizing neural network approach. Int J Computing Artif Intel 2021; 2: 6-11.

4 

Kaushik B, Vijayvargiya A, Uppal J, et al. Comparative analysis of machine learning approaches for classifying erythemato-squamous skin diseases. In: International Conference on Mining Intelligence and Knowledge Exploration 2023; 67-77. https://doi.org/10.1007/978-3-031-44084-7_7

5 

Ashvanth Louison A, Sujitha B. A review on melanoma skin cancer classification using deep learning techniques. Int J Curr Biomed Engineering 2025; 3: 26-4.

6 

Pandya DD, Degadwala S, Vyas D, et al. Advancing erythemato-squamous disease classification with multi-class machine learning. 2023 7th International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC). 2023; 542-547. https://doi.org/10.1109/I-SMAC58438.2023.10290599.

7 

Akarajarasroj T, Wattanapermpool O, Sapphaphab P, et al. Feature selection in the classification of erythemato-squamous diseases using machine learning models and principal component analysis. 2023 15th Biomedical Engineering International Conference (BMEiCON) 2023; 1-5. https://doi.org/10.1109/BMEiCON60347.2023.10322034.

8 

Singh SK, Sinha A, Yadav S. Performance analysis of machine learning algorithms for erythemato-squamous diseases classification. 2022 IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE) 2022; 1-6. https://doi.org/10.1109/ICDCECE53908.2022.9793000.

9 

Bozok MN, Çalhan A. Diagnosis of erythemato-squamous skin diseases with machine learning algorithms. J Clin Exp Dermatol Res 2022; 13: 615.

10 

Sharma S, Sharma V. Comparison of machine learning techniques in the diagnosis of erythematous squamous disease. J Sci Res Technol 2023; 1: 1-9. https://doi.org/10.5281/zenodo.8218863.

11 

Surendiran R, Thangamani M, Narmatha C, et al. Effective autism spectrum disorder prediction to improve the clinical traits using machine learning techniques. Int J Engin Trends Technol 2022: 70: 434-59.

12 

Thanjaivadivel T, Jeeva S, Ahilan A. Real time violence detection framework for football stadium comprising of big data analysis and deep learning through bidirectional LSTM. Computer Networks 2019; 151: 191-200.

13 

Prasanth A, Muthukumaran N. Primary open-angle glaucoma severity prediction using deep learning technique. Int J Curr Biomed Engineering 2023; 1: 30-7.

14 

Nithya A, Appathurai A, Venkatadri N, et al. Kidney disease detection and segmentation using artificial neural network and multi-kernel k-means clustering for ultrasound images. Measurement 2020; 149: 106952.

15 

Shastri S, Kour P, Kumar S, et al. GBoost: a novel Grading-AdaBoost ensemble approach for automatic identification of erythemato-squamous disease. Int J Inf Technol 2021; 13: 959-71.

16 

Elsayad AM, Nassef AM, Al-Dhaifallah M. Bayesian optimization of multiclass SVM for efficient diagnosis of erythemato-squamous diseases. Biomed Signal Process Control 2022; 71: 103223.

17 

Alotaibi AS. Hybrid model based on ReliefF algorithm and K-nearest neighbour for erythemato-squamous diseases forecasting. Arabian J Sci Eng 2022; 47: 1299-307.

18 

Putatunda S. A hybrid deep learning approach for diagnosis of the erythemato-squamous disease. 2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT) 2020; 1-6. https://doi.org/10.1109/CONECCT50063.2020.9198447.

19 

Deva RS. EPFSESD: An Enhanced Pipeline Feature Selection Algorithm for Erythemato-Squamous Disease Detection. Des Eng 2021; 2373-88.

20 

Igodan EC, Obe OO, Thompson AFB, et al. Prediction of erythemato squamous-disease using ensemble learning framework. In: Explainable Artificial Intelligence in Medical Decision Support Systems. Imoize AL, Hemanth J, Do DT, Sur SN (eds.). 2022; 197-228.

21 

Badrinath N, Gopinath G, Ravichandran KS, et al. Classification and prediction of erythemato-squamous diseases through tensor-based learning. Proc Natl Acad Sci India A Phys Sci 2020; 90: 327-35.

22 

Chen HC, Widodo AM, Wisnujati A, et al. AlexNet convolutional neural network for disease detection and classification of tomato leaf. Electron 2022; 11: 951.

23 

Verma R, Singh V. Leaf disease identification using DenseNet. In: International conference on artificial intelligence and speech technology 2021; 500-11. https://doi.org/10.1007/978-3-030-95711-7_42.

24 

Mahmoud S, Gaber M, Farouk G, et al. Heart disease prediction using modified version of LeNet-5 model. Int J Intell Syst Appl 2022; 14: 1-12.

25 

Dakshina DS, Della Reasa Valiaveetil, Bindhu A. Alzheimer disease detection via deep learning-based shuffle network. Int J Current Biomed Engineering 2023; 1: 9-15.

26 

Rajkumar R, Gopalakrishnan S, Praveena K, et al. Darknet-53 convolutional neural network-based image processing for breast cancer detection. Mesopotam J Artif Intel Healthcare 2024; 2024: 59-68.

27 

Abedini Z, Jamzad M. Weight-based colour constancy using contrast stretching. IET Image Processing 2021; 15: 2424-40.

28 

Somal S. Image enhancement using local and global histogram equalization technique and their comparison. In: First International Conference on Sustainable Technologies for Computational Intelligence: Proceedings of ICTSCI 2019 2020; 739-753.

29 

Tesfai H, Saleh H, Al-Qutayri M, et al. Lightweight shufflenet based cnn for arrhythmia classification. IEEE Access 2022; 10: 111842-54.

30 

Trojovský P, Dehghani M. A new bio-inspired metaheuristic algorithm for solving optimization problems based on walruses behavior. Sci Rep 2023; 13: 8775.

Copyright: © 2026 Termedia Sp. z o. o. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License (http://creativecommons.org/licenses/by-nc-sa/4.0/), allowing third parties to copy and redistribute the material in any medium or format and to remix, transform, and build upon the material, provided the original work is properly cited and states its license.
Share
without publication fees