Improving Lung and Colon Cancer Detection Using Ensemble Method Approach

Proceedings of the 18th INDIACom; INDIACom-2024; IEEE Conference ID: 61295
2024 11 International Conference on “Computing for Sustainable Global Development”, 28th Feb. – 1st Mar., 2024
th
Bharati Vidyapeeth's Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA)
Improving Lung and Colon Cancer Detection using

Ensemble Method Approach
Jessica Singh Syal Achin Jain Arun Kumar Dubey
Dept. of Information Technology Dept. of Information Technology Dept. of Information Technology
Bharati Vidyapeeth’s College of Bharati Vidyapeeth’s College of Bharati Vidyapeeth’s College of
Engineering Engineering Engineering
New Delhi, India New Delhi, India New Delhi, India
jessicasinghsyalwork@gmail.com achin.mails@gmail.com arudubey@gmail.com
Vanita Jain
Department of Electronic Science
University of Delhi
New Delhi, India
vjain@electronics.du.ac.in
Abstract— Cancer is recognised to represent an extremely I. INTRODUCTION

high risk of mortality, despite enormous developments having
Cancer stands to be the second most prevalent contributor
been made in science and medicine. Characterized by widespread
metastases, malignant cells spread rapidly and evade drugs, to mortality across the globe. In the year 2020 alone, a
making it a fatal disease with little treatment success. Cancer cells staggering 19 million [1] new cases were registered, and an
have a heterogeneous nature that makes them resistant to unfortunate albeit huge number of 9.95 million fatalities
chemotherapy and other forms of radiation. Across the globe, incurred overall. The root cause of this formidable disease is
cancer stands to be the second most leading cause of death. the unrestricted growth and aggressive replication of the
Among the many types, lung and colon cancer are the most damaged cells, thereby resulting in the formation of tumors,
common and have the highest mortality rate. Early and accurate which may manifest as either malignant or benign. One of the
detection of tumor cells in lung and colon cancer patients can help major factors that amplifies the general susceptibility to cancer
the medical industry increase patient survival statistics. This is genetic inheritance and underestimating the importance of
study focuses on improving the current state of technology regular and consistent medical checkups to facilitate early
assisted lung and colon cancer detection. A large dataset of 25,000 detection.
histopathological photographs of lung and colon tissues is
analyzed to build a Deep-learning model using the Ensemble However, the burgeoning challenge in the battle against
Method approach for accurate and reliable cancer detection. To this disease lies in the mounting high costs associated with
increase efficiency, the photos are divided into a total of five detection systems, a situation which is even more staggering
different classes. The methodology underlying the study aims to when it comes to the low and middle-income countries, which
increase the detection accuracy by building a model which learns is the sector where the majority of cancer-related deaths occur.
from pre-existing models in the field; thus displaying superiority It is in these socio-economic strata where the majority of the
in terms of predictive power. The core concept of transfer fatalities are reported due to lung and colon cancer [2], further
learning is used to leverage the knowledge of pre-trained models accentuating the need for accessible as well as cost-efficient
and create better and improved ensemble models. The study screening models to reduce the devastating impact of this
includes comprehensive data preprocessing, augmentation, model ailment.
training, validation and testing, and model performance
evaluation. With a high accuracy of 0.96, this model achieved high To address this daunting issue, modern-day technologies
reliability in detecting cancer cells. This effort holds the potential like Machine Learning (ML) and Deep Learning (DL) have
to improve cancer diagnosis through efficient and accurate been applied in pathology [3] for the primary purpose of
classification of medical images. Using pre-trained models is an diagnoses and detection of diseases as well as the development
efficient and effective approach to reduce the time and resources of a smart and reliable prescription system. DL models [4],
required to develop high-accuracy models. including pre-trained architectures [5] like EfficientNetB7,
PretrainedModel2, InceptionV3, DenseNet201, and ResNet50,
Keywords— Classification Models, Lung Disease, Colon cancer, are used in this research to classify biomedical images. To
Lung and Colon Disease, Histopathological Images, Deep enhance the productivity of the project, we employed an
Learning, Transfer Learning, Machine Learning. ensemble model [6] to improve the accuracy of an initially
Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1770

th
low-performing model by using the averaging method; we colon cancer nucleus detection. Shein et al. [11] developed an
combined the predictions of multiple individual models into a architecture based on ML without segmentation or feature
single, more robust predictive model. This ensemble approach extraction methods, achieving 87.14% accuracy for lung
helped mitigate the limitations of the individual models and nodule classification. Three deep structured algorithms,
leveraged their collective knowledge to achieve higher including CNN, achieved 89% accuracy for lung nodule
accuracy. The resulting ensemble model demonstrated feature extraction [12]. Selvanambi et al. [13] achieved 98%
improved performance and enhanced the overall accuracy of accuracy for lung cancer detection using RNN with DLS and
our classification system for cancer detection. glowworm swarm optimization. Filho et al. [14] made use of
segmented imaging and CNN for processing the lung nodule
The study comprises a meticulous and comprehensive CT scans and achieved 92.6% accuracy for classification.
approach to data preprocessing, involving the fine and Yuan et al. [15] applied CNN with preprocessing techniques
complex task of refinement as well as the organization of vast such as edge detection and intensity adjustments, obtaining
amounts of data to extract the pertinent features by deploying 91.4% accuracy for polyp detection in colonoscopy videos. In
appropriate feature engineering techniques. This foundational CT images, benign and malignant lung nodules were classified
phase lays the base for the development of the subsequent with 92.8% accuracy using RestNet50 and SVM with RBF
stages, while at the same time ensuring the quality, accuracy,
kernel [16].
and relevance of the information which is fed into the system.
This research dives into the intricate depths of model training, The DFCNet model, which is based on deep CNN, was
where a wide array of sophisticated algorithms and models are proposed by Masood et al. [17] and achieved 84.5% accuracy
employed to allow the system to be able to discern and in pulmonary nodule classification. Faster R-CNN was
differentiate between the complex patterns and relationships employed by Mo et al. [18] to detect polyps in colonoscopy
within the data. recordings, with an average accuracy of 98.5% across four
datasets. With 96.4% accuracy, Urban et al. [19] created deep
Model Validation is an indispensable part of the process of CNNs for polyp identification in colonoscopy pictures.
development. It involves putting the trained model through a Binarized weights were applied to reduce network size [20],
rigorous and well-rounded assessment against different achieving 90.28% accuracy for colonoscopy frame
datasets. This is a crucial step that provides the necessary classification. A combined learning comprehensive neural
guarantee related to the accuracy and efficiency of the system. network optimized with AdaBoost obtained 98.42% accuracy
The evaluation metrics employed in the research ensure in recognising normal and abnormal lung shapes after wolf
precision and provide the model with a nuanced understanding heuristic features [21] were chosen to minimize
of the different features of the disease in the human body. dimensionality.
The integration of Deep Learning (DL) and Machine An eight-layer CNN architecture was suggested by Suresh
Learning (ML) [7] methods and the careful examination and and Mohan [22] to categorize CT images of lung lesions into
evaluation of pre-trained models are two of this study's most three categories. They used generative adversarial networks for
unique aspects. This methodology allows us to identify and both data augmentation and picture segmentation. With 93.9%
evade the loopholes and cons of an existing model and only classification accuracy, the model performed well. Masud et al.
include the positives of a given model under consideration. [23] introduced a lightweight CNN approach for pulmonary
The project involves rigorous data preprocessing, model nodule detection. Their model, consisting of four convolutional
training, validation, and testing, with evaluation based on layers, demonstrated a high accuracy of 97.9% and was
accuracy, precision, and loss curves. By leveraging DL and suitable for real-time CT image analysis. A pre-processing
pre-trained models, this research aims to enhance cancer technique [24] preserving image brightness and reducing noise
diagnosis [8] and treatment. This model successfully was used for lung cancer CT scans. An improved neural
revolutionizes the detection of lung and colon cancer, assisting network performed region segmentation and feature extraction,
in the process of early diagnosis, identification, and subsequent followed by an ensemble classifier for classification, achieving
treatment. Our work establishes a solid foundation for the use an accuracy of 96.2%. Pre-trained CNNs (ResNet-50, ResNet-
of DL and ML [9] models in the treatment of lung and colon 34, and ResNet-18) were used by Bukhari et al. [25] to assess
cancer in the healthcare sector.
colonic cancer histopathology pictures. Their ensemble
II. RELATED WORK approach achieved an accuracy of 96.4%.With a 97.8%
classification accuracy, Mangal et al. [26] used a shallow
Studies on malicious Cancer detection by making use of neural network to classify digital pathology pictures of lung
the signature sets have been put under thorough investigation and colon squamous cell carcinoma and adenocarcinoma. In
and utilized for a lengthy time in the past. Most of this research order to classify lung cancer [27]histopathology pictures,
often uses lists of recognized malicious Cancers. As soon as Hatuwal and Thapa [28] used a CNN [29], achieving accuracy
the module encounters a new cancer, a database query is of 96.11% in training and 97.2% in validation.
initiated. If the cancer is found to be blacklisted, it is then
regarded as malicious, and a warning is generated. Yamini et al.[30] created a unique machine learning model
that analyzes lung cancer datasets with the use of Gradient
Sirinukunwattana et al. [10] proposed a spatially Boosting, KNN, LR, DT, RF, SVM, and XGB classifier
constrained neural network achieving 97.1% accuracy for models. An automated system-based classifier was explored

Improving Lung and Colon Cancer Detection using Ensemble Method Approach
and developed by Bishnoi et al. [31] which further elaborated models. Augmentation techniques were applied to increase the
upon a different approach to malignancy detection. In a recent dataset size and enhance model generalization. A sample of
study conducted by Gayap et al. [32] Deep Learning was the dataset used to train the model is shown in Table 1,
applied to diagnose lung cancer which demonstrated high which lists the main characteristics of the tissue under
potential and precision. Numerous approaches have been analysis and its classification into five distinct categories.
investigated by researchers, such as convolutional neural
networks with preprocessing techniques, recurrent neural
networks with optimization algorithms, and spatially restricted
neural networks. Impressive classification accuracies have
been achieved by these methods for identifying malignant
tumors in medical imaging data, including CT scans and
pathology pictures.
In summary, the combined findings demonstrate the critical
role that DL and ML play in the development of cancer
detection methods. By harnessing the power of computer
algorithms, researchers are making enormous strides towards
more accurate, efficient, and readily available diagnostic tools
for the battle against cancer.
III. PROPOSED SYSTEM
The primary objective is the seamless integration and
assimilation of pre-trained architectural frameworks, combined
with an efficient and meticulous data pre-processing pipeline,
specialized model training, validation, and exhaustive
protocols to perform testing. The augmentation that acts as the
pivot to this comprehensive framework lies in the strategic
incorporation of an ensemble model, which is an advanced
amalgamation of different individual models, designed Fig. 1. Flowchart of deployed Methodology
particularly to increase the accuracy and reliability of the
cancer detection system. The core of this endeavor revolves TABLE I. DESCRIPTION OF THE EMPLOYED DATASET
around the prediction of malignant cells in the lungs and colon, Image Type Class ID Class Title Total Images
distinguishing between them with a high level of accuracy.
Colon Adenocarcinoma 0 Colon_aca 5000
The progression through the following structured steps is what
Colon Benign 1 Colon_n 5000
forms the backbone of the approach followed in this research:
Lung Adenocarcinoma 2 Lung_aca 5000
 Data Preprocessing Lung Benign 3 Lung_n 5000
 Model Selection and training. Lung Squamous Cell
4 Lung_scc 5000
Carcinoma
 Validation and Hyper-parameter Tuning
The histopathological pictures, of the distinct tissue types
 Performance Evaluation that the model analyzes to classify the cancer, are shown in
Fig. 2.
 Ensemble Model
 Testing and Evaluation
The methodological approach used to create the ensemble
model for this research project is depicted in Fig. 1.
A. Data Preprocessing
1) Data cleaning, normalization, and augmentation:
Inconsistencies in data including noise, outliers etc, were
handled efficiently to ensure a well-rounded data cleaning
process, which formed the backbone of our model. Data
normalization was performed to ensure consistency in the data,
which then led to improvement in the performance of the
model. A meticulous procedure was employed to conduct the
refinement and structuring of the raw data to retrieve the Fig. 2. Histopathological images [33] from dataset: (a) Lung
essential features. This process ensured the integrity of the Adenocarcinoma. (b) Lung benign. (c) Lung squamous cell. (d) Colon
dataset for model training being conducted in subsequent adenocarcinoma. (e) Colon benign

th
B. Model Selection and Training

1) Pre-trained architectures:
EfficientNetB7, PretrainedModel2, InceptionV3,
DenseNet201, and ResNet50 were chosen as the base models
due to their proven effectiveness in image classification tasks.
These models have been pre-trained on large-scale datasets and
have learned rich representations of various features.
2) Transfer learning:
The pre-trained models were used as the starting point, and
their weights were fine-tuned specifically for the task of cancer
detection. This approach enabled the developed ensemble
model to leverage the knowledge gained from the pre-training,
which led to faster convergence and improved performance.
C. Validation & Hyperparameter Tuning
The dataset was divided into subgroups for the purpose of
training and validation. The training subset was where the
models were trained, and the validation subset was where the
models were assessed and improved iteratively.
The learning rate, batch size, and regularization strength
were among the hyperparameters that were modified to find
the ideal configuration that maximized the models'
performance on the validation set. Model generalization was Fig. 3. Architecture of Ensemble Model
improved by this process, which prevented overfitting as well
as underfitting. IV. EXPERIMENTATION AND RESULTS
D. Performance Evaluation Evaluation Metrics that have been inculcated in this study
A variety of metrics, including loss curves, accuracy, and are as follows: Accuracy, Recall, F-1 Score.
precision, were used to assess how well the trained model Leveraging the collective intelligence extracted from
functioned. Loss curves showed the convergence and stability earlier models, the ensemble model was methodically created.
of the training process, accuracy gauged the overall correctness With a well-chosen dataset of twenty-five thousand carefully
of the predictions, and precision evaluated the model's capacity selected photos, the model was rigorously trained. The dataset
to accurately identify cancer patients. was carefully cleaned and preprocessed before training to
E. Ensemble Model guarantee that it was of the best quality possible for the model
to absorb. By using data augmentation approaches, the
Ensemble construction methods such as averaging or preparation phase's sophistication was increased and the
voting, were employed to combine the predictions of dataset’s efficiency was improved. After the training process
individual models. These methods helped to mitigate the concluded, the ensemble model demonstrated high recall and
Limitations of individual models and make more robust F-1 scores of 0.94 and 0.95, respectively, and excellent
predictions. Fig. 3 represents the structure of the Ensemble accuracy of 0.96. These measurements, however, were
model with distinct input layers that were put under analysis to representative of the model's performance in its unaltered
improve model performance. condition. Even more encouraging outcomes from further
F. Testing and Evaluation augmentation operations were obtained, with final accuracy,
recall, and F-1 scores coming in at 0.98, 0.97, and 0.96,
1) Independent Dataset respectively.
The developed models were tested on a separate and
independent dataset that was not used during training or Fig.4 depicts a flowchart that represents the foundational
validation. This ensured the models’ ability to generalize new development of the ensemble model using the existing, pre-
and unseen data. trained architectures.
The Assessment of the models involved evaluating their
2) Evaluation Metrics
performance on the testing dataset, which is a critical phase of
The models' performance on the testing dataset was development where their efficiency was scrutinized using
evaluated using various metrics, such as accuracy, precision, wide-ranging metrics like accuracy, precision, and other
and other relevant measures. These evaluations provided pertinent measures. The application of a Wide variety of
insights into the models' accuracy and reliability in detecting metrics to assess the quality of the model proved to be a
cancerous patterns in real-world scenarios. beneficial and successful technique to build an advantageous
system that allowed it to make crucial decisions with a small

number of false positives, specifically in an industry as

intricate as healthcare and medicine. Such evaluations turned
out to be indispensable in affirming the theoretical capabilities
of the system as well as in analyzing their hands-on practical
utility and reliability. These measures were taken to ensure that
the model can provide real-world solutions to the rising
problem of cancer detection and treatment with a high degree
of sophistication and precision in the outcome.
Fig. 5. Comparative validation accuracy curve
Fig. 4. Flowchart representing development of Ensemble Model
TABLE II. COMPARATIVE ANALYSIS OF MODELS

Augmentation
Model Accuracy Recall F1-Score
Parameter
Raw 0.97 0.96 0.95
EfficientNet
Augmented 0.98 0.97 0.98
Raw 0.95 0.93 0.94
VGG16
Augmented 0.96 0.95 0.95
Raw 0.94 0.93 0.93
Inception V3
Augmented 0.96 0.95 0.95
Raw 0.95 0.93 0.94
DenseNet20
Augmented 0.96 0.95 0.95
Raw 0.95 0.94 0.94
ResNet50
Augmented 0.96 0.95 0.95
Ensemble Raw 0.96 0.94 0.95
Model Fig. 6. Comparative validation losses curve
Augmented 0.98 0.97 0.96
Table II shows the ensemble model and other pre-trained Fig. 6 is a graph representing the validation loss curve of
architectures' comparative performance metrics. Each model's all the models considered in the study along with the curve of
efficacy is displayed in the table using its accuracy, recall, and the developed Ensemble Model. The loss curve depicts the
F-1 score. It is noteworthy that the ensemble model performs evolution of the validation loss incurred over successive
better than the individual architectures in every category, with iterations of the model.
improved metrics. These are the respective comparative validation accuracy
Fig. 5 represents the Comparative Validation Accuracy and loss curves which help assess the ability of the model to
curve representing the higher accuracy of the Ensemble Model generalize and make accurate predictions on unseen data; and
which was developed in this study, as compared to the other to provide insights into the effectiveness of the model's
pre-existing models. learning and optimization.

th
This empirical progression highlights the ensemble model's [2] Sloan, A. Frank and Hellen Gelband. "The cancer burden in low-and
clear advantage. Outlining the experimental method that served middle-income countries and how it is measured," In Cancer control
opportunities in low-and middle-income countries. National Academies
as the foundation for the examination of certain model Press (US), 2007.
performance indicators and the augmentation tactics that [3] S. Das, S. Biswas, A. Paul and A. Dey, "AI Doctor: An intelligent
followed is crucial. Every component model was carefully approach for medical diagnosis" in Industry Interactive Innovations in
assessed, examining its innate advantages and disadvantages. Science Engineering and Technology, Singapore:Springer, pp. 173-183,
Making use of the collective knowledge gained from the 2018.
various models, the final ensemble model was carefully [4] M. Dildar, S. Akram, M. Irfan et al. "Skin cancer detection: a review
constructed, combining the advantageous aspects of its using deep learning techniques," International journal of environmental
research and public health 18, no. 10 (2021): 5479.
predecessors. Consequently, a thorough process of empirical
[5] S. Garg and S. Garg “Prediction of lung and colon cancer through
research, iterative refinement, and strategic combination of analysis of histopathological images by utilizing Pre-trained CNN
model features supported the ensemble model's developmental models with visualization of class activation and saliency maps,” In
trajectory in this work and resulted in a strong and superior Proceedings of the 3rd Artificial Intelligence and Cloud Computing
model paradigm. Conference, 2020, pp. 38-45.
[6] O. Singh and K. K. Singh, "An approach to classify lung and colon
V. CONCLUSION cancer of histopathology images using deep feature extraction and an
ensemble method," International Journal of Information Technology 15,
The main aim of this study was to detect lung and colon no. 8 (2023): 4149-4160.
cancer accurately, and efficiently using a reliable and robust [7] R. S. Yadav, “Data analysis of COVID-2019 epidemic using machine
model that has been trained using a vast dataset and provides learning methods: a case study of India,” International Journal of
precise results on real-world data. Transfer learning was Information Technology 12, no. 4, pp. 1321-1330, 2020.
employed to fulfill this purpose and detect cancer in the [8] T. Babu, D. Gupta, T. Singh and S. Hameed, "Colon cancer prediction on
patients by training the model on a dataset of 25,000 different magnified colon biopsy images", Proc. 10th Int. Conf. Adv.
histopathology images of lung and colon cancer tissues. The Comput. (ICoAC), pp. 277-280, Dec. 2018.
project involved a comprehensive comparative analysis of the [9] M. Masud, N. Sikder, A.-A. Nahid, A. K. Bairagi and M. A. AlZain, "A
machine learning approach to diagnosing lung and colon cancer using a
various cancer detection models using an ensemble approach deep learning-based classification framework", Sensors, vol. 21, no. 3,
and pre-trained architectures, namely EfficientNetB7, pp. 748, Jan. 2021.
PretrainedModel2, InceptionV3, DenseNet201, and ResNet50. [10] K. Sirinukunwattana, S. E. A. Raza, Y.-W. Tsang, D. R. J. Snead, I. A.
The objective was to enhance the overall performance and Cree and N. M. Rajpoot, "Locality sensitive deep learning for detection
improve the metrics of model testing which were deployed to and classification of nuclei in routine colon cancer histology images",
ensure a sophisticated and highly precise system of cancer IEEE Trans. Med. Imag., vol. 35, no. 5, pp. 1196-1206, May 2016.
detection. We leveraged the qualities and collective knowledge [11] S. Mehmood, T. M. Ghazal, M. A. Khan, M. Zubair, M. T. Naseem, T.
Faiz and Munir Ahmad. "Malignancy detection in lung and colon
of multiple models that have been developed in the past for the histopathology images using transfer learning with class selective image
same or similar purposes. After doing thorough testing and processing," IEEE Access, pp. 25657-25668, 2020.
analysis, we found that the ensemble model performed better [12] Y. Su, D. Li and X. Chen, "Lung nodule detection based on faster R-
than the individual model. The ensemble approach allowed us CNN framework," Computer Methods and Programs in Biomedicine,
to leverage the strengths of each pre-trained architecture and 2021
mitigate their limitations, resulting in enhanced accuracy and [13] S. Ramani, J. Natarajan, M. Karuppiah, S. K. H. Islam, M. M. Hassan,
robust predictions. The developed Ensemble Model had a high and G. Fortino, "Lung cancer prediction using higher-order recurrent
accuracy of 0.98 which was superior to the other models neural network based on glowworm swarm optimization," Neural
Computing and Applications, vol. 32, pp. 4373-4386, 2020.
assessed. A highly precise result was obtained for the real-time
[14] N. Da, R. V. Medeiros, S. A. Peixoto, S. P. P. da Silva and P. P. R. Filho,
data fed into the model with a recall value of 0.96. Our "Lung nodule classification via deep transfer learning in CT lung
comparative analysis involved various metrics, including images," In 2018 IEEE 31st International Symposium on Computer-
accuracy and loss curves, to evaluate the performance of the based Medical Systems (CBMS), 2018, pp. 244-249.
models. These metrics provided valuable insights into the [15] S.-B. Zhao, W. Yang, S.-L. Wang et al., "Establishment and validation of
effectiveness of each model and helped us make informed a computer-assisted colonic polyp localization system based on deep
decisions during the ensemble model construction. learning," World Journal of Gastroenterology, vol. 27, no. 31, 2021.
[16] G. Zhang, Z. Yang, L. Gong, S. Jiang and L. Wang, "Classification of
The conclusions and discoveries made in this research benign and malignant lung nodules from CT images based on hybrid
initiative enhance machine-learning approaches in pathology features," Physics in Medicine & Biology, vol. 64, no. 12, 2019.
and have the potential to improve illness diagnostics and [17] A. Masood, B. Sheng, P. Li, X. Hou, X. Wei, J. Qin and D. Feng.
intelligent prescribing programs. The research route in "Computer-assisted decision support system in pulmonary cancer
detection and stage classification on CT images," Journal of biomedical
question presents significant potential for enhancing the informatics, vol. 79, pp. 117-128, 2018.
model's outlier tolerance and delving deeper into the [18] M. Xi, K. Tao, Q. Wang and G. Wang, "An efficient approach for polyps
Convolutional Neural Network (CNN) as a potential solution. detection in endoscopic videos based on faster R-CNN," In 2018 24th
international conference on pattern recognition (ICPR), 2018, pp. 3929-
REFERENCES 3934.
[1] International Agency for Research on Cancer, Oct. 2021, [online] [19] U. Gregor, P. Tripathi, T. Alkayali, M. Mittal, F. Jalali, W. Karnes and P.
Available: https://gco.iarc.fr/today/data/factsheets/ populations/ 900- Baldi, “Deep learning localizes and identifies polyps in real time with
world-fact-sheets.pdf. 96% accuracy in screening colonoscopy,” Gastroenterology, vol. 155, no.
4, pp. 1069-1078, 2018.

[20] A. Mojtaba, M. Mohrekesh, S. Rafiei, S. M. R. Soroushmehr, N. Karimi,

S. Samavi and K. Najarian, "Classification of informative frames in
colonoscopy videos using convolutional neural networks with binarized
weights." In 2018 40th annual international conference of the IEEE
engineering in medicine and biology society (EMBC), 2018, pp. 65-68.
IEEE.
[21] S. P. Mohamed, A. Tolba, Z. Al-Makhadmeh and M. M. Jaber,
"Automatic detection of lung cancer from biomedical data set using
discrete AdaBoost optimized ensemble learning generalized neural
networks," Neural Computing and Applications, vol. 32, pp. 777-790,
2020.
[22] Suresh, Supriya and S. Mohan, “NROI based feature learning for
automated tumor stage classification of pulmonary lung nodules using
deep convolutional neural networks," Journal of King Saud University-
Computer and Information Sciences, vol. 34, no. 5, pp. 1706-1717,
2022.
[23] M. Mehedi,"A light-weight convolutional Neural Network Architecture
for classification of COVID-19 chest X-Ray images," Multimedia
Systems, vol. 28, no. 4, pp. 1165-1174, 2022.
[24] M. Suren, P. W. C. Prasad, A. Alsadoon, A. K. Singh and A. Elchouemi,
"Lung cancer detection using CT scan images," Procedia Computer
Science, vol. 125, pp. 107-114, 2018.
[25] S. U. K. Bukhari and S. S. K. Bukhari, A. Syed and S. S. H. Shah, "The
diagnostic evaluation of Convolutional Neural Network (CNN) for the
assessment of chest X-ray of patients infected with COVID-19,"
MedRxiv, 2020.
[26] S. Mangal, A. Chaurasia and A. Khajanchi, "Convolution neural
networks for diagnosing colon and lung cancer histopathological
images." arXiv preprint arXiv:2009.03878, 2020.
[27] N. Coudray, P. S. Ocampo, T. Sakellaropoulos et al., "Classification and
mutation prediction from non–small cell lung cancer histopathology
images using deep learning," Nature Medicine, vol. 24, no. 10, pp. 1559-
1567, 2018.
[28] B. K. Hatuwal and H. C. Thapa, "Lung cancer detection using
convolutional neural networks on histopathological images," Int. J.
Comput. Trends Technol, vol. 68, no. 10, pp. 21-24, 2020.
[29] G. Ahmed, A. A. Lawaye, “CNN-based speech segments endpoints
detection framework using short-time signal energy features,” Int. j. inf.
Tecnol., vol. 15, pp. 4179–4191, 2023.
[30] B. Yamini, K. Sudha, M. Nalini, G. Kavitha, R. S. Subramanian and R.
Sugumar, "Predictive Modelling for Lung Cancer Detection using
Machine Learning Techniques," 2023 8th International Conference on
Communication and Electronics Systems (ICCES), Coimbatore, India,
2023, pp. 1220-1226.
[31] V. Bishnoi, N. Goel and A. Tayal, "Automated system-based
classification of lung cancer using machine learning," International
Journal of Medical Engineering and Informatics, vol. 15, no. 5, pp. 403-
415, 2023.
[32] H. T. Gayap and M. A. Akhloufi, "Deep Machine Learning for Medical
Diagnosis, Application to Lung Cancer Detection: A Review,"
BioMedInformatics, vol. 4, no. 1, pp. 236-284, 2014.
[33] M. Šarić, M. Russo, M. Stella and M. Sikora, "CNN-based method for
lung cancer detection in whole slide histopathology images," In 2019 4th
International Conference on Smart and Sustainable Technologies
(SpliTech), 2019, pp. 1-4.

Improving Lung and Colon Cancer Detection Using Ensemble Method Approach

Uploaded by

Copyright:

Available Formats

Improving Lung and Colon Cancer Detection Using Ensemble Method Approach

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Improving Lung and Colon Cancer Detection Using Ensemble Method Approach

Uploaded by

Copyright:

Available Formats

Proceedings of the 18th INDIACom; INDIACom-2024; IEEE Conference ID: 61295

Improving Lung and Colon Cancer Detection using

Abstract— Cancer is recognised to represent an extremely I. INTRODUCTION

Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1770

Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1771

Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1772

B. Model Selection and Training

Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1773

number of false positives, specifically in an industry as

Fig. 5. Comparative validation accuracy curve

Fig. 4. Flowchart representing development of Ensemble Model

TABLE II. COMPARATIVE ANALYSIS OF MODELS

Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1774

Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1775

[20] A. Mojtaba, M. Mohrekesh, S. Rafiei, S. M. R. Soroushmehr, N. Karimi,

Copyright © INDIACom-2024; ISBN: 978-93-80544-51-9 1776

You might also like