ARCHIVES
Cancer Patient Identification using Machine Learning and Clustering
¹ ² Department of Computer Science & Engineering, Birla Institute of Technology, Patna Campus, Bihar, India.
Published Online: March-April 2026
Pages: 274-279
Cite this article
↗ https://www.doi.org/10.59256/ijire.20260702033Abstract
View PDFSince cancer is one of the main causes of mortality worldwide, risk assessment and early detection are crucial. A machine learning-based method for identifying cancer patients using both clustering and classification techniques is presented in this paper. For analysis, a sizable dataset comprising more than 50,000 patient records with clinical, lifestyle, and demographic characteristics was used. Managing missing values, encoding categorical variables, and getting the dataset ready for modeling were all part of the data preprocessing step. Patients were divided into low, medium, and high risk groups using K-Means clustering. The models' prediction power was significantly improved by using these risk groupings. Several machine learning techniques were developed and assessed, such as Support Vector Machine (SVM), Decision Tree, Random Forest, and XGBoost. The models were evaluated using ROC analysis, confusion matrix, and typical assessment measures like precision, recall, and F1-score. Additionally, a prediction system was created that enables users to enter patient information and obtain probability estimation and cancer risk. The suggested method shows how well clustering and classification techniques can be combined for healthcare prediction and decision assistance.
Related Articles
2026
AI-Based Stomach Cancer Detection Using Biomarkers, Medical Images, and Voice Analysis
2026
Hydrogen-Efficient Eco-Driving and Route Planning for Fuel-Cell Electric Vehicles Using Multi-Objective Optimization Under Traffic and Terrain Uncertainty
2026
A Data-Driven Machine Learning Framework for Assessing Patent Commercial Value and Technological Significance
2026
Soft Computing Approaches for Robust Analysis of Imbalanced and Noisy Data
2026
Smart Attendance System Using Face Recognition and Gaze-Based Attention Monitoring
2026
Analyzing Customer Review Sentiments using Machine Learning
2026
Agentic Artificial Intelligence as a Strategic HR Partner: Redefining Decision-Making Authority and Strategic Roles
2026
Solid Waste Management Rules, 2026 (India): Regulatory Design Review and Environmental Benefits for Urban Sustainability
2026
Optimizing Hospital Resource Utilization Using Power BI Analytics
2026
Contribution of Machine and Deep Learning methodologies in the identification of counterfeit currency notes


