top of page
Clouds in the Sky
1663318891663.jpg

Yash Karnani

Jr. Research Engineer

Machine learning approaches to identify Parkinson's disease using voice signal features

Published on: July 03, 2025

Parkinson’s Disease (PD) is a progressive neurodegenerative disorder characterized by motor and non-motor symptoms, including subtle changes in speech. Traditionally, PD diagnosis relies on motor symptom observation or imaging techniques, which often occur after significant neuronal damage and can be expensive or inaccessible. Voice alterations, such as reduced pitch variation and vocal tremor, tend to appear earlier and are measurable through acoustic features. This study investigates the potential of using machine learning models trained on voice signal features to classify PD and distinguish it from healthy controls, with the goal of developing a non-invasive, cost-effective screening tool. Methodology The authors used a publicly available dataset from the UCI Machine Learning Repository consisting of voice recordings from 31 individuals (23 with PD and 8 healthy controls). Each participant produced a 36-second sustained vowel, from which 24 acoustic features were extracted. The data was cleaned and normalized using StandardScaler. Initially, feature selection via SelectKBest was considered, but all features were ultimately retained due to minimal redundancy. The dataset was split 70/30 for training and testing. To address class imbalance, SMOTE (Synthetic Minority Oversampling Technique) was applied to the training set. Five machine learning models—K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and Multi-layer Perceptron (MLP)—were trained and optimized using GridSearchCV. Results Without SMOTE, models showed moderate accuracy and poor balance in precision/recall, especially for the minority healthy class. After applying SMOTE, all models demonstrated improved performance. KNN and SVM achieved accuracies of 88% and 95%, respectively, while MLP performed best with 98.3% accuracy and 100% precision. The MLP used a simple architecture: one hidden layer with 16 neurons, ReLU activation, and the LBFGS optimizer. These results validated the hypothesis that class balancing and hyperparameter tuning significantly enhance classification accuracy. Conclusion The study concludes that voice signal features can be effectively used for PD classification through machine learning. By addressing dataset imbalance and optimizing model parameters, the authors achieved state-of-the-art performance with a simple neural network. The findings emphasize the feasibility of non-invasive, voice-based diagnostic tools for early PD detection. Impact of Research This research offers a promising pathway toward accessible and scalable PD screening solutions. Its implications extend to remote and underserved regions where traditional diagnostics are limited. The use of open datasets and interpretable models enhances reproducibility and future validation. Looking forward, integrating voice analysis with other modalities—such as handwriting, gait, and facial movement—could lead to robust multimodal diagnostic platforms powered by AI. This work represents an important step toward digital biomarkers in neurology.

Predicting postoperative gastric cancer prognosis based on inflammatory factors and machine learning technology

Published on: June 19, 2025

This study developed machine learning-based models to predict postoperative mortality in patients with gastric cancer, utilizing clinical and inflammatory markers. Six algorithms were tested, with Random Forest and Logistic Regression showing the best performance. Results highlighted that Neutrophil-Lymphocyte Ratio (NLR) and Platelet-Lymphocyte Ratio (PLR) were stronger predictors than tumor size or stage. These findings emphasize the prognostic power of inflammatory markers. The models provide a cost-effective and scalable approach to enhance personalized treatment and facilitate real-time clinical decision-making.

Lifestyle and occupational risks assessment of bladder cancer using machine learning-based prediction models

Published on: June 05, 2025

This study applied machine learning to assess lifestyle and occupational risk factors for bladder cancer. A balanced case-control study of 692 bladder cancer patients and 692 healthy controls was conducted using data from the Iranian Cancer Registry. Random Forest achieved the highest predictive accuracy. Recurrent urinary tract infections, bladder stones, and lifestyle factors such as smoking and diet were key predictors. The study provides a valuable tool for early screening, risk assessment, and the development of preventive strategies in public health.

Deep learning integrates histopathology and proteogenomics at a pan-cancer level

Published on: May 08, 2025

Machine learning and artificial intelligence are transforming cancer research and precision medicine. Computational pathology now uses deep learning models like convolutional neural networks (CNNs) to analyze histopathological images and predict molecular alterations in cancer. These models have shown success in predicting gene mutations, molecular subtypes, and biomarkers like microsatellite instability and somatic mutations such as TP53 and PTEN. Novel architectures like Panoptes classify complex cancers like endometrial cancer from image data. However, most existing studies rely heavily on genomic and transcriptomic data, often neglecting the tumor proteome and lacking pathology expert input in model design. This limits the biological interpretability and real-world clinical application of these models. To address this, the study uses data from the Clinical Proteomic Tumor Analysis Consortium (CPTAC), offering comprehensive genomics-to-proteomics profiles linked with matched histopathology images and patient outcomes.

Background_edited_edited.jpg

Get the latest updates

Join Cell talk!

Subscribe to us today and receive the latest updates in your mailbox!

bottom of page