Skip to Main Content (Press Enter)

Logo UNICH
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNICH

|

UNI-FIND

unich.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson’s Disease: A Study on Speaker Diarization and Classification Techniques

Articolo
Data di Pubblicazione:
2024
Abstract:
Parkinson’s disease (PD) is a neurodegenerative disorder characterized by a range of motor and non-motor symptoms. One of the notable non-motor symptoms of PD is the presence of vocal disorders, attributed to the underlying pathophysiological changes in the neural control of the laryngeal and vocal tract musculature. From this perspective, the integration of machine learning (ML) techniques in the analysis of speech signals has significantly contributed to the detection and diagnosis of PD. Particularly, MEL Frequency Cepstral Coefficients (MFCCs) and Gammatone Frequency Cepstral Coefficients (GTCCs) are both feature extraction techniques commonly used in the field of speech and audio signal processing that could exhibit great potential for vocal disorder identification. This study presents a novel approach to the early detection of PD through ML applied to speech analysis, leveraging both MFCCs and GTCCs. The recordings contained in the Mobile Device Voice Recordings at King’s College London (MDVR-KCL) dataset were used. These recordings were collected from healthy individuals and PD patients while they read a passage and during a spontaneous conversation on the phone. Particularly, the speech data regarding the spontaneous dialogue task were processed through speaker diarization, a technique that partitions an audio stream into homogeneous segments according to speaker identity. The ML applied to MFCCS and GTCCs allowed us to classify PD patients with a test accuracy of 92.3%. This research further demonstrates the potential to employ mobile phones as a non-invasive, cost-effective tool for the early detection of PD, significantly improving patient prognosis and quality of life.
Tipologia CRIS:
1.1 Articolo in rivista
Keywords:
machine learning; Parkinson’s disease; speaker diarization; speech analysis
Elenco autori:
Di Cesare, M. G.; Perpetuini, D.; Cardone, D.; Merla, A.
Autori di Ateneo:
CARDONE DANIELA
MERLA Arcangelo
PERPETUINI DAVID
Link alla scheda completa:
https://ricerca.unich.it/handle/11564/826931
Link al Full Text:
https://ricerca.unich.it//retrieve/handle/11564/826931/420342/sensors-24-01499-v2.pdf
Pubblicato in:
SENSORS
Journal
  • Dati Generali

Dati Generali

URL

https://www.mdpi.com/1424-8220/24/5/1499
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.2.0