Options
TripletCough: Cougher Identification and Verification From Contact-Free Smartphone-Based Audio Recordings Using Metric Learning
Journal
IEEE Journal of Biomedical and Health Informatics
ISSN-Digital
2168-2208
Type
journal article
Date Issued
2022-06
Author(s)
Jokic, Stefan
Cleres, David
Rassouli, Frank
Steurer-Stey, Claudia
Puhan, Milo A.
Brutsche, Martin
Barata, Filipe
Abstract (De)
Cough, a symptom associated with many prevalent respiratory diseases, can serve as a potential biomarker for diagnosis and disease progression. Consequently, the development of cough monitoring systems and, in particular, automatic cough detection algorithms have been studied since the early 2000s. Recently, there has been an increased focus on the efficiency of such algorithms, as implementation on consumer-centric devices such as smartphones would provide a scalable and affordable solution for monitoring cough with contact-free sensors. Current algorithms, however, are incapable of discerning between coughs of different individuals and, thus, cannot function reliably in situations where potentially multiple individuals have to be monitored in shared environments. Therefore, we propose a weakly supervised metric learning approach for cougher recognition based on smartphone audio recordings of coughs. Our approach involves a triplet network architecture, which employs convolutional neural networks (CNNs). The CNNs of the triplet network learn an embedding function, which maps Mel spectrograms of cough recordings to an embedding space where they are more easily distinguishable. Using audio recordings of nocturnal coughs from asthmatic patients captured with a smartphone, our approach achieved a mean accuracy of 88% (10% SD) on two-way identification tests with 12 enrollment samples and accuracy of 80% and an equal error rate (EER) of 20% on verification tests. Furthermore, our approach outperformed human raters with regard to verification tests on average by 8% in accuracy, 4% in false acceptance rate (FAR), and 12% in false rejection rate (FRR). Our code and models are publicly available.
Language
English
Keywords
Cough monitoring
metric learning
mobile sensing
remote patient monitoring
speaker identification
speaker verification
triplet network
HSG Classification
contribution to scientific community
HSG Profile Area
None
Refereed
Yes
Publisher
IEEE
Volume
26
Number
6
Start page
2746
End page
2757
Subject(s)
Division(s)
Eprints ID
267656
File(s)
Loading...
open access
Name
TripletCough_IEEE_JBHI3152944_06_2022.pdf
Size
1.55 MB
Format
Adobe PDF
Checksum (MD5)
689a43866ee1bb6bcdd0c59a840df145