Claim Missing Document
Check
Articles

Lung sound classification using multiresolution Higuchi fractal dimension measurement Achmad Rizal; Risanuri Hidayat; Hanung Adi Nugroho; Willy Anugrah Cahyadi
International Journal of Electrical and Computer Engineering (IJECE) Vol 13, No 5: October 2023
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijece.v13i5.pp5091-5100

Abstract

Lung sound is one indicator of abnormalities in the lungs and respiratory tract. Research for automatic lung sound classification has become one of the interests for researchers because lung disease is one of the diseases with the most sufferers in the world. The use of lung sounds as a source of information because of the ease in data acquisition and auscultation is a standard method in examining pulmonary function. This study simulated the potential use of Higuchi fractal dimension (HFD) as a feature extraction method for lung sound classification. HFD calculations were run on a series of k values to generate some HFD values as features. According to the simulation results, the proposed method could produce an accuracy of up to 97.98% for five classes of lung sound data. The results also suggested that the shift in HFD values over the selection of a time interval k can be used for lung sound classification.
Frequency domain analysis of MFCC feature extraction in children’s speech recognition system Risanuri Hidayat
JURNAL INFOTEL Vol 14 No 1 (2022): February 2022
Publisher : LPPM INSTITUT TEKNOLOGI TELKOM PURWOKERTO

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20895/infotel.v14i1.740

Abstract

Abstract —The research on speech recognition systems currently focuses on the analysis of robust speech recognition systems. When the speech signals are combined with noise, the recognition system becomes distracted, struggling to identify the speech sounds. Therefore, the development of a robust speech recognition system continues to be carried out. The principle of a robust speech recognition system is to eliminate noise from the speech signals and restore the original information signals. In this paper, researchers conducted a frequency domain analysis on one stage of the Mel Frequency Cepstral Coefficients (MFCC) process, the Fast Fourier Transform (FFT), in children's speech recognition system. The FTT analysis in the feature extraction process determined the effect of frequency value characteristics utilized in the FFT output on the noise disruption. The analysis method was designed into three scenarios based on the value of the employed FFT points. The differences between scenarios were based on the number of shared FFT points. All FFT points were divided into four, three, and two parts in the first, second, and third scenarios, respectively. This study utilized children's speech data from the isolated TIDIGIT English digit corpus. As comparative data, the noise was added manually to simulate real-world conditions. The results showed that using a particular frequency portion following the scenario designed on MFCC affected the recognition system performance, which was relatively significant on the noisy speech data. The designed method in the scenario 3 (C1) version generated the highest accuracy, exceeded the accuracy of the conventional MFCC method. The average accuracy in the scenario 3 (C1) method increased by 1% more than all the tested noise types. Using various noise intensity values (SNR), the testing process indicates that scenario 3 (C1) generates a higher accuracy than conventional MFCC in all tested SNR values. It proves that the selection of specific frequency utilized in MFCC feature extraction significantly affects the recognition accuracy in a noisy speech.