Márcio Holsbach Costa

h-index14

5papers

30citations

Novelty46%

AI Score22

Ranked #178,155 of 194,257 authors (top 92%)#1,321 in AS (top 91%)

5 Papers

2.7IVApr 7, 2022

A Pathology-Based Machine Learning Method to Assist in Epithelial Dysplasia Diagnosis

Karoline da Rocha, José C. M. Bermudez, Elena R. C. Rivero et al.

The Epithelial Dysplasia (ED) is a tissue alteration commonly present in lesions preceding oral cancer, being its presence one of the most important factors in the progression toward carcinoma. This study proposes a method to design a low computational cost classification system to support the detection of dysplastic epithelia, contributing to reduce the variability of pathologist assessments. We employ a multilayer artificial neural network (MLP-ANN) and defining the regions of the epithelium to be assessed based on the knowledge of the pathologist. The performance of the proposed solution was statistically evaluated. The implemented MLP-ANN presented an average accuracy of 87%, with a variability much inferior to that obtained from three trained evaluators. Moreover, the proposed solution led to results which are very close to those obtained using a convolutional neural network (CNN) implemented by transfer learning, with 100 times less computational complexity. In conclusion, our results show that a simple neural network structure can lead to a performance equivalent to that of much more complex structures, which are routinely used in the literature.

1.2ASApr 19, 2021

Robust parameter design for Wiener-based binaural noise reduction methods in hearing aids

Diego M. Carmo, Ricardo Borsoi, Márcio H. Costa

This work presents a method for designing the weighting parameter required by Wiener-based binaural noise reduction methods. This parameter establishes the desired tradeoff between noise reduction and binaural cue preservation in hearing aid applications. The proposed strategy was specially derived for the preservation of interaural level difference, interaural time difference and interaural coherence binaural cues. It is defined as a function of the average input noise power at the microphones, providing robustness against the influence of joint changes in noise and speech power (Lombard effect), as well as to signal to noise ratio (SNR) variations. A theoretical framework, based on the mathematical definition of the homogeneity degree, is presented and applied to a generic augmented Wiener-based cost function. The theoretical insights obtained are supported bycomputational simulations and psychoacoustic experiments using the multichannel Wiener filter with interaural transfer function preservation technique (MWF-ITF), as a case study. Statistical analysis indicates that the proposed dynamic structure for the weighting parameter and the design method of its fixed part provide significant robustness against changes in the original binaural cues of both speech and residual noise, at the cost of a small decrease in the noise reduction performance, as compared to the use of a purely fixed weighting parameter.

1.2ASNov 9, 2019

Speech Dereverberation and Noise Reduction for both diffusive noise field and point noise source in Binaural Hearing Aids: Preliminary Version

Johnny Werner, Marcio H. Costa

The multichannel Wiener filter (MWF) and its variations have been extensively applied to binaural hearing aids. However, its major drawback is the distortion of the binaural cues of the residual noise, changing the original acoustic scenario, which is of paramount importance for hearing impaired people. The MWF-IC method was previously proposed for joint speech dereverberation and noise reduction, preserving the interaural coherence (IC) of diffuse noise fields. In this work, we propose a new variation of the MWF-IC for both speech dereverberation and noise reduction, which preserves the original spatial characteristics of the residual noise for either diffuse fields or point sources. Objective measures and preliminary psychoacoustic experiments indicate the proposed method is capable of perceptually preserving the original spatialization of both types of noise, without significant performance loss in both speech dereverberation and noise reduction.

1.2ASSep 19, 2018

New insights on the optimality of parameterized wiener filters for speech enhancement applications

Rafael Attili Chiea, Márcio Holsbach Costa, Guillaume Barrault

This work presents a unified framework for defining a family of noise reduction techniques for speech enhancement applications. The proposed approach provides a unique theoretical foundation for some widely-applied soft and hard time-frequency masks, which encompasses the well-known Wiener filter and the heuristically-designed Binary mask. These techniques can now be considered as optimal solutions of the same minimization problem. The proposed cost function is defined by two design parameters that not only establish a desired trade-off between noise reduction and speech distortion, but also provide an insightful relationship with the mask morphology. Such characteristic may be useful for applications that require online adaptation of the suppression function according to variations of the acoustic scenario. Simulation examples indicate that the derived conformable suppression mask has approximately the same quality and intelligibility performance capability of the classical heuristically-defined parametric Wiener filter. The proposed approach may be of special interest for real-time embedded speech enhancement applications such as hearing aids and cochlear implants.

4.3ASJun 24, 2018

Perceptually Relevant Preservation of Interaural Time Differences in Binaural Hearing Aids

Fábio P. Itturriet, Márcio H. Costa

This work presents a noise reduction method with perceptually relevant preservation of the interaural time difference (ITD) of the residual noise in binaural hearing aids. The interaural coherence (IC) concept, previously applied to the Multichannel Wiener Filter (MWF) for preservation of the spatial subjective sensation of diffuse noise fields, is proposed here to both preserve and emphasize the ITD binaural cues of a directional acoustic noise source. It is demonstrated that the previously developed MWF-ITD technique may decrease the original IC magnitude of the processed noise, consequently increasing the variance of the interaural phase difference (IPD) of the output signals. It is shown that the MWF-IC technique concomitantly minimizes a nonlinear function of the difference between input and output IPD, which is strictly related to ITD, and preserves the natural coherence of the directional noise captured by the reference microphones. Objective measures and psychoacoustic experiments corroborate the theoretical findings, showing the MWF-IC technique provides relevant noise reduction, while preserving the original ITD subjective perception and original lateralization for a directional noise source. These results are especially relevant for hearing aid designers, since they indicate the MWF-IC as a noise reduction technique that provides resid-ual noise spatial preservation for both diffuse and directional noise sources in frequencies below 1.5 kHz.