Microphone Speaker Analysis: Audio Segmentation and Frequency Insights

Taisia-Maria COCONU¹, Costin-Alexandru DEONISE¹, Constantin ANGHEL², Cătălin NEGRU¹ Florin POP^1,3,4

Abstract. Audio segmentation represents a technical process used for separating a stream of audio recordings, which frequently contain multiple speakers, into uniform sections. This paper explores the implementation of voice-dialing and recognition algorithms to examine and analyze the technology’s capability to accurately identify and differentiate speakers in intricate environments. It aims to enhance our understanding of the technology’s functionality, including its ability to discern speakers’ emotions and gender. Additionally, a hardware simulation is conducted using a two-way microphone and an Arduino board. It seeks to emphasize precision in speaker recognition and diarization, along with the accurate transcription of speeches, by achieving optimal parameters and enhancing existing market models. It also explores the applicability of this technology in various fields by creating applications that mainly use Speech Diarization and Speech Recognition.

Keywords: Emotion Detection, Gender Detection, Voice Recognition Hardware System.

More …

DOI 10.56082/annalsarsciinfo.2024.1.5

¹National University of Science and Technology Politehnica Bucharest, Romania

² National Institute of Research and Development in Mechatronics and Measurement Technique, Bucharest, Romania

³ National Institute for Research and Development in Informatics (ICI), Bucharest, Romania

⁴ Academy of Romanian Scientists, Bucharest, Romania

PUBLISHED in Annals of the Academy of Romanian Scientists Series on Science and Technology of Information, Volume 17, No1

ISSN PRINT2066 – 2742 ISSN ONLINE 2066-8562

Description | Editorial Board | Instructions for authors | Template | Archive