A Proposed CNN Model for Audio Recognition on Embedded Device

Authors

DOI:

https://doi.org/10.3991/ijim.v18i08.45917

Keywords:

Autonomous car, convolutional neural network, vehicle classification, audio recognition

Abstract


The audio detection system enables autonomous cars to recognize their surroundings based on the noise produced by moving vehicles. This paper proposes the utilization of a machine learning model based on convolutional neural networks (CNN) integrated into an embedded system supported by a microphone. The system includes a specialized microphone and a main processor. The microphone enables the transmission of an accurate analog signal to the main processor, which then analyzes the recorded signal and provides a prediction in return. While designing an adequate hardware system is a crucial task that directly impacts the predictive capability of the system, it is equally imperative to train a CNN model with high accuracy. To achieve this goal, a dataset containing over 3000 up-to-5-second WAV files for four classes was obtained from open-source research. The dataset is then divided into training, validation, and testing sets. The training data is converted into images using the spectrogram technique before training the CNN. Finally, the generated model is tested on the testing segment, resulting in a model accuracy of 77.54%.

Author Biographies

Minh Pham Ngoc, Institute of Information Technology - Vietnam Academy of Science and Technology

Minh.Pham Ngoc: Born in 1976. He graduated with a Bachelor's degree in Control Engineering from Hanoi University of Science and Technology from 1994 to 1999. He obtained a Doctorate in Technical Engineering specializing in Control and Automation at the Vietnam Academy of Science and Technology - Vietnam Academy of Science. He has led and served as the secretary for numerous research projects at the state and ministry levels at the Institute of Information Technology. Currently, he is the Head of the Control Engineering and Embedded Systems Department at the Institute of Information Technology. His main research interests include Embedded Systems, Process Control, Industrial Communication Networks, Broadband Wireless Networks, Robot Control, and Image Processing.

Tan Ngo Duy, Space Technology Institute - Vietnam Academy of Science and Technology

He was a researcher of the Department of Remote Sensing Technology, Institute of Physics and Electronics (IOP) – Vietnam Academy of Science and Technology (VAST) from 2001 to 2006. From 2006 to 2011, he worked in Research and Development of Satellite Technology at Space Technology Institute (STI). Since 2011, he has worked for Center for Small Satellite Control and Data Exploitation.

He received his Master of Electronics and Telecommunication from Hanoi University of Science and Technology in 2007 and PhD degree in Control and Automation Engineering from the Graduate University of Science and Technology (GUST) – Vietnam Academy of Science and Technology in 2019.
He is also a member of Scientific Council of Space Technology Institute.

His key research topics of interest are:
- Satellite technology: Attitude estimation and control, mission analysis, on-board processing, antenna tracking, RF signal reception.
- Remote sensing data processing: application of Artificial intelligence for satellite data processing and object recognition.
- Robotic and Internet of things.

Hoan Huynh Duc, Quy Nhon Univercity, Binhdinh, Vietnam

Hoan.Huynh Duc, born in 1972, earned his Ph.D. in Electrical and Electronic Equipment in 2009 at Hanoi University of Science and Technology. In 2017, he was recognized as an Associate Professor. Currently, he is a senior lecturer at Quy Nhon University. His primary research interests include Embedded Systems, Control Systems, Electrical System Protection, and Electric Drive Control

Kiet Tran Anh, Space Technology Institute, Vietnam Academy of Science and Technology

Tran Anh Kiet graduated from Purdue University in Mechanical Engineering. Currently he is doing research internship in Space Technology Institute in Vietnam Academy of Science and Technology. His research interests include autonomous systems and aerospace technology.

Downloads

Published

2024-04-23

How to Cite

Pham Ngoc, M., Ngo Duy, T., Huynh Duc, H., & Tran Anh, K. (2024). A Proposed CNN Model for Audio Recognition on Embedded Device. International Journal of Interactive Mobile Technologies (iJIM), 18(08), pp. 116–126. https://doi.org/10.3991/ijim.v18i08.45917

Issue

Section

Papers