Research on Audio Processing Method Based on 3D Technology

Li, Kai; Tang, Yaping; Ouyang, Yuanling

doi:10.1007/978-3-031-38651-0_4

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 33))

Included in the following conference series:

International Conference on Computational Finance and Business Analytics

310 Accesses
1 Citation

Abstract

The perception of sound by human auditory system includes not only subjective attributes such as loudness, tone and timbre, but also spatial attributes of sound. 3D sound effect is an acoustic concept, which has the characteristics of broad sound stage and strong sense of sound localization, and can bring advanced auditory enjoyment to users. To analyze the audio signal, we must first preprocess the signal, filter out the noise in the audio signal and extract useful signal components. Aiming at the problems that may be faced in 3D audio signal processing, an improved algorithm for determining the threshold based on decomposition scale is proposed, and the optimal decomposition scale is determined by comparing adjacent high-frequency coefficient graphs. The improved algorithm in this article better preserves the characteristics of the signal. The accuracy of audio processing using this method is as high as 95.69%, which is higher than that of the two models, 5.98% and 9.53% respectively. The results show that the method proposed in this article has obvious advantages in audio processing. The improved algorithm can effectively remove noise interference and enhance the stereo effect of 3D audio, and the signal-to-noise ratio is obviously better than the original algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Acoustics and Psychoacoustics of Sound Scenes and Events

Sound Signal Sensitivity of Subjective Auditory Features

Pattern analysis based acoustic signal processing: a survey of the state-of-art

Article 03 February 2020

References

M. Heck, M. Hobiger, A.V. Herwijnen et al., Localization of seismic events produced by avalanches using multiple signal classification. Geophys. J. Int. 216(1), 201–217 (2019)
Google Scholar
L.J. Nowak, K.M. Nowak, Perceptual audio processing stethoscope. J. Acoust. Soc. Am. 146(3), 1769–1773 (2019)
Article Google Scholar
L. Jing, L. Bo, J. Choi et al., DCAR: a discriminative and compact audio representation for audio processing. IEEE Trans. Multimed. 19(12), 2637–2650 (2017)
Article Google Scholar
D.M. Rasetshwane, J.G. Kopun, R.W. Mccreery et al., Electroacoustic and behavioral evaluation of an open source audio processing platform. J. Acoust. Soc. Am. 143(3), 1738–1738 (2018)
Article Google Scholar
D. Monroe, Digital hearing advances in audio processing help separate the conversation from background noise. Commun. ACM 60(10), 18–20 (2017)
Article Google Scholar
C. Maximo, R. Sandra, SART3D: a MATLAB toolbox for spatial audio and signal processing education. Comput. Appl. Eng. Educ. 27(4), 971–985 (2019)
Article Google Scholar
S.H. Hawley, B.L. Colburn, S.I. Mimilakis, Profiling musical audio processing effects with deep neural networks. J. Acoust. Soc. Am. 144(3), 1753–1753 (2018)
Article Google Scholar
Mustaqeem, S. Kwon, A CNN-assisted enhanced audio signal processing for speech emotion recognition. Sensors 20(1), 183 (2019)
Google Scholar
M. Matsumoto, Vision-referential speech enhancement of an audio signal using mask information captured as visual data. J. Acoust. Soc. Am. 145(1), 338–348 (2019)
Article MathSciNet Google Scholar
M.R. Bai, S.S. Lan, J.Y. Huang et al., Audio enhancement and intelligent classification of household sound events using a sparsely deployed array. J. Acoust. Soc. Am. 147(1), 11–24 (2020)
Article Google Scholar
B. Munson, Audiovisual enhancement and single-word intelligibility in children’s speech. J. Acoust. Soc. Am. 148(4), 2765–2765 (2020)
Article Google Scholar
F. Rumsey, Room acoustics modeling, enhancement, measurement. J. Audio Eng. Soc. 66(7–8), 637–641 (2018)
Google Scholar
B. Ma, J. Teng, H. Zhu et al., Three-dimensional wind measurement based on ultrasonic sensor array and multiple signal classification. Sensors 20(2), 523 (2020)
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Music and Dance, Hunan University of Humanities, Science and Technology, Loudi, 417000, China
Kai Li & Yaping Tang
Changsha Human Resources Public Service Center, Changsha, 410000, China
Yuanling Ouyang

Authors

Kai Li
View author publications
Search author on:PubMed Google Scholar
Yaping Tang
View author publications
Search author on:PubMed Google Scholar
Yuanling Ouyang
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Yaping Tang .

Editor information

Editors and Affiliations

Department of Informatics, University of Piraeus, Piraeus, Greece
George A. Tsihrintzis
Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, Krasnoyarsk, Russia
Margarita N. Favorskaya
Department of Radio Communications, University of Sofia, Sofia, Bulgaria
Roumen Kountchev
Professor, Director, Interscience Institute of Management and Technology, Bhubaneswar, Odisha, India
Srikanta Patnaik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, K., Tang, Y., Ouyang, Y. (2023). Research on Audio Processing Method Based on 3D Technology. In: Tsihrintzis, G.A., Favorskaya, M.N., Kountchev, R., Patnaik, S. (eds) Advances in Computational Vision and Robotics. ICCVR 2023. Learning and Analytics in Intelligent Systems, vol 33. Springer, Cham. https://doi.org/10.1007/978-3-031-38651-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-031-38651-0_4
Published: 13 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-38650-3
Online ISBN: 978-3-031-38651-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Keywords

Publish with us

Policies and ethics