+

WO2005106849A1 - Compression/decompression audio numerique a codage/decodage de coefficients de predicteur linaire a complexite reduite - Google Patents

Compression/decompression audio numerique a codage/decodage de coefficients de predicteur linaire a complexite reduite Download PDF

Info

Publication number
WO2005106849A1
WO2005106849A1 PCT/US2005/012738 US2005012738W WO2005106849A1 WO 2005106849 A1 WO2005106849 A1 WO 2005106849A1 US 2005012738 W US2005012738 W US 2005012738W WO 2005106849 A1 WO2005106849 A1 WO 2005106849A1
Authority
WO
WIPO (PCT)
Prior art keywords
partial correlation
companded
correlation coefficient
quantized
coefficients
Prior art date
Application number
PCT/US2005/012738
Other languages
English (en)
Inventor
Yuriy A. Reznik
Original Assignee
Realnetworks, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Realnetworks, Inc. filed Critical Realnetworks, Inc.
Publication of WO2005106849A1 publication Critical patent/WO2005106849A1/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients

Definitions

  • the present invention relates to the field of signal processing. More specifically, the present invention relates to compression and decompression of audio signal data.
  • PCM audio has a number of advantages over analog audio.
  • pulse code modulation (PCM) audio has a number of advantages over other audio formats.
  • PCM audio offers freedom to interchange audio data without generation loss between media.
  • PCM audio is not only being offered in mediums like compact disc (CD), it is also widely employed in broadcast programming, through air waves or cable, or in streamed content, through private and/or public networks, such as the Internet.
  • CD compact disc
  • bandwidth availability/consumption remains a significant challenge.
  • digital audio data is often compressed.
  • linear prediction filtering is often employed as part of the compression process.
  • Figure 1 illustrates an overview of an encoder for compressing audio signal data, in accordance with some embodiments of the present invention
  • Figure 2 illustrates an overview of a decoder for decompressing audio signal data, in accordance with some embodiments of the present invention
  • Figure 3 illustrates a system, including an encoder and a decoder of Fig. 1-2, respective, in accordance with some embodiments.
  • Illustrative embodiments of the present invention include, but are not limited to, a method to compress digital audio data (in particular, PCM audio data), encoders and/or decoders adapted to practice all or portions of the method, and systems having the encoders/decoders.
  • digital audio data in particular, PCM audio data
  • encoders and/or decoders adapted to practice all or portions of the method
  • systems having the encoders/decoders Various aspects of embodiments of the present invention will be described. However, various embodiments may be practiced with only some or all of the described aspects. For purposes of explanation, specific numbers, materials and configurations are set forth in order to provide a thorough understanding of the embodiments being described. In alternate embodiments, the described aspects may be practiced without explaining the specific details of operation.
  • Examples of general purpose “computing devices/systems” may include, but are not limited to, handheld computing devices (palm sized, tablet sized and so forth), laptop computing devices, desktop computing devices, servers, and so forth.
  • Examples of special purpose “computing device/system” may include, but are not limited to, processor based wireless mobile phones, handheld digital music players, set-top boxes, game boxes/consoles, CD/DVD players, digital cameras, digital CAMCORDERS, and so forth.
  • DVD - Digital Versatile Disk Various operations will be described as multiple discrete operations in turn, in a manner that is most helpful in understanding the various embodiments of the present invention, however, the order of description should not be construed as to imply that these operations are necessarily order dependent.
  • Encoder 100 includes Buffer 102, Linear Predictive Coding (LPC) Coefficient Estimator 104, Predictor 106, Quantization Unit 108, Subtractor 109, Entropy Encoder 110, and Multiplexor 112, coupled with each other as shown.
  • LPC Linear Predictive Coding
  • Encoder 100 includes LPC Coefficient Converter 114, Companding and Quantization Unit 116, and Partial Correlation (ParCor) Encoder 118 coupled with each other and the earlier enumerated elements, as shown.
  • Buffer 102 is adapted to temporally and successively store portions of an incoming audio stream to be encoded.
  • LPC Coefficient Estimator 104 is adapted to provide a number of LPC coefficients, based on the incoming audio stream, for use to apply linear prediction filtering on the incoming audio stream, to generate residuals for the audio stream.
  • Quantization Unit 108 is adapted to quantize the linear prediction coefficients.
  • Predictor 106 and Subtractor 109 are used in conjunction to generate the residuals of the audio data.
  • Entropy Encoder 110 is adapted to encode the residuals, generated encoded residuals and associated code indices.
  • LPC Coefficients Converter 114 is adapted to convert the linear prediction coefficients to partial correlation coefficients.
  • Companding and quantication unit 116 is adapted to compand and quantize the partial correlation coefficients to generate companded and quantized partial correlation coefficients.
  • ParCor Encoder 118 is adapted to encode the companded and quantized partial correlation coefficients.
  • Multiplexor is adapted to multiplex the encoded residuals of the audio data, the associated code indices, and the encoded partial correlation coefficients to form the compressed audio data bitstream. Resultantly, compressed audio data bitstream is typically more compact than prior art compressed audio data bitstream.
  • ParCor Encoder 118 is adapted to encode the companded and quantized partial correlation coefficients as follows: For each companded and quantized partial correlation coefficient, a residual of the companded and quantized partial correlation coefficient is computed by subtracting an offset from the companded and quantized partial correlation coefficient to generate an intermediate value; then multiplying the intermediate value by 2 to obtain the residual, if the intermediate value is positive, or multiplying the intermediate value by -2 to obtain another intermediate value, and adding 1 to the other intermediate value to obtain the residual, if the intermediate value is negative. In turn, the residual of the companded and quantized partial correlation coefficient is encoded. In various embodiments, the encoding may be performed, for example, using Golomb-Rice codes.
  • the encoding may be performed, using Huffman codes, Arithmetic codes, Fixed-rate codes, codes of the like or combination of them (including combination with Golomb-Rice codes).
  • the offsets and Rice parameters for the various coefficients are as follows:
  • Encoder 100 may be implemented in hardware and/or software, in whole or in part. For examples, for software implementations, it may be implemented in C or other system programming languages; for hardware implementations, the logic may be implemented using programmable gate arrays or Application Specific Integrated Circuits (ASIC).
  • ASIC Application Specific Integrated Circuits
  • Decoder 200 includes De- Multiplexor 212, Entropy Decoder 210, Adder 209, De-Quantization Unit 208, Predictor 206, and Buffer 202, coupled with each other as shown. Further, Decoder 200 includes Partial Correlation Decoder 218, De-Quantization and Inverse Companding Unit 216, and LPC Coefficient Recovery Unit 214, coupled with each other and the earlier enumerated elements, as shown.
  • De-Multiplexor 212 is adapted to de-multiplex an audio bitstream to recover the encoded residuals of the audio data, the associated code indices and the encoded partial correlation coefficients.
  • Entropy Decoder 210 is adapted to decode the encoded residuals, to recover unencoded residuals.
  • Predictor 206 and Adder 209 are used in conjunction to recover the audio data.
  • Buffer 202 is adapted to temporally and successively store portions of recovered/decompressed audio data.
  • ParCor Decoder 218 is adapted to decode the encoded companded and quantized partial correlation coefficients.
  • De-Quantication and Inverse Companding Unit 216 is adapted to de-quantize and inverse compand the companded and quantized partial correlation coefficients to recover the original partial correlation coefficients.
  • LPC Coefficients Recovery Unit 214 is adapted to recover the linear prediction coefficients from the partial correlation coefficients.
  • Quantization Unit 208 is adapted to quantize the recovered linear prediction coefficients for earlier described Predictor 206. In various embodiments, the above described elements are performed under the control of a controller (not shown). Still referring to Figure 2, in various embodiments, where the residuals of companded and quantized partial correlation coefficients are encoded (e.g.
  • ParCor Decoder 218 is adapted to decode the encoded residuals as follows: For each residual of a companded and quantized partial correlation coefficient, the companded and quantized partial correlation coefficient is computed by subtracting 1 from the residual to obtain a difference, dividing the difference by -2 to obtain a first quotient, determine whether the first quotient is negative, adding an offset to the first quotient to obtain the companded and quantized partial correlation coefficient, if the first quotient is negative, dividing the residual by 2 to obtain a second quotient, if the first quotient is not negative, and adding an offset to the second quotient to obtain the companded and quantized partial correlation coefficient, if the second quotient is computed.
  • decoder 200 may be implemented in hardware and/or software, in whole or in part. Referring now to Figure 3, wherein a system incorporated with the encoder and decoder of Fig 1-2 respectively, in accordance with various embodiments. As illustrated, for the embodiments, system 300 includes audio source 302 and audio renderer 304 operatively coupled to each other via "connection" 306.
  • Audio source 302 and audio renderer 304 includes audio encoder 100 and audio decoder 200 of Fig. 1 and 2, respectively.
  • audio source 302 and audio renderer 304 may be components of an audio application (e.g. a speech related application).
  • audio source 302 and audio renderer 304 may be components of an audio/video application (e.g. an audio/video streaming application).
  • audio source 302 and audio renderer 304 may be components of the same electronic device, coupled 306 to each other via e.g. an internal system bus of the electronic device.
  • audio source 302 and audio renderer 304 may be disposed on separate electronic devices, coupled 306 to each other via e.g. a cable or a networked connection.
  • Audio source 302 may e.g. be a server, set- top unit, a desktop computer, and so forth.
  • Audio renderer 304 may be a speaker, a media player, a desktop computer, and so forth.
  • the networked connection may be a local area network connection or a wide area network connection, wired or wireless.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

Dans une application de filtrage de prédiction linéaire pour la compression d'un ensemble de données audio, des coefficients de prédiction linéaire sont codés par codage de leurs coefficients de corrélation partielle correspondants. Pendant la décompression, ces coefficients de prédiction linéaire sont récupérés à partir des coefficients de corrélation partielle récupérés.
PCT/US2005/012738 2004-04-14 2005-04-14 Compression/decompression audio numerique a codage/decodage de coefficients de predicteur linaire a complexite reduite WO2005106849A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US56266904P 2004-04-14 2004-04-14
US60/562,669 2004-04-14

Publications (1)

Publication Number Publication Date
WO2005106849A1 true WO2005106849A1 (fr) 2005-11-10

Family

ID=35241897

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/012738 WO2005106849A1 (fr) 2004-04-14 2005-04-14 Compression/decompression audio numerique a codage/decodage de coefficients de predicteur linaire a complexite reduite

Country Status (1)

Country Link
WO (1) WO2005106849A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100718416B1 (ko) 2006-06-28 2007-05-14 주식회사 대우일렉트로닉스 예측필터를 이용한 채널간 스테레오 오디오 코딩 방법

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4390747A (en) * 1979-09-28 1983-06-28 Hitachi, Ltd. Speech analyzer
US4516259A (en) * 1981-05-11 1985-05-07 Kokusai Denshin Denwa Co., Ltd. Speech analysis-synthesis system
US5822732A (en) * 1995-05-12 1998-10-13 Mitsubishi Denki Kabushiki Kaisha Filter for speech modification or enhancement, and various apparatus, systems and method using same

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4390747A (en) * 1979-09-28 1983-06-28 Hitachi, Ltd. Speech analyzer
US4516259A (en) * 1981-05-11 1985-05-07 Kokusai Denshin Denwa Co., Ltd. Speech analysis-synthesis system
US5822732A (en) * 1995-05-12 1998-10-13 Mitsubishi Denki Kabushiki Kaisha Filter for speech modification or enhancement, and various apparatus, systems and method using same

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100718416B1 (ko) 2006-06-28 2007-05-14 주식회사 대우일렉트로닉스 예측필터를 이용한 채널간 스테레오 오디오 코딩 방법

Similar Documents

Publication Publication Date Title
KR100608062B1 (ko) 오디오 데이터의 고주파수 복원 방법 및 그 장치
US8386271B2 (en) Lossless and near lossless scalable audio codec
US20070043575A1 (en) Apparatus and method for encoding audio data, and apparatus and method for decoding audio data
USRE46082E1 (en) Method and apparatus for low bit rate encoding and decoding
US20090228284A1 (en) Method and apparatus for encoding/decoding multi-channel audio signal by using a plurality of variable length code tables
JP2002135131A (ja) 可変サイズベクトルエントロピ符号化を実行するための方法および装置
US20100017196A1 (en) Method, system, and apparatus for compression or decompression of digital signals
EP1847022B1 (fr) Codeur, decodeur, procede de codage/decodage, supports lisibles par ordinateur et elements de programme informatique
US9065547B2 (en) Digital audio signal compression method and apparatus
JP2003015694A (ja) ビットレート変換装置およびビットレート変換方法
KR101363206B1 (ko) 인터채널과 시간적 중복감소를 이용한 오디오 신호 인코딩
EP1175030A2 (fr) Méthode et système pour le codage perceptuel de signaux audiophoniques multicanal par transformation en cosinus discrète et cosinus discrète modifiée à cascades
KR20120048694A (ko) 오디오 인코딩에서 주파수 대역 신호 에너지를 기초로 한 주파수 대역 스케일 팩터 결정
JP3406275B2 (ja) ディジタル信号符号化方法、ディジタル信号復号化方法、これらの装置及びその各プログラム記録媒体
Yang et al. A lossless audio compression scheme with random access property
AU2003208795A1 (en) Video compression using analytical entropy coding
Moriya et al. A design of lossy and lossless scalable audio coding
US6549147B1 (en) Methods, apparatuses and recorded medium for reversible encoding and decoding
EP3577649B1 (fr) Codeur de signal audio stéréo
WO2005106849A1 (fr) Compression/decompression audio numerique a codage/decodage de coefficients de predicteur linaire a complexite reduite
US20040208169A1 (en) Digital audio signal compression method and apparatus
KR100975522B1 (ko) 스케일러블 오디오 복/부호화 방법 및 장치
Gournay et al. Backward linear prediction for lossless coding of stereo audio
Harada Lossles Compression of Speech and Audio Signals and Its Applications
JPH0255431A (ja) 情報伝送装置

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

122 Ep: pct application non-entry in european phase
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载