+

US20010005173A1 - Method and apparatus for sample rate pre-and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding - Google Patents

Method and apparatus for sample rate pre-and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding Download PDF

Info

Publication number
US20010005173A1
US20010005173A1 US09/773,492 US77349201A US2001005173A1 US 20010005173 A1 US20010005173 A1 US 20010005173A1 US 77349201 A US77349201 A US 77349201A US 2001005173 A1 US2001005173 A1 US 2001005173A1
Authority
US
United States
Prior art keywords
signal
sampling rate
audio
downsampled
rate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US09/773,492
Other versions
US6384759B2 (en
Inventor
James Snyder
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
AT&T Properties LLC
Original Assignee
AT&T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AT&T Corp filed Critical AT&T Corp
Priority to US09/773,492 priority Critical patent/US6384759B2/en
Publication of US20010005173A1 publication Critical patent/US20010005173A1/en
Application granted granted Critical
Publication of US6384759B2 publication Critical patent/US6384759B2/en
Assigned to AT&T PROPERTIES, LLC reassignment AT&T PROPERTIES, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AT&T CORP.
Assigned to AT&T INTELLECTUAL PROPERTY II, L.P. reassignment AT&T INTELLECTUAL PROPERTY II, L.P. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AT&T PROPERTIES, LLC
Assigned to AT&T CORP. reassignment AT&T CORP. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SNYDER, JAMES H.
Assigned to FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V. reassignment FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AT&T INTELLECTUAL PROPERTY II, L.P.
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation

Definitions

  • the invention relates to audio signal transmission, and more particularly to varying the sample-rate to improve coding gain for audio signals.
  • Most high-quality audio algorithms such as MPEG AAC (MPEG Advanced Audio Coder), PAC (Perceptual Audio Coder), MPEG layer3, Dolby AC3 (Advanced Coder 3), and NTT's TwinVQ, encode a fixed number of samples into each frame which then represent a unit of time for a particular algorithm.
  • Each audio frame carries side information. The number of bits needed to encode the side information per frame is roughly constant. This side information imposes a per-frame overhead.
  • the frame frequency (i.e., the number of frames per second) used by an audio algorithm is proportional to the sampling rate because each frame encodes a constant number of samples.
  • the statistical properties of music indicate that an optimal frame duration is about 40 ms.
  • the frame duration is about 23 ms; at 22050 sps, the frame duration is 46 ms.
  • AAC AAC
  • the invention relates to a method and apparatus for achieving maximal coding gain for audio coding and reproduction. More particularly, at a chosen sample rate and frequency range value, an audio input signal is transduced, sampled, downsampled to the encoding sample rate, encoded and transmitted at a given bit rate. At the receiving end, the downsampled signal is decoded and upsampled to the original or other suitable sample rate. The upsampled signal is then audibly output.
  • Resampling using “small-integer” ratios is computationally more efficient than using arbitrary resampling ratios.
  • This method and apparatus support both arbitrary and small-integer ratio resampling.
  • the use of small-integer resampling frequently implies the use of non-standard sampling rates in the transmitted channel, for example 32073 sps rather than 32000 sps.
  • FIG. 1 is an exemplary diagram of an audio transmission system
  • FIG. 2 is a block diagram of a generic audio encoding/decoding system
  • FIG. 3 is a block diagram of a generic frame-based audio encoding/decoding which operates at a bit rate too low to support the full audio bandwidth implied by the sampling rate (thru Nyquist);
  • FIG. 4 is a block diagram of a generic frame-based audio encoding/decoding system using a low-pass filter
  • FIG. 5 is a block diagram of a generic frame-based audio encoder/decoder that discards spectral coefficients
  • FIG. 6 is a generic frame-based audio encoding/decoding system that downsamples the audio input
  • FIG. 7 is a block diagram of a frame-based audio encoding/decoding system according to the invention.
  • FIG. 8 is a block diagram of a frame-based audio encoding/decoding system of the invention utilizing a non-standard downsampling ratio
  • FIG. 9 is a flowchart of the encoding portion of the invention.
  • FIG. 10 is a flowchart of the decoding portion of the invention.
  • FIG. 1 is an exemplary block diagram of an audio transmission system 100 of the invention.
  • An encoding terminal 110 that downsamples and encodes audio signals is connected to a multimedia communications network 140 through modem 120 and local exchange carrier 130 .
  • a decoding terminal 170 that receives, decodes and upsamples the audio signals is also connected to the multimedia communications network 140 through modem 160 and local exchange carrier 150 .
  • the encoding terminal 110 and decoding terminal 170 include memory units 180 and 190 , respectively, for intermediate storage of the compressed audio signal either prior to transmission or after reception of the audio signals, for example.
  • the multimedia communications network 140 represents any combination of existing communications networks, such as a telephone network, Internet, intranet, etc.
  • the modem devices 120 , 160 may be ethernet interfaces, cable modems, ISDN modems, ADSL modems, or any other interface circuit intended to connect two networks or a network and a digital computing apparatus.
  • the modem devices 120 , 160 may contain a conventional RJ-11 outlet for connection to computer modem, facsimiles, printers or other equipment.
  • the modem devices 120 and 160 may also be equipped with universal serial bus (USB), integrated system digital network (ISDN) or other standard data interfaces, as will be appreciated by the person skilled in the art. However, other similar devices may be used to permit sharing of large bandwidths over media already installed.
  • USB universal serial bus
  • ISDN integrated system digital network
  • Encoding terminal 110 and decoding terminal 170 may be any pair of devices that receive and send audio signals according to the invention through the multimedia communications network 140 via modems 120 and 160 .
  • the encoding terminal 110 and decoding terminal 170 may represent such devices as a personal computer (PC), telephone, television, facsimile, or any other device capable of sending and receiving audio signals. It may be appreciated that the encoding terminal 110 and decoding terminal 170 may include software and/or hardware for performing the encoding and decoding functions, and further that the encoding and decoding terminals may be different types of devices.
  • the compressed audio signal may be intermediately stored in one or more other intermediate storage devices located throughout the audio transmission system 100 , such as between the modem 120 , 160 and the local exchange carrier 130 , 150 , or in the multi-media communications network 140 .
  • FIGS. 2 - 6 In providing a more detailed discussion of the encoding and decoding of audio signals, a discussion of conventional systems is set forth in FIGS. 2 - 6 to better to explain the features and advantages of the present invention.
  • FIG. 2 shows a generic audio encoding/decoding system 200 operating at a bit rate which is sufficient to encode all of the frequencies in the input signal.
  • frequency range fin typically spanning the range of 20 Hz-20 KHz
  • the input signal may either be analog or digital. If the input signal is analog, the encoder 210 will include an analog-to-digital conversion apparatus. However, the input signal may already be digitized, such as stored signals retrieved from an audio compact disc, for example.
  • a decoder 220 located within another PC for example, receives and decodes the transmitted audio signal to produce an audio output f out which is less than fin and less than f s /2.
  • the encoder/decoder system 200 in this example has no other specified bandwidth limit and the distortion level is unspecified. If the bit rate b ch and the sample rate f s are high enough (for the encoding algorithm) then the reproduced audio will be indistinguishable from the original. If either is too low, then the audio will be perceived as degraded.
  • FIG. 3 shows a generic frame-based audio encoding/decoding system 300 operating at a high sampling rate, such as 44100 sps.
  • the audio encoder/decoder system of FIG. 3 is similar to that of FIG. 2, but the sampling rate of 44100 sps used for encoding is too high to permit transparent audio reproduction of the full human-audible frequency range (20 Hz-20 KHz) at the specified bit rate of 96 Kbps, so a degradation in audio signal quality is perceived.
  • the encoder is operating at 96 Kbps and 44100 sps, although the same principles apply at other sampling rates and other bit rates.
  • One way to improve reproduced audio signal quality when the bit rate is too low to support the full frequency range of the input is to encode less than the full frequency range.
  • AAC codec best reproduced signal quality at 96 Kbps and 44100 sps occurs for a signal bandwidth of about 13 KHz.
  • FIGS. 4 - 6 show various ways to decrease the audio frequency range.
  • FIG. 4 shows a generic frame-based audio encoding/decoding system 400 operating at a high sampling rate that uses a low pass filter 410 to limit the frequency range that is encoded. In many cases, a lower sampling rate would allow a wider frequency range or alternatively a higher quality audio signal (because of frame overhead and music statistics). Consequently, the system in FIG. 4 is sub-optimal.
  • FIG. 5 shows a generic frame-based audio encoding/decoding system 500 that operates at a high sampling rate (44100 sps) that discards spectral coefficients in the input signal to limit the frequency range that is encoded and transmitted. This operation is similar but not identical to that of the low pass filter 410 discussed above.
  • the audio input signal is input to the Modified Discrete Cosine Transform (MDCT) 510 (or other time-to-frequency domain transform) and the spectral coefficients are discarded by the spectral coefficient discard unit 520 .
  • MDCT Modified Discrete Cosine Transform
  • the signal is then input to a noise allocation unit 530 (which computes the masking thresholds for the audio frame and quantizes the spectral coefficients according to the thresholds) which emits the compressed signal.
  • the compressed signal is then transmitted to the decoder 220 of another computing unit (for example, another PC, or a portable audio device similar to the Diamond Rio MP3 player) for decoding and output.
  • another computing unit for example, another PC, or a portable audio device similar to the Diamond Rio MP3 player
  • FIG. 6 shows a generic frame-based audio encoding/decoding system 600 that downsamples the audio input signal to limit the frequency range that is encoded and transmitted.
  • the audio input signal is downsampled by the downsampler 610 at a 2:1 ratio and is then input into encoder 210 for encoding.
  • the signal is then transmitted across a communication channel to the decoder 220 at the receiving PC that plays out the audio signal at the downsampled rate.
  • This will generally be suboptimal because the decoder 220 must operate at a submultiple of 44100 sps. In this example, the suboptimal would be 2:1 to 22050, which is not the rate that provides optimal frequency response.
  • FIG. 7 shows the encoding/decoding system 700 of the invention.
  • the audio encoding/decoding system 700 includes an optimal triplet of sample rate f s0 (in this case 32 Ksps), bit rate 96 Kbps, and the maximum supportable frequency range f 0 which at 96 Kbps/32 Ksps is about 13 kHz.
  • the optimal triplet could be determined in a number of ways, e.g. algorithmically or by searching a table.
  • the analog signal (or a digitized version of the analog signal) is input to the encoding unit 710 of a PC, for example, where the signal is downsampled by downsampler 730 from 44100 to 32000 and encoded by the audio encoder 740 .
  • the encoded audio signal is then transmitted across a communications channel, through a modem, for example, at a given bit rate of 96 Kbps to another PC for output.
  • the received signal is input to a decoding unit 720 , where a bit stream decoder 750 decodes the downsampled signal.
  • the decoded signal is then input to the upsampler 760 which upsamples the signal to the original or other suitable sample rate.
  • An audio output is then produced with a frequency range f out of about 13 kHz. Note that in the example of FIG. 7, 44100 sps and 32000 sps are standard AAC rates.
  • the encoding unit 710 and the decoding unit 720 may include memory units for intermediate storage of the compressed audio signal either prior to transmission or after reception of the audio signals, for example.
  • codec for example, AAC
  • f s0 does not match one of these standard rates.
  • codecs such as AAC
  • many codecs can be modified to run at an arbitrary sample rate, and although the resulting encoding unit 710 will generate AAC bit streams that will not reproduce audio accurately unless the decoding unit 720 incorporates this invention, the perceived quality of the reproduced audio signal will be better for the bit stream that uses the non-standard rate than for a bit stream that uses any standard rate.
  • the downsampling process used in FIG. 7 may be more computationally efficient when the downsampling factor is the ratio of two small numbers.
  • the downsampling factor is the ratio of two small numbers.
  • 32073 sps is used as the intermediate sampling frequency.
  • 32073 sps is sufficiently close to an AAC standard rate that audio signals can be encoded using the parameters for a standard AAC rate.
  • the bit stream header which generally carries information about the sampling rate at which the audio was encoded, can indicate the nearby standard rate.
  • a conventional decoder i.e. one which does not incorporate the current invention
  • the invention is still useful when the resulting sampling rate is not close to a standard rate, as long as it is possible to modify the audio encoding unit 710 so that it supports the non-standard rate. For example, with a downsample ratio of 9:8 one obtains a sampling rate of 39200 sps, which with a production AAC codec would support a frequency range as high as 15-17 KHz at a bit rate of 112 Kbps at an acceptable level of distortion. Since the downsample factor is again the ratio of two small numbers, the resampling process would again be computationally efficient.
  • the decoding unit 720 It may be advantageous to indicate to the decoding unit 720 what resampling ratio has been used to encode the audio, since otherwise the codec system (FIGS. 7 & 8) must operate at a fixed resampling ratio.
  • the resampling ratio is incorporated into the bit stream within a reserved bit field of the standard header.
  • the resampling ratio can be incorporated as side channel information.
  • AAC permits “data packets” to be incorporated in the bit stream. These data packets are ignored by a standard AAC codec.
  • the resampling ratio can be specified in a data packet, possibly along with other information.
  • FIG. 9 is a flowchart of the encoding process according to the invention.
  • Process begins at step 1000 and proceeds to step 1010 where the sample rate f s0 and maximum frequency range f 0 are determined as an optimal pair either algorithmically or by searching a table, for example.
  • step 1020 an input signal is received by the encoding unit 710 and is downsampled by downsampler 730 to f s0 .
  • step 1030 the signal is encoded by the audio encoder 740 .
  • the process then proceeds to step 1040 where the signal (along with a header, data packet, etc. that includes the downsampling information), is transmitted at a given bit rate from a modem across a communication channel.
  • the encoding process then goes to step 1050 and ends.
  • FIG. 10 is a flowchart of the decoding process.
  • Process begins at step 1100 and proceeds to step 1110 where the downsampled signal (along with a header, data packet, etc. that includes the downsampling information) is received by another PC's (for example) decoding unit 720 .
  • the process proceeds to step 1120 where the downsampled signal is decoded by the bit stream decoder 750 and then upsampled at step 1130 by the upsampler 760 at a ratio corresponding to the downsampling ratio included with the received downsampled signal, for example.
  • the upsampled signal is then output in step 1140 .
  • the process then goes to step 1150 and ends.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention relates to a method and apparatus for achieving maximal coding gain for audio transmission. More particularly, at a chosen sample rate and frequency range value, an audio input signal is downsampled to the sample rate, encoded and transmitted at a given bit rate. At the receiving end, the downsampled signal is decoded and upsampled to the original or other suitable sample rate. The upsampled signal is then audibly output. Since resampling ratios using “small” numbers prove to be more computationally efficient, this method and apparatus supports resampling ratios which imply both standard and non-standard sampling ratios in the codec.

Description

  • This non-provisional application claims the benefit of U.S. Provisional Application 60/114,719, filed Dec. 30, 1998, the subject matter of which is incorporated herein by reference. [0001]
  • BACKGROUND OF THE INVENTION
  • 1. Field of Invention [0002]
  • The invention relates to audio signal transmission, and more particularly to varying the sample-rate to improve coding gain for audio signals. [0003]
  • 2. Description of Related Art [0004]
  • There are a number of decisions which must be made in setting up an audio compression system. Among the most important variables that affect audio quality during encoding are the sampling rate, bit rate, and the frequencies that will be encoded, such as 20 Hz-20 KHz or some lesser range, for example. For a given level of distortion and a given algorithm, more bits are required to transmit more signal frequencies. Therefore, there is a optimal match between bit rate and frequency range such that if the bit rate is specified, distortion will increase if more frequencies are encoded then is optimal for that bit rate. [0005]
  • Most high-quality audio algorithms, such as MPEG AAC (MPEG Advanced Audio Coder), PAC (Perceptual Audio Coder), MPEG layer3, Dolby AC3 (Advanced Coder 3), and NTT's TwinVQ, encode a fixed number of samples into each frame which then represent a unit of time for a particular algorithm. Each audio frame carries side information. The number of bits needed to encode the side information per frame is roughly constant. This side information imposes a per-frame overhead. [0006]
  • The frame frequency (i.e., the number of frames per second) used by an audio algorithm is proportional to the sampling rate because each frame encodes a constant number of samples. [0007]
  • Decreasing the sampling rate decreases the number of frames-per-second, which in turn decreases the number of bits diverted for overhead, allowing more bits to be used for audio coding. Thus, lowering the sampling rate results in more bits being available for audio coding which results in a higher quality signal as long as sufficient frequency range is preserved. [0008]
  • To a similar end, the statistical properties of music indicate that an optimal frame duration is about 40 ms. For AAC and PAC at sampling rates of 44100 sps (samples per second) (i.e., the CD sample rate) the frame duration is about 23 ms; at 22050 sps, the frame duration is 46 ms. [0009]
  • The lower the sampling rate, the lower the frequency range that can be transmitted, as described by the Nyquist rule, which limits the maximum frequency range to half of the sampling rate. In practical implementations a “guard band” is needed which further lowers the achievable maximum frequency range. For example, for any algorithm (e.g. AAC), at a sampling rate of 22050 sps, the maximum frequency range is 8 to 10 KHz. [0010]
  • Thus, for a given algorithm, and for a given bit rate b[0011] 0 that is not sufficient for encoding the entire human-audible frequency range in a transparent manner without audible distortion, and for a specified acceptable level of distortion, there is a maximum frequency range f0 that one can encode, and that maximum will be associated with a sample rate fs0.
  • If there were no outside constraints, then one would use f[0012] s0 as the sampling rate. However, several outside constraints exist. For example, PCs and Macintoshes work mostly at 44100, 22050 and 11025 sps. Some PCs work at one or more of the rates 48000, 32000, 24000, 16000 and 8000 sps, but very few PCs will work at all of these sample rates. In fact, Macintosh audio hardware will not work at all at these latter sample rates, so a user is constrained to a small set of sample rates if he or she want to interact with PCs and an even smaller set of sample rates if one wants to interact transparently with Macs without involving potentially inferior resampling in the PC or Mac.
  • SUMMARY OF THE INVENTION
  • The invention relates to a method and apparatus for achieving maximal coding gain for audio coding and reproduction. More particularly, at a chosen sample rate and frequency range value, an audio input signal is transduced, sampled, downsampled to the encoding sample rate, encoded and transmitted at a given bit rate. At the receiving end, the downsampled signal is decoded and upsampled to the original or other suitable sample rate. The upsampled signal is then audibly output. [0013]
  • Resampling using “small-integer” ratios (e.g. 11:8) is computationally more efficient than using arbitrary resampling ratios. This method and apparatus support both arbitrary and small-integer ratio resampling. The use of small-integer resampling frequently implies the use of non-standard sampling rates in the transmitted channel, for example 32073 sps rather than 32000 sps. [0014]
  • These and other features and advantages of this invention are described in or are apparent from the following detailed description of the preferred embodiments. [0015]
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The invention will be described with reference to the accompanying drawings, in which like elements are referenced with like numbers, and in which: [0016]
  • FIG. 1 is an exemplary diagram of an audio transmission system; [0017]
  • FIG. 2 is a block diagram of a generic audio encoding/decoding system; [0018]
  • FIG. 3 is a block diagram of a generic frame-based audio encoding/decoding which operates at a bit rate too low to support the full audio bandwidth implied by the sampling rate (thru Nyquist); [0019]
  • FIG. 4 is a block diagram of a generic frame-based audio encoding/decoding system using a low-pass filter; [0020]
  • FIG. 5 is a block diagram of a generic frame-based audio encoder/decoder that discards spectral coefficients; [0021]
  • FIG. 6 is a generic frame-based audio encoding/decoding system that downsamples the audio input; [0022]
  • FIG. 7 is a block diagram of a frame-based audio encoding/decoding system according to the invention; [0023]
  • FIG. 8 is a block diagram of a frame-based audio encoding/decoding system of the invention utilizing a non-standard downsampling ratio; [0024]
  • FIG. 9 is a flowchart of the encoding portion of the invention; and [0025]
  • FIG. 10 is a flowchart of the decoding portion of the invention. [0026]
  • DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
  • FIG. 1 is an exemplary block diagram of an [0027] audio transmission system 100 of the invention. An encoding terminal 110 that downsamples and encodes audio signals is connected to a multimedia communications network 140 through modem 120 and local exchange carrier 130. A decoding terminal 170 that receives, decodes and upsamples the audio signals is also connected to the multimedia communications network 140 through modem 160 and local exchange carrier 150. The encoding terminal 110 and decoding terminal 170 include memory units 180 and 190, respectively, for intermediate storage of the compressed audio signal either prior to transmission or after reception of the audio signals, for example.
  • The [0028] multimedia communications network 140 represents any combination of existing communications networks, such as a telephone network, Internet, intranet, etc.
  • The [0029] modem devices 120, 160 may be ethernet interfaces, cable modems, ISDN modems, ADSL modems, or any other interface circuit intended to connect two networks or a network and a digital computing apparatus. The modem devices 120, 160 may contain a conventional RJ-11 outlet for connection to computer modem, facsimiles, printers or other equipment. The modem devices 120 and 160 may also be equipped with universal serial bus (USB), integrated system digital network (ISDN) or other standard data interfaces, as will be appreciated by the person skilled in the art. However, other similar devices may be used to permit sharing of large bandwidths over media already installed.
  • Encoding [0030] terminal 110 and decoding terminal 170 may be any pair of devices that receive and send audio signals according to the invention through the multimedia communications network 140 via modems 120 and 160. The encoding terminal 110 and decoding terminal 170 may represent such devices as a personal computer (PC), telephone, television, facsimile, or any other device capable of sending and receiving audio signals. It may be appreciated that the encoding terminal 110 and decoding terminal 170 may include software and/or hardware for performing the encoding and decoding functions, and further that the encoding and decoding terminals may be different types of devices.
  • It may further be appreciated that while the [0031] encoding terminal 110 and the decoding terminal 170 include memory units 180 and 190, respectively, for intermediate storage of the compressed audio signal, the compressed audio signal may be intermediately stored in one or more other intermediate storage devices located throughout the audio transmission system 100, such as between the modem 120,160 and the local exchange carrier 130,150, or in the multi-media communications network 140.
  • In providing a more detailed discussion of the encoding and decoding of audio signals, a discussion of conventional systems is set forth in FIGS. [0032] 2-6 to better to explain the features and advantages of the present invention.
  • FIG. 2 shows a generic audio encoding/[0033] decoding system 200 operating at a bit rate which is sufficient to encode all of the frequencies in the input signal. An encoder 210 located within a computing unit, for example a PC, receives an audio input signal with frequency range fin (typically spanning the range of 20 Hz-20 KHz) and encodes the signal for transmission across a communications channel.
  • The input signal may either be analog or digital. If the input signal is analog, the [0034] encoder 210 will include an analog-to-digital conversion apparatus. However, the input signal may already be digitized, such as stored signals retrieved from an audio compact disc, for example.
  • A [0035] decoder 220, located within another PC for example, receives and decodes the transmitted audio signal to produce an audio output fout which is less than fin and less than fs/2. The encoder/decoder system 200 in this example has no other specified bandwidth limit and the distortion level is unspecified. If the bit rate bch and the sample rate fs are high enough (for the encoding algorithm) then the reproduced audio will be indistinguishable from the original. If either is too low, then the audio will be perceived as degraded.
  • FIG. 3 shows a generic frame-based audio encoding/[0036] decoding system 300 operating at a high sampling rate, such as 44100 sps. The audio encoder/decoder system of FIG. 3 is similar to that of FIG. 2, but the sampling rate of 44100 sps used for encoding is too high to permit transparent audio reproduction of the full human-audible frequency range (20 Hz-20 KHz) at the specified bit rate of 96 Kbps, so a degradation in audio signal quality is perceived. In this example, as well as in the examples in FIGS. 4-6, the encoder is operating at 96 Kbps and 44100 sps, although the same principles apply at other sampling rates and other bit rates.
  • One way to improve reproduced audio signal quality when the bit rate is too low to support the full frequency range of the input is to encode less than the full frequency range. By way of reference, for a production quality AAC codec, best reproduced signal quality at 96 Kbps and 44100 sps occurs for a signal bandwidth of about 13 KHz. FIGS. [0037] 4-6 show various ways to decrease the audio frequency range.
  • FIG. 4 shows a generic frame-based audio encoding/[0038] decoding system 400 operating at a high sampling rate that uses a low pass filter 410 to limit the frequency range that is encoded. In many cases, a lower sampling rate would allow a wider frequency range or alternatively a higher quality audio signal (because of frame overhead and music statistics). Consequently, the system in FIG. 4 is sub-optimal.
  • FIG. 5 shows a generic frame-based audio encoding/[0039] decoding system 500 that operates at a high sampling rate (44100 sps) that discards spectral coefficients in the input signal to limit the frequency range that is encoded and transmitted. This operation is similar but not identical to that of the low pass filter 410 discussed above.
  • The audio input signal is input to the Modified Discrete Cosine Transform (MDCT) [0040] 510 (or other time-to-frequency domain transform) and the spectral coefficients are discarded by the spectral coefficient discard unit 520. The signal is then input to a noise allocation unit 530 (which computes the masking thresholds for the audio frame and quantizes the spectral coefficients according to the thresholds) which emits the compressed signal. The compressed signal is then transmitted to the decoder 220 of another computing unit (for example, another PC, or a portable audio device similar to the Diamond Rio MP3 player) for decoding and output.
  • FIG. 6 shows a generic frame-based audio encoding/[0041] decoding system 600 that downsamples the audio input signal to limit the frequency range that is encoded and transmitted. (Resamplers typically incorporate frequency-limiting filters.) The audio input signal is downsampled by the downsampler 610 at a 2:1 ratio and is then input into encoder 210 for encoding. The signal is then transmitted across a communication channel to the decoder 220 at the receiving PC that plays out the audio signal at the downsampled rate. This will generally be suboptimal because the decoder 220 must operate at a submultiple of 44100 sps. In this example, the suboptimal would be 2:1 to 22050, which is not the rate that provides optimal frequency response.
  • FIG. 7 shows the encoding/[0042] decoding system 700 of the invention. The audio encoding/decoding system 700 includes an optimal triplet of sample rate fs0 (in this case 32 Ksps), bit rate 96 Kbps, and the maximum supportable frequency range f0 which at 96 Kbps/32 Ksps is about 13 kHz. The optimal triplet could be determined in a number of ways, e.g. algorithmically or by searching a table. The analog signal (or a digitized version of the analog signal) is input to the encoding unit 710 of a PC, for example, where the signal is downsampled by downsampler 730 from 44100 to 32000 and encoded by the audio encoder 740. The encoded audio signal is then transmitted across a communications channel, through a modem, for example, at a given bit rate of 96 Kbps to another PC for output.
  • At the receiving PC, the received signal is input to a [0043] decoding unit 720, where a bit stream decoder 750 decodes the downsampled signal. The decoded signal is then input to the upsampler 760 which upsamples the signal to the original or other suitable sample rate. An audio output is then produced with a frequency range fout of about 13 kHz. Note that in the example of FIG. 7, 44100 sps and 32000 sps are standard AAC rates.
  • As discussed above in reference to FIG. 1, the [0044] encoding unit 710 and the decoding unit 720 may include memory units for intermediate storage of the compressed audio signal either prior to transmission or after reception of the audio signals, for example.
  • It may be the case that the codec (for example, AAC) is specified at a set of standard rates; and that f[0045] s0 does not match one of these standard rates. However many codecs (such as AAC) can be modified to run at an arbitrary sample rate, and although the resulting encoding unit 710 will generate AAC bit streams that will not reproduce audio accurately unless the decoding unit 720 incorporates this invention, the perceived quality of the reproduced audio signal will be better for the bit stream that uses the non-standard rate than for a bit stream that uses any standard rate.
  • For example, as shown in FIG. 8, the downsampling process used in FIG. 7 may be more computationally efficient when the downsampling factor is the ratio of two small numbers. Consider the case where it is desired to downsample from the standard rate of 44100 sps to the standard rate of 32000 sps. Neither [0046] 441 nor 320 (the smallest integers which preserve the 44100:32000 ratio) qualify as a small integer in this context. If a ratio of 11:8 is used, which is equivalent to the ratio of 44000:32000, we can downsample to a comparable intermediate sample rate (32073 sps) in a computationally efficient way, without degrading significantly either frequency response or distortion levels from the optimal sample rate of 32000 sps.
  • Accordingly, as shown in FIG. 8, the process is the same as that in FIG. 7 but 32073 sps is used as the intermediate sampling frequency. 32073 sps is sufficiently close to an AAC standard rate that audio signals can be encoded using the parameters for a standard AAC rate. [0047]
  • When the intermediate sampling rate is close to a codec standard rate, the bit stream header, which generally carries information about the sampling rate at which the audio was encoded, can indicate the nearby standard rate. This is generally advantageous because it allows a conventional decoder (i.e. one which does not incorporate the current invention) to decode the bit stream and reproduce the audio, even though the audio reproduction strictly speaking is not accurate. In this case (32073 sps sampling rate rather than the 32000 sps indicated in the bit stream header), there will be a pitch shift in the audio reproduced by the conventional decoder. This may be acceptable for some applications but not for others. [0048]
  • However, the invention is still useful when the resulting sampling rate is not close to a standard rate, as long as it is possible to modify the [0049] audio encoding unit 710 so that it supports the non-standard rate. For example, with a downsample ratio of 9:8 one obtains a sampling rate of 39200 sps, which with a production AAC codec would support a frequency range as high as 15-17 KHz at a bit rate of 112 Kbps at an acceptable level of distortion. Since the downsample factor is again the ratio of two small numbers, the resampling process would again be computationally efficient.
  • It may be advantageous to indicate to the [0050] decoding unit 720 what resampling ratio has been used to encode the audio, since otherwise the codec system (FIGS. 7 & 8) must operate at a fixed resampling ratio. As a particular embodiment of the method and apparatus of this invention, the resampling ratio is incorporated into the bit stream within a reserved bit field of the standard header. As an alternative embodiment, the resampling ratio can be incorporated as side channel information. In a specific example, AAC permits “data packets” to be incorporated in the bit stream. These data packets are ignored by a standard AAC codec. The resampling ratio can be specified in a data packet, possibly along with other information.
  • While the invention above has been discussed from the point of view of supporting the maximum frequency range for a given bit rate and level of distortion, there are two alternative ways of looking at this problem. Rather than support maximum frequency at a given bit rate, a frequency range and a given distortion level at a minimum bit rate may be supported. Alternatively, a given frequency range at a given bit rate may be supported to achieve the lowest distortion levels. That is, there are three interrelated variables: bit rate, distortion level, and frequency support. One can fix any two variables and use the above embodiment to achieve the best possible results for the remaining variable. [0051]
  • FIG. 9 is a flowchart of the encoding process according to the invention. Process begins at [0052] step 1000 and proceeds to step 1010 where the sample rate fs0 and maximum frequency range f0 are determined as an optimal pair either algorithmically or by searching a table, for example. In step 1020, an input signal is received by the encoding unit 710 and is downsampled by downsampler 730 to fs0. The process proceeds to step 1030 where the signal is encoded by the audio encoder 740. The process then proceeds to step 1040 where the signal (along with a header, data packet, etc. that includes the downsampling information), is transmitted at a given bit rate from a modem across a communication channel. The encoding process then goes to step 1050 and ends.
  • FIG. 10 is a flowchart of the decoding process. Process begins at [0053] step 1100 and proceeds to step 1110 where the downsampled signal (along with a header, data packet, etc. that includes the downsampling information) is received by another PC's (for example) decoding unit 720. The process proceeds to step 1120 where the downsampled signal is decoded by the bit stream decoder 750 and then upsampled at step 1130 by the upsampler 760 at a ratio corresponding to the downsampling ratio included with the received downsampled signal, for example. The upsampled signal is then output in step 1140. The process then goes to step 1150 and ends.
  • While this invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications, and variations will be apparent to those skilled in the art. Accordingly, preferred embodiments of the invention is set forth herein are intended to be illustrative, not limiting. Various changes may be made without departing from the spirit and scope of the invention. [0054]

Claims (18)

What is claimed is:
1. The method of transmitting and receiving audio signals in a multimedia communications network, comprising:
downsampling an input audio signal from an original sampling rate to a predetermined sampling rate at a first communications device;
encoding the downsampled signal;
transmitting the encoded signal from the first communication device to a second communications device;
decoding the encoded signal at the second communications device;
upsampling the decoded signal to the original sampling rate;
audibly outputting the upsampled signal.
2. The method of
claim 1
, further comprising:
storing the encoded signal.
3. The method of
claim 1
, wherein the signal is downsampled to a standard sampling rate.
4. The method of
claim 1
, wherein the signal is downsampled to a non-standard sampling rate.
5. The method of
claim 1
, wherein the signal is upsampled to a standard sampling rate.
6. The method of
claim 1
, wherein the signal is upsampled to a non-standard sampling rate.
7. The method of
claim 1
, wherein the sampling rate and a maximum frequency range are determined algorithmically or according to a table.
8. The method of
claim 1
, wherein at least one of the given bit rate, a frequency range, and a desired distortion level, are predetermined.
9. The method of
claim 1
, further comprising:
creating a header for the encoded signal that includes a downsampling ratio;
transmitting the header with the encoded signal to the second communications device.
10. An apparatus for transmitting and receiving audio signals in a multimedia communications network, comprising:
a downsampler that downsamples an input audio signal from an original sampling rate to a predetermined sampling rate;
an encoder that encodes the downsampled signal;
a transmitter that transmits the signal at a given bit rate to another communication device;
a decoder that decodes a received downsampled signal;
a upsampler that upsamples the decoded signal to the original sampling rate;
an output device that outputs the upsampled signal.
11. The apparatus of
claim 10
, further comprising:
a memory for storing the encoded signal.
12. The apparatus of
claim 10
, wherein the signal is downsampled to a standard sampling rate.
13. The apparatus of
claim 10
, wherein the signal is downsampled to anon-standard sampling rate.
14. The apparatus of
claim 10
, wherein the signal is upsampled to a standard sampling rate.
15. The apparatus of
claim 10
, wherein the signal is upsampled to anon-standard sampling rate.
16. The apparatus of
claim 10
, wherein the sampling rate and a maximum frequency range are determined algorithmically or according to a table.
17. The apparatus of
claim 10
, wherein at least one of the given bit rate, a frequency range, and a desired distortion level are predetermined.
18. The apparatus of
claim 10
, wherein the encoder creates a header for the encoded signal that includes a downsampling ratio, and the transmitter transmits the header with the encoded signal to the another communications device.
US09/773,492 1998-12-30 2001-02-02 Method and apparatus for sample rate pre-and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding Expired - Lifetime US6384759B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/773,492 US6384759B2 (en) 1998-12-30 2001-02-02 Method and apparatus for sample rate pre-and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US11471998P 1998-12-30 1998-12-30
US09/265,880 US6208276B1 (en) 1998-12-30 1999-03-11 Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding
US09/773,492 US6384759B2 (en) 1998-12-30 2001-02-02 Method and apparatus for sample rate pre-and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US09/265,880 Continuation US6208276B1 (en) 1998-12-30 1999-03-11 Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding

Publications (2)

Publication Number Publication Date
US20010005173A1 true US20010005173A1 (en) 2001-06-28
US6384759B2 US6384759B2 (en) 2002-05-07

Family

ID=26812494

Family Applications (2)

Application Number Title Priority Date Filing Date
US09/265,880 Expired - Lifetime US6208276B1 (en) 1998-12-30 1999-03-11 Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding
US09/773,492 Expired - Lifetime US6384759B2 (en) 1998-12-30 2001-02-02 Method and apparatus for sample rate pre-and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US09/265,880 Expired - Lifetime US6208276B1 (en) 1998-12-30 1999-03-11 Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding

Country Status (1)

Country Link
US (2) US6208276B1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050129109A1 (en) * 2003-11-26 2005-06-16 Samsung Electronics Co., Ltd Method and apparatus for encoding/decoding MPEG-4 bsac audio bitstream having ancillary information
US20130226570A1 (en) * 2010-10-06 2013-08-29 Voiceage Corporation Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)
CN112786062A (en) * 2021-01-15 2021-05-11 深圳市东微智能科技股份有限公司 Bandwidth adaptive network audio transmission method, device, medium and system
US11138984B2 (en) * 2016-12-05 2021-10-05 Sony Corporation Information processing apparatus and information processing method for generating and processing a file including speech waveform data and vibration waveform data
US20240029749A1 (en) * 2014-06-10 2024-01-25 Mqa Limited Digital encapsulation of audio signals

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100324746B1 (en) * 1999-04-24 2002-02-20 구자홍 Digital data player capable of voice hearing
US6195766B1 (en) * 1999-05-10 2001-02-27 Conexant Systems, Inc. System and method for providing soft audio and soft modem copy protection for hardware interfaces and software code
US6687663B1 (en) * 1999-06-25 2004-02-03 Lake Technology Limited Audio processing method and apparatus
US7076053B1 (en) * 2001-01-12 2006-07-11 3Com Corporation System for the processing of audio data used for music on hold and paging in a private branch exchange
CN1326415C (en) * 2001-06-26 2007-07-11 诺基亚公司 Method for conducting code conversion to audio-frequency signals code converter, network unit, wivefree communication network and communication system
US7751804B2 (en) * 2004-07-23 2010-07-06 Wideorbit, Inc. Dynamic creation, selection, and scheduling of radio frequency communications
US7312729B2 (en) * 2004-08-17 2007-12-25 Motorola, Inc. Universal sampling rate converter in electronic devices and methods
JP5461835B2 (en) * 2005-05-26 2014-04-02 エルジー エレクトロニクス インコーポレイティド Audio signal encoding / decoding method and encoding / decoding device
JP5227794B2 (en) * 2005-06-30 2013-07-03 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
EP1913577B1 (en) * 2005-06-30 2021-05-05 Lg Electronics Inc. Apparatus for encoding an audio signal and method thereof
AU2006266579B2 (en) * 2005-06-30 2009-10-22 Lg Electronics Inc. Method and apparatus for encoding and decoding an audio signal
KR101169280B1 (en) * 2005-08-30 2012-08-02 엘지전자 주식회사 Method and apparatus for decoding an audio signal
JP4859925B2 (en) * 2005-08-30 2012-01-25 エルジー エレクトロニクス インコーポレイティド Audio signal decoding method and apparatus
EP1938662B1 (en) * 2005-08-30 2016-09-28 LG Electronics Inc. Method, apparatus, computer-readable medium for decoding an audio signal
US7788107B2 (en) * 2005-08-30 2010-08-31 Lg Electronics Inc. Method for decoding an audio signal
CN101278331B (en) * 2005-10-03 2013-03-13 夏普株式会社 Display
US7672379B2 (en) * 2005-10-05 2010-03-02 Lg Electronics Inc. Audio signal processing, encoding, and decoding
KR100857112B1 (en) 2005-10-05 2008-09-05 엘지전자 주식회사 Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7751485B2 (en) * 2005-10-05 2010-07-06 Lg Electronics Inc. Signal processing using pilot based coding
US7646319B2 (en) * 2005-10-05 2010-01-12 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7696907B2 (en) * 2005-10-05 2010-04-13 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7743016B2 (en) * 2005-10-05 2010-06-22 Lg Electronics Inc. Method and apparatus for data processing and encoding and decoding method, and apparatus therefor
US8068569B2 (en) * 2005-10-05 2011-11-29 Lg Electronics, Inc. Method and apparatus for signal processing and encoding and decoding
US7716043B2 (en) * 2005-10-24 2010-05-11 Lg Electronics Inc. Removing time delays in signal paths
US8473298B2 (en) * 2005-11-01 2013-06-25 Apple Inc. Pre-resampling to achieve continuously variable analysis time/frequency resolution
US7610195B2 (en) * 2006-06-01 2009-10-27 Nokia Corporation Decoding of predictively coded data using buffer adaptation
US7907579B2 (en) * 2006-08-15 2011-03-15 Cisco Technology, Inc. WiFi geolocation from carrier-managed system geolocation of a dual mode device
JP2010503881A (en) * 2006-09-13 2010-02-04 テレフオンアクチーボラゲット エル エム エリクソン(パブル) Method and apparatus for voice / acoustic transmitter and receiver
US20080133249A1 (en) * 2006-11-30 2008-06-05 Hashiguchi Kohei Audio data transmitting device and audio data receiving device
US9942593B2 (en) * 2011-02-10 2018-04-10 Intel Corporation Producing decoded audio at graphics engine of host processing platform
TWI557727B (en) 2013-04-05 2016-11-11 杜比國際公司 Audio processing system, multimedia processing system, method for processing audio bit stream, and computer program product
KR102244612B1 (en) * 2014-04-21 2021-04-26 삼성전자주식회사 Appratus and method for transmitting and receiving voice data in wireless communication system
KR102306537B1 (en) * 2014-12-04 2021-09-29 삼성전자주식회사 Method and device for processing sound signal
CN115206352A (en) * 2022-07-01 2022-10-18 哲库科技(上海)有限公司 Audio processing method, device, chip, electronic device and storage medium

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9209052D0 (en) * 1992-04-27 1992-06-10 British Broadcasting Corp Video signal coding
GB2267192B (en) * 1992-05-21 1995-09-27 Sony Broadcast & Communication Sampling frequency conversion
CA2106440C (en) * 1992-11-30 1997-11-18 Jelena Kovacevic Method and apparatus for reducing correlated errors in subband coding systems with quantizers
US5748786A (en) * 1994-09-21 1998-05-05 Ricoh Company, Ltd. Apparatus for compression using reversible embedded wavelets
US5926791A (en) * 1995-10-26 1999-07-20 Sony Corporation Recursively splitting the low-frequency band with successively fewer filter taps in methods and apparatuses for sub-band encoding, decoding, and encoding and decoding
US5841473A (en) * 1996-07-26 1998-11-24 Software For Image Compression, N.V. Image sequence compression and decompression
US6182031B1 (en) * 1998-09-15 2001-01-30 Intel Corp. Scalable audio coding system

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050129109A1 (en) * 2003-11-26 2005-06-16 Samsung Electronics Co., Ltd Method and apparatus for encoding/decoding MPEG-4 bsac audio bitstream having ancillary information
US7974840B2 (en) * 2003-11-26 2011-07-05 Samsung Electronics Co., Ltd. Method and apparatus for encoding/decoding MPEG-4 BSAC audio bitstream having ancillary information
US20130226570A1 (en) * 2010-10-06 2013-08-29 Voiceage Corporation Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)
US9552822B2 (en) * 2010-10-06 2017-01-24 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (USAC)
US20240029749A1 (en) * 2014-06-10 2024-01-25 Mqa Limited Digital encapsulation of audio signals
US12183356B2 (en) * 2014-06-10 2024-12-31 Lenbrook Industries Limited Digital encapsulation of audio signals
US11138984B2 (en) * 2016-12-05 2021-10-05 Sony Corporation Information processing apparatus and information processing method for generating and processing a file including speech waveform data and vibration waveform data
CN112786062A (en) * 2021-01-15 2021-05-11 深圳市东微智能科技股份有限公司 Bandwidth adaptive network audio transmission method, device, medium and system

Also Published As

Publication number Publication date
US6208276B1 (en) 2001-03-27
US6384759B2 (en) 2002-05-07

Similar Documents

Publication Publication Date Title
US6208276B1 (en) Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding
US5570363A (en) Transform based scalable audio compression algorithms and low cost audio multi-point conferencing systems
EP2402939B1 (en) Full-band scalable audio codec
CN101165778B (en) Dual-transform coding of audio signals method and device
CN101165777B (en) Fast lattice vector quantization
US8428959B2 (en) Audio packet loss concealment by transform interpolation
EP2118891B1 (en) Embedded silence and background noise compression
KR100711989B1 (en) Efficiently Improved Scalable Audio Coding
CN102741831B (en) Scalable audio frequency in multidrop environment
US8340959B2 (en) Method and apparatus for transmitting wideband speech signals
JP2004518346A (en) Broadband signal transmission system
JP2002517023A (en) System and method for masking quantization noise in audio signals
Kovesi et al. A scalable speech and audio coding scheme with continuous bitrate flexibility
Ding Wideband audio over narrowband low-resolution media
JP2011512564A (en) Background noise information decoding method and background noise information decoding means
JP4530567B2 (en) Digital audio decoding device

Legal Events

Date Code Title Description
STCF Information on status: patent grant

Free format text: PATENTED CASE

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

FPAY Fee payment

Year of fee payment: 12

AS Assignment

Owner name: AT&T INTELLECTUAL PROPERTY II, L.P., GEORGIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AT&T PROPERTIES, LLC;REEL/FRAME:040588/0629

Effective date: 20161205

Owner name: AT&T PROPERTIES, LLC, NEVADA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AT&T CORP.;REEL/FRAME:040588/0469

Effective date: 20161205

AS Assignment

Owner name: AT&T CORP., NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SNYDER, JAMES H.;REEL/FRAME:040706/0977

Effective date: 19990311

AS Assignment

Owner name: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AT&T INTELLECTUAL PROPERTY II, L.P.;REEL/FRAME:041149/0133

Effective date: 20161212

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载