US20120102538A1 - Methods and apparatus for decoding multiple independent audio streams using a single audio decoder - Google Patents
Methods and apparatus for decoding multiple independent audio streams using a single audio decoder Download PDFInfo
- Publication number
- US20120102538A1 US20120102538A1 US12/910,432 US91043210A US2012102538A1 US 20120102538 A1 US20120102538 A1 US 20120102538A1 US 91043210 A US91043210 A US 91043210A US 2012102538 A1 US2012102538 A1 US 2012102538A1
- Authority
- US
- United States
- Prior art keywords
- audio
- frames
- stream
- encoded
- decoded
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000009877 rendering Methods 0.000 claims description 9
- 230000003362 replicative effect Effects 0.000 claims description 9
- 230000005236 sound signal Effects 0.000 claims description 7
- 238000007781 pre-processing Methods 0.000 claims description 4
- 238000001914 filtration Methods 0.000 claims 1
- 239000000284 extract Substances 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 7
- 101000969688 Homo sapiens Macrophage-expressed gene 1 protein Proteins 0.000 description 5
- 102100021285 Macrophage-expressed gene 1 protein Human genes 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003139 buffering effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4341—Demultiplexing of audio and video streams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4344—Remultiplexing of multiplex streams, e.g. by modifying time stamps or remapping the packet identifiers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/8106—Monomedia components thereof involving special audio data, e.g. different tracks for different languages
Definitions
- An embodiment of the present invention relates to an audio signal processing system, and more specifically to a decoder for decoding multiple independent audio streams.
- Audio signal processing has become increasingly important in today's multimedia environment. Audio data processing is an important requirement for many multimedia applications, such as gaming and in telecommunications.
- One of the key elements in many digital audio information processing systems is the audio decoder. Generally, the audio decoder receives data in a compressed form and converts that data into a decompressed digital form.
- FIG. 1 illustrates an approach for decoding audio signals in an audio-video playback system.
- multiplexed audio-video streams are fed into the system for processing.
- the multiplexed streams are de-multiplexed and then fed into the audio and video subsystems.
- the multiplexed streams are of various formats such as MPEG2, PES, Windows Media files, AVI etc.
- Examples of audio-video playback system are Set Top Box, DVD player etc.
- the audio playback includes three stages: audio parsing, audio decoding, and audio rendering.
- the encoded data is extracted from its container (PES, File containers etc.) and constructed into a stream of audio frames. Each of these frames is an independent decodable unit.
- some metadata like PTS, etc is attached to these frames.
- audio decoding stage audio frames are fed into a hardware/software decoder. This decoder decodes the compressed data and outputs PCM samples. Metadata could be used by the decoding stage or forwarded to the next stage.
- some additional parameters such as sampling frequency, number of channels and bits per sample are attached to the decoded data by the decoding stage.
- PCM data is formatted (according to the requirements of the system's output interfaces) and output to the external interfaces exposed by the system.
- U.S. Pat. No. 6,081,783 which is incorporated herein by reference, discloses an audio decoder operating on a received compressed audio data stream.
- the audio data stream has been compressed using an algorithm utilizing transform encoding and a bit allocation routine.
- the audio decoder includes a first digital signal processor and a second digital signal processor.
- the first digital signal processor executes a first set of operations on the received compressed audio data stream including parsing the compressed audio data stream, recovering data fields within the compressed audio data stream, calculating bit allocation, and passing frequency domain coefficients to shared memory.
- the second digital signal processor executes a second set of operations on the data provided by the first digital signal processor including performing inverse transform operations on the data passed from the first digital signal processor.
- U.S. Pat. No. 6,233,562 which is incorporated herein by reference, discloses an audio decoding device, which decodes coded audio information with multiple channels.
- the audio decoding device includes a coded information memory section, an information transmission section, and an audio decoding section.
- the coded information memory section stores the coded audio information.
- the information transmission section reads the coded audio information stored at an arbitrary position in the coded information memory section.
- the audio decoding section decodes the coded audio information read by the information transmission section and outputs the resultant audio information in accordance with a time parameter.
- One embodiment of the present disclosure describes a system for decoding multiple independent encoded audio streams comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure also describes an embodiment of a set-top-box comprising audio and video subsystems, said audio subsystem comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure further describes an embodiment of a DVD player comprising audio and video subsystems, said audio subsystem comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- a parser coupled to each input encoded audio stream for extracting individual audio frames
- a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream
- an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream
- a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure also describes an embodiment of an audio processor capable of decoding multiple independent audio streams, said audio processor comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure further describes an embodiment of a method for decoding multiple independent encoded audio streams comprising parsing each independent encoded audio stream for extracting individual audio frames, preprocessing the extracted individual audio frames for combining into a single encoded audio frame stream, allocating a shared audio decoder resource to decode said encoded audio frame stream when said decoder resource is available, producing a single decoded audio stream, and rendering the individual output decoded audio streams from said decoded audio stream.
- FIG. 1 illustrates a conventional approach for decoding audio signals in an audio-video playback system.
- FIG. 2 illustrates a block diagram of a system for decoding multiple independent encoded audio streams according to an embodiment of the present invention.
- FIG. 3 illustrates the block diagram of the preprocessor of FIG. 2 according to an embodiment of the present invention.
- FIG. 4 illustrates the block diagram of the renderer of FIG. 2 according to an embodiment of the present invention.
- FIG. 5 illustrates a mechanism for decoding multiple independent encoded audio streams according to an embodiment of the present invention.
- FIG. 6 illustrates a mechanism for decoding multiple independent encoded audio streams according to another embodiment of the present invention.
- FIG. 7 illustrates a mechanism for decoding multiple independent encoded audio streams according to yet another embodiment of the present invention.
- FIG. 8 illustrates a block diagram that discloses an application for a system, which decodes multiple independent encoded audio streams according to an embodiment of the present invention.
- FIG. 9 illustrates a flow diagram of a method for decoding multiple independent encoded audio streams according to an embodiment of the present invention.
- FIG. 2 illustrates the block diagram of a system 200 for decoding multiple independent encoded audio streams according to an embodiment of the present invention.
- the system 200 includes one or more parsers 201 , a preprocessor 202 , an audio decoder 203 , and a renderer 204 .
- Each parser 201 extracts individual encoded audio frames from the audio stream.
- Preprocessor 202 which is coupled to the outputs of all the parsers receives the extracted audio frames and combines them into a single pseudo interleaved audio frame stream.
- Audio decoder 203 decodes the received interleaved audio frame stream and provides a single pseudo-interleaved decoded audio stream.
- Renderer 204 receives the single pseudo-interleaved decoded audio stream, discards redundant decoded frames and segregates the remaining decoded audio frames into multiple decoded audio frame streams.
- the preprocessor 202 is software.
- preprocessor and “audio driver” are used interchangeably in the description below.
- FIG. 3 illustrates the block diagram of the preprocessor 202 according to an embodiment of the present invention.
- Preprocessor 202 includes a replicator 301 and an interleaver 302 .
- Replicator 301 receives extracted individual audio frames from the output of the parser and repeats selected audio frames depending on a windowing filter. Replicator 301 may repeat all frames except the first and the last frame of the input audio stream. The number of replicas is determined by the number of samples required for the window filter to converge to the point where proper filtered samples are available.
- Interleaver 302 receives the repeated audio frames and combines them into a single pseudo-interleaved audio frame stream. After decoding, the pseudo-interleaved audio frame stream is applied to the window filter (which is an intrinsic part of the decoder 203 ) for providing filtered audio frames.
- Decoding of a stream by the audio decoder 203 through the audio driver is initiated when two conditions are met: firstly, the audio driver has collected number of frames for decoding and secondly, the audio decoder is available for decoding.
- the value of “K” is determined by the number of samples taken by a particular audio type for its windowing filter to converge to the value from where it starts giving proper output and number of PCM samples generated per frame.
- n 0 (where n is an integer)
- the audio driver waits. As the above said conditions are met, the following steps are taken to decode until the I th frame where I is last of the collected “K” frames
- FIG. 4 illustrates a block diagram of the renderer 204 according to an embodiment of the present invention.
- Renderer 204 includes a demultiplexer 401 and a frame selector 402 .
- Demultiplexer 401 segregates the decoded audio frames into multiple decoded audio frame streams depending on the input.
- Frame selector 402 discards redundant decoded frames from the decoded audio frame stream.
- the audio driver has already collected K number of frames on any stream, out of which only X number of frames are useful. Hence, in the delivery process, Y numbers of frames are dropped to get the right frames.
- the audio decoder produces all the decoded frames, i.e. 1, 2 . . . K. Out of these the first Y frames are dropped and X frames are sent for further processing.
- FIG. 5 illustrates a mechanism for decoding multiple independent encoded audio streams according to an embodiment of the present invention.
- Each independent encoded audio stream is parsed for extracting individual audio frames from their containers (PES, File containers etc). Then the metadata (PTS, etc.) is extracted from the streams. Selected extracted audio frames are replicated to form audio streams 503 and 504 .
- Audio decoder 505 is time multiplexed between audio stream 503 and audio stream 504 (which thereby function as a single pseudo-interleaved audio stream fed to the audio decoder).
- Two decoded audio streams 508 , 509 are generated from the demultiplexed decoded audio streams 506 and 507 respectively by dropping a selected number of frames at the output of audio decoder 505 .
- FIG. 6 illustrates a method for decoding multiple independent encoded audio streams according to another embodiment of the present invention.
- the embodiment is related to decoding MPEG audio signals such as MPEG1 layer 2/3 or MPEG2.
- MPEG audio signals such as MPEG1 layer 2/3 or MPEG2.
- the two streams are interleaved for creating a pseudo-interleaved stream 603 by replicating selected audio frames.
- Two consecutive frames of each independent stream 601 , 602 are copied alternatively to create the pseudo-interleaved stream 603 .
- Pseudo-interleaved stream 603 is provided to an MPEG decoder 604 .
- MPEG decoder 604 decodes the pseudo-interleaved stream 603 and producing a pseudo-interleaved decoded audio stream 605 . Then selected numbers of frames are dropped from the pseudo-interleaved decoded audio stream 605 for forming a first reconstructed audio stream 606 and a second reconstructed audio stream 607 . The selected numbers of frames refer to redundant frames, which are to be dropped.
- the MPEG1 Layer 2 frame size is 1152 samples.
- the audio decoder 604 needs 480 samples of previous frames to produce the output without any distortion.
- Processing requirements for any audio type which may be decoded with the method described above may be calculated as follows:
- FIG. 7 illustrates a method for decoding multiple independent encoded audio streams according to yet another embodiment of the present invention.
- the embodiment demonstrates the applicability of decoding process for MPEG1 Layer 1.
- the process of decoding is almost similar to the process explained in FIG. 6 but in this embodiment, four consecutive frames of each independent stream 701 , 702 are copied alternatively to create the pseudo interleaved stream 703 instead of two consecutive frames in the previous embodiment ( FIG. 6 ).
- the MPEG1 Layer1 frame size is 384 samples.
- the audio decoder needs 480 samples of previous frames to produce the output without any distortion.
- processing power required for decoding a single stream using this algorithm will depend on the number of frames generated from a decoding sequence. Therefore if we increase the number of frames generated per decoding sequence then processing power requirement will come down. Therefore
- Processing requirements for a stream may be reduced at the cost of memory (buffering) and system delay, in delivering decoded audio data.
- FIG. 8 illustrates a block diagram that discloses an application for a system 200 , decoding multiple independent encoded audio streams, according to an embodiment of the present invention used in a set-top-box 800 .
- the set-top-box 800 includes audio and video subsystems.
- the audio subsystem 200 includes a parser, a preprocessor, an audio decoder, and a renderer. According to another embodiment of the present invention, the system 200 is used in a DVD player.
- FIG. 9 illustrates a flow diagram of a method for decoding multiple independent encoded audio streams according to an embodiment of the present invention.
- each independent encoded audio stream is parsed for extracting individual audio frames.
- the extracted individual audio frames are preprocessed for combining into a single encoded audio frame stream.
- a shared audio decoder resource is allocated to decode the encoded audio frame stream when the decoder resource is available.
- a single decoded audio stream is produced.
- the individual output decoded audio streams are rendered from the decoded audio stream.
- An embodiment of the present invention is related to a system for decoding multiple independent encoded audio streams and can be used in various applications, such as dual decoding set top boxes, dual decoding DVD players, set top boxes with audio description support, and multi site video conferencing systems.
- an embodiment of the present invention offers several advantages. Firstly, an embodiment utilizes a single decoder for decoding multiple independent audio streams. Secondly, an embodiment is applicable to both hardware and software decoders. Thirdly, an embodiment supports multiplexed stream of various formats such as MPEG2, PES, Windows Media files, AVI etc.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
- An embodiment of the present invention relates to an audio signal processing system, and more specifically to a decoder for decoding multiple independent audio streams.
- Audio signal processing has become increasingly important in today's multimedia environment. Audio data processing is an important requirement for many multimedia applications, such as gaming and in telecommunications. One of the key elements in many digital audio information processing systems is the audio decoder. Generally, the audio decoder receives data in a compressed form and converts that data into a decompressed digital form.
-
FIG. 1 illustrates an approach for decoding audio signals in an audio-video playback system. In an audio-video playback system, multiplexed audio-video streams are fed into the system for processing. The multiplexed streams are de-multiplexed and then fed into the audio and video subsystems. The multiplexed streams are of various formats such as MPEG2, PES, Windows Media files, AVI etc. Examples of audio-video playback system are Set Top Box, DVD player etc. - In an audio subsystem, the audio playback includes three stages: audio parsing, audio decoding, and audio rendering. In the audio parsing stage, the encoded data is extracted from its container (PES, File containers etc.) and constructed into a stream of audio frames. Each of these frames is an independent decodable unit. In addition to the compressed data, some metadata like PTS, etc is attached to these frames. In the audio decoding stage, audio frames are fed into a hardware/software decoder. This decoder decodes the compressed data and outputs PCM samples. Metadata could be used by the decoding stage or forwarded to the next stage. In addition to the original metadata some additional parameters such as sampling frequency, number of channels and bits per sample are attached to the decoded data by the decoding stage. In the audio rendering phase, PCM data is formatted (according to the requirements of the system's output interfaces) and output to the external interfaces exposed by the system.
- U.S. Pat. No. 6,081,783, which is incorporated herein by reference, discloses an audio decoder operating on a received compressed audio data stream. The audio data stream has been compressed using an algorithm utilizing transform encoding and a bit allocation routine. The audio decoder includes a first digital signal processor and a second digital signal processor. The first digital signal processor executes a first set of operations on the received compressed audio data stream including parsing the compressed audio data stream, recovering data fields within the compressed audio data stream, calculating bit allocation, and passing frequency domain coefficients to shared memory. The second digital signal processor executes a second set of operations on the data provided by the first digital signal processor including performing inverse transform operations on the data passed from the first digital signal processor.
- U.S. Pat. No. 6,233,562, which is incorporated herein by reference, discloses an audio decoding device, which decodes coded audio information with multiple channels. The audio decoding device includes a coded information memory section, an information transmission section, and an audio decoding section. The coded information memory section stores the coded audio information. The information transmission section reads the coded audio information stored at an arbitrary position in the coded information memory section. The audio decoding section decodes the coded audio information read by the information transmission section and outputs the resultant audio information in accordance with a time parameter.
- The above-mentioned patents are known for decoding two independent audio streams. The limitation with said patents is that they need two independent audio decoders for decoding two independent audio streams simultaneously and this result in significant overhead and resource requirements.
- Therefore, there is a need of a system for decoding multiple independent audio streams using a single decoder.
- One embodiment of the present disclosure describes a system for decoding multiple independent encoded audio streams comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure also describes an embodiment of a set-top-box comprising audio and video subsystems, said audio subsystem comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure further describes an embodiment of a DVD player comprising audio and video subsystems, said audio subsystem comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure also describes an embodiment of an audio processor capable of decoding multiple independent audio streams, said audio processor comprising a parser coupled to each input encoded audio stream for extracting individual audio frames, a preprocessor coupled to the outputs of all parsers for combining the outputs of said parsers into a single audio frame stream, an audio decoder coupled to the output of said preprocessor for decoding the single audio frame stream, and a renderer coupled to the output of said audio decoder for generating the individual output decoded audio streams from the single decoded stream.
- This disclosure further describes an embodiment of a method for decoding multiple independent encoded audio streams comprising parsing each independent encoded audio stream for extracting individual audio frames, preprocessing the extracted individual audio frames for combining into a single encoded audio frame stream, allocating a shared audio decoder resource to decode said encoded audio frame stream when said decoder resource is available, producing a single decoded audio stream, and rendering the individual output decoded audio streams from said decoded audio stream.
- One or more embodiments of the present invention will be explained in the following description, taken in conjunction with the accompanying drawings, wherein:
-
FIG. 1 illustrates a conventional approach for decoding audio signals in an audio-video playback system. -
FIG. 2 illustrates a block diagram of a system for decoding multiple independent encoded audio streams according to an embodiment of the present invention. -
FIG. 3 illustrates the block diagram of the preprocessor ofFIG. 2 according to an embodiment of the present invention. -
FIG. 4 illustrates the block diagram of the renderer ofFIG. 2 according to an embodiment of the present invention. -
FIG. 5 illustrates a mechanism for decoding multiple independent encoded audio streams according to an embodiment of the present invention. -
FIG. 6 illustrates a mechanism for decoding multiple independent encoded audio streams according to another embodiment of the present invention. -
FIG. 7 illustrates a mechanism for decoding multiple independent encoded audio streams according to yet another embodiment of the present invention. -
FIG. 8 illustrates a block diagram that discloses an application for a system, which decodes multiple independent encoded audio streams according to an embodiment of the present invention. -
FIG. 9 illustrates a flow diagram of a method for decoding multiple independent encoded audio streams according to an embodiment of the present invention. - One or more embodiments of the present invention will be described in detail with reference to the accompanying drawings. However, the present invention is not limited to these embodiments. The present invention can be modified in various forms. In the accompanying drawings, like reference numerals are used to indicate like components.
-
FIG. 2 illustrates the block diagram of asystem 200 for decoding multiple independent encoded audio streams according to an embodiment of the present invention. Thesystem 200 includes one ormore parsers 201, apreprocessor 202, anaudio decoder 203, and arenderer 204. Eachparser 201 extracts individual encoded audio frames from the audio stream.Preprocessor 202, which is coupled to the outputs of all the parsers receives the extracted audio frames and combines them into a single pseudo interleaved audio frame stream.Audio decoder 203 decodes the received interleaved audio frame stream and provides a single pseudo-interleaved decoded audio stream. Renderer 204 receives the single pseudo-interleaved decoded audio stream, discards redundant decoded frames and segregates the remaining decoded audio frames into multiple decoded audio frame streams. In one embodiment of the present invention, thepreprocessor 202 is software. The terms “preprocessor” and “audio driver” are used interchangeably in the description below. -
FIG. 3 illustrates the block diagram of thepreprocessor 202 according to an embodiment of the present invention.Preprocessor 202 includes areplicator 301 and aninterleaver 302.Replicator 301 receives extracted individual audio frames from the output of the parser and repeats selected audio frames depending on a windowing filter.Replicator 301 may repeat all frames except the first and the last frame of the input audio stream. The number of replicas is determined by the number of samples required for the window filter to converge to the point where proper filtered samples are available.Interleaver 302 receives the repeated audio frames and combines them into a single pseudo-interleaved audio frame stream. After decoding, the pseudo-interleaved audio frame stream is applied to the window filter (which is an intrinsic part of the decoder 203) for providing filtered audio frames. - Decoding of a stream by the
audio decoder 203 through the audio driver is initiated when two conditions are met: firstly, the audio driver has collected number of frames for decoding and secondly, the audio decoder is available for decoding. - The value of “K” is determined by the number of samples taken by a particular audio type for its windowing filter to converge to the value from where it starts giving proper output and number of PCM samples generated per frame.
- So if,
-
- “K”=Number of continuous compressed frames to be collected
- “S”=Number of Samples per frame
- “N”=Number of Samples taken by windowing filter to converge
- Then K will be given by the equation
-
K=┌N/S┐+n Eq. 1 - n>0 (where n is an integer)
- Until both of the above conditions are met (in the same order), the audio driver waits. As the above said conditions are met, the following steps are taken to decode until the Ith frame where I is last of the collected “K” frames
-
- 1. All the collected frames i.e. (I−K+1), (I−K+2), . . . , I are fed to the audio decoder in the same order as they were received.
- 2. After the decoding process is completed, the audio decoder is released so that it can be used by the other instance of audio driver running in the system.
- 3. The selected frames are I, I-1, . . . I-
n+ 1 - 4. All the metadata received with the selected frames are attached to the current output buffers.
- 5. Output buffers along with their associated metadata can be delivered to the next stage for further processing or playback.
-
FIG. 4 illustrates a block diagram of therenderer 204 according to an embodiment of the present invention.Renderer 204 includes ademultiplexer 401 and aframe selector 402.Demultiplexer 401 segregates the decoded audio frames into multiple decoded audio frame streams depending on the input.Frame selector 402 discards redundant decoded frames from the decoded audio frame stream. - The audio driver has already collected K number of frames on any stream, out of which only X number of frames are useful. Hence, in the delivery process, Y numbers of frames are dropped to get the right frames.
-
- “K”=Number of Frames collected at an input to the decoder.
- “Y”=Number of Frames to be dropped at an output of the decoder.
- “S”=Number of Samples per frame
- “N”=Number of Samples taken by windowing filter to converge
- “X”=Number of frames to be delivered at the rendering stage.
-
Y=┌N/S┐ Eq. 2 - Now the Number of frames sent for post-rendering processing will be given by
-
X=K−Y Eq. 3 - The audio decoder produces all the decoded frames, i.e. 1, 2 . . . K. Out of these the first Y frames are dropped and X frames are sent for further processing.
-
FIG. 5 illustrates a mechanism for decoding multiple independent encoded audio streams according to an embodiment of the present invention. Consider two independent compressed encoded audio streams 501, 502. Each independent encoded audio stream is parsed for extracting individual audio frames from their containers (PES, File containers etc). Then the metadata (PTS, etc.) is extracted from the streams. Selected extracted audio frames are replicated to formaudio streams Audio decoder 505 is time multiplexed betweenaudio stream 503 and audio stream 504 (which thereby function as a single pseudo-interleaved audio stream fed to the audio decoder). Two decodedaudio streams audio streams audio decoder 505. -
FIG. 6 illustrates a method for decoding multiple independent encoded audio streams according to another embodiment of the present invention. The embodiment is related to decoding MPEG audio signals such asMPEG1 layer 2/3 or MPEG2. Consider a first parsed audioelementary stream 601 and a second parsed audioelementary stream 602. The two streams are interleaved for creating apseudo-interleaved stream 603 by replicating selected audio frames. Two consecutive frames of eachindependent stream pseudo-interleaved stream 603.Pseudo-interleaved stream 603 is provided to anMPEG decoder 604.MPEG decoder 604 decodes thepseudo-interleaved stream 603 and producing a pseudo-interleaved decodedaudio stream 605. Then selected numbers of frames are dropped from the pseudo-interleaved decodedaudio stream 605 for forming a firstreconstructed audio stream 606 and a secondreconstructed audio stream 607. The selected numbers of frames refer to redundant frames, which are to be dropped. TheMPEG1 Layer 2 frame size is 1152 samples. Theaudio decoder 604 needs 480 samples of previous frames to produce the output without any distortion. -
- S=Number of Samples per frame=1152
- N=Number of Samples taken by windowing filter to converge=480
- n=1
- Therefore as per
equation 1 value of K (number of frames) will be -
K=┌480/1152┐+1=2 -
- Number of frames to be dropped: Y=┌480/1152┐=1
- Number of frames to be sent for post-rendering processing:
-
- Processing requirements for any audio type which may be decoded with the method described above may be calculated as follows:
- P=MIPS required to decode a single stream of a particular audio type
- K=Number of continuous compressed frames to be collected as derived from
equation 1 - P′=Total Processing requirement (MIPS)
-
P′=P*K Eq. 4 - If the processing requirement to decode a
single MPEG 1layer 2/3 or MPEG2 stream is P MIPS, then as per theequation 4 stream, the worst case processing requirement P′ will become -
P′=2*P MIPS -
FIG. 7 illustrates a method for decoding multiple independent encoded audio streams according to yet another embodiment of the present invention. The embodiment demonstrates the applicability of decoding process forMPEG1 Layer 1. The process of decoding is almost similar to the process explained inFIG. 6 but in this embodiment, four consecutive frames of eachindependent stream stream 703 instead of two consecutive frames in the previous embodiment (FIG. 6 ). - The MPEG1 Layer1 frame size is 384 samples. The audio decoder needs 480 samples of previous frames to produce the output without any distortion.
-
- S=Number of Samples per frame=384
- N=Number of Samples taken by windowing filter to converge=480
- n=1
- Therefore as per
equation 1 value of K will be -
- If processing requirement to decode a
single MPEG1 layer 1 stream is P MIPS then as perequation 2 stream worst case processing requirement P′ will become -
P′=3*P MIPS - As calculated in
equation 2, processing power required for decoding a single stream using this algorithm will depend on the number of frames generated from a decoding sequence. Therefore if we increase the number of frames generated per decoding sequence then processing power requirement will come down. Therefore -
P′ opt=(P*(K+M))/(M+1) Eq. 5 - where
-
- P=MIPS required to decode a single stream of a particular audio type
- K=Minimum number of continuous compressed frames to be collected
- M=Number of valid additional decoded frames generated per decoding sequence
- Processing requirements for a stream may be reduced at the cost of memory (buffering) and system delay, in delivering decoded audio data.
- Therefore as per
equation 3, if we consider M=1 and K=3, -
- Thus,
FIG. 7 describes the method of reducing the processing requirement forMPEG 1Layer 1 from 3*P to 2*P, by using M=1 anddecoding 4 frames at a time instead of minimum 3 frames and getting 2 valid decoded frames. -
FIG. 8 illustrates a block diagram that discloses an application for asystem 200, decoding multiple independent encoded audio streams, according to an embodiment of the present invention used in a set-top-box 800. The set-top-box 800 includes audio and video subsystems. Theaudio subsystem 200 includes a parser, a preprocessor, an audio decoder, and a renderer. According to another embodiment of the present invention, thesystem 200 is used in a DVD player. -
FIG. 9 illustrates a flow diagram of a method for decoding multiple independent encoded audio streams according to an embodiment of the present invention. Atstep 901, each independent encoded audio stream is parsed for extracting individual audio frames. Atstep 902, the extracted individual audio frames are preprocessed for combining into a single encoded audio frame stream. Atstep 903, a shared audio decoder resource is allocated to decode the encoded audio frame stream when the decoder resource is available. Atstep 904, a single decoded audio stream is produced. Atstep 905, the individual output decoded audio streams are rendered from the decoded audio stream. - An embodiment of the present invention is related to a system for decoding multiple independent encoded audio streams and can be used in various applications, such as dual decoding set top boxes, dual decoding DVD players, set top boxes with audio description support, and multi site video conferencing systems.
- An embodiment of the present invention offers several advantages. Firstly, an embodiment utilizes a single decoder for decoding multiple independent audio streams. Secondly, an embodiment is applicable to both hardware and software decoders. Thirdly, an embodiment supports multiplexed stream of various formats such as MPEG2, PES, Windows Media files, AVI etc.
- Although the disclosure of a system has been described in connection with an embodiment of the present invention illustrated in the accompanying drawings, it is not limited thereto. It will be apparent to those skilled in the art that various substitutions, modifications and changes may be made thereto without departing from the scope and spirit of the disclosure.
- From the foregoing it will be appreciated that, although specific embodiments have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the disclosure. Furthermore, where an alternative is disclosed for a particular embodiment, this alternative may also apply to other embodiments even if not specifically stated.
Claims (46)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/910,432 US8613038B2 (en) | 2010-10-22 | 2010-10-22 | Methods and apparatus for decoding multiple independent audio streams using a single audio decoder |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/910,432 US8613038B2 (en) | 2010-10-22 | 2010-10-22 | Methods and apparatus for decoding multiple independent audio streams using a single audio decoder |
Publications (2)
Publication Number | Publication Date |
---|---|
US20120102538A1 true US20120102538A1 (en) | 2012-04-26 |
US8613038B2 US8613038B2 (en) | 2013-12-17 |
Family
ID=45974122
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/910,432 Expired - Fee Related US8613038B2 (en) | 2010-10-22 | 2010-10-22 | Methods and apparatus for decoding multiple independent audio streams using a single audio decoder |
Country Status (1)
Country | Link |
---|---|
US (1) | US8613038B2 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014088752A1 (en) * | 2012-12-07 | 2014-06-12 | Lockheed Martin Corporation | Harmonic decimation of real time data for unique word (or sync cell) acquisition in a limited processing environment |
US20140284003A1 (en) * | 2013-03-20 | 2014-09-25 | Daekyeong Triple Co., Ltd. | Roman shade curtain and roman shade using the same |
US11227612B2 (en) * | 2016-10-31 | 2022-01-18 | Tencent Technology (Shenzhen) Company Limited | Audio frame loss and recovery with redundant frames |
US20220262379A1 (en) * | 2017-01-10 | 2022-08-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9942593B2 (en) * | 2011-02-10 | 2018-04-10 | Intel Corporation | Producing decoded audio at graphics engine of host processing platform |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030018796A1 (en) * | 2001-05-11 | 2003-01-23 | Jim Chou | Transcoding multimedia information within a network communication system |
US6583821B1 (en) * | 1999-07-16 | 2003-06-24 | Thomson Licensing S.A. | Synchronizing apparatus for a compressed audio/video signal receiver |
US20060023706A1 (en) * | 2004-07-29 | 2006-02-02 | Microsoft Corporation | Packet multiplexing multi-channel audio |
US20060167682A1 (en) * | 2002-10-21 | 2006-07-27 | Medialive | Adaptive and progressive audio stream descrambling |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6233562B1 (en) | 1996-12-09 | 2001-05-15 | Matsushita Electric Industrial Co., Ltd. | Audio decoding device and signal processing device for decoding multi-channel signals with reduced memory requirements |
US6081783A (en) | 1997-11-14 | 2000-06-27 | Cirrus Logic, Inc. | Dual processor digital audio decoder with shared memory data transfer and task partitioning for decompressing compressed audio data, and systems and methods using the same |
-
2010
- 2010-10-22 US US12/910,432 patent/US8613038B2/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6583821B1 (en) * | 1999-07-16 | 2003-06-24 | Thomson Licensing S.A. | Synchronizing apparatus for a compressed audio/video signal receiver |
US20030018796A1 (en) * | 2001-05-11 | 2003-01-23 | Jim Chou | Transcoding multimedia information within a network communication system |
US20060167682A1 (en) * | 2002-10-21 | 2006-07-27 | Medialive | Adaptive and progressive audio stream descrambling |
US20060023706A1 (en) * | 2004-07-29 | 2006-02-02 | Microsoft Corporation | Packet multiplexing multi-channel audio |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014088752A1 (en) * | 2012-12-07 | 2014-06-12 | Lockheed Martin Corporation | Harmonic decimation of real time data for unique word (or sync cell) acquisition in a limited processing environment |
US8880758B2 (en) * | 2012-12-07 | 2014-11-04 | Lockheed Martin Corporation | Harmonic decimation of real time data for unique word (or sync cell) acquisition in a limited processing environment |
US20140284003A1 (en) * | 2013-03-20 | 2014-09-25 | Daekyeong Triple Co., Ltd. | Roman shade curtain and roman shade using the same |
US11227612B2 (en) * | 2016-10-31 | 2022-01-18 | Tencent Technology (Shenzhen) Company Limited | Audio frame loss and recovery with redundant frames |
US20220262379A1 (en) * | 2017-01-10 | 2022-08-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier |
US11837247B2 (en) * | 2017-01-10 | 2023-12-05 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier |
US12142286B2 (en) | 2017-01-10 | 2024-11-12 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier |
Also Published As
Publication number | Publication date |
---|---|
US8613038B2 (en) | 2013-12-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6108584A (en) | Multichannel digital audio decoding method and apparatus | |
EP1895511B1 (en) | Audio encoding apparatus, audio decoding apparatus and audio encoding information transmitting apparatus | |
KR100192700B1 (en) | Signal encoding and decoding system allowing adding of signals in a form of frequency sample sequence upon decoding | |
EP3561810B1 (en) | Method of encoding left and right audio input signals, corresponding encoder, decoder and computer program product | |
US7359006B1 (en) | Audio module supporting audio signature | |
JP7413418B2 (en) | Audio decoder for interleaving signals | |
CN101288309B (en) | Method and device for processing/sending and receiving/processing bitstream | |
US8613038B2 (en) | Methods and apparatus for decoding multiple independent audio streams using a single audio decoder | |
EP2289245B1 (en) | Method and apparatus for generating or cutting or changing a frame based bit stream format file including at least one header section, and a corresponding data structure | |
WO2003007112A2 (en) | System and method for seamless switching of compressed audio streams | |
RU2708942C2 (en) | Audio signals encoding and decoding | |
CN107134280B (en) | Encoding of multi-channel audio content | |
EP1624448B1 (en) | Packet multiplexing multi-channel audio | |
US6487528B1 (en) | Method and apparatus for encoding or decoding audio or video frame data | |
KR100917844B1 (en) | Apparatus and method for transmitting or replaying multi-channel audio signal | |
CN102414744B (en) | Digital signal regeneration apparatus and digital signal compression apparatus | |
EP1020998A2 (en) | Method and apparatus for encoding or decoding audio or video frame data | |
JPH08162963A (en) | Data encoder and decoder | |
JP2003233396A (en) | Decoding device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: STMICROELECTRONICS PVT. LTD, INDIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BANSAL, RAHUL;MONNIER, PHILIPPE;SINGH, SHIV KUMAR;AND OTHERS;SIGNING DATES FROM 20101203 TO 20101210;REEL/FRAME:025614/0899 Owner name: STMICROELECTRONICS (GRENOBLE) SAS, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BANSAL, RAHUL;MONNIER, PHILIPPE;SINGH, SHIV KUMAR;AND OTHERS;SIGNING DATES FROM 20101203 TO 20101210;REEL/FRAME:025614/0899 |
|
AS | Assignment |
Owner name: STMICROELECTRONICS INTERNATIONAL N.V., NETHERLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:STMICROELECTRONICS PVT. LTD.;REEL/FRAME:031575/0358 Effective date: 20131111 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
CC | Certificate of correction | ||
AS | Assignment |
Owner name: STMICROELECTRONICS (GRENOBLE 2) SAS, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:STMICROELECTRONICS (GRENOBLE) SAS;REEL/FRAME:036082/0929 Effective date: 20140528 |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20211217 |