WO2013061113A1 - Procédé et appareil de codage audio qui utilisent des informations dépendant du contexte - Google Patents
Procédé et appareil de codage audio qui utilisent des informations dépendant du contexte Download PDFInfo
- Publication number
- WO2013061113A1 WO2013061113A1 PCT/IB2011/054759 IB2011054759W WO2013061113A1 WO 2013061113 A1 WO2013061113 A1 WO 2013061113A1 IB 2011054759 W IB2011054759 W IB 2011054759W WO 2013061113 A1 WO2013061113 A1 WO 2013061113A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- mobile device
- geographical location
- dependent data
- contextual
- communication channel
- Prior art date
Links
- 230000001419 dependent effect Effects 0.000 title claims abstract description 148
- 238000000034 method Methods 0.000 title claims abstract description 34
- 238000004891 communication Methods 0.000 claims abstract description 96
- 230000005236 sound signal Effects 0.000 claims abstract description 38
- 238000012545 processing Methods 0.000 claims abstract description 26
- 230000006870 function Effects 0.000 claims description 39
- 230000005540 biological transmission Effects 0.000 claims description 20
- 238000012937 correction Methods 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims description 3
- 238000013461 design Methods 0.000 description 8
- 239000004065 semiconductor Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 230000003044 adaptive effect Effects 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000004020 conductor Substances 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B17/00—Monitoring; Testing
- H04B17/30—Monitoring; Testing of propagation channels
- H04B17/391—Modelling the propagation channel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L1/00—Arrangements for detecting or preventing errors in the information received
- H04L1/0001—Systems modifying transmission characteristics according to link quality, e.g. power backoff
- H04L1/0009—Systems modifying transmission characteristics according to link quality, e.g. power backoff by adapting the channel coding
- H04L1/0011—Systems modifying transmission characteristics according to link quality, e.g. power backoff by adapting the channel coding applied to payload information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L1/00—Arrangements for detecting or preventing errors in the information received
- H04L1/0001—Systems modifying transmission characteristics according to link quality, e.g. power backoff
- H04L1/0014—Systems modifying transmission characteristics according to link quality, e.g. power backoff by adapting the source coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L1/00—Arrangements for detecting or preventing errors in the information received
- H04L1/20—Arrangements for detecting or preventing errors in the information received using signal quality detector
- H04L1/203—Details of error rate determination, e.g. BER, FER or WER
Definitions
- the present application relates to coding, and in particular, but not exclusively to speech or audio coding.
- Audio signals like speech, are encoded for example to enable efficient transmission or storage of the audio signals.
- Speech and audio codecs can be arranged to encode speech signals for efficient transmission over both circuit switched mobile radio systems such as the Global System for Mobile Communications (GSM) and the like, and packet switched networks such as those deploying the internet protocol (IP) and the like.
- GSM Global System for Mobile Communications
- IP internet protocol
- Some speech and audio codecs can support a number of different coding rates. These speech and audio codecs are typically known as multi rate speech and audio codecs.
- Multi rate codecs can be combined with a forward error correction (FEC) scheme operating at any one of a number of different FEC coding rates.
- FEC forward error correction
- the source coding rate and error control coding rate can be adjusted to suit the error conditions of a fixed bandwidth communication channel.
- the Third Generation Partnership Project (3GPP) speech coding standard TS 26.090 Adaptive Multi-rate (AMR) Speech Codec is capable of supporting up to eight operating bit rates varying between 4.75 and 12.2 kbps.
- the corresponding FEC coding rate can vary from 3.65 kbps to 10.6 kbps.
- Each FEC coding rate can be tied to a particular source coding rate, so that when a particular mode of operation is determined the source codding rate and error control rate can be selected as a pair. This has the advantage of maintaining a constant bandwidth across different operating conditions of the transmission channel.
- a method comprising determining a first geographical location; processing contextual dependent data to generate a priori information indicative of the quality of a communication channel associated with the first geographical location, wherein the contextual dependent data comprises at least one communication channel quality measure relating to the first geographical location; determining a codec function signal dependent on the a priori information indicative of the quality of the communication channel associated with the first geographical location; and encoding an audio signal according to the determined codec function signal.
- the codec function signal may be indicative of at least one of: a source coding rate; a source coding mode; and a forward error correction coding rate.
- the method may further comprise at least one of: receiving the contextual dependent data from a contextual dependent data server; and receiving the contextual dependent data from a mobile device.
- the method may further comprise: generating contextual dependent data for the first geographical location by measuring the error of the at least one communication channel to determine the at least one communication channel quality measure relating to the first geographic location.
- the at least one communication channel quality measure may be a bit error rate of the at least one communication channel.
- the method may further comprise: transmitting the contextual dependent data to at least one of: a mobile device; and the contextual dependent data server.
- the coding function signal may indicate a source coding mode the audio signal is encoded using a multimode audio codec.
- the coding function signal may indicate a source coding rate the audio signal is encoded using a multi rate audio codec.
- the first geographical location may be in the form of global positioning system coordinates.
- the method may further comprise; receiving data from the further mobile device, wherein the received data indicates that the further mobile device is in the geographical location.
- the geographical location may be in the form of global positioning system coordinates.
- an apparatus comprising at least one processor and at least one memory including computer code for one or more programs, the at least one memory and the computer code configured to with the at least one processor cause the apparatus to at least perform: determining a first geographical location; processing contextual dependent data to generate a priori information indicative of the quality of a communication channel associated with the first geographical location, wherein the contextual dependent data comprises at least one communication channel quality measure relating to the first geographical location; determining a codec function signal dependent on the a priori information indicative of the quality of the communication channel associated with the first geographical location; and encoding an audio signal according to the determined codec function signal.
- the codec function signal may be indicative of at least one of: a source coding rate; a source coding mode; and a forward error correction coding rate.
- the at least one memory and the computer code configured with the at least one processor may cause the apparatus to at least further perform at least one of: receiving the contextual dependent data from a contextual dependent data server; and receiving the contextual dependent data from a mobile device.
- the apparatus may be further caused to perform at least one of: generating contextual dependent data for the first geographical location by measuring the error of the at least one communication channel to determine the at least one communication channel quality measure relating to the first geographic location.
- the at least one communication channel quality measure may be a bit error rate of the at least one communication channel.
- the at least one communication channel quality measure may be a frame error rate of the at least one communication channel.
- the apparatus may be further caused to perform: transmitting the contextual dependent data to at least one of: a mobile device; and the contextual dependent data server.
- the coding function signal may indicate a source coding mode the audio signal is encoded using a multimode audio codec.
- the coding function signal may indicate a source coding rate the audio signal is encoded using a multi rate audio codec.
- an apparatus comprising at least one processor and at least one memory including computer code for one or more programs, the at least one memory and the computer code configured to with the at least one processor cause the apparatus to at least perform: receiving contextual dependent data from at least one mobile device, wherein the contextual dependent data comprises at least one communication channel quality measure relating to a geographical location of the at least one mobile device and information relating to the geographical location of the at least one mobile device; processing contextual dependent data from the at least one mobile device to determine an averaged contextual dependent data for the geographical location; and transmitting the averaged contextual dependent data from the at least one mobile device to a further mobile device, wherein the transmission of the averaged contextual dependent data is dependent on the further mobile device being in the geographical location.
- the geographical location may be in the form of global positioning system coordinates.
- an apparatus comprising means for determining a first geographical location; means for processing contextual dependent data to generate a priori information indicative of the quality of a communication channel associated with the first geographical location, wherein the contextual dependent data comprises at least one communication channel quality measure relating to the first geographical location; means for determining a codec function signal dependent on the a priori information indicative of the quality of the communication channel associated with the first geographical location; and means for encoding an audio signal according to the determined codec function signal.
- the codec function signal may be indicative of at least one of: a source coding rate; a source coding mode; and a forward error correction coding rate.
- the apparatus may further comprise at least one of: means for receiving the contextual dependent data from a contextual dependent data server; and means for receiving the contextual dependent data from a mobile device.
- the apparatus may further comprise: means for generating contextual dependent data for the first geographical location by measuring the error of the at least one communication channel to determine the at least one communication channel quality measure relating to the geographic location.
- the at least one communication channel quality measure may be a bit error rate of the at least one communication channel.
- the at least one communication channel quality measure may be a frame error rate of the at least one communication channel.
- the coding function signal may indicate a source coding mode the audio signal is encoded using a multimode audio codec.
- the coding function signal may indicate a source coding rate the audio signal is encoded using a multi rate audio codec.
- the first geographical location may be in the form of global positioning system coordinates.
- an apparatus comprising means for receiving contextual dependent data from at least one mobile device, wherein the contextual dependent data comprises at least one communication channel quality measure relating to a geographical location of the at least one mobile device and information relating to the geographical location of the at least one mobile device; means for processing contextual dependent data from the at least one mobile device to determine an averaged contextual dependent data for the geographical location; and means for transmitting the averaged contextual dependent data from the at least one mobile device to a further mobile device, wherein the transmission of the averaged contextual dependent data is dependent on the further mobile device being in the geographical location.
- the apparatus may further comprise: means for receiving data from the further mobile device, wherein the received data indicates that the further mobile device is in the geographical location.
- the geographical location may be in the form of global positioning system coordinates.
- a computer program product stored on a medium may cause an apparatus to perform the method as discussed herein.
- An electronic device may comprise apparatus as discussed herein.
- a chipset may comprise apparatus as discussed herein.
- Figure 1 shows schematically an apparatus suitable for employing some embodiments
- Figure 2 shows schematically a multi rate speech/audio codec system suitable for employing some embodiments
- Figure 3 shows schematically a deployment of the contextual data server and mobile devices according to some embodiments
- Figure 4 shows schematically a mobile device according to some embodiments.
- the apparatus 10 in some embodiments comprises a microphone 1 1 , which is linked via an analogue-to-digital converter (ADC) 14 to a processor 21 .
- the processor 21 is further linked via a digital-to-analogue (DAC) converter 32 to loudspeakers 33.
- the processor 21 is further linked to a transceiver (RX/TX) 13, to a user interface (Ul) 15 and to a memory 22.
- the processor 21 may be configured to execute various program codes.
- the implemented program codes 23 in some embodiments comprise a multi rate speech codec and a FEC coder which are configured to operate together over a plethora of different source and FEC coding rates.
- the implemented program codec 23 may comprise a multimode speech/audio codec which may be configured to operate over a series of different signal types.
- the implemented program codes 23 can also comprise coding rate determining code which facilitates the selection of the source coding rate of the multi rate speech and audio codec and the FEC coding rate of the FEC coder. Further, the implemented program codes 23 may also comprise coding mode determining code which can enable the selection of a particular mode of coding of a multimode speech/audio codec.
- the components of the multi rate speech and audio codec, FEC coder and coding rate determining code can be implemented in embodiments in either hardware or firmware.
- the user interface 15 enables a user to input commands to the apparatus 10, for example via a keypad, and/or to obtain information from the apparatus 10, for example via a display.
- a touch screen may provide both input and output functions for the user interface.
- the apparatus 10 in some embodiments comprises a transceiver 13 suitable for enabling communication with other apparatus, for example via a wireless communication network.
- the analogue-to-digital converter (ADC) 14 in some embodiments converts the input analogue audio signal into a digital audio signal and provides the digital audio signal to the processor 21 .
- the microphone 1 1 can comprise an integrated microphone and ADC function and provide digital audio signals directly to the processor for processing.
- the processor 21 in such embodiments then can process the digital audio signal in the same way as described with reference to Figures 4 and 5.
- the resulting bit stream can in some embodiments be provided to the transceiver 13 for transmission to another apparatus.
- the coded audio data in some embodiments can be stored in the data section 24 of the memory 22, for instance for a later transmission or for a later presentation by the same apparatus 10.
- the apparatus 10 in some embodiments can also receive a bit stream with correspondingly encoded data from another apparatus via the transceiver 13.
- the processor 21 may execute the decoding program code stored in the memory 22.
- the processor 21 in such embodiments decodes the received data, and provides the decoded data to a digital-to-analogue converter 32.
- the digital-to- analogue converter 32 converts the digital decoded data into analogue audio data and can in some embodiments output the analogue audio via the loudspeakers 33.
- Execution of the decoding program code in some embodiments can be triggered as well by an application called by the user via the user interface 15.
- the received encoded data in some embodiments can also be stored instead of an immediate presentation via the loudspeakers 33 in the data section 24 of the memory 22, for instance for later decoding and presentation or decoding and forwarding to still another apparatus. It would be appreciated that the schematic structures described in Figures 2 to 4 and the method steps shown in Figure 5 represent only a part of the operation of embodiments as exemplarily shown implemented in the apparatus shown in Figure 1 .
- FIG. 2 The general operation of embodiments of the application is shown in Figure 2. There is illustrated in Figure 2 a system 102 with a multi rate speech/audio encoder 1 04, an FEC encoder 105, a bit rate selector 103, and a transmission channel 106. It would be understood that as described above some embodiments of the apparatus 10 can comprise or implement any or all of the source encoder 104, FEC encoder 105 and bit rate selector 103.
- the encoder 104 compresses an input audio or speech signal 1 10 producing a bit stream 1 12, which in some embodiments can be passed to an FEC encoder 105 for the addition of forward error control (FEC) bits to the bit stream 1 12.
- FEC forward error control
- the FEC protected bit stream 1 13 may then be transmitted through the transmission channel 106.
- embodiments may be generally depicted as having a multi rate audio/speech encoder 104 and a coding mode determiner 103. These embodiments may also deploy FEC coding 105.
- a multimode speech/audio codec may refer to a speech/audio codec which can be capable of deploying different modes of source coding whilst coding an audio/speech signal.
- a multimode speech/audio codec can comprise a plurality of different source coding algorithms which may each be deployed during the course of encoding the audio/speech signal.
- each mode of operation may be realised by a particular source coding algorithm.
- the deployment of a mode of operation of a multimode source codec may be performed on a frame by frame basis. In other words the mode of operation may change from one frame to the next.
- an algorithm corresponding to a particular mode of operation may be tailored to suit a particular source signal type. For example a particular algorithm may be tailored to coding a music signal, where as another algorithm may be more particularly suited to coding a speech like signal.
- multimode source coding algorithms may include algorithms more particularly suited for communication channel conditions in which the channel error rate may be determined to be high. These algorithms may be more robust to communication channel errors.
- multimode source coding algorithms may include comfort noise generation during regions of silence whilst the multimode source codec operates in a discontinuous transmission (DTX) mode of operation.
- DTX discontinuous transmission
- FIG. 3 shows schematically a mobile device 302 according to some embodiments.
- the mobile device 302 may be arranged in some embodiments to have the form of the user apparatus 10.
- the mobile device 302 is depicted as being arranged to receive an input from a contextual data server 320.
- the contextual data server 320 may be located remotely from the mobile device 302 as depicted in Figure 3.
- the mobile device may communicate with the contextual data server 320 via a bi directional wireless data connection, such as for example a GSM GPRS data link, a 3GPP data link or a WLAN link or the like.
- the wireless data connection between the contextual data server 320 and the mobile device 302 is depicted as the link 310 in Figure 3.
- the contextual data server 320 provides contextual dependent data to the mobile device 302.
- the contextual dependent data may assist the mobile device 302 in selecting an appropriate source and FEC coding rates for the forthcoming communication channel.
- contextual dependent data as provided by the contextual data server 320 may also assist the mobile device 302 in selecting the appropriate coding mode for the forthcoming communication channel.
- the contextual data server 320 may be aware of the location of the mobile device 302. In such embodiments contextual dependent data provided to the mobile device 302 may be tailored for its geographical location.
- the contextual data server 320 may obtain geographical knowledge of the mobile device 302 by either the mobile device 302 sending Global Positioning System (GPS) coordinate data to the contextual data server 320 over the wireless data link 310, or by the contextual data server 320 obtaining cell location data from the cell within which the mobile device 302 is currently active.
- GPS Global Positioning System
- the contextual dependent data provided by the contextual data server 320 may comprise error information such as frame error rates, bit error rates, or other parametric indicators which indicate the noise or quality of a communication channel.
- the context data server 320 may provide contextual data relevant for the geographical location or cell location in which the mobile device 302 is currently active.
- the contextual dependent data may then be used by the mobile device 302 to select a particular combination of source and FEC coding rates suited for the communication channel conditions over which the mobile device 302 is about to communicate. Additionally, the mobile device 302 may use the contextual dependent data to select a particular coding mode of operation of the speech/audio codec. In which a coding mode is selected according to the forthcoming communication channel conditions.
- the contextual data server 320 may receive geographical data that the mobile device 302 has entered into a geographic location in which it is already known that will be significant channel errors. For instance, such locations may include known areas of weak signal reception such as when the mobile device 302 enters a tunnel, or when the mobile device 302 receives a weak signal with many paths of propagation due to being in a built up area.
- the contextual dependent data from the contextual data server 320 may be used by the mobile device 302 to select a mode of operation whereby the FEC coding rate would be at a higher rate than if the mobile device was operating over a low error channel.
- the mobile device 302 would select a lower source coding rate in order to compensate for the higher FEC coding rate.
- the mobile device 302 may use a priori information relating to forthcoming channel conditions to determine the optimum source and FEC coding rates, or the optimum source coding mode for the said channel.
- the a priori information may be geographically driven, in other words the a priori information provided by the contextual data server 320 to the mobile device 302 may be tailored to the geographical location of the mobile device 302.
- locally collated contextual dependent data for the mobile device 302 may also comprise measured bit or frame error rates of the wireless channel over which the mobile device 320 is currently communicating.
- the further mobile devices 302a to 302e may also be connected to the contextual data server 320 via a bidirectional wireless data link such as that described above.
- the further mobile devices 302a to 302e may be arranged to provide contextual dependent data regarding the geographical location or mobile cell they are currently in.
- the contextual dependent data generated by the further mobile device 302a may comprise measured frame error rates or bit error rates of the communication channel currently in use. This information may be coupled with location based information such as GPS coordinates, and passed to the contextual data server 320.
- the contextual data server 320 may analyse the contextual dependent data from a further mobile device 302a in order to obtain information on the quality of a communication channel with respect to its geographic location. In other words the contextual data server 320 may analyse the contextual dependent data from each mobile device 302a to 302e in order to build up a database of the quality of communication links for different geographical locations.
- the contextual data server 320 may in embodiments be arranged to process the contextual dependent data in order to identify geographical locations where the communication link may consistently exhibit higher error rates. For example when a mobile device enters into a tunnel, or when a mobile device enters into a location where there is no longer a line of sight with a base station.
- the contextual data server 320 may be arranged to process the contextual dependent data provided by the mobile devices 302a to 302e in order to identify geographical locations where the communication link tend to exhibit relatively low error rates.
- the contextual data server 320 may be arranged to process all the contextual dependent data obtained for a specific geographical location. In some embodiments the processing by the contextual data server 320 may result in an averaged value for the contextual dependent data for a specific geographical location.
- the contextual data server 320 may be provided with means for processing contextual dependent data from each mobile device 302a to 302e in order to determine an averaged contextual dependent data for the geographical location of the mobile device 320.
- the contextual data server 320 may be provided with means for transmitting the averaged contextual dependent data to the mobile device 302, wherein the transmission of the averaged contextual dependent data is dependent on the mobile device 302 being in the same geographical location of at least one of the mobile devices 302a to 302e.
- a mobile device may not have access to a contextual data server such as the one depicted as 320 in Figure 3.
- the mobile device may use contextual dependent data associated with geographical map data stored locally on the device. For instance, the mobile device may measure the bit or frame error rates for a communication channel, and then couple the results of the said measurement with map data in a navigation application.
- the contextual data server 320 may be part of the mobile device.
- the contextual data server may also be arranged to perform processing functionality as described above in order to analyse the data gathered from other mobile devices.
- the mobile device 302 comprises a receiver 418 of suitable means for receiving.
- the mobile device 302 is depicted in Figure 4 as being arranged to receive a signal on the input 401 via the receiver 418.
- the output of the receiver 418 is shown as being arranged to be connected to the input of the contextual data analyser 412 via the connection 403.
- the step of receiving contextual dependent data by the mobile device 302 is shown as processing step 501 in Figure 5.
- connection 403 may convey contextual dependent data from a remotely located contextual data server 320.
- the contextual dependent data as provided by the contextual data server 320 can be passed to the mobile device 302 via the radio signal input connection 401 and the receiver 418.
- connection 403 may convey contextual dependent data from other mobile devices rather than from a contextual data server 320.
- the mobile device 302 comprises a contextual analyser 412 or suitable means for analysing contextual data.
- the contextual data analyser 412 may have the functionality to formulate an overview of the quality of communication links in relation to their geographic location by collating contextual dependent data from other mobile devices 302a to 392e over a period of time.
- the contextual data analyser 412 may be arranged to receive an input signal from a GPS receiver 410 or other suitable means for determining the mobile device's geographical location. It is to be understood that the GPS receiver 410 may be part of the mobile device 302 as depicted in Figure 4. In other words the mobile device may have means for determining the current geographical location.
- the GPS receiver 410 may be arranged to furnish the contextual data analyser 412 with data relating to the current location, the speed of travel and the direction of travel of the mobile device 302. Parameters of this kind may then be used by the context data analyser 412 in conjunction with the contextual dependent data from either other mobile devices, or a contextual data server 320 in order to assist in the determination of optimum source and FEC coding rates, or an optimum source coding mode.
- GPS related data may be conveyed from the GPS receiver to the contextual data analyser 412 along the connection 405.
- the output of the contextual data analyser 412 may be considered as a priori information relating to the forth coming communication channel.
- processing step 503 The step of analysing contextual dependent data in order to provide a priori information for the determination of the optimum source and FEC coding rates, or optimum source coding mode is depicted as processing step 503 in Figure 5.
- the mobile device 302 comprises a coding rate/mode selector 414 or suitable means for selecting coding rates and/or coding modes.
- the contextual data analyser 412 may be arranged to have a further output which may be connected to a transmitter 420 in the mobile device 302.
- connection 409 The further output from the contextual data analyser 412 is shown as connection 409 in Figure 4.
- the mobile device 302 comprises a transmitter 420 suitable means for transmitting.
- the connection 409 from the context data analyser 412 may be used to convey contextual dependent data relating to the mobile device 302 to the transmitter 420.
- the contextual dependent data conveyed to the transmitter 420 may comprise components such as the current geographical location of the mobile device 302 and channel frame or bit error rates of the communication link currently used by the mobile device.
- the mobile device 302 may provide means for generating contextual dependent data for the current geographical location be measuring the error of the communication channel to determine the communication channel quality measure relating to the geographic location.
- the contextual dependent data relating to the mobile device 302 may then be transmitted via the transmitter 420 to any number of other mobile devices such as devices 302a to 302e and/or to a central contextual data server 320.
- the coding rate/mode selector 414 may either determine the optimum source and FEC coding rates, or the optimum source coding mode for the forthcoming communication link by using the a priori information furnished by the contextual data analyser 412. In other words the coding rate/mode selector 414 may use the a priori information provided by the contextual data analyser 412 to determine either the source coding rate and FEC coding rate, or the optimum source coding mode.
- the a priori information may be based at least in part on the geographic location of the mobile device 302.
- the coding rate/mode selector 414 may also combine link quality information from the receiver of a receiving mobile device. This information may be sent as an in band signal to the mobile device 320 where it may be used to determine a link quality factor which can provide an indication of the current quality of the communication link. The link quality factor may be combined with the a priori information to provide a coding rate selection which is dependent on both current communication link conditions and forth coming communication link conditions.
- the output of the coding rate/mode selector 414 can be a codec rate request signal.
- the codec rate request signal may be used to request a particular source coding rate in a multi rate speech/audio codec and also a particular FEC coder rate in a FEC coder.
- the output of the coding rate/mode selector 414 can be a codec mode request signal.
- the codec mode request signal can be used to select a particular source coding mode of a multimode audio/speech codec.
- the codec mode request signal may indicate that the multimode audio/ speech codec uses a coding mode which is robust to channel errors. This may be the case when the contextual data analyser 412 has determined that the forth coming channel conditions are likely to have a high rate of channel errors.
- the mobile device 302 comprises a multirate speech/audio encoder 418 or suitable means for encoding multirate speech/audio signals.
- the output of the coding rate selector 414 may be connected to both the input of a multi rate speech /audio encoder 418 and to the input of the FEC coder 416.
- the output of the coding rate selector 414 may be connected to the input of a multimode speech/audio encoder.
- the step of determining the source and FEC coding rates or the source coding mode is depicted as the processing step 505 in Figure 5.
- the multi rate speech /audio encoder 418 takes as an input a digitised speech or audio signal.
- Figure 4 depicts the digitised signal input as the input connection 4 3.
- 418 may be a multimode speech/audio encoder or suitable multimode encoding means.
- the digitised speech or audio signal may be derived from components such as the microphone 11 and analogue to digital converter 14 shown in Figure 1 or any suitable means for capturing audio signals.
- the digitized signal passed along the input connection 413 may be encoded using a multi rate speech encoder.
- the input digitized speech signal may be encoded on a frame by frame basis by any one of a number of possible source coding rates supported by the multi rate speech encoder 418.
- the coding rate selected for encoding the input speech/audio signal is determined by the coding rate/mode selector 414, and in some embodiments the coding rate selected may vary on a coding frame by coding frame basis.
- the multi rate speech/audio encoder may be implemented by incorporating encoding functionality form the 3GPP standard TS.26.090 Adaptive Multi-rate coder (AMR).
- the digitized signal passed along the input connection 413 may be encoded using a multimode speech/audio encoder.
- the input digitized speech or audio signal may be encoded on a frame by frame basis by any one of a number of possible source coding modes supported by the multimode speech/audio encoder 418.
- the codec function signal is indicative of at least one of: a source coding rate; a source coding mode; and a forward error correction coding rate.
- the output of the multi rate speech/audio encoder 418 may be a bit stream comprising a parametric representation of the digitised input speech/audio signal.
- the mobile device 302 comprises a forward error correction (FEC) coder 416 or suitable means for forward error correction coding.
- FEC forward error correction
- the FEC coder 416 takes as an input the speech or audio signal encoded bit stream from the multi rate speech/audio encoder 418. Further, the FEC coder 416 can take as an additional input the coding mode request signal from the coding rate selector 414.
- the FEC coder may apply FEC coding to each frame of encoded speech/audio signal which is passed from the multi rate speech/audio encoder 416 to the FEC coder 418.
- the FEC coder 418 may be arranged to code into the bit stream forward error correction bits.
- the FEC coder 418 can support a number of different FEC coding rates.
- the rate at which forward error correction is applied to the source encoded bit stream may be determined by the value of the codec mode request signal from the coding rate selector 414.
- the FEC coder 418 may apply forward error correction coding to the source encoded bit stream according to the technique of convolutional coding.
- the output of the forward error correction coder may then be further processed into a format suitable to transmission over a wireless radio link.
- the processing required may comprise functionality to encapsulate the encoded data into further frames formats and scheduling the transmission through a modulating device.
- the functionality and technology required to transmit the encoded data to a particular radio interface standard may be generally depicted in Figure 4 as the transmitter 420.
- the invention as described above may implement any multi rate audio (or speech) codec, or a multimode audio (or speech) codec including any variable rate/adaptive rate audio (or speech) codec.
- user equipment may comprise a multi rate speech/audio codec or a multimode speech/audio codec such as those described in embodiments of the invention above.
- a geographical location may be referenced by the geographical area of a mobile cell, or the geographical area of part of a part of a mobile cell.
- a geographical location may be determined by a satellite navigation system such as GPS, or Galileo Global Satellite navigation system and the like, whereby a geographical location relates to the coordinates of the satellite navigation system.
- a satellite navigation system such as GPS, or Galileo Global Satellite navigation system and the like, whereby a geographical location relates to the coordinates of the satellite navigation system.
- the geographical location may be referenced by the area determined by the GPS coordinates to a defined accuracy.
- user equipment is intended to cover any suitable type of wireless user equipment, such as mobile telephones, portable data processing devices or portable web browsers.
- PLMN public land mobile network
- elements of a public land mobile network may also comprise audio codecs as described above.
- the various embodiments of the invention may be implemented in hardware or special purpose circuits, software, logic or any combination thereof.
- some aspects may be implemented in hardware, while other aspects may be implemented in firmware or software which may be executed by a controller, microprocessor or other computing device, although the invention is not limited thereto.
- firmware or software which may be executed by a controller, microprocessor or other computing device, although the invention is not limited thereto.
- While various aspects of the invention may be illustrated and described as block diagrams, flow charts, or using some other pictorial representation, it is well understood that these blocks, apparatus, systems, techniques or methods described herein may be implemented in, as non-limiting examples, hardware, software, firmware, special purpose circuits or logic, general purpose hardware or controller or other computing devices, or some combination thereof.
- the memory may be of any type suitable to the local technical environment and may be implemented using any suitable data storage technology, such as semiconductor-based memory devices, magnetic memory devices and systems, optical memory devices and systems, fixed memory and removable memory.
- the data processors may be of any type suitable to the local technical environment, and may include one or more of general purpose computers, special purpose computers, microprocessors, digital signal processors (DSPs), application specific integrated circuits (ASIC), gate level circuits and processors based on multi-core processor architecture, as non-limiting examples.
- Embodiments of the inventions may be practiced in various components such as integrated circuit modules.
- the design of integrated circuits is by and large a highly automated process. Complex and powerful software tools are available for converting a logic level design into a semiconductor circuit design ready to be etched and formed on a semiconductor substrate. Programs, such as those provided by Synopsys, Inc. of Mountain View, California and Cadence Design, of San Jose, California automatically route conductors and locate components on a semiconductor chip using well established rules of design as well as libraries of pre-stored design modules.
- the resultant design in a standardized electronic format (e.g., Opus, GDSII, or the like) may be transmitted to a semiconductor fabrication facility or "fab" for fabrication.
- a standardized electronic format e.g., Opus, GDSII, or the like
- circuitry refers to all of the following:
- circuits and software and/or firmware
- combinations of circuits and software such as: (i) to a combination of processor(s) or (ii) to portions of processor(s)/software (including digital signal processor(s)), software, and memory(ies) that work together to cause an apparatus, such as a mobile phone or server, to perform various functions and
- circuits such as a microprocessor(s) or a portion of a microprocessor(s), that require software or firmware for operation, even if the software or firmware is not physically present.
- circuitry' applies to all uses of this term in this application, including any claims.
- the term 'circuitry' would also cover an implementation of merely a processor (or multiple processors) or portion of a processor and its (or their) accompanying software and/or firmware.
- the term 'circuitry' would also cover, for example and if applicable to the particular claim element, a baseband integrated circuit or applications processor integrated circuit for a mobile phone or similar integrated circuit in server, a cellular network device, or other network device.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Electromagnetism (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
La présente invention concerne un procédé et un appareil de codage audio qui utilisent des informations dépendant du contexte. Un procédé comprend les étapes suivantes : la détermination d'un premier site géographique ; le traitement de données dépendant du contexte de manière à générer une information a priori, indicative de la qualité d'un canal de communication associé au premier site géographique, les données dépendant du contexte comprenant au moins une mesure de qualité du canal de communication relative au premier site géographique ; la détermination d'un signal de fonction de codec qui dépend de l'information a priori indicative de la qualité du canal de communication associé au premier site géographique ; et l'encodage d'un signal audio en fonction du signal de fonction de codec déterminé.
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201180074381.2A CN103890842A (zh) | 2011-10-25 | 2011-10-25 | 使用上下文相关信息进行音频编码的方法和装置 |
US14/352,273 US20140257826A1 (en) | 2011-10-25 | 2011-10-25 | Method and apparatus for audio coding using context dependent information |
EP11874853.2A EP2771995A4 (fr) | 2011-10-25 | 2011-10-25 | Procédé et appareil de codage audio qui utilisent des informations dépendant du contexte |
PCT/IB2011/054759 WO2013061113A1 (fr) | 2011-10-25 | 2011-10-25 | Procédé et appareil de codage audio qui utilisent des informations dépendant du contexte |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/IB2011/054759 WO2013061113A1 (fr) | 2011-10-25 | 2011-10-25 | Procédé et appareil de codage audio qui utilisent des informations dépendant du contexte |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013061113A1 true WO2013061113A1 (fr) | 2013-05-02 |
Family
ID=48167179
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2011/054759 WO2013061113A1 (fr) | 2011-10-25 | 2011-10-25 | Procédé et appareil de codage audio qui utilisent des informations dépendant du contexte |
Country Status (4)
Country | Link |
---|---|
US (1) | US20140257826A1 (fr) |
EP (1) | EP2771995A4 (fr) |
CN (1) | CN103890842A (fr) |
WO (1) | WO2013061113A1 (fr) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9703841B1 (en) | 2016-10-28 | 2017-07-11 | International Business Machines Corporation | Context-based notifications in multi-application based systems |
CN110148401B (zh) * | 2019-07-02 | 2023-12-15 | 腾讯科技(深圳)有限公司 | 语音识别方法、装置、计算机设备及存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997041549A1 (fr) * | 1996-04-26 | 1997-11-06 | Telefonaktiebolaget Lm Ericsson | Procede de commande de mode de codage et appareil de determination de mode de decodage |
WO2002061954A2 (fr) * | 2001-02-02 | 2002-08-08 | Telefonaktiebolaget Lm Ericsson (Publ) | Procede et dispositif servant a regler la puissance d'emission selon le recepteur dans un systeme de communication cellulaire |
US20030060192A1 (en) * | 2000-03-20 | 2003-03-27 | Stefan Oestreich | Method for monitoring the transmission quality in a cellular radio communications system |
WO2003084091A1 (fr) * | 2002-04-03 | 2003-10-09 | Telefonaktiebolaget L M Ericsson (Publ) | Regulation de la puissance dans des systemes de communication mobiles |
WO2004040830A1 (fr) * | 2002-10-31 | 2004-05-13 | Nokia Corporation | Codeur-decodeur vocal a debit variable |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19742124C2 (de) * | 1997-09-24 | 2001-10-18 | Siemens Ag | Verfahren und Basisstationssystem zur Sprachübertragung über eine Funkschnittstelle in einem digitalen Funk-Kommunikationssystem |
US7801953B1 (en) * | 2001-02-12 | 2010-09-21 | Nortel Networks Limited | Push-to-talk wireless telecommunications system utilizing an voice-over-IP network |
FI20011715L (fi) * | 2001-08-27 | 2003-02-28 | Nokia Corp | Koodekin toiminnallisen moodin valinta |
US20030195006A1 (en) * | 2001-10-16 | 2003-10-16 | Choong Philip T. | Smart vocoder |
KR100646343B1 (ko) * | 2004-07-12 | 2006-11-23 | 에스케이 텔레콤주식회사 | 멀티미디어 링백톤 서비스를 위한 단말 코덱 설정 방법 및시스템 |
US7783316B1 (en) * | 2004-09-09 | 2010-08-24 | Rockwell Collins, Inc. | Public safety communications network using asymmetric receive/transmit channels |
US8804575B2 (en) * | 2005-12-13 | 2014-08-12 | Cisco Technology, Inc. | Central entity to adjust redundancy and error correction on RTP sessions |
US20080025300A1 (en) * | 2006-07-31 | 2008-01-31 | Texas Instruments Incorporated | Method and/or apparatus for enabling voice packet redundancy |
US8103274B2 (en) * | 2007-11-15 | 2012-01-24 | Airwalk Communications, Inc. | System, method, and computer-readable medium for IP-femtocell provisioned radio access network |
US7830817B1 (en) * | 2007-12-05 | 2010-11-09 | Sprint Spectrum L.P. | Vocoder selection based on location of client device |
US8204022B2 (en) * | 2008-02-20 | 2012-06-19 | Alcatel Lucent | Method of providing transcoding during voice-over-internet protocol handoff |
GB0807976D0 (en) * | 2008-05-01 | 2008-06-11 | Romalon Plc | Improvements relating to multi-jurisdictional telecommunications services |
US20100121977A1 (en) * | 2008-11-10 | 2010-05-13 | Nokia Corporation | Predictive Bit-Rate Modification of Content Delivery in a Wireless Network |
US8520699B2 (en) * | 2010-12-09 | 2013-08-27 | Qualcomm Incorporated | Apparatus and methods for providing a communication quality feedback of an end-to-end communication path |
US8913997B2 (en) * | 2011-09-09 | 2014-12-16 | Nokia Siemens Networks Oy | Application performance improvement in radio networks |
-
2011
- 2011-10-25 WO PCT/IB2011/054759 patent/WO2013061113A1/fr active Application Filing
- 2011-10-25 US US14/352,273 patent/US20140257826A1/en not_active Abandoned
- 2011-10-25 CN CN201180074381.2A patent/CN103890842A/zh active Pending
- 2011-10-25 EP EP11874853.2A patent/EP2771995A4/fr not_active Withdrawn
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997041549A1 (fr) * | 1996-04-26 | 1997-11-06 | Telefonaktiebolaget Lm Ericsson | Procede de commande de mode de codage et appareil de determination de mode de decodage |
US20030060192A1 (en) * | 2000-03-20 | 2003-03-27 | Stefan Oestreich | Method for monitoring the transmission quality in a cellular radio communications system |
WO2002061954A2 (fr) * | 2001-02-02 | 2002-08-08 | Telefonaktiebolaget Lm Ericsson (Publ) | Procede et dispositif servant a regler la puissance d'emission selon le recepteur dans un systeme de communication cellulaire |
WO2003084091A1 (fr) * | 2002-04-03 | 2003-10-09 | Telefonaktiebolaget L M Ericsson (Publ) | Regulation de la puissance dans des systemes de communication mobiles |
WO2004040830A1 (fr) * | 2002-10-31 | 2004-05-13 | Nokia Corporation | Codeur-decodeur vocal a debit variable |
Non-Patent Citations (1)
Title |
---|
See also references of EP2771995A4 * |
Also Published As
Publication number | Publication date |
---|---|
EP2771995A4 (fr) | 2015-04-01 |
EP2771995A1 (fr) | 2014-09-03 |
US20140257826A1 (en) | 2014-09-11 |
CN103890842A (zh) | 2014-06-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8495237B1 (en) | Techniques for providing a media stream to a mobile computing device based on a predicted route of the mobile computing device | |
CN103299366B (zh) | 用于编码及检测带水印信号的装置 | |
EP2923509B1 (fr) | Cartographie de points d'accès sans fil | |
CN103109321B (zh) | 估计音调滞后 | |
KR101642213B1 (ko) | 모바일 장치의 위치 추정 장치, 시스템 및 방법 | |
KR101548846B1 (ko) | 워터마킹된 신호의 적응적 인코딩 및 디코딩을 위한 디바이스 | |
US20130297053A1 (en) | Audio scene processing apparatus | |
US20090177462A1 (en) | Wireless terminals, language translation servers, and methods for translating speech between languages | |
KR101590239B1 (ko) | 워터마킹된 신호를 인코딩 및 디코딩하는 디바이스들 | |
KR20070085673A (ko) | 저 비트 레이트 스피치 코딩 방법 및 장치 | |
US8436769B2 (en) | Apparatus for performing global navigation satellite system control, and associated methods and storage media | |
CN103098127A (zh) | 译码及解码瞬时帧 | |
CN108008432B (zh) | 卫星定位方法、装置和智能手表 | |
US20140330415A1 (en) | Method and apparatus for detecting audio sampling rate | |
JP2013171036A (ja) | 位置測位方法及び装置並びに電子装置 | |
CN101518133B (zh) | 移动通信终端和用于移动通信终端的移动速度检测方法 | |
US20140257826A1 (en) | Method and apparatus for audio coding using context dependent information | |
CN103109319A (zh) | 确定音调循环能量及按比例缩放激励信号 | |
CN103180907B (zh) | 音频场景装置 | |
WO2023031498A1 (fr) | Descripteur de silence utilisant des paramètres spatiaux | |
JP5141384B2 (ja) | 圏外エリア検出システム、携帯端末、圏外情報管理サーバ及び圏外エリア検出方法。 | |
KR20090006600A (ko) | 디지털 멀티미디어 방송 필드 측정 장치 및 방법 | |
KR20080023067A (ko) | 휴대인터넷을 이용한 단말기에서의 정보제공 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11874853 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011874853 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14352273 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |