WO2009126426A1 - Décalage temporel pour des systèmes de communication vocaux à bouton microphone - Google Patents
Décalage temporel pour des systèmes de communication vocaux à bouton microphone Download PDFInfo
- Publication number
- WO2009126426A1 WO2009126426A1 PCT/US2009/037966 US2009037966W WO2009126426A1 WO 2009126426 A1 WO2009126426 A1 WO 2009126426A1 US 2009037966 W US2009037966 W US 2009037966W WO 2009126426 A1 WO2009126426 A1 WO 2009126426A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- time
- communication device
- based media
- network
- media
- Prior art date
Links
- 238000004891 communication Methods 0.000 title claims abstract description 199
- 239000000872 buffer Substances 0.000 claims abstract description 66
- 238000009877 rendering Methods 0.000 claims abstract description 62
- 238000012552 review Methods 0.000 claims abstract description 40
- 238000000034 method Methods 0.000 claims abstract description 39
- 230000004044 response Effects 0.000 claims abstract description 14
- 230000005540 biological transmission Effects 0.000 claims description 33
- 230000001413 cellular effect Effects 0.000 claims description 22
- 230000000750 progressive effect Effects 0.000 claims description 6
- 230000003287 optical effect Effects 0.000 claims description 3
- 230000011664 signaling Effects 0.000 claims 2
- 230000003139 buffering effect Effects 0.000 claims 1
- 230000006870 function Effects 0.000 description 29
- 238000010586 diagram Methods 0.000 description 18
- 230000007704 transition Effects 0.000 description 7
- 238000007726 management method Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 230000002085 persistent effect Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000007727 signaling mechanism Effects 0.000 description 2
- 238000005773 Enders reaction Methods 0.000 description 1
- 235000008694 Humulus lupulus Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/06—Selective distribution of broadcast services, e.g. multimedia broadcast multicast service [MBMS]; Services to user groups; One-way selective calling services
- H04W4/10—Push-to-Talk [PTT] or Push-On-Call services
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W76/00—Connection management
- H04W76/40—Connection management for selective distribution or broadcast
- H04W76/45—Connection management for selective distribution or broadcast for Push-to-Talk [PTT] or Push-to-Talk over cellular [PoC] services
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/12—Messaging; Mailboxes; Announcements
Definitions
- the present invention relates to telecommunications, and more particularly, to an apparatus for applying time-shifting functionality to voice communications .
- Voice communication systems consist of one of more "channels.”
- PTT Push To Talk
- PoC PTT over Cellular
- a single channel is used whenever someone is transmitting. A user cannot transmit while their device is receiving.
- These systems are said to be “half-duplex”.
- full-duplex systems like landline telephones, cellular telephones, or VoIP systems such as Skype or SIP.
- Each of these full-duplex systems uses two channels, one for voice being received and the other for voice being transmitted.
- User communication devices generally "connect" these channels, either to a speaker, a microphone, or both, depending on the duplex and current mode of operation.
- the recording of the message stops while the parties engage in a telephone conversation. With other answering machines, however, the live conversation will be recorded unless the called party manually stops the recording. In either recording situation, there is no way for the called party to review the recorded message until after the recording has stopped. There is no way for the recipient to review any portion of the recorded message other than the current point of the conversation while the conversation is ongoing and is being recorded. Only after the conversation has concluded and the parties have hung up or the recipient has manually stopped the recording can the recipient go back and review the recorded message or conversation.
- Some more recent call management systems provide a "virtual answering machine", allowing callers to leave a message in a voicemail system, while giving called users the ability to hear the message as it is being left.
- the actual answering "machine” is typically a voicemail- style server, operated by the telephony service provider.
- Virtual answering machine systems differ from standard voice mail systems in that the called party may use either their phone or a computer to listen to messages as they are being left. Similar to an answering machine as described in the preceding paragraph, however, the called party can only listen at the current point of the message as it is being left. There is not way to review previous portions of the message before the message is left in its entirety and the caller hangs up.
- a method and network communication device for providing on a Push To Talk (PTT) communication network time-shifting capabilities to a user of a PTT communication device involve progressively receiving time-based media on the PTT communication network from a sending device and progressively storing the received time-based media in a time- shifting buffer located at the network communication device as the time-based media is progressively received.
- PTT Push To Talk
- the method and network communication device further includes controlling the rendering of the time-based media at the PTT communication device in either (i) a near real-time mode as the time-based media is progressively received at the network communication device or (ii) in a time shifted mode at an arbitrary later time after the receipt and storage of the time-based media in the time-shifting buffer. If the received rendering control instructions indicate the PTT user would like to review the media in the near real-time mode, the method and network communication device further involve progressively transmitting the time- based media to the PTT communication device as the time-based media is progressively received at the network communication device.
- the time-based media is progressively transmitted from the time-shifting buffer to the PTT communication device at the arbitrary later time.
- the user of the PTT device is provided a number of rendering options, including play the time-based media, pausing the time-based media as it is being rendered, jump backward to review previous time-based media, jump to the most current time-based media, play the time- based media either faster or slower than it was originally encoded and catch up to live.
- Figure 1 illustrates a block diagram of a Store and Stream (SaS) module (i.e., a time-shifting buffer) of the present invention.
- Figures 2A through 2E are a series of flow diagrams illustrating a store and stream function of the communication and management system of the invention.
- Figure 3 is a block diagram of the SaS module embedded in a VoIP client in accordance with the present invention.
- Figure 4 is block diagram of the SaS module embedded in a Push To Talk (PTT) client according to the present invention.
- PTT Push To Talk
- Figure 5 is a block diagram of the SaS module embedded in a mobile phone with PPT over cellular capabilities according to the present invention.
- Figure 6 is a block diagram of the SaS module embedded in a legacy telephone according to the present invention.
- Figure 7 is a diagram illustrating a full duplex conversation with the SaS module for the participants located on the network between user devices according to the present invention.
- Figure 8 is a diagram of illustrating a PTT transmission with the SaS module located on the network.
- This attribute allows users to begin conversations, participate in conversations, and review previously received time-shifted messages of conversations even when there is no network available, when the network is of poor quality, or other participants are unavailable; (v) enabling the system to save media payload data at the sender and, after network transmission, saving the media payload data at all receivers; (vi) enabling the system to organize messages by threading them sequentially into semantically meaningful conversations in which each message can be identified and tied to a given participant in a given conversation; (vii) enabling users to manage each conversation with a set of user controlled functions, such as reviewing "live”, pausing or time shifting the conversation until it is convenient to review, replaying in a variety of modes (e.g., playing faster, catching up to live, jump to the head of the conversation) and methods for managing conversations (archiving, tagging, searching, and retrieving from archives); (viii) enabling the system to manage and share presence data with all conversation participants, including online status, intentions with respect to reviewing any given message in either the live or time- shifted mode, current attention
- SaS Store and Stream
- one or more SaS module(s) may be provided on end-user communication devices and/or at various nodes or hops on a circuit-based network, such as the Public Switched Telephone Network (PSTN), analog voice networks, cellular networks, Push To Talk (PTT) networks, or any other circuit type based network.
- PSTN Public Switched Telephone Network
- PTT Push To Talk
- One or more SaS module(s) may also be provided on end- user devices and/or nodes of a packet-based network or a VoIP network or other network system designed to replicate the functionality of telephone or PTT radio systems using packets.
- Time-based media is any media that changes meaningfully with respect to time. Voice or video clips are both considered time-based media because they both substantively change as time passes. By way of comparison, a still photo is generally not considered time-based media because it is static and does not change with the passage of time.
- the SaS module acts as a time-shifting buffer between the existing audio/voice hardware on the device (i.e., a microphone or speaker) and the transmitting and receiving hardware on the device, such as a radio transceiver. Any media either generated at or received at the end-user device is stored in a time-based format in the time-shifting buffer in a time-based format.
- the time-shifting buffer functionality is located on the network, between the transmitting and receiving end-user communication devices. Transmissions between the two communication devices are stored in a time-based format in the time- shifting buffer on the network.
- SaS modules may be provided at both end user communication devices and on the network in one or more network communication devices. With this arrangement, the transmissions are stored both on the SaS enabled end user devices and on the one or more network communication devices on the network.
- the Store and Stream module persistently stores time-based media in the time-based format as the media is received. With embodiments where the SaS module is located on a network communication device, the time-based media is stored as it is received from one or more end-user communication devices as the one or more end- user communication devices send transmissions back and forth over the network.
- the received media that is stored includes both time-based media created by or otherwise originating at the end-user communication device itself and time-based media received over the network from others who have sent messages.
- a user may participate in an ongoing exchange of messages in either (i) a "live" near real-time mode, similar to a conventional phone call where spoken words are sent back and forth between the participants, (ii) a time-shifted mode, where the user hears a message at some point "behind" the live point, along with the ability to catch up to the live point by playing the intervening message or messages faster or directly skipping over messages to arrive at the live point; or (iii) seamlessly transition between the real-time mode and the time- shifted mode.
- the SaS module records the message from the sender. By default, missed messages would be in a time-shifted state. As missed messages are received, they are queued in storage for later review by the recipient. Alternatively, if a called party is present when a message is received, the called party may chose to engage in the conversation in the real-time mode or review the message at an arbitrary time later in the time-shifted mode. In addition, since the message is being recorded in the SaS module, the recipient also has the option of reviewing any portion of the message as it is being received with a number of playback controls, described in more detail below.
- Storing the media at the SaS module provides a number of functions previously not available on communication systems: (i) it enables users to leave a message for another party, even when the sender and/or the recipient has poorly functioning or otherwise unavailable network connectivity; (ii) the recipient has the ability to pause, replay, fast-forward, and catch-up-to-live with an ongoing exchange of messages between two or more parties such as during a conversation; (iii) the ability to retrieve and review previously sent, stored and archived messages; (iii) locally generated messages can be mixed as appropriate to create overlapping messages (generated by the normal overlap of multiple speakers during a conversation); (iv) enables the transcription or translation of voice media into either text or other languages; and (v) it enables users with several rendering options, including reviewing messages faster or slower, for example.
- the SaS module When listening to messages at some time after the messages have been received in the time-shifted mode, the SaS module provides the user with the ability to speed up the review of the recorded messages at a rendering rate faster than the rate the messages were originally encoded. This allows for more rapid reviewing of older content and also allows users to "catch up" to the live point of a conversation, seamlessly transitioning from the review of previously received and stored media of the conversation with new media as it is being received, hereafter referred to as "catch -up-to-live".
- the SaS module provides new functionality, including the ability to communicate in either the real-time mode or time-shifted mode, and the ability to seamlessly transition between the two modes, similar to that described above with regard to full duplex communication systems.
- PTT users will have the ability to skip forward and backward on a per-message basis. For each message, the user has the option of implementing all the above-listed playback features (e.g., play faster, play slower, catch up to live, etc.).
- incoming messages may optionally be played as they are being received.
- All received messages are stored for review at an arbitrary later time defined by the recipient. If a user is reviewing an old message while a new message is received, an indication may be provided. Users have the option of reviewing other messages simultaneously with the arrival of new messages or serializing the review of incoming messages. Users can also replay messages that were missed without asking the sending party to retransmit. Further, the SaS module enables a PTT user to set a "do not disturb" mode of interaction where incoming transmissions that are recorded and queued for recipient are not played through the speaker. The time-indexing, recording, and the rendering of messages in the time- shifted mode can also apply to any locally generated transmissions. In this manner, a PTT user can review their message contributions, in addition to any received messages, of an ongoing conversation.
- a SaS module embedded in the device of a sending user, the delay imposed on the user by the network for call setup and "volley" times are reduced.
- a sending user may begin speaking immediately while the SaS module time-indexes and stores the message as time-based media.
- the sending device carries out the usual network negotiation and setup processes.
- the SaS module will send the message from storage starting from the beginning of the message. The typical sequence of a caller waiting for a connection to be established and the phone to ring before speaking is thus eliminated.
- FIG. 1 a block diagram of the SaS module 24 according to one embodiment of the present invention is shown.
- the main function of the SaS module 24 is that it acts as a time- shifting buffer for communication devices connected to circuit-based or packet based networks that replicate the functionality of telephone or PTT radio systems such as VoIP.
- the components of the SaS module 24 are described in detail below. It should be noted that in the embodiment illustrated in Figure 1, the SaS module 24 is intended for use in an end-user communication device. In embodiments where the SaS module is located at a node on the network in a network communication device, some of the functionality illustrated in Figure 1 is not required, as pointed out in the discussion below.
- PIMB Persistent Infinite Message Buffer
- the Persistent Infinite Message Buffer or PIMB 30 stores or records time- based media in a time-indexed format and provides a system for the retrieval of the media.
- the media in the PIMB 30 is arbitrarily persistent, meaning it is available virtually forever or at least until it is purposely deleted or deleted in accordance with a predefined retention policy. This persistence is in comparison to existing jitter buffer technology that discards media as soon as it is rendered.
- Various retention rates and strategies may be employed to make effective use of storage resources.
- the PIMB 30 may be implemented using a small and relatively fast RAM cache memory coupled with a hard drive for persistent storage.
- the PIMB 30 is also "infinite in size, meaning the amount of media that can be stored is not inherently limited. As the physical storage capacity of the PEVIB 30 is exceeded, the media is maintained in secondary or archival storage for later retrieval.
- the archival storage may be either local or remote. In embodiments where the SaS module 24 is a node on a network, the archival storage may be located at the same node or at another node on the network.
- the archival storage may be either local at the device or at a remote location accessible over the network.
- a predefined criteria or a replacement algorithm such as least-recently-used, or first-in-last-out, is used to control the actual media stored in the PDVIB 30 or archived at any point in time.
- the PIMB 30 further provides the attributes of file system storage and the random access attributes of a database. Any number of conversations or messages, regardless of their duration, may be stored and later retrieved for review.
- the meta-data associated with messages may be also stored in the PIMB 30.
- the media and other data can be stored for a designated period of time (e.g. 30 days). Once the age of the media exceeds the designated period, the media is discarded. In another embodiment, media may be discarded based on the sender and/or the recipient of the message, or the topic of the message. In yet other embodiments, certain media may be marked for transience, meaning the messages will not be stored in the PIMB 30 beyond the requirements for immediate rendering.
- the PIMB writer 28 writes data to the PIMB 30 for two basic purposes.
- the PIMB writer 28 writes media into the PIMB 30 derived from media received from a media-capturing device, such as a microphone or video camera ("Encode Receive”).
- the PIMB writer 28 also writes media contained in messages received over the network from others into the PEVIB 30 ("Net Receive”).
- the Encode Receive and Net Receive functions are described in more detail below.
- the PIMB writer 28 For capturing locally generated media, the PIMB writer 28 includes Encoder Receiver 28a and a Media Storer 28b.
- the hardware 34 receives the raw audio signals and provides them to the Encoder Receiver 28a, which encodes the voice into electrical signals, for example in digital form.
- the Media Store 28b time-indexes the digital signals and stores the signals as time-based media in the PIMB 30. Other types of time-based media, such as video, is processed and stored in a similar manner.
- the Encode Receive function is typically implemented only on an end user communication device. In embodiments where the SaS module 24 is located on the network, the Encode Receive functionality may not be needed or implemented.
- the Net Receive function of PIMB writer 28 includes a Network Receiver 28c, a Media Bufferer 28d, and a Media Storer 28e.
- the Network Receiver 28c receives the time-based media of messages over the network.
- the Media Bufferer 28d buffers the incoming signals as necessary.
- the Media Storer 28e time-indexes and stores the time-based media in the PIMB 30.
- the Net Receive function would typically be implemented in a SaS module 24 located on both an end-user communication device and in a network communication device located at a node on the network.
- the PIMB Reader 26 reads data from the PDVIB 30 for two basic purposes.
- the PIMB reader 26 accesses the PIMB 30 when a message is to be rendered (“Render”) for the user.
- Data is also read from the PDVIB 30 when media is to be transmitted (“Transmit”) over the network.
- Render a message is to be rendered
- Transmit media is to be transmitted
- the PIMB reader 26 includes a Media Retriever 26f, a Media Mixer 26g and a Media Decoder 26h.
- the Media Retriever 26f retrieves the media selected for rendering from the PEVIB 30. If the selected media of two or more messages overlap by time-index, the Mixer 26g mixes the overlapping retrieved media.
- the Media Decoder 26h decodes or converts the media (in either mixed or non-mixed form) into signals in a form suitable for the hardware driver 34.
- the hardware 34 then drives a speaker or video display, creating audio and/or video signals.
- the Render function is typically implemented only on an end user communication device. In embodiments where the SaS module 24 is located on the network in a network communication device, the Render functionality may not be needed or implemented. 2. Transmit
- the PIMB Reader 26 includes a Media Retriever 26i, and a Transmitter 26j.
- the Retriever 26i selects the media from the PDVIB 30 for transmission and the Transmitter 26j transmits the selected media.
- the selected media may be either media previously stored in the PIMB 30 or media that is currently being created locally for transmission to a recipient over the network.
- the currently created media may be stored in the PIMB either (i) just before transmission; (ii) just after transmission, or (iii) or at substantially the same as the media is transmitted.
- any delays associated with storage and transmission are sufficiently small so that the recipient may perceive a "live" experience if rendering the media in the near real-time mode.
- the media selected for transmission is media that was either previously stored in the PIMB 30 or media that is being progressively received from a sender for transmission to a recipient.
- the media is typically being retrieved from the PIMB 30 when the recipient wishes to review a previously received and stored message in the time-shifted mode.
- the recipient is reviewing the transmission in the real-time mode, meaning the network communication device is progressively receiving, progressively storing in the time- indexed format in the PIMB 30, and progressively transmitting the message to the recipient as the media is being received.
- the interface circuit 40 is responsible for providing an interface between the SaS module 24 and the underlying application module of an end-user communication device. For example, with a VoIP client communication device, the interface circuit 40 provides the interface between the SaS module 24 and the underlying VoIP application of the device. For a PTT radio, the interface circuit 40 provides the interface between the SaS module 24 and the PTT radio application module on the device. With a mobile phone with PTT capabilities over cellular or a standard legacy telephone phone, the interface circuit provides an interface between the PTT and underlying cellular radio module or standard telephone module of the devices respectively. In each case, the underlying application module is the hardware and/or software responsible for implementing the VoIP, PPT, and legacy phone functionality of the end-user communication device. A network communication device with an SaS module 24 would typically not include an interface circuit 40, unless the network communication device implemented some of the functionality described above. Controller
- a controller 46 is provided to allow a user to interact or control the operation of time-shifting buffer capabilities of the SaS module 24 through the controller interface 48. Through the controller 46, a user can implement a host of functions such as the selection of a message among a plurality of messages or the selection of a number of available rendering options, such as: (i) pausing a live a message, (ii) jump backward to review a previous message or messages, (iii) jump to the head (i.e.
- the controller 46 is designed to interface with any number of user input control or selection features provided on the end-user communication device itself. Control or selection features, such as a touch screen graphical user interface, touch input controls, knobs, a keyboard, slide bars, etc. may be provided enter rendering and other control commands to the SaS module 24 through the controller 46.
- the controller 46 is configured to receive control signals over the network.
- a user of an end-user communication device may remotely operate the controller 46 on the SaS module 24 on the network through various control functions using Dual-Tone Multi-Frequency tones, Short Message Service (SMS) messages, or some other out-of-band signaling mechanism.
- SMS Short Message Service
- the controller 46 enables a user to remotely interact with the SaS module 24 to implement a wide variety of new functions. For example, PTT users can control the SaS module 24 to skip forward and backward on a per-message basis as well as the rendering controls described above within each message (e.g., play faster, play slower, etc.).
- Incoming messages may still be played as they are received, but all received messages will be stored for later review at an arbitrary later time defined by the receiving user. If a user is reviewing an old message, while a new message is received, an indication or flag may be provided through the controller 46. By manipulating the controller 46, users have the option of reviewing other messages simultaneously with the arrival of new messages or serializing the review of incoming messages. Users can also replay messages that were missed without asking the sending party to retransmit. In yet another embodiment, a "do not disturb" mode can be set where incoming transmissions are recorded and queued for later review without being immediately played or rendered. The controller 46 also allows a sender to review their sent messages, in addition to any received messages.
- FIG. 2A shows the sequence of operation of the SaS module when a user transmits messages to a recipient.
- Figures 2B and 2C illustrate the operation of the PIMB writer 28 and PEVIB Reader 26 during transmission.
- Figures 2D and 2E illustrate the operation of the PIMB Writer 28 and PIMB Reader 26 when the SaS module 24 receives a message.
- the flow charts below are described in the context of voice media.
- a user creates messages to be transmitted by speaking into the microphone of their end-user communication device.
- the voice signals of the message are encoded as the user speaks by the PEVIB Writer 28 (box 130), which converts the voice into electrical signals and stores the electrical signals in the PIMB 30 (box 132) as Media in a time-indexed format.
- the Transmit function the PDVIB Reader 26 transmits the message to the recipient participant(s) over the network 133.
- the Net Receive function of the PIMB Writer 28 receives the message (box 136) and stores the message as time-based media into the PDVIB 30 on the receive SaS module 24.
- the Render function of the PIMB reader 26 on the receive side renders the time-based media from the PIMB 30 into a medium suitable for human consumption, such as voice or video.
- FIG. 2B the sequence of the Encoder Receive function performed by the PDVIB Writer 28 (step 130 of Figure 2A) is provided in detail.
- the transmitting user originates the voice signals or a message to be transmitted, by for example, speaking into a microphone.
- the Encode Receiver 28a progressively encodes the voice signals as they are being created.
- the Media Storer 28b associates a time-index with the encoded signals as the person is speaking (step 13O 3 ) and then progressively stores the media in the PIMB 30 (step 132 4 ) in a time-indexed format.
- the sequence of the Transmit function performed by the PDVIB Reader 26 (step 134 of Figure 2A) on the sending SaS module 24 is provided in detail.
- the media to be transmitted is selected by the Media Retriever 26i. If the selected media is the media that is currently being created on the end user device, then the Transmitter 26j progressively transmits the media in various embodiments either just before, just after or at substantially the same time as the media is stored in the PIMB 30 by the Encode Receive function. Regardless of the embodiment, the media is transmitted without a perceptible delay. As a result, the recipient(s) may optionally render the media in the real-time mode.
- the user may select media for transmission over the network from the PIMB 30 (step 134 3 ) that was previously stored. For example, a user may select an old message and transmit it over the network. In this latter case, the Transmitter 26j retrieves the media from the PEVIB starting at the selected point and progressively transmits the media from storage.
- FIG. 2D the sequence for the Net Receive function (step 136 of Figure 2A) of the PIMB Writer 28 of the receive SaS module 24 is illustrated.
- the Network Receiver 28c progressively receives the time-based media of the message over the network. As necessary the incoming media is buffered by the Media Bufferer 28d.
- the Media Storer 28e progressively associates the time- index for the received media (step 136 2 ) and then progressively stores media in the PDVIB 30 (step 136 3 ) in the time-indexed format.
- FIG.E the sequence for the Render function of the PDVIB Reader 26 (box 140 of Figure 2A) on the receive SaS module 24 is illustrated.
- the media (e.g., a message) at a point in time is selected.
- the Media Retriever 26f then progressively retrieves the media from the PDVIB 30 starting at the selected point of time (step 14O 2 ).
- the retrieved time-based media is then progressively mixed by the Media Mixer 26g if appropriate (step 14O 3 ).
- the Decoder 26h progressively decodes either the mixed or non-mixed media (step 14O 4 ) into electrical signals suitable for the hardware driver 34, which drives a media-generating device such as a speaker or video display (step 140s) to generate audio or video.
- the media selected for rendering is the media being progressively received.
- an incoming message may be time-indexed and stored in the PIMB either just before, just after or at substantially the same time the media is being rendered at the end-user device of the recipient. In either case, the delay associated with storing the media is very small. As a result, the recipient may render the media live in the real-time mode. In the time-shifted mode, the media is selected from a previous point in time, retrieved from the PIMB 30, and rendered from the selected point forward.
- progressive or progressively is intended to be broadly construed and generally mean the continuous processing of a data stream based on availability of the data. For example as a person speaks, their voice is continuously encoded, stored, and transmitted, so long as the voice media is being created. When the person pauses or stops speaking, there is no voice media to continuously process. But when the person resumes speaking again, the progressive processing and transmission of the voice media resumes. On the receive side, the media is also progressive processed (i.e. stored) in any recipient SaS module, either on the network in a network communication device or at a recipient ender user communication device.
- any intermediate network communication device will continuously or progressively transmit media to the recipient, so long as the media is available.
- the recipient end-user communication device also continually renders the media as it is progressively received, so long as the media is available.
- the flow diagrams 2A through 2E describe the operation of the SaS module 24 on an end-user communication device as noted above.
- the operation of the SaS module 24 on the network in a network communication device operates essentially the same, except for two notable exceptions. Since media is typically not created or rendered on the network, the Encode Receive ( Figure 2B) or Render ( Figure 2E) functions are typically not implemented. Instead, only the Transmit ( Figure 2C) and Net Receive ( Figure 2D) functions are required.
- Figure 3 a block diagram of the SaS module embedded in a VoIP client in accordance with one embodiment of the present invention is shown.
- “VoIP" should be understood as a system that provides full-duplex voice and/or video communication using the IP protocol.
- the VoIP client 50 includes a network interface 52 to connect the client 50 to a network 54, a VoIP application module 56 for providing the VoIP functionality, a speaker 57, a microphone 58, a VoIP controller 60, the SaS module 24, and the controller 46.
- the VoIP application module 56 controls the VoIP functionality of the client 50 as is well known in the art.
- the SaS module 24, which is provided between the network 54 and the speaker 57 and microphone 58 on the client device 50, provides the time shifting buffer and SaS functionality, as described above, directly on the device. The operation of the SaS module 24 is controlled through the controller 46, also as described above.
- the VoIP controller 60 controls the standard operation of the VoIP application module 56, as is well known in the art, through the SaS module 24.
- the standard VoIP controls for call initiation and management are routed through the SaS module 24 first before they are sent to the VoIP application module 56 so that the SaS module 24 can allocate necessary resources for the activities of the VoIP application.
- a user of the device sends VoIP related requests to SaS module 24, which in turn, relays them to VoIP application module 56.
- the PTT client 70 includes a radio antenna 72 to connect the client 70 to a PTT radio network (not illustrated), a PTT radio application module 74 for providing the PTT functionality, a speaker 76, a microphone 78, a PTT radio controller 79, the SaS module 24, and the controller 46.
- the PTT radio application module 74 controls the PTT functionality of the client 70 as is well known in the art.
- the SaS module 24 which is provided between the PTT radio network (not illustrated) through the antenna 72 and the speaker 76 and microphone 78 on the client device 70, provides the time shifting buffer and SaS functionality, as described above, directly on the device.
- the operation of the SaS module 24 is controlled through controller 46, also as described above.
- the PTT radio controller 79 controls the standard operation of the PTT radio application module 74, as is well known in the art, through the SaS module 24.
- the mobile phone with PPT over cellular client 80 includes a radio antenna 82 to connect the client 80 to a cellular network (not illustrated), a PTT and cellular radio module 84 for providing standard cellular and PTT functionality, a speaker 86, a microphone 88, a cellular phone/PTT controller 89, the SaS module 24, and the SaS controller 46.
- the PTT and cellular radio module 84 controls PTT and full-duplex voice functionality of the client 80 as is well known in the art.
- the SaS module 24 which is provided between the cellular radio network (not illustrated) through the antenna 82 and the speaker 86 and microphone 88 on the client device 80, provides the time shifting buffer and SaS functionality, as described above, directly on the device.
- the operation of the SaS module 24 is controlled through controller 46, also as described above.
- the cellular phone/PTT radio controller 89 controls the standard operation of the PTT and cellular radio application module 84 through the SaS module 24.
- the SaS module 24 may be provided on a standard mobile phone without the PTT capability and which operates over a cellular network. Such an embodiment would be very similar to the client 80 as illustrated in Figure 5, except the module 84 and the controller 89 would not have PTT functionality or PTT controls respectively.
- the telephone client 90 includes a legacy telephone transceiver 92 to connect the client 90 to a legacy telephone network (not illustrated), a legacy or standard telephone module 94 for providing standard telephone functionality, a speaker 96, a microphone 98, a telephone keypad 99, the SaS module 24, and the SaS controller 46.
- the legacy telephone module 94 controls PTT and full-duplex voice functionality of the client 90 as is well known in the art.
- the SaS module 24 which is provided between the legacy telephone network (not illustrated) through the transceiver 92 and the speaker 96 and microphone 98 on the client device 90, provides the time shifting buffer and SaS functionality, as described above, directly on the device. The operation of the SaS module 24 is controlled through controller 46, also as described above.
- the telephone keypad 99 controls the standard operation of the legacy telephone module 84 through the SaS module 24.
- the SaS module 24 is provided on the client.
- the SaS module 24 may be provided on the network, providing legacy land-line phones, mobile phones, PTT radios, and PTT enabled mobile phones with SaS functionality.
- FIG 7 a diagram illustrating a full-duplex conversation where the SaS modules 24 for the participants are located on the network is shown.
- two non-SaS enabled end-user devices A and B are engaged in a full-duplex conversation and two network communication devices 100, each including an SaS module 24, are provided on the network for user A and B respectively.
- media transmitted by user A to user B is transmitted using a first channel.
- a second channel is used for transmissions from user B to user A.
- FIG 8 is a diagram illustrating a half-duplex PTT system with two network communication devices 100, each including an SaS module 24, for storing the transmissions from end user device A to end user device B (e.g., conventional PTT devices).
- user B controls its respective SaS module 24 through various control functions using DTMF tones, SMS messages, or some other out-of-band signaling mechanism.
- the controller 46 of the SaS module 24 corresponding to user B in response to the control signals received from the communication device of user B, is configured to control the rendering of the time-based media from user A at the communication device of user B in either the near real-time or time- shifted modes.
- the near real-time mode as the time-based media is progressively received at the SaS module 24 from the communication device A, it is progressively stored in the PEVIB 30 and progressively transmitted to the communication device B for immediate rendering.
- the transmission is not immediately rendered at device B as it is being transmitted by device A.
- the transmission is received and stored in the PIMB 30 of the SaS module 24 corresponding to user B, but is not progressively forwarded to the device of user B.
- the media of the missed transmission may be reviewed.
- user B To review the media in the time-shifted mode, user B generates control signals requesting the message.
- the SaS module 24 retrieves the time- based media of the message from the PIMB 30 and progressively transmits the retrieved time-based media to the communication device of user B for rendering.
- user B may further generate a number of control signals to select any number of rendering options besides simply playing the time-based media, such as pausing time-based media as it is being rendered, jump backward to review previous time-based media, jump to the most current time-based media, play the time-based media either faster or slower than it was originally encoded and catch up to live.
- User B may also seamlessly transition between the near-real-time and time- shifted modes.
- user B may pause or stop the rendering of a message from user A in the real-time mode. In this situation, user B may subsequently review the message at an arbitrary later time in the time-shifted mode. Alternatively, user B may be reviewing an old message from user A when a new message arrives. If this occurs, user B may stop the review of the old message and immediate transition to the review of the new message as the media arrives in the real-time mode. Alternatively, user B can implement the catch up to live feature, speeding up the rendering of the old media until caught up with the new media as it progressively arrives over the network. In either case, user B seamlessly transitions from the time- shifted mode to the near real-time mode.
- a single network communication device 100 may support both end user devices A and B for the embodiments shown in Figures 7 or 8 respectively. With this embodiment, although there is only one network communication device 100, at least a dedicated PIMB would typically be provided for each user A and B respectively. In yet other embodiments, each communication device 100 may each support multiple users besides A and B in either a half-duplex or full-duplex arrangement. In either case, again at least a dedicated PIMB would typically be provided the for each user respectively.
- SaS modules In situations where SaS modules support multiple users, transmit and receive functions may be shared. But each user would ideally have a separate PIMB or a dedicated portion of a larger PIMB. Also with the embodiments shown in Figures 7 and 8, either or both user devices A and B may have their own SaS modules 24. [0076] Although the above description was generally provided in the context of voice media, it should be understood that all types of time-based media could be processed in a similar manner. SaS modules 24, regardless of their location, can process other types of time-based media, such as other audio content besides voice, video, GPS or positional data, or sensor data such as time, temperature, pressure, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
L'invention porte sur un procédé et sur un dispositif de communication réseau pour fournir sur un réseau de communication à bouton microphone (PTT) des capacités de décalage dans le temps à un utilisateur d'un dispositif de communication PTT. Le procédé et le dispositif de communication réseau mettent en œuvre la réception progressive de multimédia à base temporelle sur le réseau de communication PTT à partir d'un dispositif d'envoi et le stockage progressif du multimédia à base temporelle reçu dans un tampon à décalage temporel situé au niveau du dispositif de communication réseau à mesure que le multimédia à base temporelle est reçu de manière progressive. En réponse à des instructions de commande de rendue reçues à partir de l'utilisateur du dispositif de communication PTT, le procédé et le dispositif de communication réseau comprennent en outre la commande du rendu du multimédia à base temporelle au niveau du dispositif de communication PTT soit (i) dans un mode proche temps réel à mesure que le multimédia à base temporelle est reçu de manière progressive au niveau du dispositif de communication réseau ou (ii) dans un mode décalé dans le temps à un temps ultérieur arbitraire après réception et stockage du multimédia à base temporelle dans le tampon à décalage temporel. Si les instructions de commande de rendue reçues indiquent que l'utilisateur PTT souhaiterait examiner le multimédia dans un mode proche temps réel, le procédé et le dispositif de communication réseau mettent en œuvre en outre la transmission progressive du multimédia à base temporelle au dispositif de communication PTT à mesure que le multimédia à base temporelle est reçu de manière progressive au niveau du dispositif de communication réseau. Si les instructions de rendue reçues indiquent un mode décalé dans le temps, alors le multimédia à base temporelle est transmis de manière progressive à partir du tampon à décalage temporel au dispositif de communication PTT au temps ultérieur arbitraire.
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US4427208P | 2008-04-11 | 2008-04-11 | |
US61/044,272 | 2008-04-11 | ||
US12/336,206 | 2008-12-16 | ||
US12/336,232 US8401583B2 (en) | 2008-04-11 | 2008-12-16 | Time-shifting for push to talk voice communication systems |
US12/336,232 | 2008-12-16 | ||
US12/336,206 US8401582B2 (en) | 2008-04-11 | 2008-12-16 | Time-shifting for push to talk voice communication systems |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2009126426A1 true WO2009126426A1 (fr) | 2009-10-15 |
Family
ID=40740157
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2009/037966 WO2009126426A1 (fr) | 2008-04-11 | 2009-03-23 | Décalage temporel pour des systèmes de communication vocaux à bouton microphone |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2009126426A1 (fr) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102647685A (zh) * | 2011-02-17 | 2012-08-22 | 深圳市源通世纪科技有限公司 | 同时支持td-lte上poc对讲和iden对讲的多功能系统 |
CN102647684A (zh) * | 2011-02-17 | 2012-08-22 | 深圳市源通世纪科技有限公司 | 同时支持td-lte上poc对讲和tetra对讲的多功能系统 |
EP2819381A4 (fr) * | 2012-02-21 | 2015-04-01 | Tencent Tech Shenzhen Co Ltd | Procédé et système de transfert d'informations vocales |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005101697A1 (fr) * | 2004-04-19 | 2005-10-27 | Samsung Electronics Co., Ltd. | Systeme et procede pour le controle simultane de sessions de telephone cellulaire a poussoir de conversation |
US20060040683A1 (en) * | 2004-08-18 | 2006-02-23 | Nokia Corporation | Cellular radio telecommunications terminal, a system, a method, a computer program and a user interface |
EP1694044A1 (fr) * | 2005-02-17 | 2006-08-23 | Sony Ericsson Mobile Communications AB | Procédé et système de communication pour transférer des messages contenant une information audio et/ou vocale |
US20070021131A1 (en) * | 2004-03-03 | 2007-01-25 | Josef Laumen | Method for delayed entering into a conversation or renewed entering into a conversation of at least one radio communication device in an already existing push-to-talk group discussion, radio communication device, exchange unit and radio communication network |
-
2009
- 2009-03-23 WO PCT/US2009/037966 patent/WO2009126426A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070021131A1 (en) * | 2004-03-03 | 2007-01-25 | Josef Laumen | Method for delayed entering into a conversation or renewed entering into a conversation of at least one radio communication device in an already existing push-to-talk group discussion, radio communication device, exchange unit and radio communication network |
WO2005101697A1 (fr) * | 2004-04-19 | 2005-10-27 | Samsung Electronics Co., Ltd. | Systeme et procede pour le controle simultane de sessions de telephone cellulaire a poussoir de conversation |
US20060040683A1 (en) * | 2004-08-18 | 2006-02-23 | Nokia Corporation | Cellular radio telecommunications terminal, a system, a method, a computer program and a user interface |
EP1694044A1 (fr) * | 2005-02-17 | 2006-08-23 | Sony Ericsson Mobile Communications AB | Procédé et système de communication pour transférer des messages contenant une information audio et/ou vocale |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102647685A (zh) * | 2011-02-17 | 2012-08-22 | 深圳市源通世纪科技有限公司 | 同时支持td-lte上poc对讲和iden对讲的多功能系统 |
CN102647684A (zh) * | 2011-02-17 | 2012-08-22 | 深圳市源通世纪科技有限公司 | 同时支持td-lte上poc对讲和tetra对讲的多功能系统 |
EP2819381A4 (fr) * | 2012-02-21 | 2015-04-01 | Tencent Tech Shenzhen Co Ltd | Procédé et système de transfert d'informations vocales |
US9232371B2 (en) | 2012-02-21 | 2016-01-05 | Tencent Technology (Shenzhen) Company Limited | Method and system for transferring speech information |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8538471B2 (en) | Time-shifting for push to talk voice communication systems | |
US8542804B2 (en) | Voice and text mail application for communication devices | |
US8233598B2 (en) | Telecommunication and multimedia management method and apparatus | |
US20240064119A1 (en) | Telecommunication and multimedia management method and apparatus | |
US8428559B2 (en) | Method for recording mobile phone calls | |
US20120114108A1 (en) | Messaging communication application | |
US8509123B2 (en) | Communication application for conducting conversations including multiple media types in either a real-time mode or a time-shifted mode | |
US8533611B2 (en) | Browser enabled communication device for conducting conversations in either a real-time mode, a time-shifted mode, and with the ability to seamlessly shift the conversation between the two modes | |
US20050210394A1 (en) | Method for providing concurrent audio-video and audio instant messaging sessions | |
US9054912B2 (en) | Communication application for conducting conversations including multiple media types in either a real-time mode or a time-shifted mode | |
US20040224678A1 (en) | Reduced latency in half-duplex wireless communications | |
US8699383B2 (en) | Method and apparatus for real-time synchronization of voice communications | |
KR20100086484A (ko) | 네트워크를 통한 실시간 미디어 동기화 방법 및 시스템 | |
WO2009126426A1 (fr) | Décalage temporel pour des systèmes de communication vocaux à bouton microphone | |
US20120300915A1 (en) | Message storage device and a moving image message processor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09729945 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 09729945 Country of ref document: EP Kind code of ref document: A1 |