+

US9843768B1 - Audience engagement feedback systems and techniques - Google Patents

Audience engagement feedback systems and techniques Download PDF

Info

Publication number
US9843768B1
US9843768B1 US15/274,338 US201615274338A US9843768B1 US 9843768 B1 US9843768 B1 US 9843768B1 US 201615274338 A US201615274338 A US 201615274338A US 9843768 B1 US9843768 B1 US 9843768B1
Authority
US
United States
Prior art keywords
audience
speech
engagement
members
aggregated
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
US15/274,338
Inventor
Ansuya Negi
Igor Tatourian
Rita H Wouhaybi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Priority to US15/274,338 priority Critical patent/US9843768B1/en
Assigned to INTEL CORPORATION reassignment INTEL CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: NEGI, ANSUYA, TATOURIAN, Igor, Wouhaybi, Rita H.
Application granted granted Critical
Publication of US9843768B1 publication Critical patent/US9843768B1/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/63Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/29Arrangements for monitoring broadcast services or broadcast-related services
    • H04H60/33Arrangements for monitoring the users' behaviour or opinions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/401Support for services or applications wherein the services involve a main real-time session and one or more additional parallel real-time or time sensitive sessions, e.g. white board sharing or spawning of a subconference
    • H04L65/4015Support for services or applications wherein the services involve a main real-time session and one or more additional parallel real-time or time sensitive sessions, e.g. white board sharing or spawning of a subconference where at least one of the additional parallel sessions is real time or time sensitive, e.g. white board sharing, collaboration or spawning of a subconference
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/174Facial expression recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/535Tracking the activity of the user

Definitions

  • a person attempting to get across an idea is not always understood as intended when the person starts trying to communicate the idea. This type of misunderstanding may lead to unintended consequences or results.
  • Clear communication is a crucial success factor for people involved in day-to-day conferencing and collaboration. While talking to an audience live, it is often easier for a speaker to see and correct misunderstandings. Unfortunately in remote sessions, a speaker is relatively isolated and may “miss the audience” or fail to get a point across. Solutions, such as relying on the audience to interrupt the person to tell the person that the idea is unclear, have low accuracy and are limited to the ability of the audience to notify the person that there has been a misunderstanding.
  • FIG. 1 illustrates a conference call user interface including an aggregated engagement level indicator and a plurality of individual audience engagement level indicators in accordance with some embodiments.
  • FIG. 2 illustrates a block diagram showing physical (e.g., sensor-based) inputs and contextual inputs used to evaluate engagement metrics in accordance with some embodiments.
  • FIG. 3 illustrates a diagram showing display indicators for audience engagement levels in accordance with some embodiments.
  • FIG. 4 illustrates a system for indicating audience engagement in accordance with some embodiments.
  • FIG. 5 illustrates a flowchart showing a technique for indicating audience engagement in accordance with some embodiments.
  • FIG. 6 illustrates generally an example of a block diagram of a machine upon which any one or more of the techniques (e.g., methodologies) discussed herein may perform in accordance with some embodiments.
  • Systems and methods for indicating audience engagement are described herein.
  • the systems and methods described herein may include using a user interface for displaying various indications of audience engagement.
  • a system described herein may use a camera and computer vision to recognize facial emotions, may use or wearable sensors to gauge electrical body response, remote heart monitoring, or other sensor capabilities. Captured information may be aggregated and anonymized and serve as inputs into an engagement process.
  • the system may collect facial emotions, body gesturing, or body response and aggregates these or other emotional indicators into a feedback based engagement process.
  • the system may output an anonymized indication of audience emotional response to a speaker (or presenter), or regarding the content of presentation.
  • the audience emotional response may be represented via a color-coded thermometor bell, emoticons or emoji's (e.g., an emotional representation such as a yellow smiley face), or by other visual means.
  • a speaker may use a virtual reality headset, and audience emotional response may be represented as 3D avatars with changing expressions, such as smiling, bored, etc.
  • the system may provide feedback to emails and articles. For example the system may display an article and monitor a reader's engagement. The monitored and stored information may be anonymized and reported to an author. The system may be used for measuring interest in movies or TV shows.
  • a state machine middleware may be used to record changing interest and to tie the changing interest to content or to a presenter's voice. As a presenter gives a talk, the presenter or the audience may see the engagement indication.
  • the system may include a set of predefined audience profiles (e.g., analogous to workload for a server) and a speaker may practice giving a speech to different audiences. The feedback given to the speaker may be directed to allowing the speaker to adjust the speech based on the predictive output. For example, a fresh graduate may practice a speech before an interview with a company and receive coaching feedback.
  • the system may collect emotional feedback from many participants in a teleconference and collate the responses to form an idea of the success of a meeting.
  • Participants may opt-in and the system may identify and alert a presenter or observer to situations where the meeting had issues, broke down, or failed, such as when there was no consensus established in the meeting.
  • the participant opt-in may be used to allow provided information to be public (e.g., not anonymous).
  • the system may point out cultural differences that may be affecting a speech or meeting and lead the participants towards internationally or culturally preferred language.
  • a user may share success or failure regarding audience engagement with other users by uploading the data to the cloud or tagging the audience engagement with relevant keywords.
  • an applicant to a hardware engineer position at a company may share the reaction and outcome of an interview with others on a job website. The service may then show that the interviewers tended to be serious and not appreciate jokes during a presentation or that sports references were viewed favorably.
  • the data may be analyzed by different parties for different outcomes.
  • a human resources department may analyze the data to understand some characteristics of successful applicants for use in future postings. This may result in a more efficient hiring process.
  • an audience may receive feedback, such as by sending information to a student that other students are also frustrated or do not understand a professor, or that other audience members are angry, happy, etc.
  • a system may be used with a child or a person with autism or special needs to detect certain changes in mood, emotion, or behavior that may be otherwise undetectable by a person.
  • the systems and methods described herein may be include an opt-in option for an audience member to make suggestions or provide information such that a speaker may tailor a speech with a predicted likelihood of success.
  • an email or a blog post may be used with a mock audience or a sample audience to give feedback and automatically determine an emotional engagement of a potential or actual reader.
  • a presenter may use too much jargon or terms of art for a wider audience.
  • the systems and methods described herein may be applied for direct in-person communication.
  • a business presenter may be giving a presentation, such as a sales presentation, presentation at a board meeting, corporate meeting, etc., and may view an emotional engagement indicator on a personal device while giving the presentation.
  • FIG. 1 illustrates a conference call user interface 100 including an aggregated engagement level indicator 110 and a plurality of individual audience engagement level indicators (e.g., 102 - 108 ) in accordance with some embodiments.
  • the aggregated level indicator 110 may be a representation of a thermometer, as shown in FIG. 1 , or other indicator, such as a dial, progress bar, or other visual representation.
  • the aggregated level indicator 110 may represent an average, a weighted average, an aggregation, sample set, or other combination of emotional engagement metrics of audience members.
  • the aggregated level indicator 110 may represent an average engagement of audience members represented on the conference call user interface 100 (e.g., represented by the individual audience engagement level indicators 102 - 108 ) or an average engagement of audience members not represented.
  • the individual audience engagement level indicators 102 - 108 may include emojis, emoticons, avatars, text representations, or other visual representations of an emotion, such as an interested emoji 102 , a surprised emoji 104 , a content or happy emoji 106 , or a disengaged emoji 108 .
  • Additional complexities in emotional display may be used by a variety of emojis or the like. For example, colors may be used to indicate mood, while facial expressions of emojis represent engagement level.
  • the conference call user interface 100 may include long-term learning or prediction based on aggregated data in the cloud from multiple users of the conference call user interface 100 . For example, when speakers upload their data to the cloud, the conference call user interface 100 may become trained with the data and predict an outcome. A speaker may then rehearse a pitch using the conference call user interface 100 with no audience and receive feedback from the aggregated level indicator 110 . The speaker may specify an audience based on context (e.g., location, ages, affiliations, level of expertise in the subject, etc.) or information may be extracted from a calendar of the speaker.
  • context e.g., location, ages, affiliations, level of expertise in the subject, etc.
  • the speaker may rehearse a talk and the conference call user interface 100 may predict a level of engagement or highlight areas to be worked on based on previous data of a sample or mock audience. Other elements may be factored in to the prediction or aggregated level indicator 110 , such as the context of the speaker (e.g., is the speaker going to be nervous, is the speaker going to be sleep deprived, etc.) or other environmental factors (e.g., is the venue or podium a place where speakers tend to get hot, is the amount of light going to hurt the performance of the speaker, etc.). Based on the data, the conference call user interface 100 may provide a predictive success metric.
  • the context of the speaker e.g., is the speaker going to be nervous, is the speaker going to be sleep deprived, etc.
  • other environmental factors e.g., is the venue or podium a place where speakers tend to get hot, is the amount of light going to hurt the performance of the speaker, etc.
  • the conference call user interface 100 may provide a predictive success metric.
  • the conference call user interface 100 may be used by each individual participant.
  • the conference call user interface 100 may be implemented on a mobile device, smartphone, smart device, computer (e.g., video conferencing or audio conferencing), or the like.
  • individual emoticons may be generated for audience members and displayed as individual audience engagement level indicators (e.g., 102 - 108 ) for feedback.
  • anonymous feedback may be presented to a speaker, such as using aggregated results.
  • the conference call user interface 100 may provide feedback based on demographics, such as a separate indicator for age, gender, interest (e.g., science interest or humanities interest or art interest), or the like.
  • demographics such as a separate indicator for age, gender, interest (e.g., science interest or humanities interest or art interest), or the like.
  • language specific demographics may be used, such as indicating that a speaker insulted people from a certain ethnic background.
  • the demographics may be aggregated over previous speeches, such as to develop a pattern over different speeches, including by using machine learning.
  • the conference call user interface 100 may collect inputs and correlating audio tones or key words with a particular emotions, such as by using machine learning.
  • the conference call user interface 100 may use a practice speech to predict emotional reactions (e.g., a practice speech without an audience).
  • the aggregated level indicator 110 may be displayed as a thermometer to show overall engagement.
  • a plurality of aggregated level indicators may be used to show separate engagement levels for different types of speech (e.g., healthcare, education, information, etc.), different types of audience (e.g., based on demographics, number of members, etc.), different types of desired engagement levels (e.g., a speech intended to inspire a crowd, a speech intended to calm an audience, a speech intended to convey factual information without eliciting an emotional response, etc.), or the like.
  • the conference call user interface 100 may inform a speaker as to which audiences would be better suited for a particular speech or suggest modifications to the speech based on predicted (or pre-signed up) audience.
  • the conference call user interface 100 may tailor a lecture per time or per hour, such as for different student groups or classes.
  • the conference call user interface 100 may provide feedback after the speech is given (or after a series of speeches are given, such as lectures over a period of semesters or years), such as the overall engagement.
  • FIG. 2 illustrates a block diagram 200 showing physical (e.g., sensor-based) inputs (e.g., 202 - 208 ) and contextual inputs (e.g., 212 - 218 ) used to evaluate engagement metrics 224 in accordance with some embodiments.
  • the physical inputs may include facial data 202 , hand gestures 204 , wearable data 206 , or body language 208 . These inputs may be aggregated, averaged, weighted, or anonymized to create a physical data structure 210 . One or more of these inputs may be ignored or not used for a given time when determining a physical data structure 210 .
  • the physical data structure 210 along with a contextual data structure 220 may be used as an input data structure 222 .
  • the physical data structure 210 and the contextual data structure 220 may be combined as an average, weighted average, etc.
  • the contextual data structure 220 may include personal data 212 , location data 214 , cultural data 216 , or familiarity data 218 . These data inputs may be ignored or not used for a given time in determining the contextual data structure 220 .
  • the input data structure 222 may be used to determine engagement metrics 224 .
  • the engagement metrics 224 may include agreement 226 , engagement level 228 , emotional response 230 , focus 232 , or feedback 234 .
  • the facial data 202 includes information about facial expressions, such as smiling, frowning, squinting, etc.
  • the hand gestures 204 may include predetermined hand gestures, or may include information about whether hand gestures are made or not.
  • the wearable data 206 may include a heartrate, oxygen saturation, amount of movement, etc.
  • the body language 208 may include posture information, orientation, etc.
  • the personal data 212 may include demographic information, such as age, gender, etc.
  • the location data 214 may include a current location, an origin location, etc. (e.g., location data 214 may indicate that an audience member is currently traveling).
  • the cultural data 216 may include a country of origin, self-identified ethnic or cultural groups (e.g., religion), generation, etc.
  • the familiarity data 218 may include familiarity with the speaker, the topic, the speech itself (e.g., an audience member who is part of a team working on similar subject matter, or someone that helped write the speech), a company of the speaker, or the
  • the engagement metric 224 may be determined or displayed separately or in any combination.
  • agreement 226 may be displayed using a first indicator, and an average of the remaining factors may be displayed using a second indicator. Other combinations may be used.
  • the agreement 226 metric may include a determined amount of agreement with a statement or statements made by a speaker.
  • the engagement level 228 metric may include a level of interest in a statement or speech.
  • the emotional response 230 may include an emotional connection or response (intended or not intended by the speaker) to a statement or speech.
  • the focus 232 metric may represent an amount of attention being paid to a statement or speech.
  • the feedback 234 metric may include audience member submitted feedback, automatically determined feedback (e.g., based on previously input desired responses from the audience), or the like.
  • the engagement metrics 224 may be displayed for a single audience member, a subset of members of an audience, or the audience as a whole.
  • the engagement metrics 224 may be based on one or more of the physical data 210 or the contextual data 220 , which may be based on one or more of the sub-inputs (e.g., 202 - 208 for physical data or 212 - 218 for contextual data).
  • Each engagement metric of the engagement metrics 224 may be based on the same or different inputs or combinations of inputs.
  • This block diagram 200 displays a quantification of human engagement metrics 224 using metrics or weighted metrics to monitor participants over time.
  • the human emotions may include collective indicators taken from facial expressions, body language, gestures, and sensor responses as well as contextual data.
  • User input may be monitored to understand and measure body language responses.
  • Physical data 210 may include posture, eye gazing, hand positions, head tilt and so on.
  • Contextual data may include demographics such as age, gender, age, time of day, day of week, location, expertise, cultural background, for example, to normalize different reactions based on the topics and other human and cultural variations.
  • An input may be assigned a weight depending upon a situation and a collective set of metrics (e.g., the engagement metrics 224 ) may be deduced.
  • feedback 234 may include participants asking questions, or sending online messages or emails. There may be an incentive for a participant to choose to share wearable data, for example, (e.g. students taking a class, friends helping a speaker rehearse, etc.).
  • the data described above may be aggregated to maintain the privacy of the audience.
  • the aggregated data may be binned or profiled based on the different demographics or inputs extracted from the context and demographics.
  • the feedback 234 may provide recommendations based on input data, such as location data 214 .
  • the geography of a participant or potential cultural references may be presented or local news may be determined.
  • a cricket reference when talking to participants in India, use a cricket reference, or when talking to participants in the U.S., use a baseball reference.
  • Other examples may include using a day of week, time of day, age, gender, etc., to display feedback (e.g., one feedback for teenagers, or another feedback for older audience).
  • FIG. 3 illustrates a diagram 300 showing display indicators (e.g., 302 , 306 , 310 ) for audience engagement levels in accordance with some embodiments.
  • the diagram 300 includes an example set of inputs (e.g., 302 ) weighted according to predetermined weights (e.g., 304 ), that may be shown over time on a user interface (e.g., using indicator 306 or 310 ).
  • the current displayed indicator 306 may be determined using a cumulative weighted average of inputs, such as input 302 according to weights (e.g., weight 304 ).
  • the cumulative weighted average of inputs may be cumulative over audience members (e.g., aggregating or averaging a number of audience members for a given time) or may be cumulative over time (e.g., a single audience member over time, a subset of audience members over time, the audience as an aggregate over time, etc.).
  • engagement metrics for specific members of an audience may be given more weight (e.g., a decision maker is given more weight than an intern).
  • the inputs may be emoticons or emojis, and may be output along a timeline 308 .
  • the timeline 308 may include past information or the displayed indicator 306 may change over time (e.g., timeline 308 represents changes over time, with a single emoticon or emoji shown at a time).
  • the timeline 308 may start with a neutral input (e.g., input 310 ) and change according to the inputs (e.g., 302 ).
  • FIG. 4 illustrates a system 400 for indicating audience engagement in accordance with some embodiments.
  • the system 400 includes processing circuitry 402 , memory 404 , a display 406 .
  • the system 400 may include a speaker 410 or a microphone 412 .
  • the display 406 includes an engagement level indicator 408 , such as the engagement level indicator 110 of FIG. 1 .
  • the system 400 may include a server, a cloud system, or a computing device.
  • the processing circuitry 402 may be used to send speech from a speaker to be played for a plurality of members of an audience.
  • the processing circuitry 402 is used to receive audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience, process the audience reaction information to aggregate the audience reaction information, determine, using the processed information, an engagement factor, and send an aggregated engagement level indicator 408 to a speaker device, the aggregated engagement level indicator 408 based on the engagement factor and indicating overall engagement of the audience to the speech.
  • the display 406 may include a user interface for displaying the engagement level indicator 408 .
  • the audience engagement indication system 400 may include components remote from each other, such as including the processing circuitry 402 on a server and the display 406 on a speaker device.
  • a technique may be implemented using multiple audience engagement indication systems 400 , for example, a server used to process the information and a speaker device for receiving an aggregated engagement level indicator 408 and displaying the aggregated engagement level indicator 408 .
  • the audience engagement indication system 400 may include a camera 414 for capturing video of a presenter.
  • the camera 414 may capture physical responses of at least one of the plurality of members of the audience while the speech is playing.
  • the physical responses may include a facial emotion, a body gesture, or a hand gesture.
  • the microphone 412 may be used to capture audio of a presenter.
  • the audience engagement indication system 400 may receive information from a sensor or sensors of a wearable device or wearable devices, such as biometric data from at least one of the plurality of members of the audience while the speech is playing.
  • the audience reaction information may be received at the audience engagement indication system 400 from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data.
  • the selection to opt in may be saved in the memory 404 .
  • the display 406 may include a specific engagement level indicator 408 for the opted in member, an engagement level indicator 408 for the audience as a whole, or the like.
  • the engagement level indicator 408 may include an emoticon, an emoji, a 3D avatar, or the like.
  • the engagement level indicator 408 may be updated in real time on the display 406 , and the real time updates may be adjusted based on received updated audience reaction information.
  • the processing circuitry 402 may anonymize the audience reaction information.
  • the anonymized audience reaction information may be saved in the memory 404 .
  • the data received from the plurality of members of the audience may include location data, personal data, cultural data, familiarity data, time data, age data, gender data, or the like. This data may be saved in a structured database in the memory 404 of the audience engagement indication system 400 .
  • the processing circuitry 402 may save the processed information and the speech to the memory 404 .
  • the processing circuitry 402 may correlate audio tones or key words in the speech with emotional responses in the processed information using machine learning based on stored speech and processed information.
  • the processing circuitry 402 may predict emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words.
  • the processing circuitry 402 may suggest modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
  • the processing circuitry 402 may send the aggregated engagement level indicator 408 to a device of a member of the plurality of members of the audience.
  • FIG. 5 illustrates a flowchart showing a technique 500 for indicating audience engagement in accordance with some embodiments.
  • the technique 500 includes an operation 502 to receive audience reaction information.
  • the audience reaction information may be in reaction to speech sent from a speaker (e.g., a presenter or other person) to be played for a plurality of members of an audience.
  • the audience reaction information may be received for the plurality of members of the audience, such as from a plurality of sensors.
  • the audience reaction information is captured by the plurality of sensors while the speech is playing for the plurality of members of the audience.
  • the audience reaction information may be captured after the speech has been played.
  • the audience reaction information may include pre-speech, during speech, or post-speech information, such as a change in reaction measured from a baseline.
  • the speech may include a video of the speaker.
  • the speaker may be remote from the audience.
  • the plurality of sensors may include a camera.
  • the camera may capture physical responses of at least one of the plurality of members of the audience, such as while the speech is playing.
  • the physical response may include at least one of a facial emotion, a body gesture, or a hand gesture.
  • the plurality of sensors may include a wearable sensor.
  • the wearable sensor may capture biometric data from at least one of the plurality of members of the audience, such as while the speech is playing.
  • the audience reaction information may be received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
  • the technique 500 includes an operation 504 to process the audience reaction information.
  • the audience reaction information may be processed to aggregate the audience reaction information. Processing the audience reaction information may include anonymizing the audience reaction information.
  • the technique 500 includes an operation 506 to determine an engagement factor.
  • the engagement factor may be determined using the processed information. Determining the engagement factor may include using data of the plurality of members of the audience. The data may include location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
  • the technique 500 includes an operation 508 to send an aggregated engagement level indicator to a speaker device.
  • the aggregated engagement level indicator may be based on the engagement factor.
  • the aggregated engagement level indicates an overall engagement of the audience to the speech.
  • the overall engagement of the audience to the speech may include an average or a weighted average of engagement of the plurality of members of the audience.
  • Sending the aggregated engagement level indicator may include sending a plurality of aggregated engagement level indicators corresponding to the plurality of members (e.g., to be displayed on the speaker device).
  • the aggregated engagement level may include an emoticon, an emoji, or a 3D avatar.
  • sending the aggregated engagement level indicator may include sending a series of aggregated engagement level indicators in real time updates to the speaker device.
  • the real time updates may be adjusted based on aggregated or individual received updated audience reaction information. For example, an emoji may appear to change over time in reaction to speech by the speaker changing.
  • the technique 500 includes an operation to save the processed information and the speech to memory.
  • the technique 500 may include an operation to correlate audio tones or key words in the speech with emotional responses in the processed information, such as by using machine learning. Correlating audio tones or key words may include using a time offset to account for a transmission delay or a language barrier issue. Emotional reactions may be predicted of a hypothetical audience to a second speech, such as by using the correlated audio tones or key words or previous reactions from past audiences stored in the memory. In an example, modifications may be suggested to the second speech, such as modifications based on the predicted emotional reactions of the hypothetical audience.
  • the technique 500 includes an operation to send the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
  • FIG. 6 illustrates generally an example of a block diagram of a machine 600 upon which any one or more of the techniques (e.g., methodologies) discussed herein may perform in accordance with some embodiments.
  • the machine 600 may operate as a standalone device or may be connected (e.g., networked) to other machines.
  • the machine 600 may operate in the capacity of a server machine, a client machine, or both in server-client network environments.
  • the machine 600 may act as a peer machine in peer-to-peer (P2P) (or other distributed) network environment.
  • P2P peer-to-peer
  • the machine 600 may be a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a mobile telephone, a web appliance, a network router, switch or bridge, or any machine capable of executing instructions (sequential or otherwise) that specify actions to be taken by that machine.
  • PC personal computer
  • PDA personal digital assistant
  • STB set-top box
  • PDA personal digital assistant
  • mobile telephone a web appliance
  • network router network router, switch or bridge
  • machine any machine capable of executing instructions (sequential or otherwise) that specify actions to be taken by that machine.
  • machine shall also be taken to include any collection of machines that individually or jointly execute a set (or multiple sets) of instructions to perform any one or more of the methodologies discussed herein, such as cloud computing, software as a service (SaaS), other computer cluster configurations.
  • SaaS software as a service
  • Examples, as described herein, may include, or may operate on, logic or a number of components, modules, or mechanisms.
  • Modules are tangible entities (e.g., hardware) capable of performing specified operations when operating.
  • a module includes hardware.
  • the hardware may be specifically configured to carry out a specific operation (e.g., hardwired).
  • the hardware may include configurable execution units (e.g., transistors, circuits, etc.) and a computer readable medium containing instructions, where the instructions configure the execution units to carry out a specific operation when in operation. The configuring may occur under the direction of the executions units or a loading mechanism. Accordingly, the execution units are communicatively coupled to the computer readable medium when the device is operating.
  • the execution units may be a member of more than one module.
  • the execution units may be configured by a first set of instructions to implement a first module at one point in time and reconfigured by a second set of instructions to implement a second module.
  • Machine 600 may include a hardware processor 602 (e.g., a central processing unit (CPU), a graphics processing unit (GPU), a hardware processor core, or any combination thereof), a main memory 604 and a static memory 606 , some or all of which may communicate with each other via an interlink (e.g., bus) 608 .
  • the machine 600 may further include a display unit 610 , an alphanumeric input device 612 (e.g., a keyboard), and a user interface (UI) navigation device 614 (e.g., a mouse).
  • the display unit 610 , alphanumeric input device 612 and UI navigation device 614 may be a touch screen display.
  • the machine 600 may additionally include a storage device (e.g., drive unit) 616 , a signal generation device 618 (e.g., a speaker), a network interface device 620 , and one or more sensors 621 , such as a global positioning system (GPS) sensor, compass, accelerometer, or other sensor.
  • the machine 600 may include an output controller 628 , such as a serial (e.g., universal serial bus (USB), parallel, or other wired or wireless (e.g., infrared (IR), near field communication (NFC), etc.) connection to communicate or control one or more peripheral devices (e.g., a printer, card reader, etc.).
  • a serial e.g., universal serial bus (USB), parallel, or other wired or wireless (e.g., infrared (IR), near field communication (NFC), etc.) connection to communicate or control one or more peripheral devices (e.g., a printer, card reader, etc.).
  • USB universal serial bus
  • the storage device 616 may include a machine readable medium 622 that is non-transitory on which is stored one or more sets of data structures or instructions 624 (e.g., software) embodying or utilized by any one or more of the techniques or functions described herein.
  • the instructions 624 may also reside, completely or at least partially, within the main memory 604 , within static memory 606 , or within the hardware processor 602 during execution thereof by the machine 600 .
  • one or any combination of the hardware processor 602 , the main memory 604 , the static memory 606 , or the storage device 616 may constitute machine readable media.
  • machine readable medium 622 is illustrated as a single medium, the term “machine readable medium” may include a single medium or multiple media (e.g., a centralized or distributed database, or associated caches and servers) configured to store the one or more instructions 624 .
  • machine readable medium may include a single medium or multiple media (e.g., a centralized or distributed database, or associated caches and servers) configured to store the one or more instructions 624 .
  • machine readable medium may include any medium that is capable of storing, encoding, or carrying instructions for execution by the machine 600 and that cause the machine 600 to perform any one or more of the techniques of the present disclosure, or that is capable of storing, encoding or carrying data structures used by or associated with such instructions.
  • Non-limiting machine readable medium examples may include solid-state memories, and optical and magnetic media.
  • machine readable media may include: non-volatile memory, such as semiconductor memory devices (e.g., Electrically Programmable Read-Only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM)) and flash memory devices; magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks.
  • non-volatile memory such as semiconductor memory devices (e.g., Electrically Programmable Read-Only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM)) and flash memory devices
  • EPROM Electrically Programmable Read-Only Memory
  • EEPROM Electrically Erasable Programmable Read-Only Memory
  • flash memory devices e.g., electrically Erasable Programmable Read-Only Memory (EEPROM)
  • EPROM Electrically Programmable Read-Only Memory
  • EEPROM Electrically Erasable Programmable Read-Only Memory
  • flash memory devices e.g., electrically Era
  • the instructions 624 may further be transmitted or received over a communications network 626 using a transmission medium via the network interface device 620 utilizing any one of a number of transfer protocols (e.g., frame relay, internet protocol (IP), transmission control protocol (TCP), user datagram protocol (UDP), hypertext transfer protocol (HTTP), etc.).
  • transfer protocols e.g., frame relay, internet protocol (IP), transmission control protocol (TCP), user datagram protocol (UDP), hypertext transfer protocol (HTTP), etc.
  • Example communication networks may include a local area network (LAN), a wide area network (WAN), a packet data network (e.g., the Internet), mobile telephone networks (e.g., cellular networks), Plain Old Telephone (POTS) networks, and wireless data networks (e.g., Institute of Electrical and Electronics Engineers (IEEE) 802.11 family of standards known as Wi-Fi®, IEEE 802.16 family of standards known as WiMax®), IEEE 802.15.4 family of standards, peer-to-peer (P2P) networks, among others.
  • the network interface device 620 may include one or more physical jacks (e.g., Ethernet, coaxial, or phone jacks) or one or more antennas to connect to the communications network 626 .
  • the network interface device 620 may include a plurality of antennas to wirelessly communicate using at least one of single-input multiple-output (SIMO), multiple-input multiple-output (MIMO), or multiple-input single-output (MISO) techniques.
  • SIMO single-input multiple-output
  • MIMO multiple-input multiple-output
  • MISO multiple-input single-output
  • transmission medium shall be taken to include any intangible medium that is capable of storing, encoding or carrying instructions for execution by the machine 600 , and includes digital or analog communications signals or other intangible medium to facilitate communication of such software.
  • Example 1 is a system for indicating audience engagement, the system comprising: a server, including processing circuitry and memory, the processing circuitry to: send speech from a speaker to be played for a plurality of members of an audience; receive audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience; process the audience reaction information to aggregate the audience reaction information; determine, using the processed information, an engagement factor; and send an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech.
  • a server including processing circuitry and memory, the processing circuitry to: send speech from a speaker to be played for a plurality of members of an audience; receive audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience; process the audience
  • Example 2 the subject matter of Example 1 optionally includes wherein the speech includes a video of the speaker.
  • Example 3 the subject matter of any one or more of Examples 1-2 optionally include wherein the speaker is remote from the audience.
  • Example 4 the subject matter of any one or more of Examples 1-3 optionally include wherein the plurality of sensors include a camera, the camera to capture physical responses of at least one of the plurality of members of the audience while the speech is playing.
  • the plurality of sensors include a camera, the camera to capture physical responses of at least one of the plurality of members of the audience while the speech is playing.
  • Example 5 the subject matter of Example 4 optionally includes wherein the physical responses include at least one of a facial emotion, a body gesture, or a hand gesture.
  • Example 6 the subject matter of any one or more of Examples 1-5 optionally include wherein the plurality of sensors include a wearable sensor, the wearable sensor to capture biometric data from at least one of the plurality of members of the audience while the speech is playing.
  • the plurality of sensors include a wearable sensor, the wearable sensor to capture biometric data from at least one of the plurality of members of the audience while the speech is playing.
  • Example 7 the subject matter of any one or more of Examples 1-6 optionally include wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
  • Example 8 the subject matter of any one or more of Examples 1-7 optionally include wherein to process the audience reaction information, the processing circuitry is to anonymize the audience reaction information.
  • Example 9 the subject matter of any one or more of Examples 1-8 optionally include wherein to determine the engagement factor includes using data of the plurality of members of the audience, the data including at least one of location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
  • Example 10 the subject matter of any one or more of Examples 1-9 optionally include wherein to send the aggregated engagement level indicator, the processing circuitry is to send a plurality of aggregated engagement level indicators corresponding to the plurality of members.
  • Example 11 the subject matter of any one or more of Examples 1-10 optionally include wherein the aggregated engagement level indicator is an emoticon.
  • Example 12 the subject matter of any one or more of Examples 1-11 optionally include wherein the aggregated engagement level indicator is a 3D avatar.
  • Example 13 the subject matter of any one or more of Examples 1-12 optionally include wherein the overall engagement of the audience to the speech is a weighted average of engagement of the plurality of members of the audience.
  • Example 14 the subject matter of any one or more of Examples 1-13 optionally include wherein to send the aggregated engagement level indicator, the processing circuitry is to send a series of aggregated engagement level indicators in real time updates to the speaker device, the real time updates adjusted based on aggregated or individual received updated audience reaction information.
  • Example 15 the subject matter of any one or more of Examples 1-14 optionally include wherein the processing circuitry is further to: save the processed information and the speech to the memory; and correlate audio tones or key words in the speech with emotional responses in the processed information using machine learning, wherein to correlate includes using a time offset to account for a transmission delay.
  • Example 16 the subject matter of Example 15 optionally includes wherein the processing circuitry is further to predict emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words and previous reactions from past audiences stored in the memory.
  • Example 17 the subject matter of Example 16 optionally includes wherein the processing circuitry is further to suggest modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
  • Example 18 the subject matter of any one or more of Examples 1-17 optionally include wherein the processing circuitry is further to send the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
  • Example 19 is a method for indicating audience engagement, the method comprising: sending speech from a speaker to be played for a plurality of members of an audience; receiving audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience; processing the audience reaction information to aggregate the audience reaction information; determining, using the processed information, an engagement factor; and sending an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech.
  • Example 20 the subject matter of Example 19 optionally includes wherein the speech includes a video of the speaker.
  • Example 21 the subject matter of any one or more of Examples 19-20 optionally include wherein the speaker is remote from the audience.
  • Example 22 the subject matter of any one or more of Examples 19-21 optionally include wherein the plurality of sensors include a camera, the camera capturing physical responses of at least one of the plurality of members of the audience while the speech is playing.
  • the plurality of sensors include a camera, the camera capturing physical responses of at least one of the plurality of members of the audience while the speech is playing.
  • Example 23 the subject matter of Example 22 optionally includes wherein the physical responses include at least one of a facial emotion, a body gesture, or a hand gesture.
  • Example 24 the subject matter of any one or more of Examples 19-23 optionally include wherein the plurality of sensors include a wearable sensor, the wearable sensor capturing biometric data from at least one of the plurality of members of the audience while the speech is playing.
  • the plurality of sensors include a wearable sensor, the wearable sensor capturing biometric data from at least one of the plurality of members of the audience while the speech is playing.
  • Example 25 the subject matter of any one or more of Examples 19-24 optionally include wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
  • Example 26 the subject matter of any one or more of Examples 19-25 optionally include wherein processing the audience reaction information includes anonymizing the audience reaction information.
  • Example 27 the subject matter of any one or more of Examples 19-26 optionally include wherein determining the engagement factor includes using data of the plurality of members of the audience, the data including at least one of location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
  • Example 28 the subject matter of any one or more of Examples 19-27 optionally include wherein sending the aggregated engagement level indicator includes sending a plurality of aggregated engagement level indicators corresponding to the plurality of members.
  • Example 29 the subject matter of any one or more of Examples 19-28 optionally include wherein the aggregated engagement level indicator is an emoticon.
  • Example 30 the subject matter of any one or more of Examples 19-29 optionally include wherein the aggregated engagement level indicator is a 3D avatar.
  • Example 31 the subject matter of any one or more of Examples 19-30 optionally include wherein the overall engagement of the audience to the speech is a weighted average of engagement of the plurality of members of the audience.
  • Example 32 the subject matter of any one or more of Examples 19-31 optionally include wherein sending the aggregated engagement level indicator includes sending a series of aggregated engagement level indicators in real time updates to the speaker device, the real time updates adjusted based on aggregated or individual received updated audience reaction information.
  • Example 33 the subject matter of any one or more of Examples 19-32 optionally include saving the processed information and the speech to memory; and correlating audio tones or key words in the speech with emotional responses in the processed information using machine learning, wherein correlating includes using a time offset to account for a transmission delay.
  • Example 34 the subject matter of Example 33 optionally includes predicting emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words and previous reactions from past audiences stored in the memory.
  • Example 35 the subject matter of Example 34 optionally includes suggesting modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
  • Example 36 the subject matter of any one or more of Examples 19-35 optionally include sending the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
  • Example 37 is at least one machine-readable medium including instructions for operation of a computing system, which when executed by a machine, cause the machine to perform operations of any of the methods of Examples 19-36.
  • Example 38 is an apparatus comprising means for performing any of the methods of Examples 19-36.
  • Example 39 is an apparatus for indicating audience engagement, the apparatus comprising: means for sending speech from a speaker to be played for a plurality of members of an audience; means for receiving audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience; means for processing the audience reaction information to aggregate the audience reaction information; means for determining, using the processed information, an engagement factor; and means for sending an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech.
  • Example 40 the subject matter of Example 39 optionally includes wherein the speech includes a video of the speaker.
  • Example 41 the subject matter of any one or more of Examples 39-40 optionally include wherein the speaker is remote from the audience.
  • Example 42 the subject matter of any one or more of Examples 39-41 optionally include wherein the plurality of sensors include a camera, the camera capturing physical responses of at least one of the plurality of members of the audience while the speech is playing.
  • the plurality of sensors include a camera, the camera capturing physical responses of at least one of the plurality of members of the audience while the speech is playing.
  • Example 43 the subject matter of Example 42 optionally includes wherein the physical responses include at least one of a facial emotion, a body gesture, or a hand gesture.
  • Example 44 the subject matter of any one or more of Examples 39-43 optionally include wherein the plurality of sensors include a wearable sensor, the wearable sensor capturing biometric data from at least one of the plurality of members of the audience while the speech is playing.
  • the plurality of sensors include a wearable sensor, the wearable sensor capturing biometric data from at least one of the plurality of members of the audience while the speech is playing.
  • Example 45 the subject matter of any one or more of Examples 39-44 optionally include wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
  • Example 46 the subject matter of any one or more of Examples 39-45 optionally include wherein the means for processing the audience reaction information include means for anonymizing the audience reaction information.
  • Example 47 the subject matter of any one or more of Examples 39-46 optionally include wherein the means for determining the engagement factor include means for using data of the plurality of members of the audience, the data including at least one of location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
  • Example 48 the subject matter of any one or more of Examples 39-47 optionally include wherein the means for sending the aggregated engagement level indicator include means for sending a plurality of aggregated engagement level indicators corresponding to the plurality of members.
  • Example 49 the subject matter of any one or more of Examples 39-48 optionally include wherein the aggregated engagement level indicator is an emoticon.
  • Example 50 the subject matter of any one or more of Examples 39-49 optionally include wherein the aggregated engagement level indicator is a 3D avatar.
  • Example 51 the subject matter of any one or more of Examples 39-50 optionally include wherein the overall engagement of the audience to the speech is a weighted average of engagement of the plurality of members of the audience.
  • Example 52 the subject matter of any one or more of Examples 39-51 optionally include wherein the means for sending the aggregated engagement level indicator include means for sending a series of aggregated engagement level indicators in real time updates to the speaker device, the real time updates adjusted based on received aggregated or individual updated audience reaction information.
  • Example 53 the subject matter of any one or more of Examples 39-52 optionally include means for saving the processed information and the speech to memory; and means for correlating audio tones or key words in the speech with emotional responses in the processed information using machine learning, wherein the means for correlating include means for using a time offset to account for a transmission delay.
  • Example 54 the subject matter of Example 53 optionally includes means for predicting emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words and previous reactions from past audiences stored in the memory.
  • Example 55 the subject matter of Example 54 optionally includes means for suggesting modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
  • Example 56 the subject matter of any one or more of Examples 39-55 optionally include means for sending the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
  • Method examples described herein may be machine or computer-implemented at least in part. Some examples may include a computer-readable medium or machine-readable medium encoded with instructions operable to configure an electronic device to perform methods as described in the above examples.
  • An implementation of such methods may include code, such as microcode, assembly language code, a higher-level language code, or the like. Such code may include computer readable instructions for performing various methods. The code may form portions of computer program products. Further, in an example, the code may be tangibly stored on one or more volatile, non-transitory, or non-volatile tangible computer-readable media, such as during execution or at other times.
  • Examples of these tangible computer-readable media may include, but are not limited to, hard disks, removable magnetic disks, removable optical disks (e.g., compact disks and digital video disks), magnetic cassettes, memory cards or sticks, random access memories (RAMs), read only memories (ROMs), and the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • General Health & Medical Sciences (AREA)
  • Social Psychology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Child & Adolescent Psychology (AREA)
  • Hospice & Palliative Care (AREA)
  • Psychiatry (AREA)
  • User Interface Of Digital Computer (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Embodiments of a system and method for indicating audience engagement are generally described herein. A method may include sending speech from a speaker to be played for a plurality of members of an audience, receiving audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience, processing the audience reaction information to aggregate the audience reaction information, determining, using the processed information, an engagement factor, and sending an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech.

Description

BACKGROUND
A person communicating with other people, such as on a conference call, often has communication issues when audience reaction is difficult to determine. A person attempting to get across an idea is not always understood as intended when the person starts trying to communicate the idea. This type of misunderstanding may lead to unintended consequences or results. Clear communication is a crucial success factor for people involved in day-to-day conferencing and collaboration. While talking to an audience live, it is often easier for a speaker to see and correct misunderstandings. Unfortunately in remote sessions, a speaker is relatively isolated and may “miss the audience” or fail to get a point across. Solutions, such as relying on the audience to interrupt the person to tell the person that the idea is unclear, have low accuracy and are limited to the ability of the audience to notify the person that there has been a misunderstanding.
BRIEF DESCRIPTION OF THE DRAWINGS
In the drawings, which are not necessarily drawn to scale, like numerals may describe similar components in different views. Like numerals having different letter suffixes may represent different instances of similar components. The drawings illustrate generally, by way of example, but not by way of limitation, various embodiments discussed in the present document.
FIG. 1 illustrates a conference call user interface including an aggregated engagement level indicator and a plurality of individual audience engagement level indicators in accordance with some embodiments.
FIG. 2 illustrates a block diagram showing physical (e.g., sensor-based) inputs and contextual inputs used to evaluate engagement metrics in accordance with some embodiments.
FIG. 3 illustrates a diagram showing display indicators for audience engagement levels in accordance with some embodiments.
FIG. 4 illustrates a system for indicating audience engagement in accordance with some embodiments.
FIG. 5 illustrates a flowchart showing a technique for indicating audience engagement in accordance with some embodiments.
FIG. 6 illustrates generally an example of a block diagram of a machine upon which any one or more of the techniques (e.g., methodologies) discussed herein may perform in accordance with some embodiments.
DETAILED DESCRIPTION
Systems and methods for indicating audience engagement are described herein. The systems and methods described herein may include using a user interface for displaying various indications of audience engagement. In an example, a system described herein may use a camera and computer vision to recognize facial emotions, may use or wearable sensors to gauge electrical body response, remote heart monitoring, or other sensor capabilities. Captured information may be aggregated and anonymized and serve as inputs into an engagement process. The system may collect facial emotions, body gesturing, or body response and aggregates these or other emotional indicators into a feedback based engagement process.
In an example, the system may output an anonymized indication of audience emotional response to a speaker (or presenter), or regarding the content of presentation. The audience emotional response may be represented via a color-coded thermometor bell, emoticons or emoji's (e.g., an emotional representation such as a yellow smiley face), or by other visual means. In another example, a speaker may use a virtual reality headset, and audience emotional response may be represented as 3D avatars with changing expressions, such as smiling, bored, etc. In yet another example, the system may provide feedback to emails and articles. For example the system may display an article and monitor a reader's engagement. The monitored and stored information may be anonymized and reported to an author. The system may be used for measuring interest in movies or TV shows. A state machine middleware may be used to record changing interest and to tie the changing interest to content or to a presenter's voice. As a presenter gives a talk, the presenter or the audience may see the engagement indication. The system may include a set of predefined audience profiles (e.g., analogous to workload for a server) and a speaker may practice giving a speech to different audiences. The feedback given to the speaker may be directed to allowing the speaker to adjust the speech based on the predictive output. For example, a fresh graduate may practice a speech before an interview with a company and receive coaching feedback. The system may collect emotional feedback from many participants in a teleconference and collate the responses to form an idea of the success of a meeting. Participants may opt-in and the system may identify and alert a presenter or observer to situations where the meeting had issues, broke down, or failed, such as when there was no consensus established in the meeting. In another example, the participant opt-in may be used to allow provided information to be public (e.g., not anonymous). The system may point out cultural differences that may be affecting a speech or meeting and lead the participants towards internationally or culturally preferred language.
A user may share success or failure regarding audience engagement with other users by uploading the data to the cloud or tagging the audience engagement with relevant keywords. For example, an applicant to a hardware engineer position at a company may share the reaction and outcome of an interview with others on a job website. The service may then show that the interviewers tended to be serious and not appreciate jokes during a presentation or that sports references were viewed favorably. The data may be analyzed by different parties for different outcomes. For example, a human resources department may analyze the data to understand some characteristics of successful applicants for use in future postings. This may result in a more efficient hiring process. In another example, an audience may receive feedback, such as by sending information to a student that other students are also frustrated or do not understand a professor, or that other audience members are angry, happy, etc. In an example, a system may be used with a child or a person with autism or special needs to detect certain changes in mood, emotion, or behavior that may be otherwise undetectable by a person.
The systems and methods described herein may be include an opt-in option for an audience member to make suggestions or provide information such that a speaker may tailor a speech with a predicted likelihood of success. In another example, an email or a blog post may be used with a mock audience or a sample audience to give feedback and automatically determine an emotional engagement of a potential or actual reader.
The problems of misunderstanding a speaker exist in remote communication and in direct in-person communication, such as for a lecturer or business presenter. For example, a presenter may use too much jargon or terms of art for a wider audience. In another example, the systems and methods described herein may be applied for direct in-person communication. For example, a business presenter may be giving a presentation, such as a sales presentation, presentation at a board meeting, corporate meeting, etc., and may view an emotional engagement indicator on a personal device while giving the presentation.
FIG. 1 illustrates a conference call user interface 100 including an aggregated engagement level indicator 110 and a plurality of individual audience engagement level indicators (e.g., 102-108) in accordance with some embodiments. The aggregated level indicator 110 may be a representation of a thermometer, as shown in FIG. 1, or other indicator, such as a dial, progress bar, or other visual representation. The aggregated level indicator 110 may represent an average, a weighted average, an aggregation, sample set, or other combination of emotional engagement metrics of audience members. For example, the aggregated level indicator 110 may represent an average engagement of audience members represented on the conference call user interface 100 (e.g., represented by the individual audience engagement level indicators 102-108) or an average engagement of audience members not represented. The individual audience engagement level indicators 102-108 may include emojis, emoticons, avatars, text representations, or other visual representations of an emotion, such as an interested emoji 102, a surprised emoji 104, a content or happy emoji 106, or a disengaged emoji 108. Additional complexities in emotional display may be used by a variety of emojis or the like. For example, colors may be used to indicate mood, while facial expressions of emojis represent engagement level.
In an example, the conference call user interface 100 may include long-term learning or prediction based on aggregated data in the cloud from multiple users of the conference call user interface 100. For example, when speakers upload their data to the cloud, the conference call user interface 100 may become trained with the data and predict an outcome. A speaker may then rehearse a pitch using the conference call user interface 100 with no audience and receive feedback from the aggregated level indicator 110. The speaker may specify an audience based on context (e.g., location, ages, affiliations, level of expertise in the subject, etc.) or information may be extracted from a calendar of the speaker. The speaker may rehearse a talk and the conference call user interface 100 may predict a level of engagement or highlight areas to be worked on based on previous data of a sample or mock audience. Other elements may be factored in to the prediction or aggregated level indicator 110, such as the context of the speaker (e.g., is the speaker going to be nervous, is the speaker going to be sleep deprived, etc.) or other environmental factors (e.g., is the venue or podium a place where speakers tend to get hot, is the amount of light going to hurt the performance of the speaker, etc.). Based on the data, the conference call user interface 100 may provide a predictive success metric.
In an example, the conference call user interface 100 may be used by each individual participant. In an example, the conference call user interface 100 may be implemented on a mobile device, smartphone, smart device, computer (e.g., video conferencing or audio conferencing), or the like. In an example, individual emoticons may be generated for audience members and displayed as individual audience engagement level indicators (e.g., 102-108) for feedback. In another example, anonymous feedback may be presented to a speaker, such as using aggregated results. For example, if an audience is big enough, (e.g., an auditorium full of high schoolers), the conference call user interface 100 may provide feedback based on demographics, such as a separate indicator for age, gender, interest (e.g., science interest or humanities interest or art interest), or the like. In another example, language specific demographics may be used, such as indicating that a speaker insulted people from a certain ethnic background. The demographics may be aggregated over previous speeches, such as to develop a pattern over different speeches, including by using machine learning. In an example, the conference call user interface 100 may collect inputs and correlating audio tones or key words with a particular emotions, such as by using machine learning. The conference call user interface 100 may use a practice speech to predict emotional reactions (e.g., a practice speech without an audience).
In an example, the aggregated level indicator 110 may be displayed as a thermometer to show overall engagement. A plurality of aggregated level indicators (e.g., 110) may be used to show separate engagement levels for different types of speech (e.g., healthcare, education, information, etc.), different types of audience (e.g., based on demographics, number of members, etc.), different types of desired engagement levels (e.g., a speech intended to inspire a crowd, a speech intended to calm an audience, a speech intended to convey factual information without eliciting an emotional response, etc.), or the like.
In another example, the conference call user interface 100 may inform a speaker as to which audiences would be better suited for a particular speech or suggest modifications to the speech based on predicted (or pre-signed up) audience. The conference call user interface 100 may tailor a lecture per time or per hour, such as for different student groups or classes. The conference call user interface 100 may provide feedback after the speech is given (or after a series of speeches are given, such as lectures over a period of semesters or years), such as the overall engagement.
FIG. 2 illustrates a block diagram 200 showing physical (e.g., sensor-based) inputs (e.g., 202-208) and contextual inputs (e.g., 212-218) used to evaluate engagement metrics 224 in accordance with some embodiments. The physical inputs may include facial data 202, hand gestures 204, wearable data 206, or body language 208. These inputs may be aggregated, averaged, weighted, or anonymized to create a physical data structure 210. One or more of these inputs may be ignored or not used for a given time when determining a physical data structure 210. The physical data structure 210, along with a contextual data structure 220 may be used as an input data structure 222. The physical data structure 210 and the contextual data structure 220 may be combined as an average, weighted average, etc. The contextual data structure 220 may include personal data 212, location data 214, cultural data 216, or familiarity data 218. These data inputs may be ignored or not used for a given time in determining the contextual data structure 220. The input data structure 222 may be used to determine engagement metrics 224. The engagement metrics 224 may include agreement 226, engagement level 228, emotional response 230, focus 232, or feedback 234.
In an example, the facial data 202 includes information about facial expressions, such as smiling, frowning, squinting, etc. The hand gestures 204 may include predetermined hand gestures, or may include information about whether hand gestures are made or not. The wearable data 206 may include a heartrate, oxygen saturation, amount of movement, etc. The body language 208 may include posture information, orientation, etc. The personal data 212 may include demographic information, such as age, gender, etc. The location data 214 may include a current location, an origin location, etc. (e.g., location data 214 may indicate that an audience member is currently traveling). The cultural data 216 may include a country of origin, self-identified ethnic or cultural groups (e.g., religion), generation, etc. The familiarity data 218 may include familiarity with the speaker, the topic, the speech itself (e.g., an audience member who is part of a team working on similar subject matter, or someone that helped write the speech), a company of the speaker, or the like.
In an example, the engagement metric 224 may be determined or displayed separately or in any combination. For example, agreement 226 may be displayed using a first indicator, and an average of the remaining factors may be displayed using a second indicator. Other combinations may be used. The agreement 226 metric may include a determined amount of agreement with a statement or statements made by a speaker. The engagement level 228 metric may include a level of interest in a statement or speech. The emotional response 230 may include an emotional connection or response (intended or not intended by the speaker) to a statement or speech. The focus 232 metric may represent an amount of attention being paid to a statement or speech. The feedback 234 metric may include audience member submitted feedback, automatically determined feedback (e.g., based on previously input desired responses from the audience), or the like. The engagement metrics 224 may be displayed for a single audience member, a subset of members of an audience, or the audience as a whole. The engagement metrics 224 may be based on one or more of the physical data 210 or the contextual data 220, which may be based on one or more of the sub-inputs (e.g., 202-208 for physical data or 212-218 for contextual data). Each engagement metric of the engagement metrics 224 may be based on the same or different inputs or combinations of inputs.
This block diagram 200 displays a quantification of human engagement metrics 224 using metrics or weighted metrics to monitor participants over time. The human emotions may include collective indicators taken from facial expressions, body language, gestures, and sensor responses as well as contextual data. User input may be monitored to understand and measure body language responses. Physical data 210 may include posture, eye gazing, hand positions, head tilt and so on. Contextual data may include demographics such as age, gender, age, time of day, day of week, location, expertise, cultural background, for example, to normalize different reactions based on the topics and other human and cultural variations. An input may be assigned a weight depending upon a situation and a collective set of metrics (e.g., the engagement metrics 224) may be deduced. In an example, feedback 234 may include participants asking questions, or sending online messages or emails. There may be an incentive for a participant to choose to share wearable data, for example, (e.g. students taking a class, friends helping a speaker rehearse, etc.). The data described above may be aggregated to maintain the privacy of the audience. In an example, the aggregated data may be binned or profiled based on the different demographics or inputs extracted from the context and demographics. In an example, the feedback 234 may provide recommendations based on input data, such as location data 214. In an example, the geography of a participant or potential cultural references may be presented or local news may be determined. For example, when talking to participants in India, use a cricket reference, or when talking to participants in the U.S., use a baseball reference. Other examples may include using a day of week, time of day, age, gender, etc., to display feedback (e.g., one feedback for teenagers, or another feedback for older audience).
FIG. 3 illustrates a diagram 300 showing display indicators (e.g., 302, 306, 310) for audience engagement levels in accordance with some embodiments. The diagram 300 includes an example set of inputs (e.g., 302) weighted according to predetermined weights (e.g., 304), that may be shown over time on a user interface (e.g., using indicator 306 or 310). In an example, the current displayed indicator 306 may be determined using a cumulative weighted average of inputs, such as input 302 according to weights (e.g., weight 304). In an example, the cumulative weighted average of inputs may be cumulative over audience members (e.g., aggregating or averaging a number of audience members for a given time) or may be cumulative over time (e.g., a single audience member over time, a subset of audience members over time, the audience as an aggregate over time, etc.). In another example, engagement metrics for specific members of an audience may be given more weight (e.g., a decision maker is given more weight than an intern).
The inputs (e.g., 302) may be emoticons or emojis, and may be output along a timeline 308. The timeline 308 may include past information or the displayed indicator 306 may change over time (e.g., timeline 308 represents changes over time, with a single emoticon or emoji shown at a time). The timeline 308 may start with a neutral input (e.g., input 310) and change according to the inputs (e.g., 302).
FIG. 4 illustrates a system 400 for indicating audience engagement in accordance with some embodiments. The system 400 includes processing circuitry 402, memory 404, a display 406. The system 400 may include a speaker 410 or a microphone 412. The display 406 includes an engagement level indicator 408, such as the engagement level indicator 110 of FIG. 1. The system 400 may include a server, a cloud system, or a computing device.
The processing circuitry 402 may be used to send speech from a speaker to be played for a plurality of members of an audience. In an example, the processing circuitry 402 is used to receive audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience, process the audience reaction information to aggregate the audience reaction information, determine, using the processed information, an engagement factor, and send an aggregated engagement level indicator 408 to a speaker device, the aggregated engagement level indicator 408 based on the engagement factor and indicating overall engagement of the audience to the speech.
In an example, the display 406 may include a user interface for displaying the engagement level indicator 408. In an example, the audience engagement indication system 400 may include components remote from each other, such as including the processing circuitry 402 on a server and the display 406 on a speaker device. In another example, a technique may be implemented using multiple audience engagement indication systems 400, for example, a server used to process the information and a speaker device for receiving an aggregated engagement level indicator 408 and displaying the aggregated engagement level indicator 408. In an example, the audience engagement indication system 400 may include a camera 414 for capturing video of a presenter. In an example, the camera 414 may capture physical responses of at least one of the plurality of members of the audience while the speech is playing. The physical responses may include a facial emotion, a body gesture, or a hand gesture. The microphone 412 may be used to capture audio of a presenter.
The audience engagement indication system 400 may receive information from a sensor or sensors of a wearable device or wearable devices, such as biometric data from at least one of the plurality of members of the audience while the speech is playing. The audience reaction information may be received at the audience engagement indication system 400 from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data. The selection to opt in may be saved in the memory 404. The display 406 may include a specific engagement level indicator 408 for the opted in member, an engagement level indicator 408 for the audience as a whole, or the like. The engagement level indicator 408 may include an emoticon, an emoji, a 3D avatar, or the like. In an example, the engagement level indicator 408 may be updated in real time on the display 406, and the real time updates may be adjusted based on received updated audience reaction information.
In an example, the processing circuitry 402 may anonymize the audience reaction information. The anonymized audience reaction information may be saved in the memory 404. The data received from the plurality of members of the audience may include location data, personal data, cultural data, familiarity data, time data, age data, gender data, or the like. This data may be saved in a structured database in the memory 404 of the audience engagement indication system 400.
In an example, the processing circuitry 402 may save the processed information and the speech to the memory 404. The processing circuitry 402 may correlate audio tones or key words in the speech with emotional responses in the processed information using machine learning based on stored speech and processed information. In an example, the processing circuitry 402 may predict emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words. In an example, the processing circuitry 402 may suggest modifications to the second speech based on the predicted emotional reactions of the hypothetical audience. In another example, the processing circuitry 402 may send the aggregated engagement level indicator 408 to a device of a member of the plurality of members of the audience.
FIG. 5 illustrates a flowchart showing a technique 500 for indicating audience engagement in accordance with some embodiments. The technique 500 includes an operation 502 to receive audience reaction information. The audience reaction information may be in reaction to speech sent from a speaker (e.g., a presenter or other person) to be played for a plurality of members of an audience. The audience reaction information may be received for the plurality of members of the audience, such as from a plurality of sensors. In an example, the audience reaction information is captured by the plurality of sensors while the speech is playing for the plurality of members of the audience. In another example, the audience reaction information may be captured after the speech has been played. In yet another example, the audience reaction information may include pre-speech, during speech, or post-speech information, such as a change in reaction measured from a baseline.
In an example, the speech may include a video of the speaker. In another example, the speaker may be remote from the audience. In an example, the plurality of sensors may include a camera. The camera may capture physical responses of at least one of the plurality of members of the audience, such as while the speech is playing. The physical response may include at least one of a facial emotion, a body gesture, or a hand gesture. In another example, the plurality of sensors may include a wearable sensor. The wearable sensor may capture biometric data from at least one of the plurality of members of the audience, such as while the speech is playing. In an example, the audience reaction information may be received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
The technique 500 includes an operation 504 to process the audience reaction information. The audience reaction information may be processed to aggregate the audience reaction information. Processing the audience reaction information may include anonymizing the audience reaction information. The technique 500 includes an operation 506 to determine an engagement factor. The engagement factor may be determined using the processed information. Determining the engagement factor may include using data of the plurality of members of the audience. The data may include location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
The technique 500 includes an operation 508 to send an aggregated engagement level indicator to a speaker device. The aggregated engagement level indicator may be based on the engagement factor. In an example, the aggregated engagement level indicates an overall engagement of the audience to the speech. In an example, the overall engagement of the audience to the speech may include an average or a weighted average of engagement of the plurality of members of the audience. Sending the aggregated engagement level indicator may include sending a plurality of aggregated engagement level indicators corresponding to the plurality of members (e.g., to be displayed on the speaker device). In an example, the aggregated engagement level may include an emoticon, an emoji, or a 3D avatar. In another example, sending the aggregated engagement level indicator may include sending a series of aggregated engagement level indicators in real time updates to the speaker device. The real time updates may be adjusted based on aggregated or individual received updated audience reaction information. For example, an emoji may appear to change over time in reaction to speech by the speaker changing.
In an example, the technique 500 includes an operation to save the processed information and the speech to memory. The technique 500 may include an operation to correlate audio tones or key words in the speech with emotional responses in the processed information, such as by using machine learning. Correlating audio tones or key words may include using a time offset to account for a transmission delay or a language barrier issue. Emotional reactions may be predicted of a hypothetical audience to a second speech, such as by using the correlated audio tones or key words or previous reactions from past audiences stored in the memory. In an example, modifications may be suggested to the second speech, such as modifications based on the predicted emotional reactions of the hypothetical audience. In another example, the technique 500 includes an operation to send the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
FIG. 6 illustrates generally an example of a block diagram of a machine 600 upon which any one or more of the techniques (e.g., methodologies) discussed herein may perform in accordance with some embodiments. In alternative embodiments, the machine 600 may operate as a standalone device or may be connected (e.g., networked) to other machines. In a networked deployment, the machine 600 may operate in the capacity of a server machine, a client machine, or both in server-client network environments. In an example, the machine 600 may act as a peer machine in peer-to-peer (P2P) (or other distributed) network environment. The machine 600 may be a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a mobile telephone, a web appliance, a network router, switch or bridge, or any machine capable of executing instructions (sequential or otherwise) that specify actions to be taken by that machine. Further, while only a single machine is illustrated, the term “machine” shall also be taken to include any collection of machines that individually or jointly execute a set (or multiple sets) of instructions to perform any one or more of the methodologies discussed herein, such as cloud computing, software as a service (SaaS), other computer cluster configurations.
Examples, as described herein, may include, or may operate on, logic or a number of components, modules, or mechanisms. Modules are tangible entities (e.g., hardware) capable of performing specified operations when operating. A module includes hardware. In an example, the hardware may be specifically configured to carry out a specific operation (e.g., hardwired). In an example, the hardware may include configurable execution units (e.g., transistors, circuits, etc.) and a computer readable medium containing instructions, where the instructions configure the execution units to carry out a specific operation when in operation. The configuring may occur under the direction of the executions units or a loading mechanism. Accordingly, the execution units are communicatively coupled to the computer readable medium when the device is operating. In this example, the execution units may be a member of more than one module. For example, under operation, the execution units may be configured by a first set of instructions to implement a first module at one point in time and reconfigured by a second set of instructions to implement a second module.
Machine (e.g., computer system) 600 may include a hardware processor 602 (e.g., a central processing unit (CPU), a graphics processing unit (GPU), a hardware processor core, or any combination thereof), a main memory 604 and a static memory 606, some or all of which may communicate with each other via an interlink (e.g., bus) 608. The machine 600 may further include a display unit 610, an alphanumeric input device 612 (e.g., a keyboard), and a user interface (UI) navigation device 614 (e.g., a mouse). In an example, the display unit 610, alphanumeric input device 612 and UI navigation device 614 may be a touch screen display. The machine 600 may additionally include a storage device (e.g., drive unit) 616, a signal generation device 618 (e.g., a speaker), a network interface device 620, and one or more sensors 621, such as a global positioning system (GPS) sensor, compass, accelerometer, or other sensor. The machine 600 may include an output controller 628, such as a serial (e.g., universal serial bus (USB), parallel, or other wired or wireless (e.g., infrared (IR), near field communication (NFC), etc.) connection to communicate or control one or more peripheral devices (e.g., a printer, card reader, etc.).
The storage device 616 may include a machine readable medium 622 that is non-transitory on which is stored one or more sets of data structures or instructions 624 (e.g., software) embodying or utilized by any one or more of the techniques or functions described herein. The instructions 624 may also reside, completely or at least partially, within the main memory 604, within static memory 606, or within the hardware processor 602 during execution thereof by the machine 600. In an example, one or any combination of the hardware processor 602, the main memory 604, the static memory 606, or the storage device 616 may constitute machine readable media.
While the machine readable medium 622 is illustrated as a single medium, the term “machine readable medium” may include a single medium or multiple media (e.g., a centralized or distributed database, or associated caches and servers) configured to store the one or more instructions 624.
The term “machine readable medium” may include any medium that is capable of storing, encoding, or carrying instructions for execution by the machine 600 and that cause the machine 600 to perform any one or more of the techniques of the present disclosure, or that is capable of storing, encoding or carrying data structures used by or associated with such instructions. Non-limiting machine readable medium examples may include solid-state memories, and optical and magnetic media. Specific examples of machine readable media may include: non-volatile memory, such as semiconductor memory devices (e.g., Electrically Programmable Read-Only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM)) and flash memory devices; magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks.
The instructions 624 may further be transmitted or received over a communications network 626 using a transmission medium via the network interface device 620 utilizing any one of a number of transfer protocols (e.g., frame relay, internet protocol (IP), transmission control protocol (TCP), user datagram protocol (UDP), hypertext transfer protocol (HTTP), etc.). Example communication networks may include a local area network (LAN), a wide area network (WAN), a packet data network (e.g., the Internet), mobile telephone networks (e.g., cellular networks), Plain Old Telephone (POTS) networks, and wireless data networks (e.g., Institute of Electrical and Electronics Engineers (IEEE) 802.11 family of standards known as Wi-Fi®, IEEE 802.16 family of standards known as WiMax®), IEEE 802.15.4 family of standards, peer-to-peer (P2P) networks, among others. In an example, the network interface device 620 may include one or more physical jacks (e.g., Ethernet, coaxial, or phone jacks) or one or more antennas to connect to the communications network 626. In an example, the network interface device 620 may include a plurality of antennas to wirelessly communicate using at least one of single-input multiple-output (SIMO), multiple-input multiple-output (MIMO), or multiple-input single-output (MISO) techniques. The term “transmission medium” shall be taken to include any intangible medium that is capable of storing, encoding or carrying instructions for execution by the machine 600, and includes digital or analog communications signals or other intangible medium to facilitate communication of such software.
VARIOUS NOTES & EXAMPLES
Each of these non-limiting examples may stand on its own, or may be combined in various permutations or combinations with one or more of the other examples.
Example 1 is a system for indicating audience engagement, the system comprising: a server, including processing circuitry and memory, the processing circuitry to: send speech from a speaker to be played for a plurality of members of an audience; receive audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience; process the audience reaction information to aggregate the audience reaction information; determine, using the processed information, an engagement factor; and send an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech.
In Example 2, the subject matter of Example 1 optionally includes wherein the speech includes a video of the speaker.
In Example 3, the subject matter of any one or more of Examples 1-2 optionally include wherein the speaker is remote from the audience.
In Example 4, the subject matter of any one or more of Examples 1-3 optionally include wherein the plurality of sensors include a camera, the camera to capture physical responses of at least one of the plurality of members of the audience while the speech is playing.
In Example 5, the subject matter of Example 4 optionally includes wherein the physical responses include at least one of a facial emotion, a body gesture, or a hand gesture.
In Example 6, the subject matter of any one or more of Examples 1-5 optionally include wherein the plurality of sensors include a wearable sensor, the wearable sensor to capture biometric data from at least one of the plurality of members of the audience while the speech is playing.
In Example 7, the subject matter of any one or more of Examples 1-6 optionally include wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
In Example 8, the subject matter of any one or more of Examples 1-7 optionally include wherein to process the audience reaction information, the processing circuitry is to anonymize the audience reaction information.
In Example 9, the subject matter of any one or more of Examples 1-8 optionally include wherein to determine the engagement factor includes using data of the plurality of members of the audience, the data including at least one of location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
In Example 10, the subject matter of any one or more of Examples 1-9 optionally include wherein to send the aggregated engagement level indicator, the processing circuitry is to send a plurality of aggregated engagement level indicators corresponding to the plurality of members.
In Example 11, the subject matter of any one or more of Examples 1-10 optionally include wherein the aggregated engagement level indicator is an emoticon.
In Example 12, the subject matter of any one or more of Examples 1-11 optionally include wherein the aggregated engagement level indicator is a 3D avatar.
In Example 13, the subject matter of any one or more of Examples 1-12 optionally include wherein the overall engagement of the audience to the speech is a weighted average of engagement of the plurality of members of the audience.
In Example 14, the subject matter of any one or more of Examples 1-13 optionally include wherein to send the aggregated engagement level indicator, the processing circuitry is to send a series of aggregated engagement level indicators in real time updates to the speaker device, the real time updates adjusted based on aggregated or individual received updated audience reaction information.
In Example 15, the subject matter of any one or more of Examples 1-14 optionally include wherein the processing circuitry is further to: save the processed information and the speech to the memory; and correlate audio tones or key words in the speech with emotional responses in the processed information using machine learning, wherein to correlate includes using a time offset to account for a transmission delay.
In Example 16, the subject matter of Example 15 optionally includes wherein the processing circuitry is further to predict emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words and previous reactions from past audiences stored in the memory.
In Example 17, the subject matter of Example 16 optionally includes wherein the processing circuitry is further to suggest modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
In Example 18, the subject matter of any one or more of Examples 1-17 optionally include wherein the processing circuitry is further to send the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
Example 19 is a method for indicating audience engagement, the method comprising: sending speech from a speaker to be played for a plurality of members of an audience; receiving audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience; processing the audience reaction information to aggregate the audience reaction information; determining, using the processed information, an engagement factor; and sending an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech.
In Example 20, the subject matter of Example 19 optionally includes wherein the speech includes a video of the speaker.
In Example 21, the subject matter of any one or more of Examples 19-20 optionally include wherein the speaker is remote from the audience.
In Example 22, the subject matter of any one or more of Examples 19-21 optionally include wherein the plurality of sensors include a camera, the camera capturing physical responses of at least one of the plurality of members of the audience while the speech is playing.
In Example 23, the subject matter of Example 22 optionally includes wherein the physical responses include at least one of a facial emotion, a body gesture, or a hand gesture.
In Example 24, the subject matter of any one or more of Examples 19-23 optionally include wherein the plurality of sensors include a wearable sensor, the wearable sensor capturing biometric data from at least one of the plurality of members of the audience while the speech is playing.
In Example 25, the subject matter of any one or more of Examples 19-24 optionally include wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
In Example 26, the subject matter of any one or more of Examples 19-25 optionally include wherein processing the audience reaction information includes anonymizing the audience reaction information.
In Example 27, the subject matter of any one or more of Examples 19-26 optionally include wherein determining the engagement factor includes using data of the plurality of members of the audience, the data including at least one of location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
In Example 28, the subject matter of any one or more of Examples 19-27 optionally include wherein sending the aggregated engagement level indicator includes sending a plurality of aggregated engagement level indicators corresponding to the plurality of members.
In Example 29, the subject matter of any one or more of Examples 19-28 optionally include wherein the aggregated engagement level indicator is an emoticon.
In Example 30, the subject matter of any one or more of Examples 19-29 optionally include wherein the aggregated engagement level indicator is a 3D avatar.
In Example 31, the subject matter of any one or more of Examples 19-30 optionally include wherein the overall engagement of the audience to the speech is a weighted average of engagement of the plurality of members of the audience.
In Example 32, the subject matter of any one or more of Examples 19-31 optionally include wherein sending the aggregated engagement level indicator includes sending a series of aggregated engagement level indicators in real time updates to the speaker device, the real time updates adjusted based on aggregated or individual received updated audience reaction information.
In Example 33, the subject matter of any one or more of Examples 19-32 optionally include saving the processed information and the speech to memory; and correlating audio tones or key words in the speech with emotional responses in the processed information using machine learning, wherein correlating includes using a time offset to account for a transmission delay.
In Example 34, the subject matter of Example 33 optionally includes predicting emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words and previous reactions from past audiences stored in the memory.
In Example 35, the subject matter of Example 34 optionally includes suggesting modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
In Example 36, the subject matter of any one or more of Examples 19-35 optionally include sending the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
Example 37 is at least one machine-readable medium including instructions for operation of a computing system, which when executed by a machine, cause the machine to perform operations of any of the methods of Examples 19-36.
Example 38 is an apparatus comprising means for performing any of the methods of Examples 19-36.
Example 39 is an apparatus for indicating audience engagement, the apparatus comprising: means for sending speech from a speaker to be played for a plurality of members of an audience; means for receiving audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience; means for processing the audience reaction information to aggregate the audience reaction information; means for determining, using the processed information, an engagement factor; and means for sending an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech.
In Example 40, the subject matter of Example 39 optionally includes wherein the speech includes a video of the speaker.
In Example 41, the subject matter of any one or more of Examples 39-40 optionally include wherein the speaker is remote from the audience.
In Example 42, the subject matter of any one or more of Examples 39-41 optionally include wherein the plurality of sensors include a camera, the camera capturing physical responses of at least one of the plurality of members of the audience while the speech is playing.
In Example 43, the subject matter of Example 42 optionally includes wherein the physical responses include at least one of a facial emotion, a body gesture, or a hand gesture.
In Example 44, the subject matter of any one or more of Examples 39-43 optionally include wherein the plurality of sensors include a wearable sensor, the wearable sensor capturing biometric data from at least one of the plurality of members of the audience while the speech is playing.
In Example 45, the subject matter of any one or more of Examples 39-44 optionally include wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
In Example 46, the subject matter of any one or more of Examples 39-45 optionally include wherein the means for processing the audience reaction information include means for anonymizing the audience reaction information.
In Example 47, the subject matter of any one or more of Examples 39-46 optionally include wherein the means for determining the engagement factor include means for using data of the plurality of members of the audience, the data including at least one of location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
In Example 48, the subject matter of any one or more of Examples 39-47 optionally include wherein the means for sending the aggregated engagement level indicator include means for sending a plurality of aggregated engagement level indicators corresponding to the plurality of members.
In Example 49, the subject matter of any one or more of Examples 39-48 optionally include wherein the aggregated engagement level indicator is an emoticon.
In Example 50, the subject matter of any one or more of Examples 39-49 optionally include wherein the aggregated engagement level indicator is a 3D avatar.
In Example 51, the subject matter of any one or more of Examples 39-50 optionally include wherein the overall engagement of the audience to the speech is a weighted average of engagement of the plurality of members of the audience.
In Example 52, the subject matter of any one or more of Examples 39-51 optionally include wherein the means for sending the aggregated engagement level indicator include means for sending a series of aggregated engagement level indicators in real time updates to the speaker device, the real time updates adjusted based on received aggregated or individual updated audience reaction information.
In Example 53, the subject matter of any one or more of Examples 39-52 optionally include means for saving the processed information and the speech to memory; and means for correlating audio tones or key words in the speech with emotional responses in the processed information using machine learning, wherein the means for correlating include means for using a time offset to account for a transmission delay.
In Example 54, the subject matter of Example 53 optionally includes means for predicting emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words and previous reactions from past audiences stored in the memory.
In Example 55, the subject matter of Example 54 optionally includes means for suggesting modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
In Example 56, the subject matter of any one or more of Examples 39-55 optionally include means for sending the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
Method examples described herein may be machine or computer-implemented at least in part. Some examples may include a computer-readable medium or machine-readable medium encoded with instructions operable to configure an electronic device to perform methods as described in the above examples. An implementation of such methods may include code, such as microcode, assembly language code, a higher-level language code, or the like. Such code may include computer readable instructions for performing various methods. The code may form portions of computer program products. Further, in an example, the code may be tangibly stored on one or more volatile, non-transitory, or non-volatile tangible computer-readable media, such as during execution or at other times. Examples of these tangible computer-readable media may include, but are not limited to, hard disks, removable magnetic disks, removable optical disks (e.g., compact disks and digital video disks), magnetic cassettes, memory cards or sticks, random access memories (RAMs), read only memories (ROMs), and the like.

Claims (24)

What is claimed is:
1. A system for indicating audience engagement, the system comprising:
a server, including processing circuitry and memory, the processing circuitry to:
send speech from a speaker to be played for a plurality of members of an audience;
receive audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience;
process the audience reaction information to aggregate the audience reaction information;
determine, using the processed information, an engagement factor;
send an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech;
save the processed information and the speech to the memory; and
correlate audio tones or key words in the speech with emotional responses in the processed information, wherein to correlate includes using a time offset to account for a transmission delay.
2. The system of claim 1, wherein the speech includes a video of the speaker.
3. The system of claim 1, wherein the speaker is remote from the audience.
4. The system of claim 1, wherein the plurality of sensors include a camera, the camera to capture physical responses of at least one of the plurality of members of the audience while the speech is playing.
5. The system of claim 4, wherein the physical responses include at least one of a facial emotion, a body gesture, or a hand gesture.
6. The system of claim 1, wherein the plurality of sensors include a wearable sensor, the wearable sensor to capture biometric data from at least one of the plurality of members of the audience while the speech is playing.
7. The system of claim 1, wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data.
8. The system of claim 1, wherein to process the audience reaction information, the processing circuitry is to anonymize the audience reaction information.
9. The system of claim 1, wherein to determine the engagement factor includes using data of the plurality of members of the audience, the data including at least one of location data, personal data, cultural data, familiarity data, time data, age data, or gender data.
10. The system of claim 1, wherein to send the aggregated engagement level indicator, the processing circuitry is to send a plurality of aggregated engagement level indicators corresponding to the plurality of members.
11. The system of claim 1, wherein the aggregated engagement level indicator is an emoticon.
12. The system of claim 1, wherein the aggregated engagement level indicator is a 3D avatar.
13. The system of claim 1, wherein the overall engagement of the audience to the speech is a weighted average of engagement of the plurality of members of the audience.
14. The system of claim 1, wherein to send the aggregated engagement level indicator, the processing circuitry is to send a series of aggregated engagement level indicators in real time updates to the speaker device, the real time updates adjusted based on aggregated or individual received updated audience reaction information.
15. The system of claim 1, wherein the processing circuitry is further to predict emotional reactions of a hypothetical audience to a second speech using the correlated audio tones or key words and previous reactions from past audiences stored in the memory.
16. The system of claim 15, wherein the processing circuitry is further to suggest modifications to the second speech based on the predicted emotional reactions of the hypothetical audience.
17. The system of claim 1, wherein the processing circuitry is further to send the aggregated engagement level indicator to a device of a member of the plurality of members of the audience.
18. A method for indicating audience engagement, the method comprising:
sending speech from a speaker to be played for a plurality of members of an audience;
receiving audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience;
processing the audience reaction information to aggregate the audience reaction information;
determining, using the processed information, an engagement factor;
sending an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech;
saving the processed information and the speech to the memory; and
correlating audio tones or key words in the speech with emotional responses in the processed information, wherein correlating includes using a time offset to account for a transmission delay.
19. The method of claim 18, wherein the speech includes a video of the speaker.
20. The method of claim 18, wherein the speaker is remote from the audience.
21. The method of claim 18, wherein the plurality of sensors include a camera, the camera capturing physical responses of at least one of the plurality of members of the audience while the speech is playing.
22. At least one non-transitory machine-readable medium including instructions for operation of a computing system, which when executed by a machine, cause the machine to:
send speech from a speaker to be played for a plurality of members of an audience;
receive audience reaction information, for the plurality of members of the audience, from a plurality of sensors, the audience reaction information captured by the plurality of sensors while the speech is playing for the plurality of members of the audience;
process the audience reaction information to aggregate the audience reaction information;
determine, using the processed information, an engagement factor;
send an aggregated engagement level indicator to a speaker device, the aggregated engagement level indicator based on the engagement factor and indicating overall engagement of the audience to the speech;
save the processed information and the speech to the memory; and
correlate audio tones or key words in the speech with emotional responses in the processed information, wherein to correlate includes using a time offset to account for a transmission delay.
23. The at least one machine-readable medium of claim 22, wherein the plurality of sensors include a wearable sensor, the wearable sensor capturing biometric data from at least one of the plurality of members of the audience while the speech is playing.
24. The at least one machine-readable medium of claim 22, wherein the audience reaction information is received from at least one of the plurality of members of the audience in response to the at least one of the plurality of members of the audience opting in to provide data in an individual or an aggregated form.
US15/274,338 2016-09-23 2016-09-23 Audience engagement feedback systems and techniques Expired - Fee Related US9843768B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US15/274,338 US9843768B1 (en) 2016-09-23 2016-09-23 Audience engagement feedback systems and techniques

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US15/274,338 US9843768B1 (en) 2016-09-23 2016-09-23 Audience engagement feedback systems and techniques

Publications (1)

Publication Number Publication Date
US9843768B1 true US9843768B1 (en) 2017-12-12

Family

ID=60516423

Family Applications (1)

Application Number Title Priority Date Filing Date
US15/274,338 Expired - Fee Related US9843768B1 (en) 2016-09-23 2016-09-23 Audience engagement feedback systems and techniques

Country Status (1)

Country Link
US (1) US9843768B1 (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170344109A1 (en) * 2016-05-31 2017-11-30 Paypal, Inc. User physical attribute based device and content management system
US20180070052A1 (en) * 2016-09-07 2018-03-08 Panasonic Intellectual Property Management Co., Ltd. Information processing method, information processing system, and information processing apparatus
US20180295158A1 (en) * 2017-04-05 2018-10-11 Microsoft Technology Licensing, Llc Displaying group expressions for teleconference sessions
US10108262B2 (en) 2016-05-31 2018-10-23 Paypal, Inc. User physical attribute based device and content management system
US20190110022A1 (en) * 2017-10-06 2019-04-11 Fuji Xerox Co.,Ltd. Communication device, communication system, and non-transitory computer readable medium storing program
US20190245972A1 (en) * 2014-01-08 2019-08-08 Callminer, Inc. System and method for searching content using acoustic characteristics
US10462422B1 (en) * 2018-04-09 2019-10-29 Facebook, Inc. Audio selection based on user engagement
CN110719436A (en) * 2019-10-17 2020-01-21 浙江同花顺智能科技有限公司 Conference document information acquisition method and device and related equipment
US20200135045A1 (en) * 2018-12-27 2020-04-30 Intel Corporation Engagement level determination and dissemination
CN112954426A (en) * 2021-02-07 2021-06-11 咪咕文化科技有限公司 Video playing method, electronic equipment and storage medium
US11093903B2 (en) * 2019-05-20 2021-08-17 International Business Machines Corporation Monitoring meeting participation level
US20210400236A1 (en) * 2020-06-19 2021-12-23 Airbnb, Inc. Aggregating audience member emotes in large-scale electronic presentation
US11228625B1 (en) * 2018-02-02 2022-01-18 mmhmm inc. AI director for automatic segmentation, participant behavior analysis and moderation of video conferences
WO2022031335A1 (en) * 2020-08-03 2022-02-10 Microsoft Technology Licensing, Llc Automatic reaction-triggering for live presentations
JP2022055372A (en) * 2020-09-23 2022-04-08 三菱電機インフォメーションネットワーク株式会社 Dynamic element display system, online conference terminal, and dynamic element display program
WO2022093392A1 (en) * 2020-10-30 2022-05-05 Microsoft Technology Licensing, Llc Techniques for rich interaction in remote live presentation and accurate suggestion for rehearsal through audience video analysis
WO2022093356A1 (en) * 2020-10-30 2022-05-05 Microsoft Technology Licensing, Llc Techniques for presentation analysis based on audience feedback, reactions, and gestures
US11328253B2 (en) * 2019-04-02 2022-05-10 Educational Measures, LLC Systems and methods for improved meeting engagement
WO2022130298A1 (en) * 2020-12-18 2022-06-23 Sony Group Corporation Simulating audience reactions for performers on camera
US11403961B2 (en) * 2014-08-13 2022-08-02 Pitchvantage Llc Public speaking trainer with 3-D simulation and real-time feedback
US11610044B2 (en) * 2018-07-30 2023-03-21 Primer Technologies, Inc. Dynamic management of content in an electronic presentation
USD984457S1 (en) 2020-06-19 2023-04-25 Airbnb, Inc. Display screen of a programmed computer system with graphical user interface
USD985005S1 (en) 2020-06-19 2023-05-02 Airbnb, Inc. Display screen of a programmed computer system with graphical user interface
WO2023075980A1 (en) * 2021-10-29 2023-05-04 Zoom Video Communications, Inc. Real-time video-based audience reaction sentiment analysis
WO2023233482A1 (en) * 2022-05-30 2023-12-07 日本電信電話株式会社 Conversation support device, conversation support method, and program
EP4195676A4 (en) * 2020-08-28 2024-01-24 Huawei Technologies Co., Ltd. VIDEO INTERACTION METHOD AND APPARATUS

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130205314A1 (en) * 2012-02-07 2013-08-08 Arun Ramaswamy Methods and apparatus to select media based on engagement levels
US20130339875A1 (en) * 2012-06-14 2013-12-19 Adobe Systems Inc. Method and apparatus for presenting a participant engagement level in an online interaction
US20140237381A1 (en) * 2008-09-26 2014-08-21 Deep Rock Drive Partners Inc. Interactive Live Political Events
US20140278933A1 (en) * 2013-03-15 2014-09-18 F. Gavin McMillan Methods and apparatus to measure audience engagement with media
US20160086205A1 (en) * 2014-09-23 2016-03-24 Brilliant Lime, Inc. Correcting for Poll Bias
US20160255384A1 (en) * 2006-09-05 2016-09-01 The Nielsen Company (Us), Llc Method and system for predicting audience viewing behavior

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160255384A1 (en) * 2006-09-05 2016-09-01 The Nielsen Company (Us), Llc Method and system for predicting audience viewing behavior
US20140237381A1 (en) * 2008-09-26 2014-08-21 Deep Rock Drive Partners Inc. Interactive Live Political Events
US20130205314A1 (en) * 2012-02-07 2013-08-08 Arun Ramaswamy Methods and apparatus to select media based on engagement levels
US20130339875A1 (en) * 2012-06-14 2013-12-19 Adobe Systems Inc. Method and apparatus for presenting a participant engagement level in an online interaction
US20140278933A1 (en) * 2013-03-15 2014-09-18 F. Gavin McMillan Methods and apparatus to measure audience engagement with media
US20160086205A1 (en) * 2014-09-23 2016-03-24 Brilliant Lime, Inc. Correcting for Poll Bias

Cited By (56)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12137186B2 (en) 2014-01-08 2024-11-05 Callminer, Inc. Customer journey contact linking to determine root cause and loyalty
US10582056B2 (en) 2014-01-08 2020-03-03 Callminer, Inc. Communication channel customer journey
US10992807B2 (en) * 2014-01-08 2021-04-27 Callminer, Inc. System and method for searching content using acoustic characteristics
US11277516B2 (en) 2014-01-08 2022-03-15 Callminer, Inc. System and method for AB testing based on communication content
US10645224B2 (en) 2014-01-08 2020-05-05 Callminer, Inc. System and method of categorizing communications
US12219093B2 (en) 2014-01-08 2025-02-04 Callminer, Inc. System and method of determining topics of a communication
US20190245972A1 (en) * 2014-01-08 2019-08-08 Callminer, Inc. System and method for searching content using acoustic characteristics
US10601992B2 (en) 2014-01-08 2020-03-24 Callminer, Inc. Contact center agent coaching tool
US11403961B2 (en) * 2014-08-13 2022-08-02 Pitchvantage Llc Public speaking trainer with 3-D simulation and real-time feedback
US11798431B2 (en) 2014-08-13 2023-10-24 Pitchvantage Llc Public speaking trainer with 3-D simulation and real-time feedback
US10108262B2 (en) 2016-05-31 2018-10-23 Paypal, Inc. User physical attribute based device and content management system
US11340699B2 (en) 2016-05-31 2022-05-24 Paypal, Inc. User physical attribute based device and content management system
US10037080B2 (en) * 2016-05-31 2018-07-31 Paypal, Inc. User physical attribute based device and content management system
US20170344109A1 (en) * 2016-05-31 2017-11-30 Paypal, Inc. User physical attribute based device and content management system
US11983313B2 (en) 2016-05-31 2024-05-14 Paypal, Inc. User physical attribute based device and content management system
US20180070052A1 (en) * 2016-09-07 2018-03-08 Panasonic Intellectual Property Management Co., Ltd. Information processing method, information processing system, and information processing apparatus
US10523897B2 (en) * 2016-09-07 2019-12-31 Panasonic Intellectual Property Management Co., Ltd. Information processing method, information processing system, and information processing apparatus
US20180295158A1 (en) * 2017-04-05 2018-10-11 Microsoft Technology Licensing, Llc Displaying group expressions for teleconference sessions
US10798337B2 (en) * 2017-10-06 2020-10-06 Fuji Xerox Co., Ltd. Communication device, communication system, and non-transitory computer readable medium storing program
US20190110022A1 (en) * 2017-10-06 2019-04-11 Fuji Xerox Co.,Ltd. Communication device, communication system, and non-transitory computer readable medium storing program
US11228625B1 (en) * 2018-02-02 2022-01-18 mmhmm inc. AI director for automatic segmentation, participant behavior analysis and moderation of video conferences
US10462422B1 (en) * 2018-04-09 2019-10-29 Facebook, Inc. Audio selection based on user engagement
CN112262367A (en) * 2018-04-09 2021-01-22 脸谱公司 Audio selection based on user engagement
US10838689B2 (en) * 2018-04-09 2020-11-17 Facebook, Inc. Audio selection based on user engagement
US20200050420A1 (en) * 2018-04-09 2020-02-13 Facebook, Inc. Audio selection based on user engagement
US11610044B2 (en) * 2018-07-30 2023-03-21 Primer Technologies, Inc. Dynamic management of content in an electronic presentation
US20200135045A1 (en) * 2018-12-27 2020-04-30 Intel Corporation Engagement level determination and dissemination
US11908345B2 (en) * 2018-12-27 2024-02-20 Intel Corporation Engagement level determination and dissemination
US11455599B2 (en) 2019-04-02 2022-09-27 Educational Measures, LLC Systems and methods for improved meeting engagement
US11328253B2 (en) * 2019-04-02 2022-05-10 Educational Measures, LLC Systems and methods for improved meeting engagement
US11093903B2 (en) * 2019-05-20 2021-08-17 International Business Machines Corporation Monitoring meeting participation level
CN110719436A (en) * 2019-10-17 2020-01-21 浙江同花顺智能科技有限公司 Conference document information acquisition method and device and related equipment
USD984457S1 (en) 2020-06-19 2023-04-25 Airbnb, Inc. Display screen of a programmed computer system with graphical user interface
US20210400236A1 (en) * 2020-06-19 2021-12-23 Airbnb, Inc. Aggregating audience member emotes in large-scale electronic presentation
US11552812B2 (en) * 2020-06-19 2023-01-10 Airbnb, Inc. Outputting emotes based on audience member expressions in large-scale electronic presentation
USD1070907S1 (en) 2020-06-19 2025-04-15 Airbnb, Inc. Display screen of a programmed computer system with graphical user interface
US20210398032A1 (en) * 2020-06-19 2021-12-23 Airbnb, Inc. Incorporating individual audience member participation and feedback in large-scale electronic presentation
USD1070908S1 (en) 2020-06-19 2025-04-15 Airbnb, Inc. Display screen of a programmed computer system with graphical user interface
USD985005S1 (en) 2020-06-19 2023-05-02 Airbnb, Inc. Display screen of a programmed computer system with graphical user interface
US11979245B2 (en) 2020-06-19 2024-05-07 Airbnb, Inc. Augmenting audience member emotes in large-scale electronic presentation
US11646905B2 (en) * 2020-06-19 2023-05-09 Airbnb, Inc. Aggregating audience member emotes in large-scale electronic presentation
US12021643B2 (en) 2020-06-19 2024-06-25 Airbnb, Inc. Outputting emotes based on audience member expressions in large-scale electronic presentation
US11991013B2 (en) * 2020-06-19 2024-05-21 Airbnb, Inc. Incorporating individual audience member participation and feedback in large-scale electronic presentation
WO2022031335A1 (en) * 2020-08-03 2022-02-10 Microsoft Technology Licensing, Llc Automatic reaction-triggering for live presentations
US11570307B2 (en) 2020-08-03 2023-01-31 Microsoft Technology Licensing, Llc Automatic reaction-triggering for live presentations
EP4195676A4 (en) * 2020-08-28 2024-01-24 Huawei Technologies Co., Ltd. VIDEO INTERACTION METHOD AND APPARATUS
JP2022055372A (en) * 2020-09-23 2022-04-08 三菱電機インフォメーションネットワーク株式会社 Dynamic element display system, online conference terminal, and dynamic element display program
WO2022093392A1 (en) * 2020-10-30 2022-05-05 Microsoft Technology Licensing, Llc Techniques for rich interaction in remote live presentation and accurate suggestion for rehearsal through audience video analysis
US12026948B2 (en) 2020-10-30 2024-07-02 Microsoft Technology Licensing, Llc Techniques for presentation analysis based on audience feedback, reactions, and gestures
WO2022093356A1 (en) * 2020-10-30 2022-05-05 Microsoft Technology Licensing, Llc Techniques for presentation analysis based on audience feedback, reactions, and gestures
EP4245037A1 (en) * 2020-12-18 2023-09-20 Sony Group Corporation Simulating audience reactions for performers on camera
WO2022130298A1 (en) * 2020-12-18 2022-06-23 Sony Group Corporation Simulating audience reactions for performers on camera
CN112954426A (en) * 2021-02-07 2021-06-11 咪咕文化科技有限公司 Video playing method, electronic equipment and storage medium
US11943074B2 (en) 2021-10-29 2024-03-26 Zoom Video Communications, Inc. Real-time video-based audience reaction sentiment analysis
WO2023075980A1 (en) * 2021-10-29 2023-05-04 Zoom Video Communications, Inc. Real-time video-based audience reaction sentiment analysis
WO2023233482A1 (en) * 2022-05-30 2023-12-07 日本電信電話株式会社 Conversation support device, conversation support method, and program

Similar Documents

Publication Publication Date Title
US9843768B1 (en) Audience engagement feedback systems and techniques
US12231809B2 (en) Optimizing video conferencing using contextual information
US10546235B2 (en) Relativistic sentiment analyzer
US11108991B2 (en) Method and apparatus for contextual inclusion of objects in a conference
Phua et al. Uses and gratifications of social networking sites for bridging and bonding social capital: A comparison of Facebook, Twitter, Instagram, and Snapchat
US20210076002A1 (en) Enhanced video conference management
US10425694B2 (en) Live broadcast on an online social network
Bricker et al. Randomized, controlled pilot trial of a smartphone app for smoking cessation using acceptance and commitment therapy
JP6734852B2 (en) System and method for tracking events and providing virtual conference feedback
US10073905B2 (en) Remote control and modification of live presentation
US20180191792A1 (en) Live Broadcast on an Online Social Network
US20160188125A1 (en) Method to include interactive objects in presentation
US11343460B2 (en) Participation management system for videoconferencing
US20170228135A1 (en) System and method for dynamic user interface gamification in conference calls
US20160128617A1 (en) Social cuing based on in-context observation
Lane et al. Making it Facebook official: The warranting value of online relationship status disclosures on relational characteristics
US20140136626A1 (en) Interactive Presentations
CN107533714B (en) System and method for enabling efficient digital marketing on portable wireless devices for parties with low capabilities
US9942516B1 (en) Optimizing video conferencing using contextual information
CN111369275B (en) Identifying and describing group methods, coordinating devices, and computer-readable storage media
Bell et al. Networking in the digital age: Identifying factors that influence adolescents’ online communication and relationship building
KR102467785B1 (en) Method for sharing schedule
US20150228034A1 (en) Photo booth system
US20230230496A1 (en) Smart phone usage monitoring and management system
KR102088410B1 (en) Method, system and non-transitory computer-readable recording medium for controlling flow of advertising contents based on video chat

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTEL CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NEGI, ANSUYA;TATOURIAN, IGOR;WOUHAYBI, RITA H.;SIGNING DATES FROM 20161024 TO 20161025;REEL/FRAME:041782/0765

STCF Information on status: patent grant

Free format text: PATENTED CASE

FEPP Fee payment procedure

Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

LAPS Lapse for failure to pay maintenance fees

Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20211212

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载