USRE38401E1 - Interactive video icon with designated viewing position - Google Patents
Interactive video icon with designated viewing position Download PDFInfo
- Publication number
- USRE38401E1 USRE38401E1 US09/662,481 US66248100A USRE38401E US RE38401 E1 USRE38401 E1 US RE38401E1 US 66248100 A US66248100 A US 66248100A US RE38401 E USRE38401 E US RE38401E
- Authority
- US
- United States
- Prior art keywords
- frames
- image
- interface
- video
- root
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000002452 interceptive effect Effects 0.000 title claims abstract description 34
- 230000000694 effects Effects 0.000 claims abstract description 58
- 238000000926 separation method Methods 0.000 claims abstract description 5
- 230000033001 locomotion Effects 0.000 claims description 40
- 238000000034 method Methods 0.000 claims description 39
- 238000013515 script Methods 0.000 claims description 33
- 230000008569 process Effects 0.000 claims description 27
- 230000000007 visual effect Effects 0.000 claims description 14
- 238000013459 approach Methods 0.000 claims description 11
- 230000004044 response Effects 0.000 claims description 11
- 230000008859 change Effects 0.000 claims description 10
- 238000004422 calculation algorithm Methods 0.000 claims description 8
- 238000012545 processing Methods 0.000 claims description 8
- 230000001419 dependent effect Effects 0.000 claims description 4
- 230000000750 progressive effect Effects 0.000 claims 1
- 230000003993 interaction Effects 0.000 abstract description 11
- 238000013461 design Methods 0.000 abstract description 4
- 238000012550 audit Methods 0.000 description 9
- 238000000605 extraction Methods 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 238000012938 design process Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 238000005452 bending Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- UKUVVAMSXXBMRX-UHFFFAOYSA-N 2,4,5-trithia-1,3-diarsabicyclo[1.1.1]pentane Chemical compound S1[As]2S[As]1S2 UKUVVAMSXXBMRX-UHFFFAOYSA-N 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000004424 eye movement Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/738—Presentation of query results
- G06F16/739—Presentation of query results in form of a video summary, e.g. the video summary being a video sequence, a composite still image or having synthesized frames
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/54—Browsing; Visualisation therefor
Definitions
- the present invention relates to the field of interfaces for video information. More particularly, the present invention provides interactive interfaces for video information and toolkits for use in creation of such interactive interfaces.
- Video information is being produced at an ever-increasing rate and video sequences, especially short sequences, are increasingly being used, for example, in websites and on CD-ROM, and being created, for example, by domestic use of camcorders.
- video information consists of a sequence of frames recorded at a fixed time interval.
- the video information consists of 25 or 30 frames per second.
- Each frame is meaningful since it corresponds to an image which can be viewed.
- a frame may be made up of a number of interlaced fields, but this is not obligatory as is seen from more recently proposed video formats, such as those intended for high definition television.
- Frames describe the temporal decomposition of the video image information.
- Each frame contains image information structured in terms of lines and pixels, which represent the spatial decomposition of the video.
- video information or “video sequences” refer to data representing a visual image recorded over a given time period, without reference to the length of that time period or the structure of the recorded information.
- video sequence will be used to refer to any series of video frames, regardless of whether this series corresponds to a single camera shot (recorded between two cuts) or to a plurality of shots or scenes.
- the coded content may, for example, identify the types of objects present in the video sequence, their properties/motion, the type of camera movements involved in the video sequence (pan, tracking shot, zoom, etc.), and other properties.
- a “summary” of the coded document may be prepared, consisting of certain representative frames taken from the sequence, together with text information or icons indicating how the sequence has been coded.
- the interface for interacting with the video database typically includes a computer input device enabling the user to specify objects or properties of interest and, in response to the query, the computer determines which video sequences in the database correspond to the input search terms and displays the appropriate “summaries”. The user then indicates whether or not a particular video sequence should be reproduced. Examples of products using this approach are described in the article “Advanced Imaging Product Survey: Photo, Document and Video” from the journal “Advances Imaging”, October 1994, which document is incorporated herein by this reference.
- the video sequence is divided up into shorter series of frames based upon the scene changes or the semantic content of the video information.
- a hierarchical structure may be defined. Index “summaries” may be produced for the different series of frames corresponding to nodes in the hierarchical structure.
- the “summary” corresponding to a complete video sequence may be retrieved for display to the user who is then allowed to request display of “summaries” relating to sub-sections of the video sequence which are lower down in the hierarchical structure. If the user so wishes, a selected sequence or sub-section is reproduced on the display monitor.
- Such a scheme is described in EP-A-0 555 028 which is incorporated herein by this reference.
- Another approach derived from the field of video editing, consists of the “digital storyboard”.
- the video sequence is segmented into scenes and one or more representative frames from each scene is selected and displayed, usually accompanied by text information, side-by-side with representative frames from other segments.
- the user now has both a visual overview of all the scenes and a direct visual access to individual scenes.
- Each representative frame of the storyboard can be considered to be an icon. Selection of the icon via a pointing device (typically a mouse-controlled cursor) causes the associated video sequence or subsequence to be reproduced.
- Typical layouts for the storyboards are two-dimensional arrays or long one-dimensional strips. In the first case, the user scans the icons from the left to the right, line by line, whereas in the second case the user needs to move the strip across the screen.
- Digital storyboards are typically created by a video editor who views the video sequence, segments the data into individual scenes and places each scene, with a descriptive comment, onto the storyboard. As is well-known from technical literature, many steps of this process can be automated. For example, different techniques for automatic detection of scene changes are discussed in the following documents, each of which is incorporated herein by reference:
- a “trace” consisting of a single frame having superimposed images taken from different frames of the video sequence, these images being offset one from the other due to motion occurring between the different frames from which the images were taken.
- the corresponding “trace” will include multiple (probably overlapping) images of the sprinter, spaced in the direction in which the sprinter is running.
- Another approach of this kind generates a composite image, called a “salient still”, representative of the video sequence—see “Salient Video Stills: Content and Context Preserved” by Teodosio et al, Proc. ACM Multimedia 93, California, Aug. 1-6, 1993), pp 39-47 which article is incorporated herein by this reference in its entirety.
- Still another approach of this general type consists in creation of a “video icon”, as described in the papers “Developing Power Tools for Video Indexinor and retrieval” by Zhang et al, SPIE, Vol.2185, pp 140-149-, and “Video Representation tools using a unified object and perspective based approach” by the present inventors, IS&T/SPIE Conference on Storage and Perusal for Image and Video Databases, San Jose, Calif., February 1995 which are incorporated herein by reference.
- a “video icon”, as illustrated in FIG. 1A the scene is represented by a number of frames selected from the sequence and which are displayed as if they were stacked up one behind the other in the z-direction and are viewed in perspective.
- each individual frame is represented by a plane and the planes lie one behind the other with a slight offset.
- the first frame of the stack is displayed in its entirety whereas underlying frames are partially occluded by the frames in front.
- the envelope of the stack of frames has a parallelepiped shape.
- object based video icons Two special types have been proposed, “object based” video icons and video icons containing a representation of camera movement.
- object based video icon as illustrated in FIG. 1B, objects of interest are isolated in the individual frames and, for at least some of the stacked frames, the only image information included in the video icon is the image information corresponding to the selected object.
- video icon at least some of the individual frames are represented as if they were transparent except in the regions containing the selected object.
- Video icons containing an indication of camera movement may have, as illustrated in the example of FIG. 1C, a serpentine-shaped envelope corresponding to the case of side-to-side motion of the camera.
- the video icons discussed above present the user with information concerning the content of the whole of a video sequence and serve as a selection tool allowing the user to access -frames of the video sequence out of the usual order. In other words, these icons allow non-sequential access to the video sequence. Nevertheless, the ways in which the user can interact with the video sequence information are strictly limited. The user can select frames for playback in a non-sequential way but he has little or no means of obtaining a deeper level of information concerning the video sequence as a whole, short of watching a playback of the whole sequence.
- the present invention provides a novel type of interface to video information which allows the user to access information concerning a video sequence in a highly versatile manner.
- interactive video interfaces of the present invention enable a user to obtain deeper levels of information concerning an associated video sequence at positions in the sequence which are designated by the user as being of interest.
- the present invention provides an interface to information concerning an associated video sequence, one such interface comprising:
- the root image consisting of a plurality of basic frames selected from said video sequence, and/or a plurality of portions of video frames corresponding to selected objects represented in the video sequence, x and y directions in the root image corresponding to x and y directions in the video frames and the z direction in the root image corresponding to the time axis whereby the basic frames are spaced apart from one another in the z direction of the root image by distances corresponding to the time separation between the respective video frames;
- customized user interfaces may be created for video sequences.
- These interfaces comprise a displayable “root” image which directly represents the content and context of the image information in the video sequence and can be manipulated, either automatically or by the user, in order to display further image information, by designation of a viewing position with respect to the root image, the representation of the displayed image being changed in response to changes in the designated viewing position.
- the representation of the displayed image changes dependent upon the designated viewing position as if the root image were a three-dimensional object.
- the data necessary to form the displayed representation of the root image is calculated so as to provide the correct perspective view given the viewing angle, the distance separating the viewing position from the displayed quasi-object and whether the viewing position is above or below the displayed quasi-object.
- the present invention can provide non-interactive interfaces to video sequences, in which the root image information is packaged with an associated script defining a routine for automatically displaying a sequence of different views of the root image and performing a set of manipulations on the displayed image, no user manipulation being permitted.
- the full benefits of the invention are best seen in interactive interfaces where the viewing position of the root image is designated by the user, as follows.
- the user first accesses the interface he is presented with a displayed image which represents the root image seen from a particular viewpoint (which may be a predetermined reference viewpoint).
- the displayed image represents the root image seen from different perspectives.
- the displayed image increases or reduces the size and, preferably, resolution of the displayed information, accessing image data from additional video frames, if need be.
- the customized, interactive interfaces provided by the present invention involve displayed images, representing the respective associated video sequences, which, in some ways, could be considered to be a navigable environment or a manipulable object.
- This environment or object is a quasi-three-dimensional entity.
- the x and y dimensions of the environment/object correspond to true spatial dimensions (corresponding to the x and y direction in the associated video frames) whereas the z dimension of the environment/object corresponds to the time axis.
- These interfaces could be considered to constitute a development of the “video icons” discussed above, now rendered interactive and manipulable by the user.
- the user can select spatial and temporal information from a video sequence for access by designating a viewing position with respect to a video icon representing the video sequence.
- Arbitrarily chosen oblique “viewing direction” are possible whereby the user simultaneously accesses image information corresponding to portions of a number of different frames in the video sequence.
- the user's viewing position relative to the video icon changes, the amount of a given frame which is visible to him, and the number and selection of frames which he can see, changes correspondingly.
- the interactive video interfaces of the present invention make use of a “root” image comprising a plurality of basic frames arranged to form a quasi-three dimensional object. It is preferred that the relative placement positions of the basic frames be arranged so as to indicate visually some underlying motion in the video sequence.
- the envelope of the set of basic frames preferably does not have a parallelpiped shape but, instead, composes a “pipe” of rectangular section and bending, in a way corresponding to the camera travel during filming of the video sequence.
- the basic video frames making up the root image are chosen as a function of the amount of motion or change in the sequence.
- successive basic frames should include back-round information overlapping by, say, 50%.
- the root image corresponds to an “object-based video icon.”
- certain basic frames included in the root image are not included therein in full; only those portions corresponding to selected objects are included.
- certain basic frames may be included in full in the root image but may include “hot objects,” that is, representations of objects selectable by the user.
- the corresponding basic frames and, if necessary, additional frames
- the root image allows the user to selectively isolate objects of interest in the video sequence and obtain at a glance a visual impression of the appearance and movement of the objects during the video sequence.
- the interfaces of the present invention allow the user to select an arbitrary portion of the video sequence for playback.
- the user designates a portion of the video sequence which is of interest, by designating a corresponding portion of the displayed image forming part of the interface to the video sequence. This portion of the video sequence is than played back.
- the interface may include a displayed set of controls similar to those provided on a VCR in order to permit the user to select different modes for this playback, such as fast-forward, rewind, etc.
- the displayed image forming part of the interface remains visible whilst the designated portion of the sequence is being played back.
- This can be achieved in any number of ways, as for example, by providing a second display device upon which the playback takes place, or by designating a “playback window” on the display screen, this playback window being offset with respect to the screen area used by the interface, or by any other suitable means.
- the preferred embodiments of interfaces according to the invention also permit the user to designate an object of interest and to select a playback mode in which only image information concerning that selected object is included in the playback. Furthermore, the user can select a single frame from the video sequence for display separately from the interactive displayed image generated by the interface.
- the interfaces of the present invention allow the user to generate a displayed image corresponding to a distortion of the root image.
- the displayed image can correspond to the root image subjected to an “accordion effect”, where the root image is “cracked open”, for example, by bending around a bend line so as to “fan out” video frames in the vicinity of the opening point, or is modified by linearly spreading apart video frames at a point of interest.
- the accordion effect can also be applied repetitively or otherwise in a nested fashion according to the present invention.
- the present invention can provide user interfaces to “multi-threaded” video sequences, that is, video sequences consisting of numerous interrelated shorter segments such as are found, for example, in a video game where the user's choices change the scene which is displayed.
- Interfaces to such multi-threaded video sequences can include frames of the different video segments in the root image, such that the root image has a branching structure. Alternatively, some or all of the different threads may not be visible in the root image but may become visible as a result of user manipulation.
- a pointing device such as a mouse, or by touching a touch screen, etc.
- image portions for these different threads may be added to the displayed image.
- the root image for the video sequence concerned is associated with information defining how the corresponding displayed image will change in response to given types of user manipulation.
- this associated information may define how many, or which additional frames are displayed when the user moves the viewing position closer up to the root image.
- the associated information may identify which objects in the scene are “hot objects” and what image information will be displayed in relation to these hot objects when activated by the user.
- the user who is interested in a particular video sequence may first download only certain components of the associated interface. First of all he downloads information for generating a displayed view of the root image, together with an associated application program (if he does not already have an appropriate “interface player” loaded in his computer).
- the downloaded (or already-resident) application program includes basic routines for chancing the perspective of the displayed image in response to changes in the viewing position designated by the user.
- the application program is also adapted to consult any “associated information” (as mentioned above) which forms part of the interface and conditions the way in which the displayed image changes in response to certain predetermined user manipulations (such as “zoom-in” and “activate object”). If the interface does not contain any such “associated information” then the application program makes use of pre-set default parameters.
- the root image corresponds to a particular set of basic video frames and information designating relative placement positions thereof.
- the root image information downloaded to the user may include just the data necessary to create a reference view of the root image or it may include the image data for the set of basic frames (in order to enable the changes in user viewing angle to be catered for without the need to download additional information).
- this extra information can either be pre-packaged and supplied with the root image information or the extra information can be downloaded from the host website as and when it is needed.
- the present invention also provides apparatus for creation of interfaces according to the present invention.
- This may be dedicated hardware or, more preferably, a computer system programmed in accordance with specially designed computer programs.
- the selection of basic frames for inclusion in the “root image” of the interface can be made automatically according to one of a number of different algorithms, such as choosing one frame every n frames, or choosing 1 frame every time the camera movement has displaced he background by m%, etc.
- the relative placement positions of the basic frames in the root image can be set automatically taking into account the time separation between those frames and, if desired, other factors such as camera motion.
- the presence of objects or people in the video sequence can be detected automatically according to one of the known algorithms (such as those discussed in the references cited above), and an “object oriented” root image can be created automatically.
- the interface creation apparatus of the present invention has the capability of automatically processing video sequence information in order to produce a root image.
- These embodiments include means for associating with the root image a standard set of routines for changing the representation of the displayed image in response to user manipulations.
- the present invention provides a toolkit for use in creation of customized interfaces.
- the toolkit enables a designer to tailor the configuration and content of the root image, as well as to specify which objects in the video sequence are “hot objects” and to control the way in which the displayed interface image will change in response to manipulation by an end user.
- the toolkit enables the interface designer to determine which frames of the video sequence should be used as basic frames in the root image, and how many additional frames are added to the displayed image when the user designates a viewing position close to the root image.
- FIGS. 1A-C illustrates various types of video icon, wherein FIG. 1A shows an ordinary video icon, FIG. 1B shows an object-based video icon and FIG. 1C shows a video icon including a representation of camera motion;
- FIG. 2 is a block diagram indicating the components of an interactive interface according to a first embodiment of the present invention
- FIG. 3 is a diagram illustrating the content of the interface data file (FDI) used in the first embodiment of the invention
- FIG. 4 is a diagram illustrating a reference view of a root image and three viewing, positions designated by a user
- FIGS. 5A-C illustrate the displayed image in the case of the root image viewed from the different viewing positions of FIG. 4, wherein FIG. 5A represents the displayed image from viewing position A, wherein FIG. 5B represent the displayed image from viewing position B, and wherein FIG. 5C represents the displayed image from viewing position C;
- FIGS. 6A-B illustrate displayed images based on more complex root images according to the present invention, in which FIG. 6A is derived from a root image visually representing motion and FIG. 6B is derived from a root image visually representing a zoom effect;
- FIGS. 7A-B illustrate the effect of user selection of an object represented in the displayed image, in a second embodiment of interface according to the present invention
- FIG. 8 illustrates a user manipulation of a root image to produce an “according effect”
- FIG. 9 illustrates a displayed image corresponding to a view of a branching root image associated with a multi-threaded scenario
- FIG. 10 is a flow diagram indicating steps in a preferred process of designing an interface according to the present invention.
- FIG. 11 is a schematic representation of a preferred embodiment of an interface editor unit according to the present invention.
- FIG. 12 is a schematic representation of a preferred embodiment of an interface viewer according to the present invention.
- an interactive interface of the invention is associated with video sequences recorded on a CD-ROM.
- a CD-ROM reader 1 is connected to a computer system including a central processor portion 2 , a display screen 3 , and a user-operable input device which, in this case, includes a keyboard 4 and a mouse 5 .
- a user-operable input device which, in this case, includes a keyboard 4 and a mouse 5 .
- the user wishes to consult video sequences recorded on a CD-ROM 7 , he places the CD-ROM 7 in the CD-ROM reader and activates CD-ROM accessing software provided in the central processor portion 2 or an associated memory or unit.
- the CD-ROM has recorded thereon not only the video sequence image information 8 (in any convenient format), but also a respective interface data file (FDI i ) 10 for each video sequence, together with a video interface application program 11 .
- FDI i respective interface data file
- FIG. 3 The content of a typical data file is illustrated in FIG. 3 .
- Respective scripts 12 are optionally associated with the interface data files.
- FIG. 4 illustrates a simple interactive video icon according to the present invention.
- this video icon is represented on the display screen as a set of superposed images arranged within an envelope having the shape of a regular parallelepiped.
- Each of the superposed images corresponds to a video frame selected from the video sequence, but these frames are offset from one another. It may be considered that the displayed image corresponds to a cuboid viewed from a particular viewing position (above and to the right, in this example).
- This cuboid is a theoretical construct consisting of the set of selected video frames disposed such that their respective x and y axes correspond to the x and y axes of the cuboid and the z axis of the cuboid corresponds to the time axis.
- the selected frames are spaced apart in the z direction in accordance with their respective time separations in the video sequence.
- FIG. 4 When the user seeks to explore the video sequence via the interactive video icon displayed on the display screen, one of the basic operations he can perform is to designate a position on the screen as a viewing position relative to the displayed image (e.g. by “clicking” with the computer mouse). In FIG. 4, three such designated viewing positions are indicated by the letters A, B and C.
- FIGS. 5A, 5 B and 5 C correspond to “viewing positions” A, B and C, respectively, of FIG. 4 .
- the image displayed to the user changes so as to provide a perspective view of the theoretical cuboid as seen from an angle corresponding to the viewing position designated by the user.
- the above-mentioned cuboid is a special case of a “root image” according to the present invention.
- This “root image” is derived from the video sequence and conveys information concerning both the image content of the selected sub-set of frames (called below, “basic frames”) and the relative “position” of that image information in time as well as space.
- the “root image” is defined by information in the interface data file. The definition specifies which video frames are “basic frames” (for example, by storing the relevant frame numbers), as well as specifying the placement positions of the basic frames relative to one another within the root image.
- the central processor portion 2 of the computer system calculates the image data required to generate the displayed image from the root image definition contained in the appropriate interface data file, image data of the basic frames (and, where required, additional frames) and the viewing position designated by the user, using, standard ray-tracing techniques.
- the data required to generated the displayed image is loaded into the video buffer and displayed on the display screen.
- the image information in the area of interest should be enriched. This is achieved by including, in the displayed image, image data relating to additional video frames besides the basic video frames. Such a case is illustrated in FIG. 5B, where the basic frames BF 5 and BF 6 are displayed together with additional frames AF 1 and AF 2 . As the user-designated viewing position approaches closer and closer to the displayed image the video interface application program causes closely spaced additional frames to be added to the displayed image. Ultimately, successive video frames of the video sequences may be included in the displayed image. As is clear from FIG. 5B, image information corresponding to parts of the root image distant from the area of interest may be omitted from the displayed “closeup” image.
- the interface data file includes data specifying how the choice should be made of additional frames to be added as the user “moves close up” to the displayed image. More preferably, this data defines rules governing the choice of how many, and which, additional frames should be used to enrich the displayed image as the designated viewing position changes. These rules can, for example, define a mathematical relationship between the number of displayed frames and the distance separating the designated viewing position and the displayed quasi-object. In preferred embodiments of the invention, the number of frames which are added to the display as the viewing position approaches the displayed quasi-object depends upon the amount of motion or change in the video sequence at that location.
- the example illustrated in FIG. 4 is a simplification in which the displayed image corresponds to a root image having a simple, cuboid shape.
- the root image may have a variety of different forms.
- the relative placement positions of the basic frames may be selected such that the envelope of the root image has a shape which reflects motion in the corresponding video sequence (either camera motion, during tracking shots and the like, or motion of objects represented in the sequence)—see the corresponding interactive icon shown in FIG. 6 A.
- the dimensions of the basic frames in the root image may be scaled so as to visually represent a zoom effect occurring in the video sequence -see the corresponding interactive icon shown in FIG. 6 B.
- the interactive icon represented in FIG. 6B includes certain frames for which only a portion of the image information has been displayed. This corresponds to a case when an object of special interest has been selected.
- object selection can be made in various ways.
- the root image may be designed such that, instead of including basic frames in full, only those portions of frames which represent a particular object are included. This involves a choice being made, at the time of design of the root image portion of the interface, concerning which objects are interesting. The designer can alternatively or additionally decide that the root image will include basic frames in full but that certain objects represented in the video sequence are to be “selectable” or “extractable” at user request. This feature will now be discussed with reference to FIG. 7 .
- FIG. 7A illustrates an initial view presented to a user when he consults the interface for a particular selected video sequence.
- this sequence two people walk towards each other and their paths cross.
- the designer of the interface has decided that the two people are objects that may be of interest to the end user. Accordingly, he has included, in the interface data file, information designating these objects as “extractable”.
- This designation information may correspond to x, y co-ordinate range information identifying the position of the object in each video frame (or a subset of frames.
- the interface application program controls the displayed image such that extraneous portions of the displayed frames disappear from the display, leaving only a representation of the two people and their motion, as shown in FIG. 7 B.
- the objects of interest are “extracted” from their surroundings.
- the “missing” or transparent portions of the displayed frames can be restored to the displayed image at the user's demand (e.g. by a further “click” of the mouse button).
- interfaces may be designed such that particular “extractable” objects may be extracted simultaneously with some or all of the other extractable objects, or they may be extracted individually.
- Sophisticated interfaces according to the present invention can incorporate object-extraction routines permitting the user to arbitrarily select objects visible in the displayed view of the root image, for extraction.
- the user may use a pointing device to create a frame around an object visible in a displayed view of the root image and the application program that provides analysis routines permitting identification of the designated object in the other basic frames of the root image (and, if required, in additional frames) so as to cause display of that selected object as if it were located on transparent frames.
- Preferred embodiments of interface according to the invention thus provide a so-called “accordion” effect, as illustrated in FIG. 8 .
- the basic frames in the vicinity of the region of interest are spread so as to provide the user with a better view.
- the function of displaying additional frames so as to increase detail is inhibited during the “accordion” effect.
- FIG. 9 illustrates an interactive video icon derived from a simple example of such a root image.
- the designer may create secondary root images for the respective sub-sequences, these secondary root images being used to generate the displayed image only when the user designates a viewing position close to the video frame where the sub-sequence begins.
- this is a logical choice since it is at the point where the video sub-sequence branches from the main sequence that user choices during playing of the game, or using of the educational software, change the experienced scenario.
- VCR controls which permit the user to playback the video sequence with which the displayed video icon is associated.
- the user can select for playback portions or frames within the sequence by, for example, “clicking” with the mouse button on the frames of interest as displayed in the interactive video icon.
- the video playback can take place on a separate display screen or on a window defined on the display screen displaying the video icon.
- a particular video sequence may be associated with an interface data file and a script.
- the script is a routine defined by the interface designer which leads the user through the use of the interface.
- the script can, for example, consist of a routine to cause an automatic demonstration of the different manipulations possible of the displayed quasi-object.
- the user can alter the running of the script in the usual way, for example by pausing it, slowing it down, etc.
- the script may, if desired, include additional text, sound or graphic information which can be reproduced in association with the displayed view of the root image either automatically or in response to operations performed by the end user.
- Script functionality according to the present invention allows creation and editing of viewing scenarios that may be subsequently be played, in part or in whole, automatically, or interactively with user inputs. For example, in a completely automatic mode, the user can cause the scenario to begin to play by itself and take the user through the scenario and any associated information by simply reading the scenario and changing the view. In other situations the script may call for interaction by the user, such as to initiate a transaction. In this case the user may be asked to specify information, e.g. if he wants to purchase the video or any other items associated with what has been viewed.
- tags which when activated by the user will cause some information to be displayed on the display device; e.g. associated text, graphics, video, or sound files which are played through the speakers of the display device.
- tags are attached to objects selected and extracted from the video sequence, such as so-called “hot objects” according to the present invention.
- FIG. 10 is a flow diagram illustrating typical stages in the design of an interface according to the present invention, in the case where a designer is involved. It is to be understood that interfaces according to the present invention can also be generated entirely automatically. It will be noted that the designer's choices affect, notably, the content of the interface data file. It is to be understood, also, that not all of the steps illustrated in FIG. 10 are necessarily required—for example, steps concerning creation of secondary root images can be omitted in the case of a video sequence which is not multithreaded. Similarly, it may be desirable to include in the interface design process certain supplementary steps which are not shown in FIG. 10 . Thus, for example, it is often desirable to include in the interface data file (as indicated in the example of FIG.
- the interface design process can permit, for example, additional video frames to be added to the displayed image and positioned so as to provide a visual representation of the camera motion.
- the information on the characteristics of the video sequence can be determined either automatically (using, known cut-detection techniques and the like) and/or may be specified by the interface designer. It may also be desirable to include in the interface data file information which allows the sequence, or scripting for it, to be indexed and retrieved.
- the interface or sequence is accessed using such information applied according to a traditional method, such as standard database query language or through a browser via a channel of network; the interface data may be downloaded in its entirety or fetched on an as needed basis.
- the present invention provides toolkitd for use by designers wishing to create an interactive video interface according to the present invention.
- These toolkits are preferably implemented as a computer program for running on a general purpose computer.
- the toolkits present the designer with displayed menus and instructions to lead him through a process including steps such as the typical sequence illustrated in FIG. 10 .
- the designer first of all indicates for which video sequence he desires to create an interface, for example by typing in the name of a stored file containing the video sequence information.
- the toolkit accesses this video sequence information for display in a window on the screen for consultation by the designer during the interface design process.
- the designer may make his selection of basic frames/objects for the root image, extractable objects and the like by stepping slowly through the video sequence and, for example, using a mouse to place a cursor on frames or portions of frames which are of interest.
- the toolkit logs the frame number (and x, y locations or regions in a frame, where appropriate) of the frames/frame portions indicated by the designer and associates this positional information with the appropriate parameter being defined.
- the designer is presented with a displayed view of the root image for manipulation so that he may determine whether any changes to the interface data file are required.
- Different versions of the application program can be associated with the interface data file (and script, if present) depending upon the interface functions which are to be supported. Thus, if no script is associated with the interface data file, the application program does not require routines handling the running of scripts. Similarly, if the interface data file does not permit an accordion effect to be performed by the end user then the application program does not need to include routines required for calculating display information for such effects. If the interface designer believes that the end user is likely already to have an application program suitable for running interfaces according to the present invention then he may choose not to package an application program with the interface data file or else to associate with the interface data file merely information which identifies a suitable versions of application program for running this particular interface.
- the present invention has been described above in connection with video sequences stored on CD-ROM. It is to be understood that the present invention can be realized in numerous other applications.
- the content of the interface data file and the elements of the interface which are present at the same location as the end user can vary depending upon the application.
- the user may first download via his telecommunications connections just the interface data file applicable to the sequence. If the user does not already have software suitable for handling manipulation of the interactive video icon then he will also download the corresponding application program. As the user manipulates the interactive video icon, any extra image information that he may require which has not already been downloaded can be downloaded in a dynamic fashion as required.
- This process can be audited according to the present invention if desired.
- the user's interaction with the interface can be audited, and he can interact with the transaction/audit functionality for example to supply any information required by a script which may then be recorded and stored.
- the transaction/audit information can be stored and made available for externally (optional) located auditing and transaction processing facilities/applications.
- the auditing information can be transmitted at the end of a session whereas the transaction information may be performed on-line, i.e. the transaction information is submitted during the session. Real time transmission can also occur according to the present invention, however.
- the interface data frame includes the image information. Some additional image information may also be provided.
- Editors, readers and viewers according to the present invention can be implemented in hardware, hardware/software hybrid, or as software on a dedicated platform, a workstation, a personal computer, or any other hardware.
- Different units implemented in software run on a CPU or graphics boards or other conventional hardware in a conventional manner, and the various storage devices can be general purpose computer storage devices such as magnetic disks, CD-ROMs, DVD, etc.
- the editor connects to a database manager ( 101 ) and selects a video document and any other documents to be included in the interface by using a data chooser unit ( 102 ).
- the database manager may be implemented in various ways; e.g., as a simple file structure or even as a complete multimedia database.
- the data storage ( 100 ) contains the video data and any other information/documents required and can be implemented in various modes; e.g., in a simple stand-alone mode of operation it could be a CD-ROM or in a networked application it could be implemented as a bank of video servers.
- the user operating through the user interaction unit ( 120 ) is first presented a list of available videos or uses a standard database query language to choose the desired video and then chooses any other documents required.
- the video document chosen by the editor is first processed by the activity measure unit ( 103 ).
- the activity measure unit is responsible for computing various parameters related to the motion and changes in the video. This unit typically will implement one of a number of known techniques for measuring changes, e.g., by calculating the statistics of the differences between frames, by tracking objects in motion, or by estimating camera motions by separating foreground and background portions of the image. In other implementations this unit may use motion vector information stored in an MPEG-encoded sequence to detect important frames of activity in the video document.
- the activity measures template store is optional but would contain templates which can be used to calculate the frame ranking measure and could be specified by the user through the user interaction unit.
- the frame ranking measure is derived heuristically from these measures [e.g., by normalizing the values and taking an average of the parameters, and can be tailored for different kinds of sequences (traveling shots, single objects in motion, etc) or applications].
- the editor may choose a pre-defined set of parameters from the activity measures template store ( 108 ) to detect or highlight a specific kind of activity (rapid motion, abrupt changes, accelerations, etc.)
- the frame ranking measures can be employed by the user acting through the user interaction unit on the frame selection unit ( 104 ) to select the frames to be included within the interface. For example, if 10 frames are to be included in the interface then in default mode the 10 frames corresponding to the 10 largest frame making measures are selected for inclusion in the interface. The user can then interactively de-select some of these frames and add other frames.
- the camera motion analysis unit ( 105 ) is an optional unit which typically will implement one of a number of known techniques for measuring camera motion parameters. This information can be used to determine what shape to give to the outer envelope of the interface as shown in FIG. 1C; a default shape, stored in the interface template store ( 116 ) can be chosen. This information may be optionally stored in the FDI file.
- the object selection unit ( 106 A) is responsible for selecting or detecting individual objects in the video document.
- the information regarding the properties of the object may be optionally stored in the FDI file.
- the object extraction and tracking unit ( 106 B) is now responsible for extracting the object of interest from the frame and then tracking it by using known tracking algorithms.
- the algorithms used are either chosen by the user or by default. It is understood that the object selecting, detection, extraction, and tracking process may be highly interactive and that the user may be called upon or choose to intervene in the process a number of times.
- the information about the presence and location of objects may be optionally stored in the FDI file.
- the FDI file can be made available to an external program, for example when the interface editor is associated with an indexing program, the task of which is to attach indexes (identifiers) to the video documents, to portions thereof, or to objects located within the video document.
- the user acting through the user interaction unit ( 120 ) on the interface creation unit ( 109 ) determines the visual layout of the interface.
- Default shapes are stored in the interface template store ( 116 ).
- the user can also choose to vary the spacing of the frames seen on the interface; that is the distance between frames of the interface as perceived on the display unit.
- the user can also insert selections of the extracted and tracked objects from unit ( 106 B) as illustrated in FIG. 7 B. In this case, the corresponding frames are rendered transparent except at the locations of the objects.
- the different pieces of information generated by the units described above are gathered together by the interface creation unit ( 109 ) into an FDI file containing a description of the interface in terms of its layout i.e. shape and structure, the image frame numbers and their positions, and if available, the extracted features the ranking of the frames and the camera motion information. This information is transmitted to the interface effects creation unit ( 117 ).
- the editor can also specify three classes of interface features which serve to convey additional information to the user and which allow the user to interact with the interface.
- the editor performs this specification through the interface effects creation unit ( 117 ).
- the zooming effects creation unit ( 110 ) is used by the editor to specify which frames will be made visible, and also which will be rendered invisible to the user when he moves up closer to the interface (FIG. 5B) so as to view it from a new viewing position.
- the choice of frames to add depends upon factors such as, the distance of the viewing point from the interface, the degree of motion, the degree of scene change, the number of frames that can be made visible and optionally the frame ranking measures calculated by the activity measure unit ( 103 ).
- the editor can choose to use one or more of the default zooming effect templates contained in the zooming effect templates store ( 113 ) and assign these in a differential manner to different parts of the interface; alternatively the editor can choose to modify these templates and apply them differentially to the interface.
- the special effects creation unit ( 111 ) is used by the editor to create special visual effects on the interface.
- One such example is the accordion effect illustrated in FIG. 8 where parts of the interface are compressed and other parts are expanded.
- FIG. 7A and 7B Another example is illustrated in FIG. 7A and 7B where the editor was designated an extractable object and which is then shown in its extracted form; in other words, the background is removed.
- the editor creates the scripts by calling up templates from the specific effects templates store ( 114 ) and instantiating them by defining the positions where the special effect is to take place and by setting the appropriate parameters.
- the script effects creation unit ( 113 ) allows the editor of the interface to build an interface viewing scenario that may be subsequently be played, in part or in whole, automatically, or interactively with user inputs. For example, in a completely automatic mode when the user calls up the interface it begins to play by itself and takes the user through the interface and any associated information by simply reading the scenario and changing the view of the interface.
- the script may call for the user to interact with the interface, e.g. to initiate a transaction. In this case the user may be asked to specify information, e.g. if he wants to purchase the video or any other items associated with the interface.
- the editor may leave visible tags which when activated by the user will cause some information to be displayed on the display device; e.g.
- tags are attached to objects selected and extracted from the video sequence by units 6 A and 6 B and become so-called “hot object.”
- the editor creates the scripts by calling up templates from the script effects templates store ( 115 ) and instantiating them by defining the tag and the locations of the information to be called up.
- the interface effects creation unit ( 117 ) creates 4 files which are passed to the interface database manager ( 118 ) which will store these files either remotely or locally as the case may be: (1) The FDI file, completed by the special effect and script tags, text and graphics which have been added to the interface and which are directly visible to the user. (2) The zoom effect details, scripts and special effects. (3) The application programs (optional) to view the interface; i.e., allow the user to view the interface from different perspectives, traverse the interface, run the script, perform the special effects, or coded information which indicate which application program residing on the users machine can be used to perform these operations. (4) The video sequence and any other associated information (data) required for reading the interface.
- the user/editor can view the interface under construction, according to the current set of parameters, templates and designer preferences, on the interface viewer unit ( 121 ) (presented in FIG. 12 and described below), thus allowing the editor to interactively change its appearance and features.
- the interface viewer unit is then employed to read and interact with the interface.
- the storage units ( 201 ) are remotely located and accessed through the interface database manager ( 202 ) by way of a communication channel or network; depending upon the size and characteristics of the channel and the application the interface data may be loaded in its entirety or fetched on a as need bias.
- the data are then stored in a local memory unit ( 203 ) which may be either a cache memory, a disk store or any other writable storage element.
- the local memory unit ( 203 ) stores the 4 files created by the editor (see above) and in addition a transaction/audit file. In certain cases the applications programs are already resident in the interface viewer unit and so do not need to be transmitted.
- the CPU unit ( 204 ) fetches the application program, deduces which actions need to be performed, and then fetches the relevant interface information contained in the local memory unit ( 203 ). Typically the CPU unit fetches the required application program for the user interaction unit ( 205 ), the navigation unit ( 206 ), and the transaction/audit unit ( 207 ), then interface information is read from the local memory unit ( 203 ) passed to the interface renderer unit ( 208 ) which then calculates how the interface is to appear or be rendered for viewing on the display device ( 209 ).
- the user interacts with the interface through the user interaction unit ( 205 ) to the navigation unit ( 206 ) and all his actions are audited by the transaction/audit unit ( 207 ).
- the user can interact with the transaction/audit unit ( 207 ) for example to supply any information required by the script which is then recorded and stored in the transaction/audit portion of the local memory unit ( 203 ).
- this transaction/audit file or a portion thereof is transmitted by the interface database manager to the appropriate storage unit ( 201 ). This information is then available for externally (optional) located auditing and transaction processing facilities/applications.
- the auditing information is transmitted at the end of the session whereas the transaction information may be performed on-line, i.e. the transaction information is submitted during the session.
- the interface rendered unit ( 208 ) calculates how the interface is to appear or be rendered for viewing on the display device ( 209 ).
- the zoom effects unit ( 210 ) fetches the required application program, reads the zoom effect parameters stored in the local memory store ( 203 ), determines the frames to be dropped or added and supplies this information (including the additional frames if needed) to interface renderer unit ( 208 ) which then calculates how the interface is to appear or be rendered for viewing on the display device ( 209 ).
- the video play effects unit ( 211 ) fetches the required application program, then reads the required video data from the local memory unit ( 203 ) and plays the video on a second display device ( 209 ) or in a new window if only one display device is available.
- the special effects unit ( 212 ) fetches the required application program, reads the locations of the object and the corresponding frames are modified so as to be transparent wherever the objects do not occur; the new frames are passed to interface renderer unit ( 208 ) which then calculates how the interface is to appear or be rendered for viewing on the display device ( 209 ).
- the frames are passed to the video effects unit ( 211 ) which then plays the video on a second display device ( 209 ) or in a new window if only one display device is available.
- the special effects unit fetches the accordion effect store ( 203 ), determines the frames to be dropped or added and calculates parameters stored in the local memory the relative position of all the frames and supplies this information (including the additional frames if needed) to interface renderer unit ( 208 ) which then calculates how the interface is to appear or be rendered for viewing on the display device ( 209 ).
- the script effects unit ( 214 ) fetches the required application program, reads the corresponding portion of the script and the related information required to carry out the portion of the script associated with the tag designated. If the interface is to be played in automatic mode then the script effects unit ( 214 ) fetches the entire script and all the related information required to carry out the script. When needed the zoom effects unit ( 210 ), the video play unit ( 211 ), and the special effects unit ( 212 ) may be called into play. If the script calls for user input such as required for carrying out a transaction, then a new window may be opened on the display device (or on a second display device) where the information is supplied and transmitted to the transaction/audit unit ( 207 ).
- references above to user input or user selection processes cover the use of any input device whatsoever operable by the user including, but not limited to, a keyboard, a mouse (or other pointing device), a touch screen or panel, glove input devices, detectors of eye movements, voice actuated devices, etc.
- references above to “displays” cover the use of numerous different devices such as, but not limited to, conventional monitor screens, liquid crystal displays, etc.
- the respective root images each have a single characteristic feature, such as, giving a visual representation of motion, or giving a visual representation of zoom, or having a multi-threaded structure, etc. It is to be understood that a single root image can combine several of these features, as desired. Similarly, special effects such as object extraction, the accordion effect, etc. have been described separately. Again, it is to be understood that interfaces according to the invention can be designed to permit any desired combination of special effects.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (41)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/662,481 USRE38401E1 (en) | 1997-01-16 | 2000-09-13 | Interactive video icon with designated viewing position |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR9700423 | 1997-01-16 | ||
FR9700423A FR2758428B1 (en) | 1997-01-16 | 1997-01-16 | INTERACTIVE VIDEO INTERFACES |
US08/887,992 US5963203A (en) | 1997-07-03 | 1997-07-03 | Interactive video icon with designated viewing position |
US09/662,481 USRE38401E1 (en) | 1997-01-16 | 2000-09-13 | Interactive video icon with designated viewing position |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/887,992 Reissue US5963203A (en) | 1997-01-16 | 1997-07-03 | Interactive video icon with designated viewing position |
Publications (1)
Publication Number | Publication Date |
---|---|
USRE38401E1 true USRE38401E1 (en) | 2004-01-27 |
Family
ID=30118812
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/662,481 Expired - Lifetime USRE38401E1 (en) | 1997-01-16 | 2000-09-13 | Interactive video icon with designated viewing position |
Country Status (1)
Country | Link |
---|---|
US (1) | USRE38401E1 (en) |
Cited By (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020041339A1 (en) * | 2000-10-10 | 2002-04-11 | Klaus Diepold | Graphical representation of motion in still video images |
US20020113757A1 (en) * | 2000-12-28 | 2002-08-22 | Jyrki Hoisko | Displaying an image |
US20030115598A1 (en) * | 2001-03-23 | 2003-06-19 | Pantoja William E. | System and method for interactively producing a web-based multimedia presentation |
US20030132955A1 (en) * | 2002-01-16 | 2003-07-17 | Herve Le Floch | Method and device for temporal segmentation of a video sequence |
US20040155898A1 (en) * | 2002-09-19 | 2004-08-12 | Fuji Xerox Co., Ltd. | Image processing system |
US20040255337A1 (en) * | 2003-02-19 | 2004-12-16 | Impatica Inc. | Method of synchronization |
US20070174785A1 (en) * | 2006-01-23 | 2007-07-26 | Paavo Perttula | Mobile communication terminal and method therefore |
US20070260986A1 (en) * | 2006-05-08 | 2007-11-08 | Ge Security, Inc. | System and method of customizing video display layouts having dynamic icons |
US20080141180A1 (en) * | 2005-04-07 | 2008-06-12 | Iofy Corporation | Apparatus and Method for Utilizing an Information Unit to Provide Navigation Features on a Device |
US20080180394A1 (en) * | 2007-01-26 | 2008-07-31 | Samsung Electronics Co., Ltd. | Method for providing graphical user interface for changing reproducing time point and imaging apparatus therefor |
USD577034S1 (en) * | 2006-04-18 | 2008-09-16 | Samsung Electronics Co., Ltd. | Graphic user interface for dislay on a MP3 player |
US20090147010A1 (en) * | 2006-07-04 | 2009-06-11 | George Russell | Generation of video |
US20090199118A1 (en) * | 2008-02-05 | 2009-08-06 | Sivan Sabato | System and Method for Visualization of Time-Based Events |
US7595833B2 (en) | 2005-02-28 | 2009-09-29 | Canon Kabushiki Kaisha | Visualizing camera position in recorded video |
US20090300530A1 (en) * | 2008-05-29 | 2009-12-03 | Telcordia Technologies, Inc. | Method and system for multi-touch-based browsing of media summarizations on a handheld device |
US20090300498A1 (en) * | 2008-05-29 | 2009-12-03 | Telcordia Technologies, Inc. | Method and System for Generating and Presenting Mobile Content Summarization |
US20100013757A1 (en) * | 2006-03-14 | 2010-01-21 | Junichi Ogikubo | Image processing device and image processing method |
US20100073519A1 (en) * | 2008-09-22 | 2010-03-25 | Sony Corporation | Image processing apparatus, image pickup apparatus, image processing method, and computer program |
US20100146392A1 (en) * | 2008-12-10 | 2010-06-10 | Canon Kabushiki Kaisha | Method of selecting a frame from motion video |
US20100169777A1 (en) * | 2008-12-30 | 2010-07-01 | Apple Inc. | Light Table for Editing Digital Media |
US20100169783A1 (en) * | 2008-12-30 | 2010-07-01 | Apple, Inc. | Framework for Slideshow Object |
US20100229121A1 (en) * | 2009-03-09 | 2010-09-09 | Telcordia Technologies, Inc. | System and method for capturing, aggregating and presenting attention hotspots in shared media |
US7848598B2 (en) | 2002-09-19 | 2010-12-07 | Fuji Xerox Co., Ltd. | Image retrieval processing to obtain static image data from video data |
US8161389B1 (en) * | 2007-10-31 | 2012-04-17 | Adobe Systems Incorporated | Authoring tool sharable file format |
WO2013121239A1 (en) * | 2012-02-15 | 2013-08-22 | Thomson Licensing | User interface for depictive video editing |
US8626322B2 (en) | 2008-12-30 | 2014-01-07 | Apple Inc. | Multimedia display based on audio and visual complexity |
US8762890B2 (en) | 2010-07-27 | 2014-06-24 | Telcordia Technologies, Inc. | System and method for interactive projection and playback of relevant media segments onto the facets of three-dimensional shapes |
US20140258869A1 (en) * | 2013-03-08 | 2014-09-11 | Thomas Creative Productions, LLC | Interactive presentation system |
US9032300B2 (en) | 2010-08-24 | 2015-05-12 | Apple Inc. | Visual presentation composition |
US20150142799A1 (en) * | 2013-11-21 | 2015-05-21 | Nokia Corporation | Establishment of an association between an object and a mood media item |
US20180040128A1 (en) * | 2016-08-03 | 2018-02-08 | International Business Machines Corporation | Verification of business processes using spatio-temporal data |
US20210357110A1 (en) * | 2020-05-13 | 2021-11-18 | Cbs Interactive Inc. | Systems and methods for generating consistent user interfaces across multiple computing platforms |
US20220309938A1 (en) * | 2021-03-29 | 2022-09-29 | Panasonic Intellectual Property Management Co., Ltd. | Online video distribution support method and online video distribution support apparatus |
US20230333704A1 (en) * | 2020-12-21 | 2023-10-19 | Vivo Mobile Communication Co., Ltd. | Image display method and apparatus, and electronic device |
US11860931B1 (en) * | 2022-07-11 | 2024-01-02 | Truist Bank | Graphical user interface with insight hub and insight carousel |
US20240380945A1 (en) * | 2023-05-08 | 2024-11-14 | Apple Inc. | Video editing and key-framing animations using accessory on a touch device |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5235375A (en) * | 1990-04-12 | 1993-08-10 | Olympus Optical Co., Ltd. | Focusing position detecting and automatic focusing apparatus with optimal focusing position calculation method |
EP0555028A2 (en) * | 1992-02-06 | 1993-08-11 | International Business Machines Corporation | An apparatus and method for manipulating digital data relating to motion video images |
US5237648A (en) * | 1990-06-08 | 1993-08-17 | Apple Computer, Inc. | Apparatus and method for editing a video recording by selecting and displaying video clips |
EP0590759A1 (en) * | 1992-08-12 | 1994-04-06 | International Business Machines Corporation | System and method for locating video segment boundaries |
US5729471A (en) * | 1995-03-31 | 1998-03-17 | The Regents Of The University Of California | Machine dynamic selection of one video camera/image of a scene from multiple video cameras/images of the scene in accordance with a particular perspective on the scene, an object in the scene, or an event in the scene |
US5745710A (en) * | 1993-05-24 | 1998-04-28 | Sun Microsystems, Inc. | Graphical user interface for selection of audiovisual programming |
US5768447A (en) * | 1996-06-14 | 1998-06-16 | David Sarnoff Research Center, Inc. | Method for indexing image information using a reference model |
-
2000
- 2000-09-13 US US09/662,481 patent/USRE38401E1/en not_active Expired - Lifetime
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5235375A (en) * | 1990-04-12 | 1993-08-10 | Olympus Optical Co., Ltd. | Focusing position detecting and automatic focusing apparatus with optimal focusing position calculation method |
US5237648A (en) * | 1990-06-08 | 1993-08-17 | Apple Computer, Inc. | Apparatus and method for editing a video recording by selecting and displaying video clips |
EP0555028A2 (en) * | 1992-02-06 | 1993-08-11 | International Business Machines Corporation | An apparatus and method for manipulating digital data relating to motion video images |
EP0590759A1 (en) * | 1992-08-12 | 1994-04-06 | International Business Machines Corporation | System and method for locating video segment boundaries |
US5745710A (en) * | 1993-05-24 | 1998-04-28 | Sun Microsystems, Inc. | Graphical user interface for selection of audiovisual programming |
US5729471A (en) * | 1995-03-31 | 1998-03-17 | The Regents Of The University Of California | Machine dynamic selection of one video camera/image of a scene from multiple video cameras/images of the scene in accordance with a particular perspective on the scene, an object in the scene, or an event in the scene |
US5768447A (en) * | 1996-06-14 | 1998-06-16 | David Sarnoff Research Center, Inc. | Method for indexing image information using a reference model |
Non-Patent Citations (28)
Title |
---|
"Advanced Imaging Product Survey: Photo, Document and Video," Advanced Imaging, Oct. 1994, pp. 66-72.* * |
Akutsu, A. et al., "Video Indexing Using Motion Vectors," Proceedings of the SPIE, vol. 1818, No. 3, pp. 1522-1530, Nov. 18, 1992.* * |
Ariki et al., "Extraction of News Articles based on Scene Cut Detection using DCT Clustering," International Conference on Image Processing, Sep. 1996, Lausanne, Switzerland, pp. 847-850.* * |
Ding, W. et al. "Previewing Video Data: Browsing Key Frames Rates Using a Video Slide Show Interface," Proceeding of the Intl. Symposium on Research, 1997.* * |
Hampapur et al., "Digital Video Segmentation," ACM Multimedia '94 Proceedings, ACM Press-1, Oct. 1994, pp. 357-364.* * |
Hirotada, Ueda et al., "Automatic Structure Visualization for Video Editing," Bridges Between Worlds, Amsterdam, Apr. 24-29, 1993; pp. 137-141.* * |
Hoetter, M., "Differential Estimation of the Global Motion Parameters Zoom and Pan," Signal Processing, vol. 16, pp. 249-265, 1989.* * |
Huang, J., et al., "Image Indexing Using Color Correlograms," IEEE Computer Vision and Pattern Recognition Conference, San Juan, Puerto Rico, Jun. 17-19, 1997, pp. 762-768.* * |
Huang, T.S., "Modeling Analysis and Visualization of Nonrigid Object Motion," International Conf. on Pattern Recognition, vol. 1, pp. 361-364, Atlantic City, NJ Jun. 1990.* * |
Kataoka, R., et al. "Architecture and Storage Structure of an Interactive Multimedia Information System," IEICE Transactions on Information and Systems, vol. E78-D, No. 11, Nov. 1, 1995, pp. 1354-1365.* * |
Kobla, V., et al. "Archiving, Indexing, and Retrievel of Video in the Compressed Domain," SPIE Conference on Multimedia Storage and Archiving Systems, Boston, Massachusetts Nov. 18-19, 1996, vol. 2916, pp. 78-89.* * |
Madrane, et al. "Video Representation Tools Using a Unified object and perspective Base Approach," Proccedings of the SPIE, vol. 2420, 1995.* * |
Madrane, N., et al., "Video Presentation Tools Using a Unified Object and Perspective Base Approach," Proceedings of the SPIE, vol. 2420, 1995, pp. 160, 162.* * |
Madrane, N., et al., "Video Representation Tools Using a Unified Object and Perspective Base Approach," Proceedings of the SPIE, vol. 2420, Feb. 9, 1995, pp. 152-163.* * |
Massey, M. et al. "Salient Stills: Process and Practice," IBM Systems Journal, vol. 35, No. 3/04, 1996, pp. 557-573.* * |
Mazest, et al. "Video Representation Tools Using a Unified Object and Perspective Base Approach," IS&T/SPIE Conference on Storage and Perusal for Image Video Databases, San Jose, California, Feb. 1995.* * |
Meng, J. and S. Chang, "CVEPS-A Compressed Video Editing and Parsing System," Proceedings of ACM Multimedia 1996, Boston, Massachusetts, Nov. 18-22, 1996, pp. 43-53.* * |
Mills et al, "A Magnifier Tool for Video Data", ACM CHI '92, PP. 93-98, May 7, 1992.* * |
Pass, G., et al. "Comparing Images Using Color Coherence Vectors," Proceedings of ACM Multimedia 1996, Boston, Massachusetts, Nov. 18-22, 1996, pp. 65-73.* * |
Shio et al., "Segmentation of People in Motion," Proc. IEEE, vol. 79, pp. 325-332, 1991.* * |
Taniguchi, Y., et al., "Panorama Excerpts: Extracting and Packing Panoramas for Video Browsing," Proceedings of the ACM Multimedia, Seattle, Washington, Nov. 9-13, 1997, pp. 427-436.* * |
Teodosio et al., "Salient Video Stills: Content and Context Preserved," Proc. ACM Multimedia 93, California, Aug. 1-6, 1993, pp. 39-47.* * |
Tonomura et al., Content Oriented Visual Interface using Video Icons for Visual Database Systems, IEEE, pp. 68-73, Oct. 6, 1989.* * |
Tonomura, Y., et al. "Content Oriented Visual Interface Using Video Icons for Visual Database Systems," Journal of Visual Languages and Computing, vol. 1, Jan. 1, 1990, pp. 183-198.* * |
Tse et al, "Global Zoom/Pan Estimation and Compensation for Video Compression," Proc. ICASSP, vol. 4, pp. 2725-2728, May 1991.* * |
Yeung, M. et al. "Video Browsing Using Clustering and Scene Transitions on Compressed Sequences," Proceedings, IS&T/SPIE Multimedia Computing and Networking, pp. 399-413 (1995).* * |
Zhang et al, "Developing Power Tools for Video Indexinor and Retrieval," SPIE, vol. 2185, pp. 140-149.* * |
Zhang, HoncJiang et al., "Automatic Partitioning of Full-Motion Video," Multimedia Systems (Springer-Verfaa, 1991) 1, pp.10-28. * |
Cited By (67)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020041339A1 (en) * | 2000-10-10 | 2002-04-11 | Klaus Diepold | Graphical representation of motion in still video images |
US7755566B2 (en) * | 2000-12-28 | 2010-07-13 | Nokia Corporation | Displaying an image |
US20020113757A1 (en) * | 2000-12-28 | 2002-08-22 | Jyrki Hoisko | Displaying an image |
US20030115598A1 (en) * | 2001-03-23 | 2003-06-19 | Pantoja William E. | System and method for interactively producing a web-based multimedia presentation |
US7228502B2 (en) * | 2002-01-16 | 2007-06-05 | Canon Kabushiki Kaisha | Method and device for temporal segmentation of a video sequence |
US20030132955A1 (en) * | 2002-01-16 | 2003-07-17 | Herve Le Floch | Method and device for temporal segmentation of a video sequence |
US7873905B2 (en) * | 2002-09-19 | 2011-01-18 | Fuji Xerox Co., Ltd. | Image processing system |
US7848598B2 (en) | 2002-09-19 | 2010-12-07 | Fuji Xerox Co., Ltd. | Image retrieval processing to obtain static image data from video data |
US20040155898A1 (en) * | 2002-09-19 | 2004-08-12 | Fuji Xerox Co., Ltd. | Image processing system |
US20040255337A1 (en) * | 2003-02-19 | 2004-12-16 | Impatica Inc. | Method of synchronization |
US7595833B2 (en) | 2005-02-28 | 2009-09-29 | Canon Kabushiki Kaisha | Visualizing camera position in recorded video |
US20110167390A1 (en) * | 2005-04-07 | 2011-07-07 | Ingram Dv Llc | Apparatus and method for utilizing an information unit to provide navigation features on a device |
US20080141180A1 (en) * | 2005-04-07 | 2008-06-12 | Iofy Corporation | Apparatus and Method for Utilizing an Information Unit to Provide Navigation Features on a Device |
US7587684B2 (en) * | 2006-01-23 | 2009-09-08 | Nokia Corporation | Mobile communication terminal and method therefore |
US20070174785A1 (en) * | 2006-01-23 | 2007-07-26 | Paavo Perttula | Mobile communication terminal and method therefore |
US20100013757A1 (en) * | 2006-03-14 | 2010-01-21 | Junichi Ogikubo | Image processing device and image processing method |
US8627206B2 (en) * | 2006-03-14 | 2014-01-07 | Sony Corporation | Image processing device and image processing method for displaying images in a spiral form |
USD577034S1 (en) * | 2006-04-18 | 2008-09-16 | Samsung Electronics Co., Ltd. | Graphic user interface for dislay on a MP3 player |
US8756528B2 (en) * | 2006-05-08 | 2014-06-17 | Ascom (Sweden) Ab | System and method of customizing video display layouts having dynamic icons |
US20070260986A1 (en) * | 2006-05-08 | 2007-11-08 | Ge Security, Inc. | System and method of customizing video display layouts having dynamic icons |
US20090147010A1 (en) * | 2006-07-04 | 2009-06-11 | George Russell | Generation of video |
US8405662B2 (en) * | 2006-07-04 | 2013-03-26 | Iti Scotland Limited | Generation of video |
US8875022B2 (en) * | 2007-01-26 | 2014-10-28 | Samsung Electronics Co., Ltd. | Method for providing graphical user interface for changing reproducing time point and imaging apparatus therefor |
US9411497B2 (en) | 2007-01-26 | 2016-08-09 | Samsung Electronics Co., Ltd. | Method for providing graphical user interface for changing reproducing time point and imaging apparatus therefor |
US20080180394A1 (en) * | 2007-01-26 | 2008-07-31 | Samsung Electronics Co., Ltd. | Method for providing graphical user interface for changing reproducing time point and imaging apparatus therefor |
US8161389B1 (en) * | 2007-10-31 | 2012-04-17 | Adobe Systems Incorporated | Authoring tool sharable file format |
US8103966B2 (en) * | 2008-02-05 | 2012-01-24 | International Business Machines Corporation | System and method for visualization of time-based events |
US20090199118A1 (en) * | 2008-02-05 | 2009-08-06 | Sivan Sabato | System and Method for Visualization of Time-Based Events |
US20090300530A1 (en) * | 2008-05-29 | 2009-12-03 | Telcordia Technologies, Inc. | Method and system for multi-touch-based browsing of media summarizations on a handheld device |
US8584048B2 (en) * | 2008-05-29 | 2013-11-12 | Telcordia Technologies, Inc. | Method and system for multi-touch-based browsing of media summarizations on a handheld device |
US8171410B2 (en) | 2008-05-29 | 2012-05-01 | Telcordia Technologies, Inc. | Method and system for generating and presenting mobile content summarization |
US20090300498A1 (en) * | 2008-05-29 | 2009-12-03 | Telcordia Technologies, Inc. | Method and System for Generating and Presenting Mobile Content Summarization |
US8421871B2 (en) * | 2008-09-22 | 2013-04-16 | Sony Corporation | Method and apparatus for image pickup and image processing |
US20100073519A1 (en) * | 2008-09-22 | 2010-03-25 | Sony Corporation | Image processing apparatus, image pickup apparatus, image processing method, and computer program |
US8959436B2 (en) * | 2008-12-10 | 2015-02-17 | Canon Kabushiki Kaisha | Method of selecting a frame from motion video |
US20100146392A1 (en) * | 2008-12-10 | 2010-06-10 | Canon Kabushiki Kaisha | Method of selecting a frame from motion video |
US20100169777A1 (en) * | 2008-12-30 | 2010-07-01 | Apple Inc. | Light Table for Editing Digital Media |
US20100169783A1 (en) * | 2008-12-30 | 2010-07-01 | Apple, Inc. | Framework for Slideshow Object |
US8626322B2 (en) | 2008-12-30 | 2014-01-07 | Apple Inc. | Multimedia display based on audio and visual complexity |
US8621357B2 (en) * | 2008-12-30 | 2013-12-31 | Apple Inc. | Light table for editing digital media |
US8832555B2 (en) * | 2008-12-30 | 2014-09-09 | Apple Inc. | Framework for slideshow object |
US8296675B2 (en) | 2009-03-09 | 2012-10-23 | Telcordia Technologies, Inc. | System and method for capturing, aggregating and presenting attention hotspots in shared media |
US20100229121A1 (en) * | 2009-03-09 | 2010-09-09 | Telcordia Technologies, Inc. | System and method for capturing, aggregating and presenting attention hotspots in shared media |
US8762890B2 (en) | 2010-07-27 | 2014-06-24 | Telcordia Technologies, Inc. | System and method for interactive projection and playback of relevant media segments onto the facets of three-dimensional shapes |
US9032300B2 (en) | 2010-08-24 | 2015-05-12 | Apple Inc. | Visual presentation composition |
WO2013121239A1 (en) * | 2012-02-15 | 2013-08-22 | Thomson Licensing | User interface for depictive video editing |
US9830059B2 (en) | 2013-03-08 | 2017-11-28 | Thomas Creative Group Productions, Llc | Interactive presentation system |
US20210397331A1 (en) * | 2013-03-08 | 2021-12-23 | Thomas Creative Group Productions, Llc | Interactive presentation system |
US9753619B2 (en) * | 2013-03-08 | 2017-09-05 | Thomas Creative Group Productions, Llc | Interactive presentation system |
US20140258869A1 (en) * | 2013-03-08 | 2014-09-11 | Thomas Creative Productions, LLC | Interactive presentation system |
US11550458B2 (en) * | 2013-03-08 | 2023-01-10 | Trustees Of The Thomas Family Trust | Interactive presentation system |
US11126335B2 (en) * | 2013-03-08 | 2021-09-21 | Trustees Of The Thomas Family Trust | Interactive presentation system |
US10409466B2 (en) * | 2013-03-08 | 2019-09-10 | Thomas Creative Group Productions, Llc | Interactive presentation system |
US10599310B2 (en) * | 2013-03-08 | 2020-03-24 | Thomas Creative Group Productions, Llc | Interactive presentation system |
US20150142799A1 (en) * | 2013-11-21 | 2015-05-21 | Nokia Corporation | Establishment of an association between an object and a mood media item |
US10754888B2 (en) * | 2013-11-21 | 2020-08-25 | Nokia Technologies Oy | Establishment of an association between an object and a mood media item |
US20180040128A1 (en) * | 2016-08-03 | 2018-02-08 | International Business Machines Corporation | Verification of business processes using spatio-temporal data |
US10049456B2 (en) * | 2016-08-03 | 2018-08-14 | International Business Machines Corporation | Verification of business processes using spatio-temporal data |
US20210357110A1 (en) * | 2020-05-13 | 2021-11-18 | Cbs Interactive Inc. | Systems and methods for generating consistent user interfaces across multiple computing platforms |
US20230333704A1 (en) * | 2020-12-21 | 2023-10-19 | Vivo Mobile Communication Co., Ltd. | Image display method and apparatus, and electronic device |
US20220309938A1 (en) * | 2021-03-29 | 2022-09-29 | Panasonic Intellectual Property Management Co., Ltd. | Online video distribution support method and online video distribution support apparatus |
US11860931B1 (en) * | 2022-07-11 | 2024-01-02 | Truist Bank | Graphical user interface with insight hub and insight carousel |
US20240086456A1 (en) * | 2022-07-11 | 2024-03-14 | Truist Bank | Graphical user interface with insight hub and insight carousel |
US20240176813A1 (en) * | 2022-07-11 | 2024-05-30 | Truist Bank | Graphical user interface with insight hub and insight carousel |
US20240380945A1 (en) * | 2023-05-08 | 2024-11-14 | Apple Inc. | Video editing and key-framing animations using accessory on a touch device |
US12231715B2 (en) * | 2023-05-08 | 2025-02-18 | Apple Inc. | Video editing and key-framing animations using accessory on a touch device |
US12262077B2 (en) | 2023-05-08 | 2025-03-25 | Apple Inc. | User interfaces for video editing application on touch screen device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5963203A (en) | Interactive video icon with designated viewing position | |
USRE38401E1 (en) | Interactive video icon with designated viewing position | |
US7181757B1 (en) | Video summary description scheme and method and system of video summary description data generation for efficient overview and browsing | |
EP2127368B1 (en) | Concurrent presentation of video segments enabling rapid video file comprehension | |
US7421455B2 (en) | Video search and services | |
CN103702039B (en) | image editing apparatus and image editing method | |
EP1960990B1 (en) | Voice and video control of interactive electronically simulated environment | |
KR100464076B1 (en) | Video browsing system based on keyframe | |
KR101967036B1 (en) | Methods, systems, and media for searching for video content | |
US20080159708A1 (en) | Video Contents Display Apparatus, Video Contents Display Method, and Program Therefor | |
US20110093798A1 (en) | Automated Content Detection, Analysis, Visual Synthesis and Repurposing | |
EP1006459A2 (en) | Content-based video story browsing system | |
US20040125124A1 (en) | Techniques for constructing and browsing a hierarchical video structure | |
CN101398843B (en) | Device and method for browsing video summary description data | |
US20040012621A1 (en) | Hyper-media information providing method, hyper-media information providing program and hyper-media information providing apparatus | |
KR101440168B1 (en) | A method for generating a new overview of an audiovisual document that already includes an overview and report and a receiver capable of implementing the method | |
JP3579111B2 (en) | Information processing equipment | |
Lee et al. | Implementation and analysis of several keyframe-based browsing interfaces to digital video | |
JP2001175380A (en) | Information index display method and device | |
KR100319160B1 (en) | How to search video and organize search data based on event section | |
US20010033302A1 (en) | Video browser data magnifier | |
JP2001306579A (en) | Device and method for retrieving information and computer-readable recording medium recorded with program for computer to execute the same method | |
Brachmann et al. | Keyframe-less integration of semantic information in a video player interface | |
KR100518846B1 (en) | Video data construction method for video browsing based on content | |
KR101417614B1 (en) | Interaction method of object-oriented multi-media |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FEPP | Fee payment procedure |
Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
AS | Assignment |
Owner name: SONY CORPORATION, JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:OBVIOUS TECHNOLOGY, INC.;REEL/FRAME:016651/0895 Effective date: 20050523 |
|
AS | Assignment |
Owner name: SONY CORPORATION, JAPAN Free format text: RECORDATION COVER SHEET RECORDED AT REEL 016651 FRAME 0895 CONTAINED AN INCORRECT ADDRESS FOR ASSIGNEE. ASSIGNMENT AND RECORDATION COVER SHEET ARE RE-RECORDED TO CORRECT ERROR ON STATED REEL.;ASSIGNOR:OBVIOUS TECHNOLOGY, INC.;REEL/FRAME:016996/0988 Effective date: 20050523 |
|
FEPP | Fee payment procedure |
Free format text: PAT HOLDER NO LONGER CLAIMS SMALL ENTITY STATUS, ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: STOL); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
REFU | Refund |
Free format text: REFUND - PAYMENT OF MAINTENANCE FEE, 8TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: R2552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
REMI | Maintenance fee reminder mailed | ||
FPAY | Fee payment |
Year of fee payment: 8 |
|
SULP | Surcharge for late payment |
Year of fee payment: 7 |
|
FEPP | Fee payment procedure |
Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 12 |