US20080065697A1 - Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method - Google Patents

Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method Download PDF

Info

Publication number
US20080065697A1
US20080065697A1 US11/980,544 US98054407A US2008065697A1 US 20080065697 A1 US20080065697 A1 US 20080065697A1 US 98054407 A US98054407 A US 98054407A US 2008065697 A1 US2008065697 A1 US 2008065697A1
Authority
US
United States
Prior art keywords
metadata
scene
unit
content
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/980,544
Inventor
Yoshimi Moriya
Hirofumi Nishikawa
Shunichi Sekiguchi
Kohtaro Asai
Yoshihisa Yamada
Kenji Otoi
Shinichi Kuroda
Fuminobu Ogawa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US11/980,544 priority Critical patent/US20080065697A1/en
Publication of US20080065697A1 publication Critical patent/US20080065697A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8543Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7847Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
    • G06F16/785Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using colour or luminescence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/1435Saving, restoring, recovering or retrying at system level using file system or storage system metadata
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/71Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7847Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
    • G06F16/786Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using motion, e.g. object motion or camera motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234336Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by media transcoding, e.g. video is transformed into a slideshow of still pictures or audio is converted into text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234363Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4668Learning process for intelligent management, e.g. learning user preferences for recommending movies for recommending content, e.g. movies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • H04N21/8405Generation or processing of descriptive data, e.g. content descriptors represented by keywords
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Definitions

  • a conventional moving picture management apparatus which includes: a means for dividing a video into multiple scenes and editing and creating at least one index as an assembly of section information necessary for reproduction of each scene, a scene number assigned to each scene, and a representative image of each scene; means for giving a title to each index for the purpose of searching; and means for searching for a desired index using a corresponding title and successively reproducing scenes of the index in order of the scene numbers.
  • a metadata editing apparatus is provided with: a scene division unit for dividing multimedia content containing at least one of moving pictures and audio into a plurality of scenes to generate scene section information metadata indicating a scene start position and a scene end position for each scene obtained as a result of the division; a scene description edit unit for performing hierarchical editing of each scene of the multimedia content based on the scene section information metadata sent from the scene division unit and generating scene structure information metadata describing a hierarchical structure of the multimedia content; and a metadata description unit for integrating the scene section information metadata and the scene structure information metadata and generating metadata describing contents and a structure of the multimedia content in accordance with a predetermined format.
  • a metadata delivery method includes the steps of: analyzing metadata optimization hint information describing a type of each descriptor contained in metadata; re-generating second metadata by analyzing the metadata describing contents and a structure of multimedia content containing at least one of moving pictures and audio based on the analyzed metadata optimization hint information and a condition for re-generation of the metadata; and delivering the re-generated second metadata to a client terminal.
  • FIG. 1 is a block diagram showing a construction of a metadata editing apparatus according to a first embodiment of the present invention.
  • FIG. 3 shows an example of scene section information metadata of a scene division unit of the metadata editing apparatus according to the first embodiment of the present invention.
  • FIG. 5 shows examples of screen images of a content reproduction/display unit and a user input unit of the metadata editing apparatus according to the first embodiment of the present invention.
  • FIG. 6 is a block diagram showing a construction of a metadata editing apparatus according to a second embodiment of the present invention.
  • FIG. 8 is a block diagram showing a construction of a metadata reproduction apparatus according to a third embodiment of the present invention.
  • FIG. 9 illustrates how the metadata reproduction apparatus according to the third embodiment of the present invention operates.
  • FIG. 10 is a block diagram showing a construction of a content delivery system according to a fourth embodiment of the present invention.
  • FIG. 11 shows content (in this case, a news video) structure information outputted from a metadata analysis unit of a metadata delivery server according to the fourth embodiment of the present invention.
  • FIG. 13 is a block diagram showing a construction of a metadata delivery server according to a fifth embodiment of the present invention.
  • FIG. 14 shows an example of video content, with reference to which processing of metadata optimization hint information by the metadata delivery server according to the fifth embodiment of the present invention is described.
  • FIG. 15 shows how metadata is described in MPEG-7 by the metadata delivery server according to the fifth embodiment of the present invention.
  • FIG. 17 shows the metadata optimization hint information used by the metadata delivery server according to the fifth embodiment of the present invention.
  • FIG. 18 is a flowchart showing how a metadata analysis/re-generation unit of the metadata delivery server according to the fifth embodiment of the present invention operates.
  • FIG. 20 is a block diagram showing a construction of a metadata search server according to a sixth embodiment of the present invention.
  • FIG. 21 is a flowchart showing how a metadata analysis unit of the metadata search server according to the sixth embodiment of the present invention operates.
  • FIG. 22 is a block diagram showing a construction of a client terminal according to a seventh embodiment of the present invention.
  • FIG. 23 is a block diagram showing a construction of a content delivery server according to an eighth embodiment of the present invention.
  • Embodiments of the present invention will now be described with reference to the accompanying drawings, with a first embodiment relating to a metadata editing apparatus, a second embodiment relating to another metadata editing apparatus, a third embodiment relating to a metadata reproduction apparatus, a fourth embodiment relating to a content delivery system, a fifth embodiment relating to a metadata delivery server, a sixth embodiment relating to a metadata search server, a seventh embodiment relating to a client terminal, and an eighth embodiment relating to a content delivery server.
  • FIG. 1 is a block diagram showing a construction of the metadata editing apparatus according to the first embodiment of the present invention. Note that in each drawing, the same reference numerals denote the same or equivalent portions.
  • the content reproduction/display unit 2 reproduces and displays multimedia content 10 that includes video data and audio data and is a target of editing.
  • the scene division unit 3 divides the content into multiple scenes.
  • the thumbnail image generation unit 4 extracts a representative frame of each scene as a thumbnail image.
  • the scene description edit unit 5 hierarchically edits the scenes obtained as a result of the division by the scene division unit 3 through scene grouping, scene combining, scene deletion, generation of information that shows relations among the scenes, and the like.
  • the text information giving unit 6 gives various types of text information to each scene.
  • the characteristic extraction unit 7 extracts characteristics of each scene.
  • the user input unit 8 receives input of designation information from a user and outputs it to the content reproduction/display unit 2 , the scene division unit 3 , the thumbnail image generation unit 4 , the scene description edit unit 5 , and the text information giving unit 6 as user input information 11 .
  • the metadata description unit 9 integrates scene section information metadata 12 , scene thumbnail image information metadata 13 , scene structure information metadata 14 , text information metadata 15 , and characteristic description metadata 16 outputted from the scene division unit 3 , the thumbnail image generation unit 4 , the scene description edit unit 5 , the text information giving unit 6 , and the characteristic extraction unit 7 , respectively.
  • the metadata description unit 9 then generates metadata 17 describing the contents and structure of the multimedia content in accordance with a specified format.
  • FIG. 2 shows a construction of a news video that is an example of a target of editing by the metadata editing apparatus according to the first embodiment.
  • the content reproduction/display unit 2 of the metadata editing apparatus 100 receives input of the multimedia content 10 , such as video content, stored in a content storage unit (not shown) via a network or the like, and reproduces/displays the multimedia content 10 for editing.
  • the multimedia content 10 such as video content
  • a content storage unit not shown
  • the scene division unit 3 When the user of the metadata editing apparatus 100 inputs positions for clipping a scene, which is to say a scene start position and a scene end position, using the user input unit 8 while watching the reproduced video, the scene division unit 3 generates the scene section information metadata 12 showing the scene start position and the scene end position inputted from the user.
  • FIG. 3 shows an example of the scene section information metadata generated by the scene division unit of the metadata editing apparatus according to the first embodiment.
  • the scene section information metadata 12 shown in FIG. 3 was generated from the news video shown in FIG. 2 .
  • the scene section information metadata 12 generated by the scene division unit 3 gives the scene start position and the scene end position of each scene clipped from the news video content, such as a “news digest” scene, a “domestic news” scene, and an “international news” scene.
  • the scene description edit unit 5 On receiving designation of scene editing from the user via the user input unit 8 , the scene description edit unit 5 performs hierarchical editing of the scenes continuously clipped by the scene division unit 3 based on the scene section information metadata 12 from the scene division unit 3 , and then outputs the scene structure information metadata 14 .
  • the scene hierarchical editing refers to scene grouping, scene re-division, scene combining, or scene deletion, for instance.
  • the scene grouping refers to grouping of scenes that are related to each other with respect to specific characteristics into a single group. For instance, as shown in FIG. 4 , the “domestic news” scene, the “international news” scene, and the “financial news” scene of the news video shown in FIG. 2 are grouped into a single “news” group.
  • the scene re-division refers to division of a single scene into multiple scenes
  • the scene combining refers to generation of a single scene by combining multiple scenes with each other.
  • FIG. 4 shows an example of the scene structure information metadata generated by the scene description edit unit of the metadata editing apparatus according to the first embodiment.
  • the scene structure information metadata 14 shown in FIG. 4 describes the hierarchical structure of the video content generated as a result of the editing by the scene description edit unit 5 .
  • a “news” scene is edited into multiple scenes, such as a “news digest” scene, a “news” scene, a “special” scene, and a “sports” scene, and the “news” scene is further hierarchically edited into a “domestic news” scene, an “international news” scene, and an “financial news” scene by the scene description edit unit 5 through scene editing such as the scene grouping, scene re-division, and scene combining.
  • the metadata 14 generated by the scene description edit unit 5 such as the metadata shown in FIG. 4 , is outputted to the metadata description unit 9 .
  • the thumbnail image generation unit 4 generates a representative frame of each scene clipped by the scene division unit 3 as a thumbnail image based on the scene section information metadata 12 from the scene division unit 3 , and outputs information concerning the generated thumbnail image as the thumbnail image information metadata 13 to the metadata description unit 9 , in which the thumbnail image information metadata 13 is registered.
  • the user it is possible for the user to perform selection of the thumbnail using the user input unit 8 , although it is also possible to automatically set a head frame or each frame clipped at fixed time intervals as the representative frame or to automatically detect each scene change point and set a frame at each detected point as the representative frame.
  • the thumbnail image information metadata 13 is information showing the position (such as the frame number or time) of the thumbnail in the video content or information giving the location (such as the URL) of the thumbnail image.
  • the characteristic extraction unit 7 extracts visual characteristic amounts possessed by each scene, such as motions, colors, or shapes of objects contained in the scene, from the scene based on the scene section information metadata 12 from the scene division unit 3 .
  • the extracted characteristic amounts are outputted to the metadata description unit 9 as the characteristic description metadata 16 and are registered therein.
  • the text information giving unit 6 gives various types of text information, such as a title, an abstract, a keyword, a comment, and scene importance, designated by the user to each scene based on the scene section information metadata 12 from the scene division unit 3 .
  • the text information may be given through user's input using the user input unit 8 or be automatically given through analysis of audio information and captions contained in the content.
  • the text information is outputted to the metadata description unit 9 and is registered therein as the text information metadata 15 .
  • FIG. 5 shows examples of screen images displayed by the content reproduction/display unit and the user input unit 8 of the metadata editing apparatus according to the first embodiment.
  • a video reproduction screen G 1 is an example of the screen image displayed by the content reproduction/display unit 2 , with content to be edited being reproduced/displayed on this video reproduction screen G 1 .
  • a user interface is also provided which includes buttons and the like for commanding “reproduction”, “stop”, “rewind”, “fast forward”, “frame advance”, and other operations.
  • a scene division designation screen G 2 is displayed which has a slider form, for instance.
  • the user designates a scene start position and a scene end position of the video displayed on the video reproduction screen G 1 through this scene division designation screen G 2 while watching the video displayed on the video reproduction screen G 1 . Also, the user simultaneously designates the position of a thumbnail between the scene start position and the scene end position through the scene division designation screen G 2 .
  • the thumbnail image generation unit 4 generates a thumbnail image from a frame of the video content at the designated position.
  • the thumbnail image whose position has been designated through the scene division designation screen G 2 , is displayed on a scene division information display screen G 3 as scene division information.
  • this scene division information display screen G 3 it is also possible to display information showing the scene start position and the scene end position in addition to the thumbnail image, as shown in FIG. 3 .
  • the user designates scene editing through a tree structure generation designation/display screen G 4 . That is, the user generates a tree showing the hierarchical structure possessed by the video content while watching the scene division information, such as the thumbnail image, displayed on the scene division information display screen G 3 .
  • the user uses a manipulation method with which, for instance, a new node is added to the tree and each scene that should be grouped is added to the node.
  • the user may use a method with which a scene that should be added is selected on the scene division information display screen G 3 and the selected scene is added to the node by a drag-and-drop operation.
  • the user may input text information for the selected scene from the scene division information display screen G 3 or the tree structure generation designation/display screen G 4 using the user input unit 8 provided as a user interface for giving the text information to the scene via the text information giving unit 6 .
  • the metadata description unit 9 generates a metadata file described in accordance with a specified description format by integrating the various types of metadata outputted from the scene division unit 3 , the thumbnail image generation unit 4 , the scene description edit unit 5 , the text information giving unit 6 , and the characteristic extraction unit 7 .
  • the specified metadata description format may be a uniquely determined format, although MPEG-7 standardized by ISO is used in this first embodiment.
  • the MPEG-7 stipulates a format for describing the structure and characteristics of content and includes an XML file format and a binary format.
  • the metadata editing apparatus 100 of the first embodiment is provided with the scene description edit unit 5 for hierarchically editing scenes and the characteristic extraction unit 7 for extracting characteristics from the scenes, so that it becomes possible to generate metadata describing the hierarchical structure possessed by content, such as video data, and characteristic amounts of each scene.
  • the multimedia content 10 inputted into the content reproduction/display unit 2 is obtained from a content server (not shown) existing on a network, from a content storage unit (not shown) in the metadata editing apparatus 100 , or from an accumulation medium (not shown) such as a CD or a DVD, for instance.
  • the metadata outputted from the metadata description unit 9 is accumulated in a metadata server (not shown) existing on a network, in a metadata accumulation unit (not shown) in the metadata editing apparatus, or in an accumulation medium (not shown), such as a CD or a DVD, together with content, for instance.
  • the metadata editing apparatus 100 is provided with both of the scene description edit unit 5 and the characteristic extraction unit 7 .
  • the present invention is not limited to this and it is of course possible to provide the metadata editing apparatus 100 with only one of the scene description edit unit 5 and the characteristic extraction unit 7 .
  • a metadata editing apparatus to be described in this second embodiment is provided with a scene change detection unit for automatically detecting each scene change point.
  • FIG. 6 is a block diagram showing a construction of the metadata editing apparatus according to the second embodiment of the present invention.
  • a metadata editing apparatus 100 A includes a content reproduction/display unit 2 , a scene division unit 3 , a thumbnail image generation unit 4 , a scene description edit unit 5 , a text information giving unit 6 , a characteristic extraction unit 7 , a user input unit 8 , a metadata description unit 9 , and scene change detection unit 39 .
  • reference numeral 40 denotes scene start position information which is automatically detected.
  • FIG. 7 illustrates how the metadata editing apparatus according to the second embodiment of the present invention operates.
  • the scene change detection unit 39 automatically detects each scene change/cut point. This scene change detection is performed based on a difference in pixel between frames, a difference in color between the frames, a difference in luminance histogram between the frames, or the like, for instance.
  • the scene division unit 3 determines a scene start position and a scene end position based on each scene change point detected by the scene change detection unit 39 .
  • the scene change detection unit 39 calculates a color histogram for each frame.
  • a color system HSV, RGB, YCbCr, and the like are available, although an HSV color space is used in this example.
  • This HSV color space is composed of three elements called “hue (H)”, “saturation (S)”, and “value (V)”.
  • H hue
  • S saturation
  • V value
  • a histogram of each element is calculated.
  • a difference in histogram between frames is calculated based on Equation 1 given below, for instance.
  • Equation 1 given below, for instance.
  • it is assumed that frames from a scene start frame to the Nth frame (N 3, for instance) belong to the same scene, that is, do not contain any scene change point.
  • each frame, from the N+1th and the subsequent frames, that has an inter-frame difference in histogram greater than “means + ⁇ sd” is regarded as a scene change point and is set as a new scene start position candidate.
  • an image having a determined pattern such as an image composed of an announcer, a studio set on the background, and a character description (caption) is inserted at switching between news, for instance.
  • the image having the predetermined pattern hereinafter referred to as the “template image” or metadata describing the characteristic amounts of the temperate image is registered in advance.
  • the characteristic amounts of the template image are the color histogram of the temperate image, the motion pattern (for instance, less motions are observed in the area of an announcer at switching between news), or the like.
  • each image corresponding to a scene change point is matched against the template image, as shown in FIG. 7 . Then, if the similarity therebetween is high, the scene change point is registered as a scene start position.
  • the similarity may be judged based on inter-frame differences, inter-frame differences in color histogram, or the like.
  • characteristic amounts of the template image are registered in advance, characteristic amounts of each image corresponding to a scene change point are extracted and are matched against the characteristic amounts of the temperate image. If the similarity therebetween is high, the scene change point is registered as a scene start position. Then, information showing the scene start position is outputted to the scene division unit 3 .
  • the scene division unit 3 determines a scene start position and a scene end position based on the information showing the scene start position automatically detected by the scene change detection unit 39 .
  • the scene division unit 3 of the second embodiment is also capable of determining the scene start position and the scene end position based on designation from the user, like in the first embodiment described above.
  • the scene change detection unit 39 can detect each scene change point contained in a scene with reference to each scene start position and each scene end portion described in the scene section information metadata 12 outputted from the scene division unit 3 to the scene change detection unit 39 .
  • the scene description edit unit 5 re-divides or integrates the scenes automatically detected by the scene change detection unit 39 based on the scene section information metadata 12 from the scene division unit 3 . Note that the details of the scene description edit unit 5 are the same as those in the first embodiment described above.
  • the metadata editing apparatus 100 A As described above, with the metadata editing apparatus 100 A according to the second embodiment, it becomes possible to generate metadata describing the hierarchical structure possessed by content, such as video data, and characteristic amounts of each scene, like in the first embodiment described above.
  • the scene change detection unit 39 is provided, so that it becomes possible to automatically detect each scene change point in content.
  • a metadata reproduction apparatus which performs summary reproduction of images, searching, and the like using the metadata generated by the metadata editing apparatus according to the first embodiment or the second embodiment described above.
  • FIG. 8 is a block diagram showing a construction of the metadata reproduction apparatus according to the third embodiment of the present invention.
  • a metadata reproduction apparatus 200 includes a metadata analysis unit 19 , a structure display unit 20 , a thumbnail image display unit 21 , an user input unit 22 , a search unit 23 , a search result display unit 24 , a summary creation unit 25 , a summary structure display unit 26 , and a content reproduction unit 27 .
  • the metadata analysis unit 19 performs analysis of metadata 28 describing the hierarchical scene structure possessed by content, information concerning the thumbnail of each scene, the characteristic amounts of each scene, and the like.
  • the structure display unit 20 displays a scene structure 29 obtained as a result of the metadata analysis, that is, the hierarchical structure of the content.
  • the thumbnail image display unit 21 displays thumbnail image information 30 obtained as a result of the metadata analysis.
  • the search unit 23 performs searching based on the search designation (search condition 31 ) from the user and the scene characteristic amounts or text information 32 obtained from the metadata.
  • the search result display unit 24 displays a result 33 of the searching.
  • the summary creation unit 25 performs creation of a summary based on summary creation designation (summary creation condition 34 ) from the user.
  • the summary structure display unit 26 displays a structure 38 of summarized content.
  • the content reproduction unit 27 reproduces/displays the content based on summary information 35 , content reproduction designation 36 , and content 37 to be reproduced.
  • the metadata analysis unit 19 receives input of the metadata 28 describing the hierarchical scene structure possessed by the content, information concerning the thumbnail of each scene, the characteristic amounts of each scene, and the like, and performs analysis of the metadata.
  • the metadata 28 is metadata generated by the metadata description unit 9 of the first embodiment or the second embodiment described above in a format stipulated by MPEG-7. Consequently, the metadata is a text file written in XML or a binary file encoded in the binary format.
  • the metadata analysis unit 19 serves as an XML parser that performs analysis of an XML file.
  • the metadata analysis unit 19 serves as a decoder that performs decoding of the metadata 28 .
  • the structure display unit 20 receives input of a result of the analysis by the metadata analysis unit 19 and displays the hierarchical scene structure 29 of the content.
  • the scene structure of the content is displayed in a tree form together with the title of each scene, as shown in FIG. 4 .
  • the thumbnail image display unit 21 receives input of the result of the analysis by the metadata analysis unit 19 (thumbnail image information 30 ) and displays a list of thumbnail images of the content.
  • the search unit 23 receives search designation from the user via the user input unit 22 and searches for a scene contained in the content. At this time, the user inputs a search condition by giving a keyword, a sample image, or the like via the user input unit 22 .
  • the search unit 23 searches for each scene matching the search condition 31 , such as the keyword or the characteristics of the sample image, given by the user based on the scene characteristic amounts described in the metadata or the text information 32 giving scene titles and the like.
  • the search result display unit 24 receives input of the result 33 of the searching by the search unit 23 and performs displaying of the search result.
  • the thumbnail image of each scene matching the search condition is displayed, for instance.
  • the summary creation unit 25 creates a summary of the content based on summary creation designation from the user via the user input unit 22 .
  • the user inputs information showing the reproduction time of summarized content, user preferences, and the like using the user input unit 22 .
  • the content is a news video
  • the user inputs preference information showing that, for instance, he/she wishes to mainly watch sports news in the news video or to watch a 20-minute summary of the news video whose original length is one hour.
  • the summary creation unit 25 also creates the summary information 35 matching the summary condition based on the scene reproduction times described in the metadata and the text information 32 giving the scene titles and the like.
  • this summary information 35 is a reproduction list of scenes contained in the summarized content and is a list in which the location information, such as the URL, of the content is written together with the start position and end position of each scene in the content that the user wishes to reproduce.
  • the content reproduction/display unit 27 specifies target content based on the location information of the content contained in the summary information 35 , and performs obtainment/reproduction/display of each scene to be reproduced based on the scene list contained in the summary information.
  • the summary information hierarchically describes the scene structure of the summarized content.
  • FIG. 9 shows an example of a hierarchical scene structure.
  • FIG. 9 ( a ) shows an example of a scene structure of original content.
  • Each scene is given importance in a range of 0.0 to 1.0, with “1.0” meaning the highest importance and “0.0” meaning the lowest importance.
  • the importance is calculated based on the user preferences, for instance. If the user preferences are registered in advance and indicate that he/she wishes to watch scenes of a soccer game of a team A and, in particular, to necessarily watch a result of the game and goal scenes, each scene is given importance reflecting the user preferences.
  • each scene has metadata showing the location information, such as the URL, of the content containing the scene, the position information (the start position and the end position) of the scene in the content, and the like.
  • Information concerning the scene structure 38 of the summarized content is passed to the summary structure display unit 26 , which then displays the scene structure 38 in the tree form shown in FIG. 9 ( b ).
  • the content reproduction/display unit 27 reproduces/displays each selected scene contained in the content.
  • the metadata reproduction apparatus 200 As described above, with the metadata reproduction apparatus 200 according to the third embodiment, it becomes possible to reproduce only each scene that the user wishes to watch using the metadata generated by the metadata editing apparatus according to the first embodiment or the second embodiment described above or to search for the scene desired by the user using the characteristic amounts described in the metadata.
  • the content reproduction/display unit 27 is provided within the metadata reproduction apparatus 200 .
  • this content reproduction/display unit may be provided in another apparatus.
  • manipulations and displaying concerning reproduction of the metadata such as displaying of the scene structure and the thumbnail images, may be performed by a mobile telephone, a portable information terminal, or the like, and processing and displaying concerning reproduction of the multimedia content may be performed by a terminal (PC, for instance) connected to the mobile telephone, the portable information terminal, or the like via a network.
  • a terminal PC, for instance
  • a metadata delivery server (metadata delivery apparatus), which delivers the metadata of content to a client terminal, and a content delivery server, which scalably constructs the content with reference to the terminal capability of the client terminal and delivers the constructed content to the client terminal, will be described.
  • FIG. 10 is a block diagram showing a construction of the content delivery system according to the fourth embodiment of the present invention.
  • a content delivery server 300 includes a metadata delivery server 400 , various client terminals 481 to 48 n , and a content delivery server 50 .
  • the metadata delivery server 400 includes a metadata accumulation unit 41 , a metadata analysis unit 42 , a terminal capability judgment unit 43 , a metadata re-generation unit 44 , and a metadata delivery unit 45 .
  • the metadata accumulation unit 41 there is accumulated the metadata generated by the metadata editing apparatus of the first embodiment or the second embodiment described above, for instance.
  • the metadata analysis unit 42 performs analysis of metadata 49 describing the structure and characteristics of content.
  • the terminal capability judgment unit 43 judges the terminal capability of each client terminal based on information 51 concerning the capability of the client terminal.
  • the metadata re-generation unit 44 restructures the content in accordance with the judged terminal capability of the client terminal based on a result 50 of the analysis of the metadata, and re-generates metadata 52 of the restructured content.
  • the metadata delivery unit 45 delivers metadata 53 re-generated by the metadata re-generation unit 44 to the client terminals 481 to 48 n.
  • the metadata accumulation unit 41 may be provided outside the metadata delivery server 400 of the fourth embodiment.
  • the metadata delivery server 400 receives input of the metadata 49 from the metadata accumulation unit 41 via a network (not shown) or the like.
  • the content delivery server 500 includes a content accumulation unit 46 and a content delivery unit 47 .
  • the content delivery unit 47 delivers content 56 to the client terminals 481 to 48 n in accordance with content delivery requests 54 from the client terminals.
  • the content accumulation unit 46 may be provided outside the content delivery server 500 .
  • the content delivery server 500 receives input of the content data 55 via a network (not shown).
  • the metadata analysis unit 42 performs analysis of the metadata accumulated in the metadata accumulation unit 41 .
  • the metadata analysis unit 42 operates in the same manner as the metadata analysis unit 19 of the metadata reproduction apparatus 200 of the third embodiment described above. By performing the analysis of the metadata, the metadata analysis unit 42 obtains information concerning the structure and characteristics of the content.
  • FIG. 11 shows content structure information outputted from the metadata analysis unit of the metadata delivery server according to the fourth embodiment, with the illustrated example relating to a news video.
  • the hierarchical scene structure of the content is displayed in a tree form.
  • Each node of the tree corresponds to one scene and is associated with various types of scene information.
  • the various types of scene information include a scene title, an abstract, time information giving a scene start position and a scene end position, a scene thumbnail, a representative frame, a thumbnail shot, a representative shot, and scene characteristics such as visual characteristic amounts concerning colors, motions, and the like. Note that in FIG. 11 , among the various types of scene information, only the scene titles are shown.
  • the client terminals are various information household devices having different terminal capabilities.
  • the terminal capability refers to a communication speed, a processing speed, an image format that can be reproduced/displayed, an image resolution, a user input function, and the like.
  • the client terminal 481 is a personal computer (PC) that has sufficient performance with respect to the communication speed, processing speed, display performance, and user input function.
  • the client terminal 482 is a mobile telephone and the remaining client terminals are each a PDA or the like.
  • Each of the client terminals 481 to 48 n sends information concerning its terminal performance.
  • the terminal capability judgment unit 43 analyzes the information 51 that was sent from each of the client terminals 481 to 48 n and shows the terminal performance of the client terminal, determines a deliverable image format, a maximum image resolution, a length of the content, and the like, and outputs them to the metadata re-generation unit 44 .
  • the original content is video content encoded in MPEG-2 and has a high resolution
  • the original content can be reproduced by the client terminal 481 as it is because the client terminal 481 has sufficient performance as described above.
  • this client terminal 481 has a function with which it is possible to perform the image summary reproduction and searching described in the third embodiment described above.
  • the client terminal 482 is capable of reproducing only short video shots encoded in MPEG-4 and the maximum resolution displayable by the client terminal 482 is low.
  • the metadata re-generation unit 44 restructures the content in accordance with the terminal performance of each of the client terminals 481 to 48 n informed by the terminal performance judgment unit 43 , re-generates the metadata 52 describing the structure and contents of the restructured content, and outputs the metadata 52 to the metadata delivery unit 45 .
  • the original metadata is delivered to the client terminal 481 as it is, so that the restructuring of the content is not performed.
  • the client terminal 482 has only the function of reproducing short video shots and is incapable of reproducing every scene, so that the restructuring of the content is performed for the client terminal 482 using short video shots of important scenes.
  • FIG. 12 shows an example of a content structure after the restructuring by the metadata re-generation unit of the content delivery system according to the fourth embodiment.
  • each important scene, out of scenes of the news video is extracted and the content is restructured so as to include only the representative shot or representative frame of each extracted scene.
  • the client terminal 482 does not have the search function described in the above third embodiment, so that among the various types of scene information in the metadata, the scene characteristic amounts are not required to be included for searching. Therefore, the metadata re-generation unit 44 re-generates metadata describing only the structure of restructured scenes and the position information of the representative shots or representative frames of the scenes, and sends the metadata to the metadata delivery unit 45 .
  • the metadata delivery unit 45 delivers the metadata 53 generated by the metadata re-generation unit 44 to the client terminals 481 to 48 n.
  • Each of the client terminals 481 to 48 n analyzes the metadata 53 delivered by the metadata delivery unit 45 and obtains scene structure information of the content.
  • the client terminal transmits position information of the selected scene to the content delivery unit 47 of the content delivery server 500 .
  • the content delivery unit 47 of the content delivery server 500 On receiving the scene position information from each of the client terminals 481 to 48 n , the content delivery unit 47 of the content delivery server 500 obtains corresponding content 55 from the content accumulation unit 46 and delivers the content to each of the client terminals 481 to 48 n .
  • the content delivery unit 47 sends a scene start position and a scene end position and delivers a corresponding scene of the original content.
  • the content delivery unit 47 sends the location information (such as the URL) of a scene representative shot.
  • the content delivery unit 47 performs various kinds of processing, such as format conversion, resolution conversion, and reduction in file size through content summarization, and sends resultant data to the client terminal 482 .
  • the metadata delivery server 400 of the fourth embodiment it becomes possible to re-generate metadata in accordance with the capability of each of the client terminals 481 to 48 n and to deliver the re-generated metadata to each of the client terminals 481 to 48 n.
  • the metadata delivery server 400 and the content delivery server 500 are shown as separated apparatuses, but the present invention is not limited to this.
  • the content delivery server may be provided in the metadata delivery server or the metadata delivery server may be provided in the content delivery server.
  • the metadata delivery server and the content delivery server may be provided in the same server. In this case, it becomes possible for the terminal capability judgment unit 43 to easily inform the content delivery unit 47 of the capability of each of the client terminals 481 to 48 n , which makes it possible to restructure the content through format conversion or the like in accordance with the capability of the client terminal and to deliver the restructured content to each of the client terminals 481 to 48 n.
  • the fourth embodiment has been described by assuming that in the metadata accumulation unit 41 , the metadata generated by the metadata editing apparatus of the first embodiment or the second embodiment described above is accumulated.
  • the present invention is not limited to this and, needless to say, metadata generated by an apparatus other than the metadata editing apparatus of the first embodiment or the second embodiment described above may be accumulated in the metadata accumulation unit 41 .
  • the metadata delivery server of the above fourth embodiment performs the metadata re-generation based on the terminal information sent from each client terminal.
  • the metadata delivery server (metadata delivery apparatus) is provided with a metadata analysis/re-generation unit that performs the metadata re-generation using metadata optimization hint information that is hint information for the metadata re-generation.
  • FIG. 13 is a block diagram showing a construction of the metadata delivery server according to the fifth embodiment of the present invention.
  • a metadata delivery server 400 A includes a hint information analysis unit 61 , metadata analysis/re-generation unit 63 , and a metadata delivery unit 45 .
  • the hint information analysis unit 61 analyzes metadata optimization hint information 60 and outputs a result of the analysis.
  • the metadata analysis/re-generation unit 63 analyzes metadata 49 describing the structure and characteristics of content based on analyzed metadata optimization hint information 62 and a condition 65 concerning metadata re-generation such as information concerning the performances of the client terminals or user preferences, and outputs restructured metadata 64 . Then, the metadata delivery unit 45 delivers metadata 53 to the client terminals.
  • the metadata 49 describing the structure and characteristics of the content and the metadata optimization hint information 60 that is hint information for the re-generation of the metadata 49 are accumulated.
  • the metadata optimization hint information 60 for the re-generation of the metadata 49 is information describing the types of information contained in the metadata 49 , the amount of the contained information, and the outline and complexity of the metadata 49 .
  • the metadata optimization hint information 60 will be described in detail by taking, as an example, a case of video content having the structure shown in FIG. 14 .
  • a video content (Root) (Soccer game program) is broadly divided into two scenes (Scene 1 and Scene 2 ) corresponding to the first half and the second half, and the first half scene is further divided into multiple scenes (Scene 1 - 1 , Scene 1 - 2 , . . . , Scene 1 - n ) (such as goal scenes and corner kick scenes).
  • the temporal hierarchical structure among the scenes is indicated using a tree structure.
  • the metadata 49 corresponding to the video content describes the temporal hierarchical structure of the content, that is, the temporal relations among the scenes, and the start times and lengths of the scenes.
  • the metadata 49 also describes text information (such as a title, abstract, category, and explanatory notes), importance, and the like of each scene as well as the characteristics (for instance, a color histogram or motion complexity) possessed by the scene in accordance with the hierarchical level of the scene. Note that in this fifth embodiment, it is assumed that MPEG-7 standardized by ISO is used as a metadata description format.
  • FIG. 15 shows how the metadata is described in MPEG-7.
  • each scene is described in units called “video segment”.
  • time information scene start point and length
  • a title an outline
  • a category and the like.
  • the information described in each video segment is changed in accordance with the hierarchical level of the video segment.
  • importance is described in each video segment at Level 2 and Level 3 , although no importance is described in each video segment at Level 4 .
  • the characteristic amounts concerning colors and motions are described only in each video segment at Level 4 .
  • the metadata optimization hint information 60 for the re-generation of the metadata 49 describes the types and contents of information (descriptors) contained in the metadata 49 . Accordingly, in the case of the metadata shown in FIG. 15 , the metadata optimization hint information 60 contains a descriptor (“time division”) expressing the temporal hieratical structure possessed by the content, descriptors expressing the color histogram and the motion complexity, and descriptors expressing the title, abstract, category, and importance. Also, in order to express description contents and complexity, the depth of each video segment in the hierarchical structure is expressed with up to four levels (Level 1 to Level 4 ). Further, the importance assumes one of five discrete values ( ⁇ 0.0, 0.25, 0.5, 0.75, 1.0 ⁇ ). As importance with respect to viewpoints, there are described importance from the viewpoint of “Team A” and importance from the viewpoint of “Team B”. Also, there is described the hierarchical position at which the importance is described (video segment level at which the importance is described).
  • FIG. 16 shows an example of a format of the metadata optimization hint information 60 .
  • the metadata optimization hint information 60 shown in FIG. 16 contains metadata file information and metadata construction element information.
  • the metadata file information describes information for predicting resources required to process the metadata, such as the memory size required to accumulate/analyze the metadata and the processing system (S/W) required to analyze the metadata.
  • the metadata file information describes the location of a metadata file, the size of the metadata file, the format of the metadata file (for instance, the XML format or the binary format), syntax file information (location of a syntax file defining the syntax of the metadata), and an appearing element number showing the number of elements contained (appearing) in the metadata.
  • the syntax file defining the format of the metadata file corresponds to a DTD file, a schema file, or the like defining the description format (syntax) of the metadata, and the syntax file information describes the location of the DTD file or the schema file, for instance.
  • the metadata construction element information is information describing the type and contents of each descriptor constituting the metadata.
  • the metadata construction element information contains the name of each descriptor contained in the metadata, the appearing frequency (number of appearing times) of the descriptor in the metadata, and a description (completeness of description) showing whether or not the descriptor contains every descriptor that has the possibility of being syntaxically contained.
  • the metadata construction element information also contains the temporal or spatial hierarchical property (maximum value of the depth) possessed by the descriptor.
  • “video segment” is the descriptor recursively described and has a hierarchical structure with up to four levels, so that the maximum hierarchical depth possessed by the “video segment” descriptor becomes four.
  • the hint information also describes the appearing position (hierarchical level) at which the contained descriptor appears. For instance, “importance” is a descriptor contained in the “video segment” descriptor and, when the “importance” is contained in the video segment at up to Level 3 , that is, is not contained in the video segment at Level 4 , the appearing position of the “importance” becomes up to Level 3 . In this manner, the appearing position is specified using the hierarchical level. However, when an ID is assigned to each “video segment” containing the “importance” or the “video segment” itself, it is also possible to describe the appearing position as an ID list.
  • the hint information additionally describes the type of the descriptor and the range of values that the descriptor is assumable to have.
  • the assumable values of the “importance” become a list of ⁇ 0.0, 0.25, 0.5, 0.75, 1.0 ⁇ having a floating-point form. The above description is repeated for each descriptor that is a construction element of the metadata.
  • FIG. 17 shows an example of the metadata optimization hint information described in accordance with the format shown in FIG. 16 . It can be seen that the example of the metadata optimization hint information 60 shown in FIG. 17 contains the metadata file information and the metadata construction element information for descriptors such as the “video segment” descriptor and the “title” descriptor.
  • the hint information analysis unit 61 performs analysis of the metadata optimization hint information 60 described in the specified format.
  • the metadata analysis/re-generation unit 63 performs analysis of the metadata 49 using the analyzed metadata optimization hint information 62 outputted from the hint information analysis unit 61 , and outputs the metadata 64 re-generated based on the condition 65 concerning the metadata re-generation.
  • FIG. 18 shows an example of a method with which the metadata analysis/re-generation unit 63 analyzes the metadata using the analyzed metadata optimization hint information 62 .
  • the metadata analysis/re-generation unit 63 analyzes the metadata using the analyzed metadata optimization hint information 62 .
  • the metadata analysis/re-generation unit 63 specifies metadata necessary for re-generation based on the condition 65 for metadata re-generation (step S 1 ).
  • metadata necessary for re-generation based on the condition 65 for metadata re-generation (step S 1 ).
  • video segment characterized by having importance of 0.5 or higher is extracted, so that “importance” and “video segment” are descriptors necessary for the re-generation.
  • the analyzed metadata optimization hint information 62 judges whether or not the descriptors specified in step S 1 are contained in the metadata 49 (step S 2 ) (the following description will be made by taking a case of the “importance” descriptor as an example).
  • the analysis processing is ended without performing the analysis for Level 4 and the following hierarchical levels (step S 6 ).
  • step S 1 in order to perform the analysis of another piece of metadata 49 if necessary, the operations in step S 1 and the following steps are repeated. Also, when the metadata optimization hint information 62 specifies that the number of appearing times of the “importance” descriptor is 20, at the time when the analysis of 20 “importance” descriptors is finished (step S 5 ), the analysis of the metadata is ended (step S 6 ). After the metadata analysis processing is ended in step S 4 or step S 6 , in order to perform the analysis of another piece of metadata if necessary, the operations in step S 1 and the following steps are repeated.
  • FIG. 19 shows another example of the method for analyzing the metadata using the analyzed metadata optimization hint information 62 .
  • metadata is re-generated by extracting only video segment containing the “title” descriptor.
  • the judgement whether or not the metadata contains the “title” descriptor is performed in the same manner as in the example shown in FIG. 18 .
  • the metadata analysis/re-generation unit 63 judges whether or not a video segment matches an appearing position ID described in the metadata optimization hint information 62 (step S 13 ).
  • the video segment does not match the ID, the video segment does not contain the “title” descriptor, so that the analysis of the description of this video segment is skipped (step S 16 ).
  • step S 17 when the analysis of every video segment matching the appearing position ID is finished (step S 17 ), no video segment containing the “title” descriptor remains in the metadata, so that the analysis processing is ended (step S 18 ).
  • step S 11 the operations in step S 11 and the following steps are repeated. Then, the metadata 64 restructured using the descriptor extracted through the analysis processing described above is outputted.
  • the metadata delivery unit 45 delivers the restructured metadata 64 to the client terminals.
  • metadata optimization hint information corresponding to the metadata after the re-generation may be re-generated.
  • the descriptor analysis of the metadata 49 is performed in the manner described above using the metadata optimization hint information 60 describing the list of each descriptor contained in the metadata 49 , the appearing position of the descriptor, the number of appearing times, and the like.
  • the analysis of each descriptor not matching the re-generation condition is omitted using the appearing position or the number of appearing times, so that it becomes possible to reduce the processing cost (such as the processing amount and the memory usage amount) required to perform the metadata analysis and re-generation.
  • the metadata delivery server that reduces the processing cost required to perform the metadata analysis and re-generation using the metadata optimization hint information for the metadata re-generation.
  • a metadata search server metal search apparatus
  • FIG. 20 is a block diagram showing a construction of the metadata search server according to the sixth embodiment of the present invention.
  • a metadata delivery server 600 includes a hint information analysis unit 61 , metadata analysis unit 71 , and a search unit 73 .
  • the hint information analysis unit 61 is the same as that in the fifth embodiment described above and therefore the description thereof is omitted in this embodiment.
  • the metadata analysis unit 71 performs analysis of an enormous amount of metadata 49 describing the structure and characteristics of content with efficiency and with a less processing cost using analyzed metadata optimization hint information 62 and a search condition 70 .
  • the search unit 73 searches for content matching the search condition using a result 72 of the analysis of the metadata.
  • FIG. 21 is a flowchart showing how the metadata analysis unit of the metadata search server according to the sixth embodiment operates.
  • the metadata analysis unit 71 performs analysis of at least one piece of metadata using the metadata optimization hint information 62 corresponding to the metadata.
  • the analysis of the metadata is extraction of each characteristic description necessary for the searching from the metadata.
  • a video segment having specific color characteristic amounts is given as the search condition and each video segment having characteristics close to those of the given video segment will be searched for, for instance, it is required to extract each video segment having a color characteristic description.
  • a color characteristic description (“color histogram”) is added to each video segment at Level 4 , so that a description concerning each video segment at Level 4 is extracted.
  • the metadata analysis unit 71 analyses the search condition 70 and specifies a descriptor that is valid for the searching (step S 21 ).
  • the search condition may be characteristic amounts described in a format defined in MPEG-7.
  • the search condition may be an image, a keyword, or the like.
  • each corresponding descriptor color arrangement information
  • each descriptor in a text form (such as a title, an abstract, or explanatory notes) becomes the descriptor that is valid for the searching.
  • step S 22 it is judged whether the selected descriptor is contained in the metadata 49 (step S 22 ).
  • the analysis processing of the metadata 49 is ended (step S 24 ) and the analysis of another piece of metadata 49 is performed if necessary.
  • the analysis of the metadata is performed (step S 23 ).
  • the metadata analysis processing shown in FIG. 18 or 19 is performed with efficiency using the metadata optimization hint information 62 (steps S 25 and S 26 ).
  • the metadata analysis unit 71 extracts each characteristic description necessary for the searching.
  • the search unit 73 searches for content matching the search condition using the metadata analysis result (characteristic description necessary for the searching) 72 outputted from the metadata analysis unit 71 .
  • a description concerning each video segment having a color characteristic description (“color histogram”) is outputted by the metadata analysis unit 71 , so that the search unit 73 judges compatibility with the color characteristic amounts (histogram) given as the search condition and outputs information (“time information”, for instance) concerning each video segment, whose judgment result is positive, as a search result 74 .
  • the analysis of the metadata 49 is performed using the metadata optimization hint information 60 , so that it becomes possible to omit the analysis of the metadata 49 itself for the metadata re-generation. Also, the analysis of each descriptor that is not necessary for the searching is omitted based on the appearing position and the number of appearing times, so that it becomes possible to reduce the processing cost (such as the processing amount and the memory usage amount) required to perform the metadata searching.
  • FIG. 22 is a block diagram showing a construction of the client terminal according to the seventh embodiment of the present invention.
  • a client terminal 48 A includes a hint information analysis unit 80 , and a metadata re-generation condition setting unit 82 .
  • FIG. 22 shows only a portion of the function of the client terminal 48 A that relates to means for setting a condition for metadata re-generation using metadata optimization hint information 60 .
  • the hint information analysis unit 80 performs analysis of the metadata optimization hint information 60 described in a specified format. This hint information analysis unit 80 is the same as that in the fifth embodiment described above, so that the detailed description thereof is omitted in this embodiment.
  • the metadata re-generation condition setting unit 82 performs setting of a condition 83 for metadata re-generation based on a result 81 of the analysis outputted from the hint information analysis unit 80 .
  • the condition setting refers to selection of each descriptor that is unnecessary for the client terminal 48 A from among various descriptors contained in the metadata optimization hint information 60 , for instance.
  • each descriptor expressing the characteristic amounts such as a color histogram or motion complexity, is unnecessary.
  • the depth in the hierarchical structure processible by the client terminal is set based on the maximum value of the depth in the hierarchical structure described in the metadata optimization hint information 60 .
  • a viewpoint of a user and a threshold value of scene importance are set based on assumable values of the importance described in the metadata optimization hint information 60 .
  • the condition is set so that only each scene having the importance of 0.5 or higher from the viewpoint of “Team A” is selected, for instance.
  • the condition 83 for metadata re-generation set by the metadata re-generation condition setting unit 82 is sent to the metadata delivery server.
  • the metadata is restructured based on the metadata re-generation condition and the terminal performance of the client terminal.
  • the maximum value of the depth in the hierarchical structure of the original metadata is four and the depth in the hierarchical structure processible by the client terminal is set at two in the metadata re-generation condition, for instance, the structure of the metadata is restructured so that the maximum value of the depth in the hierarchical structure becomes two.
  • the metadata re-generation condition is set using the metadata optimization hint information 60 , so that it becomes possible to generate appropriate metadata in accordance with the client terminal or application.
  • a server that re-generates metadata using the metadata optimization hint information and delivers the re-generated metadata.
  • a content delivery server content delivery apparatus which analyzes metadata using the metadata optimization hint information, re-generates content suited for the client terminal or user preferences using a result of the analysis, and delivers the re-generated content.
  • FIG. 23 is a block diagram showing a construction of the content delivery server according to the eighth embodiment of the present invention.
  • a content delivery server 500 A includes a hint information analysis unit 61 , a metadata analysis unit and a content restructuring/delivery unit 88 .
  • the hint information analysis unit 61 operates in the same manner as in the fifth embodiment described above, so that the description thereof is omitted in this embodiment.
  • the metadata analysis unit 86 performs analysis of metadata 49 using analyzed metadata optimization hint information 62 outputted from the hint information analysis unit 61 , and extracts each description matching information concerning the client terminal or a condition 85 concerning content restructuring such as user preferences.
  • the analysis using the hint information is the same as that in the fifth embodiment described above.
  • this eighth embodiment differs from the fifth embodiment in that not the metadata re-generation but content restructuring is performed using each extracted description.
  • Each description extracted by the metadata analysis unit 86 that is, analyzed metadata 87 is outputted to the content restructuring/delivery unit 88 .
  • the content restructuring/delivery unit 88 performs restructuring of content 89 based on each description extracted by the metadata analysis unit 86 .
  • the following description will be made based on the example described in the above fifth embodiment.
  • only each video segment having the characteristics, whose importance is 0.5 or higher, is extracted from the metadata 49 and metadata composed of only each description concerning the extracted video segment is re-generated.
  • this eighth embodiment only each video segment having characteristics, whose importance is 0.5 or higher, is extracted from the metadata 49 and content 90 composed of only each scene corresponding to the extracted video segment is restructured and is delivered.
  • the location of corresponding content and the position (time information) of the video segment in the content are described. Therefore, it is possible to clip each corresponding scene from the content, to restructure a single content 90 using the clipped scene, and to deliver the restructured content 90 . Alternatively, it is possible to clip each corresponding scene from the content and to sequentially deliver the clipped scene.
  • the metadata analysis is performed using the metadata optimization hint information 60 describing a list of each descriptor contained in the metadata 49 , the appearing position of the descriptor, the number of appearing times, and the like, so that it becomes possible to omit the analysis of the metadata 49 itself for the metadata re-generation. Also, the analysis of each descriptor not matching the re-generation condition is omitted using the appearing position and the number of appearing times, so that it becomes possible to reduce the processing cost (such as the processing amount and the memory usage amount) required to perform the metadata analysis and the content restructuring at the time of re-generation and delivery of content suited for the client terminal and user preferences.
  • the processing cost such as the processing amount and the memory usage amount
  • multimedia content containing moving pictures and audio is divided into multiple scenes, editing of the multiple scenes is performed, and metadata that is scene structure information describing the hierarchical structure of the multimedia content is generated.
  • metadata that is scene structure information describing the hierarchical structure of the multimedia content is generated.

Abstract

Multimedia content containing moving pictures and audio is divided into multiple scenes and metadata is generated for each of the scenes obtained as a result of the division. It is possible to generate metadata containing scene structure information metadata that describes the hierarchical structure of the content in addition to scene section information and titles. Also, a name or an identifier of each descriptor contained in the metadata is described as hint information for manipulation of metadata composed of at least one descriptor describing semantic content, a structure, and characteristics of content.

Description

  • This application is a Divisional of co-pending application Ser. No. 10/510,548, filed on Oct. 8, 2004, and for which priority is claimed under 35 U.S.C. § 120. Application Ser. No. 10/510,548 is the national phase of PCT International Application No. PCT/JP03/03450, filed on Mar. 20, 2003 under 35 U.S.C. § 371, which claims priority from Japanese Application Nos. 2002-110259 filed on Apr. 12, 2002, and 2002-178169 filed on Jun. 19, 2002. The entire contents of each of the above-identified applications are hereby incorporated by reference.
  • TECHNICAL FIELD
  • The present invention relates to a metadata editing apparatus, a metadata reproduction apparatus, a metadata delivery apparatus, a metadata search apparatus, a metadata re-generation condition setting apparatus, a content delivery apparatus, and a metadata delivery method, with which, for instance, multimedia content containing moving pictures and audio is divided into multiple scenes and metadata is generated for each of the scenes obtained as a result of the division.
  • BACKGROUND ART
  • A conventional moving picture management apparatus is known which includes: a means for dividing a video into multiple scenes and editing and creating at least one index as an assembly of section information necessary for reproduction of each scene, a scene number assigned to each scene, and a representative image of each scene; means for giving a title to each index for the purpose of searching; and means for searching for a desired index using a corresponding title and successively reproducing scenes of the index in order of the scene numbers. With this construction, it becomes possible to reproduce only necessary scenes by editing an index in which the necessary scenes are arranged (see Japanese Patent Laid-Open No. 2001-028722 (page 1, FIG. 1), for instance).
  • With the moving picture management apparatus described above, however, metadata is merely created using the section information necessary for the scene reproduction, the scene number, and the scene representative image. Therefore, there remains a problem that it is impossible to also manage the structure of video data such as the hierarchical property of the video data.
  • Also, at the time of searching for a registered image, the title given to a corresponding index is used, causing a disadvantage in that in order to obtain an appropriate search result, an appropriate title needs to be input.
  • The present invention has been made in order to solve the problems described above. Therefore, it is an object of the present invention to provide a metadata editing apparatus capable of generating metadata that is index information showing the structure and the like of content (video data, for instance) in addition to scene section information and titles.
  • It is another object of the present invention to provide a metadata reproduction apparatus, a metadata delivery apparatus, a metadata search apparatus, a metadata re-generation condition setting apparatus, a content delivery apparatus, and a metadata delivery method, with which it is possible to collect and reproduce only scenes which a user wishes to watch using the metadata generated by the metadata editing apparatus, or to search for the scenes desired by the user using characteristic amounts or the like described in the metadata.
  • DISCLOSURE OF INVENTION
  • A metadata editing apparatus according to the present invention is provided with: a scene division unit for dividing multimedia content containing at least one of moving pictures and audio into a plurality of scenes to generate scene section information metadata indicating a scene start position and a scene end position for each scene obtained as a result of the division; a scene description edit unit for performing hierarchical editing of each scene of the multimedia content based on the scene section information metadata sent from the scene division unit and generating scene structure information metadata describing a hierarchical structure of the multimedia content; and a metadata description unit for integrating the scene section information metadata and the scene structure information metadata and generating metadata describing contents and a structure of the multimedia content in accordance with a predetermined format.
  • Further a metadata delivery apparatus according to the present invention is provided with: a hint information analysis unit for analyzing metadata optimization hint information describing a type and content of each descriptor contained in metadata; a metadata analysis/re-generation unit for analyzing metadata describing contents and a structure of multimedia content containing at least one of moving pictures and audio based on the analyzed metadata optimization hint information and a condition for metadata re-generation and re-generating second metadata; and a metadata delivery unit for delivering the second metadata re-generated by the metadata analysis/re-generation unit to a client terminal.
  • Further a metadata delivery method according to the present invention includes the steps of: analyzing metadata optimization hint information describing a type of each descriptor contained in metadata; re-generating second metadata by analyzing the metadata describing contents and a structure of multimedia content containing at least one of moving pictures and audio based on the analyzed metadata optimization hint information and a condition for re-generation of the metadata; and delivering the re-generated second metadata to a client terminal.
  • Further a hint information description method according to the present invention includes the steps of: describing, as hint information for manipulation of metadata composed of at least one descriptor describing semantic content, a structure, and characteristics of content, a name or an identifier of each descriptor contained in the metadata.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram showing a construction of a metadata editing apparatus according to a first embodiment of the present invention.
  • FIG. 2 shows a news video that is an example of a target of editing by the metadata editing apparatus according to the first embodiment of the present invention.
  • FIG. 3 shows an example of scene section information metadata of a scene division unit of the metadata editing apparatus according to the first embodiment of the present invention.
  • FIG. 4 shows an example of scene structure information metadata of a scene description edit unit of the metadata editing apparatus according to the first embodiment of the present invention.
  • FIG. 5 shows examples of screen images of a content reproduction/display unit and a user input unit of the metadata editing apparatus according to the first embodiment of the present invention.
  • FIG. 6 is a block diagram showing a construction of a metadata editing apparatus according to a second embodiment of the present invention.
  • FIG. 7 illustrates how the metadata editing apparatus according to the second embodiment of the present invention operates.
  • FIG. 8 is a block diagram showing a construction of a metadata reproduction apparatus according to a third embodiment of the present invention.
  • FIG. 9 illustrates how the metadata reproduction apparatus according to the third embodiment of the present invention operates.
  • FIG. 10 is a block diagram showing a construction of a content delivery system according to a fourth embodiment of the present invention.
  • FIG. 11 shows content (in this case, a news video) structure information outputted from a metadata analysis unit of a metadata delivery server according to the fourth embodiment of the present invention.
  • FIG. 12 shows an example of a structure of content after restructuring by a metadata re-generation unit of the content delivery system according to the fourth embodiment of the present invention.
  • FIG. 13 is a block diagram showing a construction of a metadata delivery server according to a fifth embodiment of the present invention.
  • FIG. 14 shows an example of video content, with reference to which processing of metadata optimization hint information by the metadata delivery server according to the fifth embodiment of the present invention is described.
  • FIG. 15 shows how metadata is described in MPEG-7 by the metadata delivery server according to the fifth embodiment of the present invention.
  • FIG. 16 shows an example of a format of the metadata optimization hint information used by the metadata delivery server according to the fifth embodiment of the present invention.
  • FIG. 17 shows the metadata optimization hint information used by the metadata delivery server according to the fifth embodiment of the present invention.
  • FIG. 18 is a flowchart showing how a metadata analysis/re-generation unit of the metadata delivery server according to the fifth embodiment of the present invention operates.
  • FIG. 19 is another flowchart showing how the metadata analysis/re-generation unit of the metadata delivery server according to the fifth embodiment of the present invention operates.
  • FIG. 20 is a block diagram showing a construction of a metadata search server according to a sixth embodiment of the present invention.
  • FIG. 21 is a flowchart showing how a metadata analysis unit of the metadata search server according to the sixth embodiment of the present invention operates.
  • FIG. 22 is a block diagram showing a construction of a client terminal according to a seventh embodiment of the present invention.
  • FIG. 23 is a block diagram showing a construction of a content delivery server according to an eighth embodiment of the present invention.
  • BEST MODE FOR CARRYING OUT THE INVENTION
  • Embodiments of the present invention will now be described with reference to the accompanying drawings, with a first embodiment relating to a metadata editing apparatus, a second embodiment relating to another metadata editing apparatus, a third embodiment relating to a metadata reproduction apparatus, a fourth embodiment relating to a content delivery system, a fifth embodiment relating to a metadata delivery server, a sixth embodiment relating to a metadata search server, a seventh embodiment relating to a client terminal, and an eighth embodiment relating to a content delivery server.
  • FIRST EMBODIMENT
  • In this first embodiment, a metadata editing apparatus will be described which divides multimedia content containing moving pictures and audio into multiple scenes and creates metadata (index information) containing descriptions of a scene hierarchical structure and characteristic amounts of each scene.
  • The metadata editing apparatus according to the first embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 1 is a block diagram showing a construction of the metadata editing apparatus according to the first embodiment of the present invention. Note that in each drawing, the same reference numerals denote the same or equivalent portions.
  • Referring to FIG. 1, a metadata editing apparatus 100 includes a content reproduction/display unit 2, a scene division unit 3, a thumbnail image generation unit 4, a scene description edit unit 5, a text information giving unit 6, a characteristic extraction unit 7, a user input unit 8, and a metadata description unit 9.
  • The content reproduction/display unit 2 reproduces and displays multimedia content 10 that includes video data and audio data and is a target of editing. The scene division unit 3 divides the content into multiple scenes. The thumbnail image generation unit 4 extracts a representative frame of each scene as a thumbnail image. The scene description edit unit 5 hierarchically edits the scenes obtained as a result of the division by the scene division unit 3 through scene grouping, scene combining, scene deletion, generation of information that shows relations among the scenes, and the like. The text information giving unit 6 gives various types of text information to each scene. The characteristic extraction unit 7 extracts characteristics of each scene.
  • Also, the user input unit 8 receives input of designation information from a user and outputs it to the content reproduction/display unit 2, the scene division unit 3, the thumbnail image generation unit 4, the scene description edit unit 5, and the text information giving unit 6 as user input information 11.
  • Further, the metadata description unit 9 integrates scene section information metadata 12, scene thumbnail image information metadata 13, scene structure information metadata 14, text information metadata 15, and characteristic description metadata 16 outputted from the scene division unit 3, the thumbnail image generation unit 4, the scene description edit unit 5, the text information giving unit 6, and the characteristic extraction unit 7, respectively. The metadata description unit 9 then generates metadata 17 describing the contents and structure of the multimedia content in accordance with a specified format.
  • Next, how the metadata editing apparatus according to the first embodiment operates will be described with reference to the accompanying drawings. FIG. 2 shows a construction of a news video that is an example of a target of editing by the metadata editing apparatus according to the first embodiment.
  • A case where the news video having the construction shown in FIG. 2 is edited will be described as an example.
  • First, the content reproduction/display unit 2 of the metadata editing apparatus 100 receives input of the multimedia content 10, such as video content, stored in a content storage unit (not shown) via a network or the like, and reproduces/displays the multimedia content 10 for editing.
  • When the user of the metadata editing apparatus 100 inputs positions for clipping a scene, which is to say a scene start position and a scene end position, using the user input unit 8 while watching the reproduced video, the scene division unit 3 generates the scene section information metadata 12 showing the scene start position and the scene end position inputted from the user.
  • FIG. 3 shows an example of the scene section information metadata generated by the scene division unit of the metadata editing apparatus according to the first embodiment.
  • Here, the scene section information metadata 12 shown in FIG. 3 was generated from the news video shown in FIG. 2. As shown in FIG. 3, the scene section information metadata 12 generated by the scene division unit 3 gives the scene start position and the scene end position of each scene clipped from the news video content, such as a “news digest” scene, a “domestic news” scene, and an “international news” scene.
  • On receiving designation of scene editing from the user via the user input unit 8, the scene description edit unit 5 performs hierarchical editing of the scenes continuously clipped by the scene division unit 3 based on the scene section information metadata 12 from the scene division unit 3, and then outputs the scene structure information metadata 14. Here, the scene hierarchical editing refers to scene grouping, scene re-division, scene combining, or scene deletion, for instance. The scene grouping refers to grouping of scenes that are related to each other with respect to specific characteristics into a single group. For instance, as shown in FIG. 4, the “domestic news” scene, the “international news” scene, and the “financial news” scene of the news video shown in FIG. 2 are grouped into a single “news” group. Also, the scene re-division refers to division of a single scene into multiple scenes and the scene combining refers to generation of a single scene by combining multiple scenes with each other.
  • FIG. 4 shows an example of the scene structure information metadata generated by the scene description edit unit of the metadata editing apparatus according to the first embodiment.
  • The scene structure information metadata 14 shown in FIG. 4 describes the hierarchical structure of the video content generated as a result of the editing by the scene description edit unit 5. In FIG. 4, a “news” scene is edited into multiple scenes, such as a “news digest” scene, a “news” scene, a “special” scene, and a “sports” scene, and the “news” scene is further hierarchically edited into a “domestic news” scene, an “international news” scene, and an “financial news” scene by the scene description edit unit 5 through scene editing such as the scene grouping, scene re-division, and scene combining.
  • Then, the metadata 14 generated by the scene description edit unit 5, such as the metadata shown in FIG. 4, is outputted to the metadata description unit 9.
  • On the other hand, the thumbnail image generation unit 4 generates a representative frame of each scene clipped by the scene division unit 3 as a thumbnail image based on the scene section information metadata 12 from the scene division unit 3, and outputs information concerning the generated thumbnail image as the thumbnail image information metadata 13 to the metadata description unit 9, in which the thumbnail image information metadata 13 is registered. Here, it is possible for the user to perform selection of the thumbnail using the user input unit 8, although it is also possible to automatically set a head frame or each frame clipped at fixed time intervals as the representative frame or to automatically detect each scene change point and set a frame at each detected point as the representative frame. The thumbnail image information metadata 13 is information showing the position (such as the frame number or time) of the thumbnail in the video content or information giving the location (such as the URL) of the thumbnail image.
  • Also, the characteristic extraction unit 7 extracts visual characteristic amounts possessed by each scene, such as motions, colors, or shapes of objects contained in the scene, from the scene based on the scene section information metadata 12 from the scene division unit 3. The extracted characteristic amounts are outputted to the metadata description unit 9 as the characteristic description metadata 16 and are registered therein.
  • Also, the text information giving unit 6 gives various types of text information, such as a title, an abstract, a keyword, a comment, and scene importance, designated by the user to each scene based on the scene section information metadata 12 from the scene division unit 3. Here, the text information may be given through user's input using the user input unit 8 or be automatically given through analysis of audio information and captions contained in the content. The text information is outputted to the metadata description unit 9 and is registered therein as the text information metadata 15.
  • FIG. 5 shows examples of screen images displayed by the content reproduction/display unit and the user input unit 8 of the metadata editing apparatus according to the first embodiment. In FIG. 5, a video reproduction screen G1 is an example of the screen image displayed by the content reproduction/display unit 2, with content to be edited being reproduced/displayed on this video reproduction screen G1. Although not clearly shown in FIG. 5, like in the case of an ordinary video reproduction apparatus, a user interface is also provided which includes buttons and the like for commanding “reproduction”, “stop”, “rewind”, “fast forward”, “frame advance”, and other operations. Also, below the video reproduction screen G1, a scene division designation screen G2 is displayed which has a slider form, for instance. The user designates a scene start position and a scene end position of the video displayed on the video reproduction screen G1 through this scene division designation screen G2 while watching the video displayed on the video reproduction screen G1. Also, the user simultaneously designates the position of a thumbnail between the scene start position and the scene end position through the scene division designation screen G2. Here, when the thumbnail position is designated through the scene division designation screen G2, the thumbnail image generation unit 4 generates a thumbnail image from a frame of the video content at the designated position.
  • Also, the thumbnail image, whose position has been designated through the scene division designation screen G2, is displayed on a scene division information display screen G3 as scene division information. Here, on this scene division information display screen G3, it is also possible to display information showing the scene start position and the scene end position in addition to the thumbnail image, as shown in FIG. 3.
  • Next, the user designates scene editing through a tree structure generation designation/display screen G4. That is, the user generates a tree showing the hierarchical structure possessed by the video content while watching the scene division information, such as the thumbnail image, displayed on the scene division information display screen G3.
  • When performing the scene grouping, the user uses a manipulation method with which, for instance, a new node is added to the tree and each scene that should be grouped is added to the node. In order to perform the scene addition, the user may use a method with which a scene that should be added is selected on the scene division information display screen G3 and the selected scene is added to the node by a drag-and-drop operation. Here, it is possible for the user to input text information for the selected scene from the scene division information display screen G3 or the tree structure generation designation/display screen G4 using the user input unit 8 provided as a user interface for giving the text information to the scene via the text information giving unit 6.
  • The metadata description unit 9 generates a metadata file described in accordance with a specified description format by integrating the various types of metadata outputted from the scene division unit 3, the thumbnail image generation unit 4, the scene description edit unit 5, the text information giving unit 6, and the characteristic extraction unit 7. The specified metadata description format may be a uniquely determined format, although MPEG-7 standardized by ISO is used in this first embodiment. The MPEG-7 stipulates a format for describing the structure and characteristics of content and includes an XML file format and a binary format.
  • As described above, the metadata editing apparatus 100 of the first embodiment is provided with the scene description edit unit 5 for hierarchically editing scenes and the characteristic extraction unit 7 for extracting characteristics from the scenes, so that it becomes possible to generate metadata describing the hierarchical structure possessed by content, such as video data, and characteristic amounts of each scene.
  • It should be noted here that the multimedia content 10 inputted into the content reproduction/display unit 2 is obtained from a content server (not shown) existing on a network, from a content storage unit (not shown) in the metadata editing apparatus 100, or from an accumulation medium (not shown) such as a CD or a DVD, for instance. In a like manner, the metadata outputted from the metadata description unit 9 is accumulated in a metadata server (not shown) existing on a network, in a metadata accumulation unit (not shown) in the metadata editing apparatus, or in an accumulation medium (not shown), such as a CD or a DVD, together with content, for instance.
  • Also, in the first embodiment, a case where the metadata editing apparatus 100 is provided with both of the scene description edit unit 5 and the characteristic extraction unit 7 has been described. However, the present invention is not limited to this and it is of course possible to provide the metadata editing apparatus 100 with only one of the scene description edit unit 5 and the characteristic extraction unit 7.
  • EMBODIMENT
  • In the first embodiment described above, every scene is divided manually. However, a metadata editing apparatus to be described in this second embodiment is provided with a scene change detection unit for automatically detecting each scene change point.
  • The metadata editing apparatus according to the second embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 6 is a block diagram showing a construction of the metadata editing apparatus according to the second embodiment of the present invention.
  • Referring to FIG. 6, a metadata editing apparatus 100A includes a content reproduction/display unit 2, a scene division unit 3, a thumbnail image generation unit 4, a scene description edit unit 5, a text information giving unit 6, a characteristic extraction unit 7, a user input unit 8, a metadata description unit 9, and scene change detection unit 39. Note that reference numeral 40 denotes scene start position information which is automatically detected.
  • Next, how the metadata editing apparatus according to the second embodiment operates will be described with reference to the accompanying drawings.
  • FIG. 7 illustrates how the metadata editing apparatus according to the second embodiment of the present invention operates.
  • The construction elements other than the scene change detection unit 39 and the scene division unit 3 operate in the same manner as in the first embodiment described above. Therefore, operations unique to the second embodiment will be described below.
  • The scene change detection unit 39 automatically detects each scene change/cut point. This scene change detection is performed based on a difference in pixel between frames, a difference in color between the frames, a difference in luminance histogram between the frames, or the like, for instance. The scene division unit 3 determines a scene start position and a scene end position based on each scene change point detected by the scene change detection unit 39.
  • Hereinafter, processing by the scene change detection unit 39 and the scene division unit 3 will be described in detail by taking, as an example, a case where a news video is content that is a target of editing.
  • A case where a color histogram is used as characteristic amounts for the scene change detection will be described as an example.
  • The scene change detection unit 39 calculates a color histogram for each frame. As a color system, HSV, RGB, YCbCr, and the like are available, although an HSV color space is used in this example. This HSV color space is composed of three elements called “hue (H)”, “saturation (S)”, and “value (V)”. A histogram of each element is calculated. Next, from the obtained histogram, a difference in histogram between frames is calculated based on Equation 1 given below, for instance. Here, it is assumed that frames from a scene start frame to the Nth frame (N=3, for instance) belong to the same scene, that is, do not contain any scene change point. Note that as the initial characteristic amounts of the scene, a means value (mean) and a standard deviation (sd) of the differences in histogram between the first N frames are obtained based on Equation 2 given below. sum i = k = 1 bin_H H i ( k ) - H i - 1 ( k ) + k = 1 bin_S S i ( k ) - S i - 1 ( k ) + k = 1 bin_V V i ( k ) - V i - 1 ( k ) Equation 1
    sumi: a sum of differences in histogram between a frame i and a frame i-1
    Hi(h): a hue histogram, bin_H: the number of elements of the histogram
    Si(h): a saturation histogram, bin_S: the number of elements of the histogram
    Vi(h): a value histogram, bin_V: the number of elements of the histogram mean = 1 N - 1 i = 1 n - 1 sum i , sd = 1 N - 1 i = 1 N - 1 ( sum i - mean ) 2 Equation 2
    mean: a means value of the differences in histogram between the frames
    sd: a standard deviation of the differences in histogram between the frames
  • Then, each frame, from the N+1th and the subsequent frames, that has an inter-frame difference in histogram greater than “means +λ·sd” is regarded as a scene change point and is set as a new scene start position candidate.
  • In this manner, multiple scene start position candidates are obtained. Next, there will be considered a case where like in the case of a news video, an image having a determined pattern is inserted at switching between news or the like.
  • In many cases, in a news video, an image having a determined pattern, such as an image composed of an announcer, a studio set on the background, and a character description (caption), is inserted at switching between news, for instance. Accordingly, the image having the predetermined pattern (hereinafter referred to as the “template image”) or metadata describing the characteristic amounts of the temperate image is registered in advance. For instance, the characteristic amounts of the template image are the color histogram of the temperate image, the motion pattern (for instance, less motions are observed in the area of an announcer at switching between news), or the like.
  • When the temperate image is registered in advance, each image corresponding to a scene change point is matched against the template image, as shown in FIG. 7. Then, if the similarity therebetween is high, the scene change point is registered as a scene start position. The similarity may be judged based on inter-frame differences, inter-frame differences in color histogram, or the like.
  • Also, when the characteristic amounts of the template image are registered in advance, characteristic amounts of each image corresponding to a scene change point are extracted and are matched against the characteristic amounts of the temperate image. If the similarity therebetween is high, the scene change point is registered as a scene start position. Then, information showing the scene start position is outputted to the scene division unit 3.
  • The scene division unit 3 determines a scene start position and a scene end position based on the information showing the scene start position automatically detected by the scene change detection unit 39. Note that the scene division unit 3 of the second embodiment is also capable of determining the scene start position and the scene end position based on designation from the user, like in the first embodiment described above.
  • It is also possible for the scene change detection unit 39 to detect each scene change point contained in a scene with reference to each scene start position and each scene end portion described in the scene section information metadata 12 outputted from the scene division unit 3 to the scene change detection unit 39.
  • The scene description edit unit 5 re-divides or integrates the scenes automatically detected by the scene change detection unit 39 based on the scene section information metadata 12 from the scene division unit 3. Note that the details of the scene description edit unit 5 are the same as those in the first embodiment described above.
  • As described above, with the metadata editing apparatus 100A according to the second embodiment, it becomes possible to generate metadata describing the hierarchical structure possessed by content, such as video data, and characteristic amounts of each scene, like in the first embodiment described above. In addition, the scene change detection unit 39 is provided, so that it becomes possible to automatically detect each scene change point in content.
  • THIRD EMBODIMENT
  • In this third embodiment, a metadata reproduction apparatus will be described which performs summary reproduction of images, searching, and the like using the metadata generated by the metadata editing apparatus according to the first embodiment or the second embodiment described above.
  • The metadata reproduction apparatus according to the third embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 8 is a block diagram showing a construction of the metadata reproduction apparatus according to the third embodiment of the present invention.
  • Referring to FIG. 8, a metadata reproduction apparatus 200 includes a metadata analysis unit 19, a structure display unit 20, a thumbnail image display unit 21, an user input unit 22, a search unit 23, a search result display unit 24, a summary creation unit 25, a summary structure display unit 26, and a content reproduction unit 27.
  • The metadata analysis unit 19 performs analysis of metadata 28 describing the hierarchical scene structure possessed by content, information concerning the thumbnail of each scene, the characteristic amounts of each scene, and the like. The structure display unit 20 displays a scene structure 29 obtained as a result of the metadata analysis, that is, the hierarchical structure of the content. The thumbnail image display unit 21 displays thumbnail image information 30 obtained as a result of the metadata analysis.
  • With the user input unit 22, a user inputs search designation, reproduction designation, and the like. The search unit 23 performs searching based on the search designation (search condition 31) from the user and the scene characteristic amounts or text information 32 obtained from the metadata. The search result display unit 24 displays a result 33 of the searching. The summary creation unit 25 performs creation of a summary based on summary creation designation (summary creation condition 34) from the user. The summary structure display unit 26 displays a structure 38 of summarized content. The content reproduction unit 27 reproduces/displays the content based on summary information 35, content reproduction designation 36, and content 37 to be reproduced.
  • Next, how the metadata reproduction apparatus according to the third embodiment operates will be described with reference to the accompanying drawings.
  • First, the metadata analysis unit 19 receives input of the metadata 28 describing the hierarchical scene structure possessed by the content, information concerning the thumbnail of each scene, the characteristic amounts of each scene, and the like, and performs analysis of the metadata.
  • In the third embodiment, it is assumed that the metadata 28 is metadata generated by the metadata description unit 9 of the first embodiment or the second embodiment described above in a format stipulated by MPEG-7. Consequently, the metadata is a text file written in XML or a binary file encoded in the binary format.
  • If the metadata 28 is written in XML, the metadata analysis unit 19 serves as an XML parser that performs analysis of an XML file. On the other hand, if the metadata 28 is encoded in the binary format, the metadata analysis unit 19 serves as a decoder that performs decoding of the metadata 28.
  • The structure display unit 20 receives input of a result of the analysis by the metadata analysis unit 19 and displays the hierarchical scene structure 29 of the content. The scene structure of the content is displayed in a tree form together with the title of each scene, as shown in FIG. 4.
  • The thumbnail image display unit 21 receives input of the result of the analysis by the metadata analysis unit 19 (thumbnail image information 30) and displays a list of thumbnail images of the content.
  • The search unit 23 receives search designation from the user via the user input unit 22 and searches for a scene contained in the content. At this time, the user inputs a search condition by giving a keyword, a sample image, or the like via the user input unit 22. The search unit 23 searches for each scene matching the search condition 31, such as the keyword or the characteristics of the sample image, given by the user based on the scene characteristic amounts described in the metadata or the text information 32 giving scene titles and the like.
  • When the searching by the search unit 23 is finished, the search result display unit 24 receives input of the result 33 of the searching by the search unit 23 and performs displaying of the search result. As a method for displaying the search result, the thumbnail image of each scene matching the search condition is displayed, for instance.
  • Also, the summary creation unit 25 creates a summary of the content based on summary creation designation from the user via the user input unit 22. At this time, the user inputs information showing the reproduction time of summarized content, user preferences, and the like using the user input unit 22. When the content is a news video, for instance, the user inputs preference information showing that, for instance, he/she wishes to mainly watch sports news in the news video or to watch a 20-minute summary of the news video whose original length is one hour. The summary creation unit 25 also creates the summary information 35 matching the summary condition based on the scene reproduction times described in the metadata and the text information 32 giving the scene titles and the like. For instance, this summary information 35 is a reproduction list of scenes contained in the summarized content and is a list in which the location information, such as the URL, of the content is written together with the start position and end position of each scene in the content that the user wishes to reproduce.
  • Also, the content reproduction/display unit 27 specifies target content based on the location information of the content contained in the summary information 35, and performs obtainment/reproduction/display of each scene to be reproduced based on the scene list contained in the summary information. In another form, the summary information hierarchically describes the scene structure of the summarized content.
  • FIG. 9 shows an example of a hierarchical scene structure. FIG. 9(a) shows an example of a scene structure of original content. Each scene is given importance in a range of 0.0 to 1.0, with “1.0” meaning the highest importance and “0.0” meaning the lowest importance. The importance is calculated based on the user preferences, for instance. If the user preferences are registered in advance and indicate that he/she wishes to watch scenes of a soccer game of a team A and, in particular, to necessarily watch a result of the game and goal scenes, each scene is given importance reflecting the user preferences.
  • Following this, when summarization is performed using only scenes having the highest importance in FIG. 9(a), there is generated summarized content having the scene structure shown in FIG. 9(b). Note that each scene has metadata showing the location information, such as the URL, of the content containing the scene, the position information (the start position and the end position) of the scene in the content, and the like. Information concerning the scene structure 38 of the summarized content is passed to the summary structure display unit 26, which then displays the scene structure 38 in the tree form shown in FIG. 9(b).
  • Also, when the user selects at least one scene that he/she wishes to reproduce using the scene structure displayed by the structure display unit 20 or the summary structure display unit 26 or using the scene thumbnails displayed by the thumbnail image display unit 21 or the search result display unit 24 via the user input unit 22, the content reproduction/display unit 27 reproduces/displays each selected scene contained in the content.
  • As described above, with the metadata reproduction apparatus 200 according to the third embodiment, it becomes possible to reproduce only each scene that the user wishes to watch using the metadata generated by the metadata editing apparatus according to the first embodiment or the second embodiment described above or to search for the scene desired by the user using the characteristic amounts described in the metadata.
  • In the third embodiment, the content reproduction/display unit 27 is provided within the metadata reproduction apparatus 200. However, this content reproduction/display unit may be provided in another apparatus. For instance, manipulations and displaying concerning reproduction of the metadata, such as displaying of the scene structure and the thumbnail images, may be performed by a mobile telephone, a portable information terminal, or the like, and processing and displaying concerning reproduction of the multimedia content may be performed by a terminal (PC, for instance) connected to the mobile telephone, the portable information terminal, or the like via a network.
  • FOURTH EMBODIMENT
  • In this fourth embodiment, a metadata delivery server (metadata delivery apparatus), which delivers the metadata of content to a client terminal, and a content delivery server, which scalably constructs the content with reference to the terminal capability of the client terminal and delivers the constructed content to the client terminal, will be described.
  • A content delivery system according to the fourth embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 10 is a block diagram showing a construction of the content delivery system according to the fourth embodiment of the present invention.
  • Referring to FIG. 10, a content delivery server 300 includes a metadata delivery server 400, various client terminals 481 to 48 n, and a content delivery server 50.
  • The metadata delivery server 400 includes a metadata accumulation unit 41, a metadata analysis unit 42, a terminal capability judgment unit 43, a metadata re-generation unit 44, and a metadata delivery unit 45.
  • In the metadata accumulation unit 41, there is accumulated the metadata generated by the metadata editing apparatus of the first embodiment or the second embodiment described above, for instance. The metadata analysis unit 42 performs analysis of metadata 49 describing the structure and characteristics of content. The terminal capability judgment unit 43 judges the terminal capability of each client terminal based on information 51 concerning the capability of the client terminal. The metadata re-generation unit 44 restructures the content in accordance with the judged terminal capability of the client terminal based on a result 50 of the analysis of the metadata, and re-generates metadata 52 of the restructured content. The metadata delivery unit 45 delivers metadata 53 re-generated by the metadata re-generation unit 44 to the client terminals 481 to 48 n.
  • Note that the metadata accumulation unit 41 may be provided outside the metadata delivery server 400 of the fourth embodiment. In this case, the metadata delivery server 400 receives input of the metadata 49 from the metadata accumulation unit 41 via a network (not shown) or the like.
  • On the other hand, the content delivery server 500 includes a content accumulation unit 46 and a content delivery unit 47.
  • In the content accumulation unit 46, there is accumulated content 55. The content delivery unit 47 delivers content 56 to the client terminals 481 to 48 n in accordance with content delivery requests 54 from the client terminals.
  • Like in the case of the metadata delivery server 400 described above, the content accumulation unit 46 may be provided outside the content delivery server 500. In this case, the content delivery server 500 receives input of the content data 55 via a network (not shown).
  • Next, how the content delivery system according to the fourth embodiment operates will be described with reference to the accompanying drawings.
  • First, on the metadata delivery server 400 side, the metadata analysis unit 42 performs analysis of the metadata accumulated in the metadata accumulation unit 41. The metadata analysis unit 42 operates in the same manner as the metadata analysis unit 19 of the metadata reproduction apparatus 200 of the third embodiment described above. By performing the analysis of the metadata, the metadata analysis unit 42 obtains information concerning the structure and characteristics of the content.
  • FIG. 11 shows content structure information outputted from the metadata analysis unit of the metadata delivery server according to the fourth embodiment, with the illustrated example relating to a news video. In FIG. 11, the hierarchical scene structure of the content is displayed in a tree form. Each node of the tree corresponds to one scene and is associated with various types of scene information. Here, the various types of scene information include a scene title, an abstract, time information giving a scene start position and a scene end position, a scene thumbnail, a representative frame, a thumbnail shot, a representative shot, and scene characteristics such as visual characteristic amounts concerning colors, motions, and the like. Note that in FIG. 11, among the various types of scene information, only the scene titles are shown.
  • Here, it is assumed that the client terminals are various information household devices having different terminal capabilities. The terminal capability refers to a communication speed, a processing speed, an image format that can be reproduced/displayed, an image resolution, a user input function, and the like. For instance, it is assumed that the client terminal 481 is a personal computer (PC) that has sufficient performance with respect to the communication speed, processing speed, display performance, and user input function. Also, it is assumed that the client terminal 482 is a mobile telephone and the remaining client terminals are each a PDA or the like. Each of the client terminals 481 to 48 n sends information concerning its terminal performance.
  • The terminal capability judgment unit 43 analyzes the information 51 that was sent from each of the client terminals 481 to 48 n and shows the terminal performance of the client terminal, determines a deliverable image format, a maximum image resolution, a length of the content, and the like, and outputs them to the metadata re-generation unit 44. When the original content is video content encoded in MPEG-2 and has a high resolution, for instance, the original content can be reproduced by the client terminal 481 as it is because the client terminal 481 has sufficient performance as described above. Also, it is assumed that this client terminal 481 has a function with which it is possible to perform the image summary reproduction and searching described in the third embodiment described above. On the other hand, it is assumed that the client terminal 482 is capable of reproducing only short video shots encoded in MPEG-4 and the maximum resolution displayable by the client terminal 482 is low.
  • The metadata re-generation unit 44 restructures the content in accordance with the terminal performance of each of the client terminals 481 to 48 n informed by the terminal performance judgment unit 43, re-generates the metadata 52 describing the structure and contents of the restructured content, and outputs the metadata 52 to the metadata delivery unit 45. For instance, the original metadata is delivered to the client terminal 481 as it is, so that the restructuring of the content is not performed. On the other hand, the client terminal 482 has only the function of reproducing short video shots and is incapable of reproducing every scene, so that the restructuring of the content is performed for the client terminal 482 using short video shots of important scenes.
  • FIG. 12 shows an example of a content structure after the restructuring by the metadata re-generation unit of the content delivery system according to the fourth embodiment. As shown in FIG. 12, each important scene, out of scenes of the news video, is extracted and the content is restructured so as to include only the representative shot or representative frame of each extracted scene. Also, the client terminal 482 does not have the search function described in the above third embodiment, so that among the various types of scene information in the metadata, the scene characteristic amounts are not required to be included for searching. Therefore, the metadata re-generation unit 44 re-generates metadata describing only the structure of restructured scenes and the position information of the representative shots or representative frames of the scenes, and sends the metadata to the metadata delivery unit 45.
  • The metadata delivery unit 45 delivers the metadata 53 generated by the metadata re-generation unit 44 to the client terminals 481 to 48 n.
  • Each of the client terminals 481 to 48 n analyzes the metadata 53 delivered by the metadata delivery unit 45 and obtains scene structure information of the content. When a user of each of the client terminals 481 to 48 n selects a scene that he/she wishes to reproduce, the client terminal transmits position information of the selected scene to the content delivery unit 47 of the content delivery server 500.
  • On receiving the scene position information from each of the client terminals 481 to 48 n, the content delivery unit 47 of the content delivery server 500 obtains corresponding content 55 from the content accumulation unit 46 and delivers the content to each of the client terminals 481 to 48 n. In the case of the client terminal 481, the content delivery unit 47 sends a scene start position and a scene end position and delivers a corresponding scene of the original content. On the other hand, in the case of the client terminal 482, the content delivery unit 47 sends the location information (such as the URL) of a scene representative shot. Note that when the representative shot is not reproducible/displayable by the client terminal 482 because of its image format, image resolution, image file size, or the like, the content delivery unit 47 performs various kinds of processing, such as format conversion, resolution conversion, and reduction in file size through content summarization, and sends resultant data to the client terminal 482.
  • As described above, with the metadata delivery server 400 of the fourth embodiment, it becomes possible to re-generate metadata in accordance with the capability of each of the client terminals 481 to 48 n and to deliver the re-generated metadata to each of the client terminals 481 to 48 n.
  • It should be noted here that in FIG. 10, the metadata delivery server 400 and the content delivery server 500 are shown as separated apparatuses, but the present invention is not limited to this. For instance, the content delivery server may be provided in the metadata delivery server or the metadata delivery server may be provided in the content delivery server. In addition, needless to say, the metadata delivery server and the content delivery server may be provided in the same server. In this case, it becomes possible for the terminal capability judgment unit 43 to easily inform the content delivery unit 47 of the capability of each of the client terminals 481 to 48 n, which makes it possible to restructure the content through format conversion or the like in accordance with the capability of the client terminal and to deliver the restructured content to each of the client terminals 481 to 48 n.
  • Also, the fourth embodiment has been described by assuming that in the metadata accumulation unit 41, the metadata generated by the metadata editing apparatus of the first embodiment or the second embodiment described above is accumulated. However, the present invention is not limited to this and, needless to say, metadata generated by an apparatus other than the metadata editing apparatus of the first embodiment or the second embodiment described above may be accumulated in the metadata accumulation unit 41.
  • FIFTH EMBODIMENT
  • In this fifth embodiment, another example of the metadata delivery server described in the above fourth embodiment will be described. The metadata delivery server of the above fourth embodiment performs the metadata re-generation based on the terminal information sent from each client terminal. In the fifth embodiment, however, in order to more appropriately perform the metadata re-generation, the metadata delivery server (metadata delivery apparatus) is provided with a metadata analysis/re-generation unit that performs the metadata re-generation using metadata optimization hint information that is hint information for the metadata re-generation.
  • The metadata delivery server according to the fifth embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 13 is a block diagram showing a construction of the metadata delivery server according to the fifth embodiment of the present invention.
  • Referring to FIG. 13, a metadata delivery server 400A includes a hint information analysis unit 61, metadata analysis/re-generation unit 63, and a metadata delivery unit 45.
  • The hint information analysis unit 61 analyzes metadata optimization hint information 60 and outputs a result of the analysis. The metadata analysis/re-generation unit 63 analyzes metadata 49 describing the structure and characteristics of content based on analyzed metadata optimization hint information 62 and a condition 65 concerning metadata re-generation such as information concerning the performances of the client terminals or user preferences, and outputs restructured metadata 64. Then, the metadata delivery unit 45 delivers metadata 53 to the client terminals.
  • In the metadata accumulation unit 41 (see FIG. 10), the metadata 49 describing the structure and characteristics of the content and the metadata optimization hint information 60 that is hint information for the re-generation of the metadata 49 are accumulated. Here, the metadata optimization hint information 60 for the re-generation of the metadata 49 is information describing the types of information contained in the metadata 49, the amount of the contained information, and the outline and complexity of the metadata 49.
  • Next, how the metadata delivery server according to the fifth embodiment operates will be described with reference to the accompanying drawings.
  • The metadata optimization hint information 60 will be described in detail by taking, as an example, a case of video content having the structure shown in FIG. 14.
  • A video content (Root) (Soccer game program) is broadly divided into two scenes (Scene 1 and Scene2) corresponding to the first half and the second half, and the first half scene is further divided into multiple scenes (Scene1-1, Scene1-2, . . . , Scene1-n) (such as goal scenes and corner kick scenes). In FIG. 14, the temporal hierarchical structure among the scenes is indicated using a tree structure.
  • The metadata 49 corresponding to the video content describes the temporal hierarchical structure of the content, that is, the temporal relations among the scenes, and the start times and lengths of the scenes. The metadata 49 also describes text information (such as a title, abstract, category, and explanatory notes), importance, and the like of each scene as well as the characteristics (for instance, a color histogram or motion complexity) possessed by the scene in accordance with the hierarchical level of the scene. Note that in this fifth embodiment, it is assumed that MPEG-7 standardized by ISO is used as a metadata description format.
  • FIG. 15 shows how the metadata is described in MPEG-7. In MPEG-7, each scene is described in units called “video segment”. In each video segment, there are described time information (scene start point and length), a title, an outline, a category, and the like. Note that there is a case where the information described in each video segment is changed in accordance with the hierarchical level of the video segment. In the example shown in FIG. 15, importance is described in each video segment at Level 2 and Level 3, although no importance is described in each video segment at Level 4. Also, the characteristic amounts concerning colors and motions are described only in each video segment at Level 4.
  • It is possible to express the temporal hierarchical relations among the scenes by recursively describing the video segments. In the description example shown in FIG. 15, with a “time division” description, there is described a state where one video segment is composed of multiple video segments temporally divided. In MPEG-7, it is also possible to describe the spatial hierarchical structure possessed by the content in a like manner. In this case, instead of the “time division” description, a “space division” description is used to express a state where one segment is composed of multiple segments spatially divided.
  • The metadata optimization hint information 60 for the re-generation of the metadata 49 describes the types and contents of information (descriptors) contained in the metadata 49. Accordingly, in the case of the metadata shown in FIG. 15, the metadata optimization hint information 60 contains a descriptor (“time division”) expressing the temporal hieratical structure possessed by the content, descriptors expressing the color histogram and the motion complexity, and descriptors expressing the title, abstract, category, and importance. Also, in order to express description contents and complexity, the depth of each video segment in the hierarchical structure is expressed with up to four levels (Level 1 to Level 4). Further, the importance assumes one of five discrete values ({0.0, 0.25, 0.5, 0.75, 1.0}). As importance with respect to viewpoints, there are described importance from the viewpoint of “Team A” and importance from the viewpoint of “Team B”. Also, there is described the hierarchical position at which the importance is described (video segment level at which the importance is described).
  • FIG. 16 shows an example of a format of the metadata optimization hint information 60. The metadata optimization hint information 60 shown in FIG. 16 contains metadata file information and metadata construction element information.
  • The metadata file information describes information for predicting resources required to process the metadata, such as the memory size required to accumulate/analyze the metadata and the processing system (S/W) required to analyze the metadata. In more detail, for instance, the metadata file information describes the location of a metadata file, the size of the metadata file, the format of the metadata file (for instance, the XML format or the binary format), syntax file information (location of a syntax file defining the syntax of the metadata), and an appearing element number showing the number of elements contained (appearing) in the metadata. Note that when the metadata is described in the XML format, the syntax file defining the format of the metadata file corresponds to a DTD file, a schema file, or the like defining the description format (syntax) of the metadata, and the syntax file information describes the location of the DTD file or the schema file, for instance.
  • The metadata construction element information is information describing the type and contents of each descriptor constituting the metadata. In more detail, the metadata construction element information contains the name of each descriptor contained in the metadata, the appearing frequency (number of appearing times) of the descriptor in the metadata, and a description (completeness of description) showing whether or not the descriptor contains every descriptor that has the possibility of being syntaxically contained. In addition, when the descriptor is recursively described, the metadata construction element information also contains the temporal or spatial hierarchical property (maximum value of the depth) possessed by the descriptor. In the case of the metadata description shown in FIG. 15, for instance, “video segment” is the descriptor recursively described and has a hierarchical structure with up to four levels, so that the maximum hierarchical depth possessed by the “video segment” descriptor becomes four.
  • In addition, as to a descriptor contained the descriptor recursively described, the hint information also describes the appearing position (hierarchical level) at which the contained descriptor appears. For instance, “importance” is a descriptor contained in the “video segment” descriptor and, when the “importance” is contained in the video segment at up to Level 3, that is, is not contained in the video segment at Level 4, the appearing position of the “importance” becomes up to Level 3. In this manner, the appearing position is specified using the hierarchical level. However, when an ID is assigned to each “video segment” containing the “importance” or the “video segment” itself, it is also possible to describe the appearing position as an ID list. Also, in the case of a descriptor having a value, the hint information additionally describes the type of the descriptor and the range of values that the descriptor is assumable to have. When the importance is expressed using the five discrete values ({0.0, 0.25, 0.5, 0.75, 1.0}) with respect to each of the viewpoints of “Team A” and “Team B”, for instance, the assumable values of the “importance” become a list of {0.0, 0.25, 0.5, 0.75, 1.0} having a floating-point form. The above description is repeated for each descriptor that is a construction element of the metadata.
  • FIG. 17 shows an example of the metadata optimization hint information described in accordance with the format shown in FIG. 16. It can be seen that the example of the metadata optimization hint information 60 shown in FIG. 17 contains the metadata file information and the metadata construction element information for descriptors such as the “video segment” descriptor and the “title” descriptor.
  • Next, a method for performing re-generation of metadata using the metadata optimization hint information 60 will be described with reference to FIG. 13.
  • The hint information analysis unit 61 performs analysis of the metadata optimization hint information 60 described in the specified format. The metadata analysis/re-generation unit 63 performs analysis of the metadata 49 using the analyzed metadata optimization hint information 62 outputted from the hint information analysis unit 61, and outputs the metadata 64 re-generated based on the condition 65 concerning the metadata re-generation.
  • FIG. 18 shows an example of a method with which the metadata analysis/re-generation unit 63 analyzes the metadata using the analyzed metadata optimization hint information 62. In this example, it is assumed that only video segment characterized by having importance of 0.5 or higher is extracted from the original metadata 49, and metadata composed of only a description concerning the extracted video segment is re-generated.
  • First, the metadata analysis/re-generation unit 63 specifies metadata necessary for re-generation based on the condition 65 for metadata re-generation (step S1). In this example, only video segment characterized by having importance of 0.5 or higher is extracted, so that “importance” and “video segment” are descriptors necessary for the re-generation.
  • Next, the analyzed metadata optimization hint information 62 judges whether or not the descriptors specified in step S1 are contained in the metadata 49 (step S2) (the following description will be made by taking a case of the “importance” descriptor as an example).
  • When the “importance” descriptor is contained in the metadata, analysis of the metadata is performed (step 3). On the other hand, when the “importance” descriptor is not contained, the metadata analysis processing is ended (step S4).
  • Also, when the analyzed metadata optimization hint information 62 specifies that the appearing position of the “importance” descriptor is up to Level 3 of the hierarchical structure, at the time when the analysis of the video segments up to Level 3 is finished (step S5), the analysis processing is ended without performing the analysis for Level 4 and the following hierarchical levels (step S6).
  • It should be noted here that in order to perform the analysis of another piece of metadata 49 if necessary, the operations in step S1 and the following steps are repeated. Also, when the metadata optimization hint information 62 specifies that the number of appearing times of the “importance” descriptor is 20, at the time when the analysis of 20 “importance” descriptors is finished (step S5), the analysis of the metadata is ended (step S6). After the metadata analysis processing is ended in step S4 or step S6, in order to perform the analysis of another piece of metadata if necessary, the operations in step S1 and the following steps are repeated.
  • FIG. 19 shows another example of the method for analyzing the metadata using the analyzed metadata optimization hint information 62. In this example, it is assumed that metadata is re-generated by extracting only video segment containing the “title” descriptor. The judgement whether or not the metadata contains the “title” descriptor is performed in the same manner as in the example shown in FIG. 18.
  • When the metadata contains the “title” descriptor, the metadata analysis/re-generation unit 63 judges whether or not a video segment matches an appearing position ID described in the metadata optimization hint information 62 (step S13).
  • When the video segment does not match the ID, the video segment does not contain the “title” descriptor, so that the analysis of the description of this video segment is skipped (step S16).
  • On the other hand, when the video segment matches the ID, in order to obtain the “title” descriptor, the analysis of the description of this video segment is performed (S15).
  • Next, when the analysis of every video segment matching the appearing position ID is finished (step S17), no video segment containing the “title” descriptor remains in the metadata, so that the analysis processing is ended (step S18).
  • It should be noted here that in order to perform the analysis of another piece of metadata if necessary, the operations in step S11 and the following steps are repeated. Then, the metadata 64 restructured using the descriptor extracted through the analysis processing described above is outputted.
  • The metadata delivery unit 45 delivers the restructured metadata 64 to the client terminals.
  • It should be noted here that although not shown, after the metadata re-generation, the location of the metadata file, the size of the metadata file, the number of elements appearing in the metadata, and the information concerning the metadata construction elements are also changed. Accordingly, metadata optimization hint information corresponding to the metadata after the re-generation may be re-generated.
  • It has conventionally been required to analyze every descriptor contained in metadata for metadata re-generation. In the fifth embodiment, however, the descriptor analysis of the metadata 49 is performed in the manner described above using the metadata optimization hint information 60 describing the list of each descriptor contained in the metadata 49, the appearing position of the descriptor, the number of appearing times, and the like. As a result, it becomes possible to omit the analysis of the metadata 49 itself for the metadata re-generation. Also, the analysis of each descriptor not matching the re-generation condition is omitted using the appearing position or the number of appearing times, so that it becomes possible to reduce the processing cost (such as the processing amount and the memory usage amount) required to perform the metadata analysis and re-generation.
  • SIXTH EMBODIMENT
  • In the fifth embodiment described above, there has been described the metadata delivery server that reduces the processing cost required to perform the metadata analysis and re-generation using the metadata optimization hint information for the metadata re-generation. In this sixth embodiment, however, a metadata search server (metadata search apparatus) will be described which reduces the processing cost required to perform metadata searching using the metadata optimization hint information.
  • The metadata search server according to the sixth embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 20 is a block diagram showing a construction of the metadata search server according to the sixth embodiment of the present invention.
  • Referring to FIG. 20, a metadata delivery server 600 includes a hint information analysis unit 61, metadata analysis unit 71, and a search unit 73.
  • The hint information analysis unit 61 is the same as that in the fifth embodiment described above and therefore the description thereof is omitted in this embodiment. The metadata analysis unit 71 performs analysis of an enormous amount of metadata 49 describing the structure and characteristics of content with efficiency and with a less processing cost using analyzed metadata optimization hint information 62 and a search condition 70. The search unit 73 searches for content matching the search condition using a result 72 of the analysis of the metadata.
  • Next, how the metadata search server according to the sixth embodiment operates will be described with reference to the accompanying drawings.
  • FIG. 21 is a flowchart showing how the metadata analysis unit of the metadata search server according to the sixth embodiment operates.
  • The metadata analysis unit 71 performs analysis of at least one piece of metadata using the metadata optimization hint information 62 corresponding to the metadata. Here, in this example, the analysis of the metadata is extraction of each characteristic description necessary for the searching from the metadata. When a video segment having specific color characteristic amounts is given as the search condition and each video segment having characteristics close to those of the given video segment will be searched for, for instance, it is required to extract each video segment having a color characteristic description. In the metadata example shown in FIG. 15, a color characteristic description (“color histogram”) is added to each video segment at Level 4, so that a description concerning each video segment at Level 4 is extracted.
  • The metadata analysis unit 71 analyses the search condition 70 and specifies a descriptor that is valid for the searching (step S21). Here, the search condition may be characteristic amounts described in a format defined in MPEG-7. Alternatively, the search condition may be an image, a keyword, or the like. When the characteristic amounts (color arrangement information, for instance) described in the format defined in the MPEG-7 is given as the search condition, each corresponding descriptor (color arrangement information) becomes a descriptor that is valid for the searching. Also, when a keyword is given as the search condition, each descriptor in a text form (such as a title, an abstract, or explanatory notes) becomes the descriptor that is valid for the searching.
  • Next, by referring to the metadata optimization hint information 62, it is judged whether the selected descriptor is contained in the metadata 49 (step S22). When the descriptor for the searching is not contained in the metadata 49, the analysis processing of the metadata 49 is ended (step S24) and the analysis of another piece of metadata 49 is performed if necessary.
  • On the other hand, when the selected descriptor is contained in the metadata 49, the analysis of the metadata is performed (step S23). As to a metadata analysis method used in this embodiment, like in the case of the fifth embodiment described above, the metadata analysis processing shown in FIG. 18 or 19 is performed with efficiency using the metadata optimization hint information 62 (steps S25 and S26). As a result of the operations described above, the metadata analysis unit 71 extracts each characteristic description necessary for the searching.
  • The search unit 73 searches for content matching the search condition using the metadata analysis result (characteristic description necessary for the searching) 72 outputted from the metadata analysis unit 71. In this example, a description concerning each video segment having a color characteristic description (“color histogram”) is outputted by the metadata analysis unit 71, so that the search unit 73 judges compatibility with the color characteristic amounts (histogram) given as the search condition and outputs information (“time information”, for instance) concerning each video segment, whose judgment result is positive, as a search result 74.
  • As described above, in the sixth embodiment, the analysis of the metadata 49 is performed using the metadata optimization hint information 60, so that it becomes possible to omit the analysis of the metadata 49 itself for the metadata re-generation. Also, the analysis of each descriptor that is not necessary for the searching is omitted based on the appearing position and the number of appearing times, so that it becomes possible to reduce the processing cost (such as the processing amount and the memory usage amount) required to perform the metadata searching.
  • SEVENTH EMBODIMENT
  • In the fifth embodiment and the sixth embodiment described above, description has been made for a server side that uses the metadata optimization hint information. In this seventh embodiment, however, a client terminal (metadata re-generation condition setting apparatus) will be described which uses the metadata optimization hint information.
  • The client terminal according to the seventh embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 22 is a block diagram showing a construction of the client terminal according to the seventh embodiment of the present invention.
  • Referring to FIG. 22, a client terminal 48A includes a hint information analysis unit 80, and a metadata re-generation condition setting unit 82.
  • It should be noted here that FIG. 22 shows only a portion of the function of the client terminal 48A that relates to means for setting a condition for metadata re-generation using metadata optimization hint information 60.
  • Next, how the client terminal according to the seventh embodiment operates will be described with reference to the accompanying drawing.
  • The hint information analysis unit 80 performs analysis of the metadata optimization hint information 60 described in a specified format. This hint information analysis unit 80 is the same as that in the fifth embodiment described above, so that the detailed description thereof is omitted in this embodiment.
  • Then, the metadata re-generation condition setting unit 82 performs setting of a condition 83 for metadata re-generation based on a result 81 of the analysis outputted from the hint information analysis unit 80. Here, the condition setting refers to selection of each descriptor that is unnecessary for the client terminal 48A from among various descriptors contained in the metadata optimization hint information 60, for instance. When the client terminal 48A is not provided with a search function using characteristic amounts, each descriptor expressing the characteristic amounts, such as a color histogram or motion complexity, is unnecessary.
  • As another example of the condition setting, when the complexity of the metadata is increased in accordance with an increase in depth in the hierarchical structure describing relations between scenes of content, the depth in the hierarchical structure processible by the client terminal is set based on the maximum value of the depth in the hierarchical structure described in the metadata optimization hint information 60. In still another example, a viewpoint of a user and a threshold value of scene importance are set based on assumable values of the importance described in the metadata optimization hint information 60.
  • When the importance assumes five discrete values ({0.0, 0.25, 0.5, 0.75, 1.0}) from each of the viewpoints of “Team A” and “Team B” as described above, the condition is set so that only each scene having the importance of 0.5 or higher from the viewpoint of “Team A” is selected, for instance.
  • The condition 83 for metadata re-generation set by the metadata re-generation condition setting unit 82 is sent to the metadata delivery server. On the metadata delivery server side, the metadata is restructured based on the metadata re-generation condition and the terminal performance of the client terminal. When the maximum value of the depth in the hierarchical structure of the original metadata is four and the depth in the hierarchical structure processible by the client terminal is set at two in the metadata re-generation condition, for instance, the structure of the metadata is restructured so that the maximum value of the depth in the hierarchical structure becomes two.
  • Also, when the metadata re-generation condition has been set so that only each scene having the importance of 0.5 or higher from the viewpoint of “Team A” is selected, metadata composed of only each scene matching the condition is re-generated. As a result, like in the fifth embodiment described above, it becomes possible to perform the metadata re-generation with efficiency using the metadata optimization hint information.
  • As described above, in the seventh embodiment, the metadata re-generation condition is set using the metadata optimization hint information 60, so that it becomes possible to generate appropriate metadata in accordance with the client terminal or application.
  • EIGHTH EMBODIMENT
  • In the fifth embodiment and the sixth embodiment described above, there has been described a server that re-generates metadata using the metadata optimization hint information and delivers the re-generated metadata. In this eighth embodiment, however, a content delivery server (content delivery apparatus) will be described which analyzes metadata using the metadata optimization hint information, re-generates content suited for the client terminal or user preferences using a result of the analysis, and delivers the re-generated content.
  • The content delivery server according to the eighth embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 23 is a block diagram showing a construction of the content delivery server according to the eighth embodiment of the present invention.
  • Referring to FIG. 23, a content delivery server 500A includes a hint information analysis unit 61, a metadata analysis unit and a content restructuring/delivery unit 88.
  • Next, how the content delivery server according to the eighth embodiment operates will be described with reference to the accompanying drawings.
  • The hint information analysis unit 61 operates in the same manner as in the fifth embodiment described above, so that the description thereof is omitted in this embodiment.
  • The metadata analysis unit 86 performs analysis of metadata 49 using analyzed metadata optimization hint information 62 outputted from the hint information analysis unit 61, and extracts each description matching information concerning the client terminal or a condition 85 concerning content restructuring such as user preferences. The analysis using the hint information is the same as that in the fifth embodiment described above. However, this eighth embodiment differs from the fifth embodiment in that not the metadata re-generation but content restructuring is performed using each extracted description. Each description extracted by the metadata analysis unit 86, that is, analyzed metadata 87 is outputted to the content restructuring/delivery unit 88.
  • The content restructuring/delivery unit 88 performs restructuring of content 89 based on each description extracted by the metadata analysis unit 86. Here, the following description will be made based on the example described in the above fifth embodiment. In the fifth embodiment, only each video segment having the characteristics, whose importance is 0.5 or higher, is extracted from the metadata 49 and metadata composed of only each description concerning the extracted video segment is re-generated.
  • In a like manner, in this eighth embodiment, only each video segment having characteristics, whose importance is 0.5 or higher, is extracted from the metadata 49 and content 90 composed of only each scene corresponding to the extracted video segment is restructured and is delivered. In the description concerning the extracted video segment, the location of corresponding content and the position (time information) of the video segment in the content are described. Therefore, it is possible to clip each corresponding scene from the content, to restructure a single content 90 using the clipped scene, and to deliver the restructured content 90. Alternatively, it is possible to clip each corresponding scene from the content and to sequentially deliver the clipped scene.
  • As described above, with the content delivery server 500A according to the eighth embodiment, the metadata analysis is performed using the metadata optimization hint information 60 describing a list of each descriptor contained in the metadata 49, the appearing position of the descriptor, the number of appearing times, and the like, so that it becomes possible to omit the analysis of the metadata 49 itself for the metadata re-generation. Also, the analysis of each descriptor not matching the re-generation condition is omitted using the appearing position and the number of appearing times, so that it becomes possible to reduce the processing cost (such as the processing amount and the memory usage amount) required to perform the metadata analysis and the content restructuring at the time of re-generation and delivery of content suited for the client terminal and user preferences.
  • INDUSTRIAL APPLICABILITY
  • As described above, with the present invention, multimedia content containing moving pictures and audio is divided into multiple scenes, editing of the multiple scenes is performed, and metadata that is scene structure information describing the hierarchical structure of the multimedia content is generated. As a result, it becomes possible to generate metadata describing the hierarchical structure possessed by multimedia content containing video data and the like.

Claims (1)

1. A hint information description method comprising:
describing, as hint information for manipulation of metadata composed of at least one descriptor describing semantic content, a structure, and characteristics of content, a name or an identifier of each descriptor contained in the metadata.
US11/980,544 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method Abandoned US20080065697A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/980,544 US20080065697A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
JP2002110259 2002-04-12
JP2002-110259 2002-04-12
JP2002-178169 2002-06-19
JP2002178169 2002-06-19
PCT/JP2003/003450 WO2003088665A1 (en) 2002-04-12 2003-03-20 Meta data edition device, meta data reproduction device, meta data distribution device, meta data search device, meta data reproduction condition setting device, and meta data distribution method
US10/510,548 US7826709B2 (en) 2002-04-12 2003-03-20 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,544 US20080065697A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method

Related Parent Applications (2)

Application Number Title Priority Date Filing Date
US10/510,548 Division US7826709B2 (en) 2002-04-12 2003-03-20 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
PCT/JP2003/003450 Division WO2003088665A1 (en) 2002-04-12 2003-03-20 Meta data edition device, meta data reproduction device, meta data distribution device, meta data search device, meta data reproduction condition setting device, and meta data distribution method

Publications (1)

Publication Number Publication Date
US20080065697A1 true US20080065697A1 (en) 2008-03-13

Family

ID=29253534

Family Applications (7)

Application Number Title Priority Date Filing Date
US10/510,548 Expired - Fee Related US7826709B2 (en) 2002-04-12 2003-03-20 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,523 Abandoned US20080071836A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,514 Abandoned US20080075431A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,544 Abandoned US20080065697A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,648 Expired - Fee Related US8811800B2 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,624 Abandoned US20080071837A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US12/555,510 Abandoned US20100005070A1 (en) 2002-04-12 2009-09-08 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, and metadata delivery method and hint information description method

Family Applications Before (3)

Application Number Title Priority Date Filing Date
US10/510,548 Expired - Fee Related US7826709B2 (en) 2002-04-12 2003-03-20 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,523 Abandoned US20080071836A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,514 Abandoned US20080075431A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method

Family Applications After (3)

Application Number Title Priority Date Filing Date
US11/980,648 Expired - Fee Related US8811800B2 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US11/980,624 Abandoned US20080071837A1 (en) 2002-04-12 2007-10-31 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US12/555,510 Abandoned US20100005070A1 (en) 2002-04-12 2009-09-08 Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, and metadata delivery method and hint information description method

Country Status (10)

Country Link
US (7) US7826709B2 (en)
EP (7) EP2202978A1 (en)
JP (5) JPWO2003088665A1 (en)
KR (4) KR100918725B1 (en)
CN (1) CN100367794C (en)
AU (1) AU2003221185A1 (en)
CA (2) CA2664732C (en)
SG (1) SG152905A1 (en)
TW (1) TWI231140B (en)
WO (1) WO2003088665A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090123086A1 (en) * 2005-10-31 2009-05-14 Sharp Kabushiki Kaisha View environment control system
US20140007159A1 (en) * 2012-06-29 2014-01-02 Moda E Tecnologia S.R.L. Video streams management system for a television apparatus
US9197926B2 (en) 2013-06-05 2015-11-24 International Business Machines Corporation Location based determination of related content
US20170308251A1 (en) * 2010-12-03 2017-10-26 Facebook, Inc. User Interface with Media Wheel Facilitating Viewing of Media Objects
US9912924B2 (en) 2014-10-06 2018-03-06 Samsung Electronics Co., Ltd. Image forming apparatus, image forming method, image processing apparatus and image processing method thereof
US10891032B2 (en) 2012-04-03 2021-01-12 Samsung Electronics Co., Ltd Image reproduction apparatus and method for simultaneously displaying multiple moving-image thumbnails
US11350069B2 (en) 2015-04-29 2022-05-31 Samsung Electronics Co., Ltd. Source device and control method thereof, and sink device and image quality improvement processing method thereof

Families Citing this family (139)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050162515A1 (en) * 2000-10-24 2005-07-28 Objectvideo, Inc. Video surveillance system
US7116716B2 (en) * 2002-11-01 2006-10-03 Microsoft Corporation Systems and methods for generating a motion attention model
JP4546467B2 (en) * 2003-07-08 2010-09-15 パナソニック株式会社 Content storage system, home server device, information providing device, integrated circuit, and program
TWI310545B (en) * 2003-10-04 2009-06-01 Samsung Electronics Co Ltd Storage medium storing search information and reproducing apparatus
DE60318451T2 (en) * 2003-11-12 2008-12-11 Sony Deutschland Gmbh Automatic summary for a TV program suggestion machine based on consumer preferences
JP4611209B2 (en) * 2004-01-30 2011-01-12 パナソニック株式会社 Content playback device
CN1957349A (en) * 2004-05-25 2007-05-02 三星电子株式会社 Method of reproducing multimedia data using musicphotovideo profiles and reproducing apparatus using the method
US20050289593A1 (en) * 2004-05-26 2005-12-29 Skipjam Corp. Method and system for displaying and selecting content of an electronic program guide
US8953908B2 (en) * 2004-06-22 2015-02-10 Digimarc Corporation Metadata management and generation using perceptual features
US8156123B2 (en) * 2004-06-25 2012-04-10 Apple Inc. Method and apparatus for processing metadata
US9053754B2 (en) * 2004-07-28 2015-06-09 Microsoft Technology Licensing, Llc Thumbnail generation and presentation for recorded TV programs
KR100619064B1 (en) * 2004-07-30 2006-08-31 삼성전자주식회사 Storage medium including meta data and apparatus and method thereof
JP2006066015A (en) * 2004-08-30 2006-03-09 Sony Corp Picture information recording device and picture information display device
KR100565080B1 (en) * 2004-09-13 2006-03-30 삼성전자주식회사 Information storage medium recording AV data including meta data with representative title information, reproducing apparatus thereof and searching method of meta data
KR100602388B1 (en) * 2004-11-17 2006-07-20 주식회사 픽스트리 Resource Reference Method of MPEG - 21 Multimedia Framework
KR20060065476A (en) 2004-12-10 2006-06-14 엘지전자 주식회사 Recording medium, method for searching the contents recorded in the recording medium, and method and apparatus for reproducing contents from the recording medium
JP4349277B2 (en) * 2004-12-24 2009-10-21 株式会社日立製作所 Movie playback device
JP5522896B2 (en) * 2004-12-24 2014-06-18 コーニンクレッカ フィリップス エヌ ヴェ Method and apparatus for editing program search information
EP1839190A4 (en) * 2005-01-07 2012-01-18 Korea Electronics Telecomm Apparatus and method for providing adaptive broadcast service using game metadata
WO2006080469A1 (en) * 2005-01-25 2006-08-03 Nec Corporation Structured document search device, structured document search method, and structured document search program
FR2883441A1 (en) * 2005-03-17 2006-09-22 Thomson Licensing Sa METHOD FOR SELECTING PARTS OF AUDIOVISUAL TRANSMISSION AND DEVICE IMPLEMENTING THE METHOD
TWI309389B (en) * 2005-05-06 2009-05-01 Sunplus Technology Co Ltd Digital audio-video information reproducing apparatus and reproducing method thereof
KR100654455B1 (en) 2005-05-26 2006-12-06 삼성전자주식회사 Apparatus and method for providing addition information using extension subtitle file
US8239558B2 (en) * 2005-06-27 2012-08-07 Core Wireless Licensing, S.a.r.l. Transport mechanisms for dynamic rich media scenes
US20070022215A1 (en) * 2005-07-19 2007-01-25 Singer David W Method and apparatus for media data transmission
US20080130989A1 (en) * 2005-07-22 2008-06-05 Mitsubishi Electric Corporation Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program
JP4514671B2 (en) * 2005-08-03 2010-07-28 シャープ株式会社 CONTENT EDITING DEVICE, COMPUTER-READABLE PROGRAM, AND RECORDING MEDIUM CONTAINING THE SAME
US20070078896A1 (en) * 2005-09-30 2007-04-05 Yahoo! Inc. Identifying portions within media files with location tags
US20070078898A1 (en) * 2005-09-30 2007-04-05 Yahoo! Inc. Server-based system and method for retrieving tagged portions of media files
US20070078883A1 (en) * 2005-09-30 2007-04-05 Yahoo! Inc. Using location tags to render tagged portions of media files
KR100977580B1 (en) * 2005-10-13 2010-08-23 엘지전자 주식회사 Method and apparatus for encoding/decoding
US7743363B2 (en) * 2005-10-13 2010-06-22 Microsoft Corporation Extensible meta-data
US8180826B2 (en) * 2005-10-31 2012-05-15 Microsoft Corporation Media sharing and authoring on the web
US8856118B2 (en) * 2005-10-31 2014-10-07 Qwest Communications International Inc. Creation and transmission of rich content media
US8196032B2 (en) * 2005-11-01 2012-06-05 Microsoft Corporation Template-based multimedia authoring and sharing
JP2007179435A (en) * 2005-12-28 2007-07-12 Sony Corp Information processor, information processing method, and program
US20070204238A1 (en) * 2006-02-27 2007-08-30 Microsoft Corporation Smart Video Presentation
US7421455B2 (en) * 2006-02-27 2008-09-02 Microsoft Corporation Video search and services
US7873946B2 (en) * 2006-03-23 2011-01-18 Oracle America, Inc. Scalable vector graphics, tree and tab as drag and drop objects
JP4377887B2 (en) * 2006-03-30 2009-12-02 株式会社東芝 Video dividing device
JP2007265341A (en) * 2006-03-30 2007-10-11 Sony Corp Content utilization method, content utilization device, content recording method, content recording device, content providing system, content receiving method, content receiving device, and content data format
US7827155B2 (en) * 2006-04-21 2010-11-02 Microsoft Corporation System for processing formatted data
US8549492B2 (en) * 2006-04-21 2013-10-01 Microsoft Corporation Machine declarative language for formatted data processing
JP4760572B2 (en) * 2006-06-30 2011-08-31 ソニー株式会社 Editing apparatus, editing method, and program
US8275814B2 (en) * 2006-07-12 2012-09-25 Lg Electronics Inc. Method and apparatus for encoding/decoding signal
US20080019281A1 (en) * 2006-07-21 2008-01-24 Microsoft Corporation Reuse of available source data and localizations
US7769363B2 (en) 2006-08-01 2010-08-03 Chew Gregory T H User-initiated communications during multimedia content playback on a mobile communications device
US20080065693A1 (en) * 2006-09-11 2008-03-13 Bellsouth Intellectual Property Corporation Presenting and linking segments of tagged media files in a media services network
WO2008032739A1 (en) * 2006-09-12 2008-03-20 Panasonic Corporation Content imaging device
TW200832370A (en) * 2006-09-29 2008-08-01 Sony Corp Reproducing device and method, information generation device and method, data storage medium, data structure, program storage medium, and program
JP2008099012A (en) * 2006-10-12 2008-04-24 Mitsubishi Electric Corp Content reproduction system and content storage system
US8271553B2 (en) 2006-10-19 2012-09-18 Lg Electronics Inc. Encoding method and apparatus and decoding method and apparatus
KR101317204B1 (en) * 2006-11-27 2013-10-10 삼성전자주식회사 Method for generating frame information on moving image and apparatus thereof
KR100827241B1 (en) * 2006-12-18 2008-05-07 삼성전자주식회사 Apparatus and method of organizing a template for generating moving image
FR2910769B1 (en) * 2006-12-21 2009-03-06 Thomson Licensing Sas METHOD FOR CREATING A SUMMARY OF AUDIOVISUAL DOCUMENT COMPRISING A SUMMARY AND REPORTS, AND RECEIVER IMPLEMENTING THE METHOD
EP2122482B1 (en) 2007-01-05 2018-11-14 Sonic IP, Inc. Video distribution system including progressive playback
US8671346B2 (en) * 2007-02-09 2014-03-11 Microsoft Corporation Smart video thumbnail
KR100864524B1 (en) * 2007-02-14 2008-10-21 주식회사 드리머 Method of processing digital broadcasting data application and computer-readable medium having thereon program performing function embodying the same
JP4511569B2 (en) * 2007-03-26 2010-07-28 株式会社エヌ・ティ・ティ・ドコモ Remote viewing system and remote viewing method
JP4469868B2 (en) * 2007-03-27 2010-06-02 株式会社東芝 Explanation expression adding device, program, and explanation expression adding method
WO2008129600A1 (en) * 2007-04-05 2008-10-30 Sony Computer Entertainment Inc. Content reproduction apparatus, content delivery apparatus, content delivery system, and method for generating metadata
EP1983755A1 (en) * 2007-04-17 2008-10-22 Thomson Licensing Method to transmit video data in a data stream and associated metadata
JPWO2008136466A1 (en) * 2007-05-01 2010-07-29 株式会社Dep Movie editing device
KR100935862B1 (en) * 2007-07-06 2010-01-07 드리머 System for providing contents based on media reproduction apparatus
JP4360428B2 (en) * 2007-07-19 2009-11-11 ソニー株式会社 Recording apparatus, recording method, computer program, and recording medium
JP4420085B2 (en) * 2007-08-20 2010-02-24 ソニー株式会社 Data processing apparatus, data processing method, program, and recording medium
KR101268987B1 (en) * 2007-09-11 2013-05-29 삼성전자주식회사 Method and apparatus for recording multimedia data by automatically generating/updating metadata
KR20090031142A (en) * 2007-09-21 2009-03-25 삼성전자주식회사 A method for providing gui to display related contents when contents are made by user, and a multimedia apparatus thereof
KR101034758B1 (en) * 2007-10-04 2011-05-17 에스케이 텔레콤주식회사 Method for Providing Initial Behavior of Multimedia Application Format Content and System therefor
US20090158157A1 (en) * 2007-12-14 2009-06-18 Microsoft Corporation Previewing recorded programs using thumbnails
JP2009152927A (en) * 2007-12-21 2009-07-09 Sony Corp Playback method and playback system of contents
KR20090079010A (en) * 2008-01-16 2009-07-21 삼성전자주식회사 Method and apparatus for displaying program information
JP5033653B2 (en) * 2008-01-21 2012-09-26 株式会社日立製作所 Video recording / reproducing apparatus and video reproducing apparatus
JP5188260B2 (en) * 2008-05-08 2013-04-24 キヤノン株式会社 Image processing apparatus, image processing method, program thereof, and storage medium
US20090287655A1 (en) * 2008-05-13 2009-11-19 Bennett James D Image search engine employing user suitability feedback
WO2009149440A1 (en) * 2008-06-06 2009-12-10 Divx, Inc. Multimedia distribution and playback systems and methods using enhanced metadata structures
KR20100002032A (en) * 2008-06-24 2010-01-06 삼성전자주식회사 Image generating method, image processing method, and apparatus thereof
US20090317062A1 (en) * 2008-06-24 2009-12-24 Samsung Electronics Co., Ltd. Image processing method and apparatus
US20090315981A1 (en) * 2008-06-24 2009-12-24 Samsung Electronics Co., Ltd. Image processing method and apparatus
US20090315980A1 (en) * 2008-06-24 2009-12-24 Samsung Electronics Co., Image processing method and apparatus
JP5649273B2 (en) * 2008-08-25 2015-01-07 株式会社東芝 Information processing apparatus, information processing method, and information processing program
JP5091806B2 (en) * 2008-09-01 2012-12-05 株式会社東芝 Video processing apparatus and method
JP5322550B2 (en) * 2008-09-18 2013-10-23 三菱電機株式会社 Program recommendation device
US8239359B2 (en) * 2008-09-23 2012-08-07 Disney Enterprises, Inc. System and method for visual search in a video media player
KR101592943B1 (en) * 2008-12-11 2016-02-12 엘지전자 주식회사 Mobile Terminal And Method Of Managing Data Thereof
KR20110032610A (en) * 2009-09-23 2011-03-30 삼성전자주식회사 Apparatus and method for scene segmentation
KR20110047768A (en) * 2009-10-30 2011-05-09 삼성전자주식회사 Apparatus and method for displaying multimedia contents
CN102065237B (en) * 2009-11-13 2014-12-24 新奥特(北京)视频技术有限公司 Title generator capable of being convenient to process title files
EP2499612A4 (en) * 2009-11-13 2013-07-31 Samsung Electronics Co Ltd Method and apparatus for managing data
JP2011130279A (en) * 2009-12-18 2011-06-30 Sony Corp Content providing server, content reproducing apparatus, content providing method, content reproducing method, program and content providing system
WO2011087814A2 (en) * 2009-12-21 2011-07-21 Estefano Emilio Isaias Video segment management and distribution system and method
JP2011188342A (en) * 2010-03-10 2011-09-22 Sony Corp Information processing apparatus, information processing method, and program
JP5913788B2 (en) * 2010-03-25 2016-04-27 ソニー株式会社 Content server, content providing system, and content providing method
KR101746453B1 (en) * 2010-04-12 2017-06-13 삼성전자주식회사 System and Method for Processing Sensory Effect
US8825488B2 (en) * 2010-04-12 2014-09-02 Adobe Systems Incorporated Method and apparatus for time synchronized script metadata
US9276986B2 (en) * 2010-04-27 2016-03-01 Nokia Technologies Oy Systems, methods, and apparatuses for facilitating remote data processing
GB2481185A (en) * 2010-05-28 2011-12-21 British Broadcasting Corp Processing audio-video data to produce multi-dimensional complex metadata
US8806340B2 (en) * 2010-09-01 2014-08-12 Hulu, LLC Method and apparatus for embedding media programs having custom user selectable thumbnails
JP5671288B2 (en) * 2010-09-24 2015-02-18 任天堂株式会社 Information processing apparatus, information processing program, information processing method, and information processing system
US8587672B2 (en) 2011-01-31 2013-11-19 Home Box Office, Inc. Real-time visible-talent tracking system
US9264484B1 (en) 2011-02-09 2016-02-16 Google Inc. Attributing preferences to locations for serving content
CN102143001B (en) * 2011-04-02 2013-10-09 西南科技大学 Semantic comprehension-based audio resource management method
GB2491894A (en) * 2011-06-17 2012-12-19 Ant Software Ltd Processing supplementary interactive content in a television system
US9146909B2 (en) * 2011-07-27 2015-09-29 Qualcomm Incorporated Web browsing enhanced by cloud computing
KR101315608B1 (en) * 2011-09-27 2013-10-18 엘지전자 주식회사 Contents managing method and display apparatus thereof
US20130089301A1 (en) * 2011-10-06 2013-04-11 Chi-cheng Ju Method and apparatus for processing video frames image with image registration information involved therein
US9536044B2 (en) 2011-12-06 2017-01-03 Microsoft Technology Licensing, Llc Metadata extraction pipeline
US9525642B2 (en) 2012-01-31 2016-12-20 Db Networks, Inc. Ordering traffic captured on a data connection
US9100291B2 (en) * 2012-01-31 2015-08-04 Db Networks, Inc. Systems and methods for extracting structured application data from a communications link
TWI461955B (en) * 2012-02-20 2014-11-21 Univ Nat Cheng Kung Malicious scripts suppression system and suppression method thereof
CN104126307B (en) 2012-02-29 2018-02-06 杜比实验室特许公司 Processor and method are created for improved image procossing and the image metadata of content transmission
KR101332834B1 (en) * 2012-04-03 2013-11-27 모젼스랩(주) A method for providing service related exhibition based on the ontology
SE537206C2 (en) * 2012-04-11 2015-03-03 Vidispine Ab Method and system for searching digital content
US20140136545A1 (en) 2012-10-05 2014-05-15 International Business Machines Corporation Multi-tier Indexing Methodology for Scalable Mobile Device Data Collection
TWI474201B (en) * 2012-10-17 2015-02-21 Inst Information Industry Construction system scene fragment, method and recording medium
KR101537665B1 (en) * 2013-02-26 2015-07-20 주식회사 알티캐스트 Method and apparatus for contents play
EP2809078A1 (en) * 2013-05-27 2014-12-03 Thomson Licensing Method and apparatus for managing metadata files
EP2809077A1 (en) * 2013-05-27 2014-12-03 Thomson Licensing Method and apparatus for classification of a file
BR122020018850B1 (en) 2013-07-30 2023-01-31 Dolby Laboratories Licensing Corporation METHOD FOR USING STABLE SCENE METADATA IN A VIDEO ENCODER AND METHOD FOR USING STABLE SCENE METADATA IN A VIDEO DECODER
US20150331551A1 (en) * 2014-05-14 2015-11-19 Samsung Electronics Co., Ltd. Image display apparatus, image display method, and computer-readable recording medium
EP3136655B1 (en) * 2014-05-19 2019-09-11 Huawei Technologies Co., Ltd. Multimedia display method, device and equipment
KR101640317B1 (en) 2014-11-20 2016-07-19 소프트온넷(주) Apparatus and method for storing and searching image including audio and video data
KR102380979B1 (en) * 2015-01-05 2022-04-01 삼성전자 주식회사 Image metadata managing method and apparatus
KR102306538B1 (en) 2015-01-20 2021-09-29 삼성전자주식회사 Apparatus and method for editing content
JP2016144080A (en) * 2015-02-03 2016-08-08 ソニー株式会社 Information processing device, information processing system, information processing method, and program
US9554160B2 (en) * 2015-05-18 2017-01-24 Zepp Labs, Inc. Multi-angle video editing based on cloud video sharing
US10462524B2 (en) * 2015-06-23 2019-10-29 Facebook, Inc. Streaming media presentation system
US10187684B2 (en) 2015-06-23 2019-01-22 Facebook, Inc. Streaming media presentation system
US10375443B2 (en) 2015-07-31 2019-08-06 Rovi Guides, Inc. Method for enhancing a user viewing experience when consuming a sequence of media
US10142583B1 (en) * 2015-10-16 2018-11-27 Tribune Broadcasting Company, Llc Computing system with external speaker detection feature
US10645465B2 (en) * 2015-12-21 2020-05-05 Centurylink Intellectual Property Llc Video file universal identifier for metadata resolution
US11023417B2 (en) * 2017-05-30 2021-06-01 Home Box Office, Inc. Video content graph including enhanced metadata
CN108829881B (en) * 2018-06-27 2021-12-03 深圳市腾讯网络信息技术有限公司 Video title generation method and device
JP6688368B1 (en) * 2018-11-13 2020-04-28 西日本電信電話株式会社 Video content structuring device, video content structuring method, and computer program
CN112150778A (en) * 2019-06-29 2020-12-29 华为技术有限公司 Environmental sound processing method and related device
KR102250642B1 (en) * 2019-10-31 2021-05-11 테크온미디어(주) Decentralized and distributed contents management system for efeective contents ditribution and computing device for executing the same
JP2021132281A (en) * 2020-02-19 2021-09-09 Jcc株式会社 Metadata generating system and metadata generating method
US20210319230A1 (en) * 2020-04-10 2021-10-14 Gracenote, Inc. Keyframe Extractor
US11526612B2 (en) 2020-09-22 2022-12-13 International Business Machines Corporation Computer file metadata segmentation security system

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5119465A (en) * 1989-06-19 1992-06-02 Digital Equipment Corporation System for selectively converting plurality of source data structures through corresponding source intermediate structures, and target intermediate structures into selected target structure
US5986675A (en) * 1996-05-24 1999-11-16 Microsoft Corporation System and method for animating an object in three-dimensional space using a two-dimensional input device
US6266673B1 (en) * 1997-10-31 2001-07-24 Oracle Corporation Performing operations on objects in a database system in a response to a request that specifies references that indicate where the objects reside
US6370316B1 (en) * 1999-07-13 2002-04-09 Matsushita Electric Industrial Co., Ltd. Apparatus for retrieving and administrating moving pictures and related network system
US20020079833A1 (en) * 1997-09-01 2002-06-27 Seiko Epson Corporation Electroluminescent element
US20030110501A1 (en) * 2001-12-12 2003-06-12 Rafey Richter A. Personalizing media presentations based on a target duration
US6606117B1 (en) * 1997-09-15 2003-08-12 Canon Kabushiki Kaisha Content information gathering apparatus system and method
US6646676B1 (en) * 2000-05-17 2003-11-11 Mitsubishi Electric Research Laboratories, Inc. Networked surveillance and control system
US6651253B2 (en) * 2000-11-16 2003-11-18 Mydtv, Inc. Interactive system and method for generating metadata for programming events
US6704029B1 (en) * 1999-04-13 2004-03-09 Canon Kabushiki Kaisha Method and apparatus for specifying scene information in a moving picture
US6711590B1 (en) * 1998-07-10 2004-03-23 Canon Kabushiki Kaisha Linking metadata with a time-sequential digital signal
US6751623B1 (en) * 1998-01-26 2004-06-15 At&T Corp. Flexible interchange of coded multimedia facilitating access and streaming
US7082255B1 (en) * 1999-10-22 2006-07-25 Lg Electronics Inc. Method for providing user-adaptive multi-level digest stream
US7212972B2 (en) * 1999-12-08 2007-05-01 Ddi Corporation Audio features description method and audio video features description collection construction method
US7254570B2 (en) * 2001-03-21 2007-08-07 Nokia Corporation Query resolution system and service
US7362946B1 (en) * 1999-04-12 2008-04-22 Canon Kabushiki Kaisha Automated visual image editing system

Family Cites Families (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5758180A (en) * 1993-04-15 1998-05-26 Sony Corporation Block resizing function for multi-media editing which moves other blocks in response to the resize only as necessary
JP3194837B2 (en) 1994-07-19 2001-08-06 日本電信電話株式会社 Representative screen extraction method and apparatus
JP3144285B2 (en) 1995-11-30 2001-03-12 松下電器産業株式会社 Video processing equipment
JP3529540B2 (en) 1996-03-07 2004-05-24 Kddi株式会社 Video search device
WO1997034240A1 (en) * 1996-03-15 1997-09-18 University Of Massachusetts Compact tree for storage and retrieval of structured hypermedia documents
US20010038719A1 (en) * 1996-10-14 2001-11-08 Nikon Corporation Information processing apparatus
KR100284574B1 (en) * 1997-10-27 2001-03-15 정선종 Automatic Object Storage System and How It Works
JPH11238071A (en) 1998-02-20 1999-08-31 Toshiba Corp Device and method for digest generation
US6085198A (en) * 1998-06-05 2000-07-04 Sun Microsystems, Inc. Integrated three-tier application framework with automated class and table generation
KR100279735B1 (en) * 1998-11-20 2001-02-01 정선종 Multimedia Content Delivery Method Using Metadata
JP2000253337A (en) * 1999-02-24 2000-09-14 Sony Corp Method and device for controlling screen, method and device for reproducing video, method and device for recording video information, and computer readable recording medium
JP2001008136A (en) 1999-06-21 2001-01-12 Victor Co Of Japan Ltd Authoring device for multimedia data
JP2001119666A (en) * 1999-08-16 2001-04-27 Univ Of Washington Method of interactive processing of video sequence, storage medium thereof and system
AU7588100A (en) * 1999-09-16 2001-04-17 Ixl Enterprises, Inc. System and method for linking media content
KR100373371B1 (en) * 1999-09-20 2003-02-25 한국전자통신연구원 Method for searching video data using meta data weighting determination scheme
JP3738631B2 (en) * 1999-09-27 2006-01-25 三菱電機株式会社 Image search system and image search method
WO2001024046A2 (en) 1999-09-29 2001-04-05 Xml-Global Technologies, Inc. Authoring, altering, indexing, storing and retrieving electronic documents embedded with contextual markup
KR100371813B1 (en) 1999-10-11 2003-02-11 한국전자통신연구원 A Recorded Medium for storing a Video Summary Description Scheme, An Apparatus and a Method for Generating Video Summary Descriptive Data, and An Apparatus and a Method for Browsing Video Summary Descriptive Data Using the Video Summary Description Scheme
JP3478331B2 (en) * 1999-10-29 2003-12-15 株式会社リコー Structure display method and computer-readable recording medium storing a program for causing a computer to execute the method
JP2001167099A (en) * 1999-12-07 2001-06-22 Nippon Hoso Kyokai <Nhk> Data base system based on video/audio object
JP2001167109A (en) 1999-12-08 2001-06-22 Kddi Corp Method for constituting group of feature descriptions of audio video information
WO2001069936A2 (en) 2000-03-13 2001-09-20 Sony Corporation Method and apparatus for generating compact transcoding hints metadata
KR100739031B1 (en) * 2000-03-27 2007-07-25 주식회사 큐론 Method of mpeg-7 meta data hiding and detection to retrieve multimedia for multimedia indexing retrieval system
JP3517631B2 (en) * 2000-05-08 2004-04-12 株式会社リコー Digest video storage method and digest video storage device
JP2001320693A (en) * 2000-05-12 2001-11-16 Sony Corp Service providing device and method, reception terminal and method, service providing system
JP4953496B2 (en) * 2000-05-15 2012-06-13 ソニー株式会社 Content search / presentation system and method, and software storage medium
JP2001326901A (en) 2000-05-18 2001-11-22 Sharp Corp Information management device for moving picture scene
JP3600186B2 (en) * 2000-05-19 2004-12-08 株式会社リコー Video search device
JP2001357008A (en) * 2000-06-14 2001-12-26 Mitsubishi Electric Corp Device and method for retrieving and distributing contents
AUPQ867700A0 (en) 2000-07-10 2000-08-03 Canon Kabushiki Kaisha Delivering multimedia descriptions
KR100369222B1 (en) * 2000-07-26 2003-01-24 (학)창성학원 Digital Video Searching And Authoring Tool
US7054508B2 (en) * 2000-08-03 2006-05-30 Canon Kabushiki Kaisha Data editing apparatus and method
JP4403643B2 (en) * 2000-08-04 2010-01-27 ソニー株式会社 Program viewing service system
US6959326B1 (en) * 2000-08-24 2005-10-25 International Business Machines Corporation Method, system, and program for gathering indexable metadata on content at a data repository
JP4536940B2 (en) * 2001-01-26 2010-09-01 キヤノン株式会社 Image processing apparatus, image processing method, storage medium, and computer program
JP2003067397A (en) 2001-06-11 2003-03-07 Matsushita Electric Ind Co Ltd Content control system
US20030088876A1 (en) * 2001-11-08 2003-05-08 Liberate Technologies Video on demand gateway
JP3826043B2 (en) * 2002-01-31 2006-09-27 キヤノン株式会社 Information processing apparatus and method
JP3826048B2 (en) * 2002-02-15 2006-09-27 キヤノン株式会社 Information processing apparatus and method
JP2007179435A (en) * 2005-12-28 2007-07-12 Sony Corp Information processor, information processing method, and program
US8386438B2 (en) * 2009-03-19 2013-02-26 Symantec Corporation Method for restoring data from a monolithic backup

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5119465A (en) * 1989-06-19 1992-06-02 Digital Equipment Corporation System for selectively converting plurality of source data structures through corresponding source intermediate structures, and target intermediate structures into selected target structure
US5986675A (en) * 1996-05-24 1999-11-16 Microsoft Corporation System and method for animating an object in three-dimensional space using a two-dimensional input device
US20020079833A1 (en) * 1997-09-01 2002-06-27 Seiko Epson Corporation Electroluminescent element
US6606117B1 (en) * 1997-09-15 2003-08-12 Canon Kabushiki Kaisha Content information gathering apparatus system and method
US6266673B1 (en) * 1997-10-31 2001-07-24 Oracle Corporation Performing operations on objects in a database system in a response to a request that specifies references that indicate where the objects reside
US6751623B1 (en) * 1998-01-26 2004-06-15 At&T Corp. Flexible interchange of coded multimedia facilitating access and streaming
US6711590B1 (en) * 1998-07-10 2004-03-23 Canon Kabushiki Kaisha Linking metadata with a time-sequential digital signal
US7362946B1 (en) * 1999-04-12 2008-04-22 Canon Kabushiki Kaisha Automated visual image editing system
US6704029B1 (en) * 1999-04-13 2004-03-09 Canon Kabushiki Kaisha Method and apparatus for specifying scene information in a moving picture
US6370316B1 (en) * 1999-07-13 2002-04-09 Matsushita Electric Industrial Co., Ltd. Apparatus for retrieving and administrating moving pictures and related network system
US7082255B1 (en) * 1999-10-22 2006-07-25 Lg Electronics Inc. Method for providing user-adaptive multi-level digest stream
US7212972B2 (en) * 1999-12-08 2007-05-01 Ddi Corporation Audio features description method and audio video features description collection construction method
US6646676B1 (en) * 2000-05-17 2003-11-11 Mitsubishi Electric Research Laboratories, Inc. Networked surveillance and control system
US6651253B2 (en) * 2000-11-16 2003-11-18 Mydtv, Inc. Interactive system and method for generating metadata for programming events
US7254570B2 (en) * 2001-03-21 2007-08-07 Nokia Corporation Query resolution system and service
US20030110501A1 (en) * 2001-12-12 2003-06-12 Rafey Richter A. Personalizing media presentations based on a target duration

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090123086A1 (en) * 2005-10-31 2009-05-14 Sharp Kabushiki Kaisha View environment control system
US20170308251A1 (en) * 2010-12-03 2017-10-26 Facebook, Inc. User Interface with Media Wheel Facilitating Viewing of Media Objects
US10891032B2 (en) 2012-04-03 2021-01-12 Samsung Electronics Co., Ltd Image reproduction apparatus and method for simultaneously displaying multiple moving-image thumbnails
US20140007159A1 (en) * 2012-06-29 2014-01-02 Moda E Tecnologia S.R.L. Video streams management system for a television apparatus
US9197926B2 (en) 2013-06-05 2015-11-24 International Business Machines Corporation Location based determination of related content
US9912924B2 (en) 2014-10-06 2018-03-06 Samsung Electronics Co., Ltd. Image forming apparatus, image forming method, image processing apparatus and image processing method thereof
US11350069B2 (en) 2015-04-29 2022-05-31 Samsung Electronics Co., Ltd. Source device and control method thereof, and sink device and image quality improvement processing method thereof

Also Published As

Publication number Publication date
KR20080064875A (en) 2008-07-09
EP2200315A1 (en) 2010-06-23
KR100912984B1 (en) 2009-08-20
EP2202649A1 (en) 2010-06-30
KR20100055541A (en) 2010-05-26
JP4652462B2 (en) 2011-03-16
US20050149557A1 (en) 2005-07-07
CN100367794C (en) 2008-02-06
US20080071836A1 (en) 2008-03-20
KR100918725B1 (en) 2009-09-24
KR100997599B1 (en) 2010-11-30
SG152905A1 (en) 2009-06-29
CN1647528A (en) 2005-07-27
US20080075431A1 (en) 2008-03-27
JP4987907B2 (en) 2012-08-01
KR20040108726A (en) 2004-12-24
JPWO2003088665A1 (en) 2005-08-25
EP1496701A4 (en) 2009-01-14
EP1496701A1 (en) 2005-01-12
AU2003221185A1 (en) 2003-10-27
US20100005070A1 (en) 2010-01-07
TWI231140B (en) 2005-04-11
CA2482431C (en) 2013-03-19
JP2009171621A (en) 2009-07-30
CA2664732C (en) 2013-07-02
EP2202648A1 (en) 2010-06-30
EP2202979A1 (en) 2010-06-30
KR20060097767A (en) 2006-09-15
JP2009171624A (en) 2009-07-30
WO2003088665A1 (en) 2003-10-23
EP2202978A1 (en) 2010-06-30
CA2664732A1 (en) 2003-10-23
EP2202977A1 (en) 2010-06-30
KR100986401B1 (en) 2010-10-08
TW200306114A (en) 2003-11-01
US8811800B2 (en) 2014-08-19
US20080071838A1 (en) 2008-03-20
JP2009171623A (en) 2009-07-30
US20080071837A1 (en) 2008-03-20
JP2009171622A (en) 2009-07-30
US7826709B2 (en) 2010-11-02
CA2482431A1 (en) 2003-10-23

Similar Documents

Publication Publication Date Title
US8811800B2 (en) Metadata editing apparatus, metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus, metadata delivery method and hint information description method
US7181757B1 (en) Video summary description scheme and method and system of video summary description data generation for efficient overview and browsing
KR100493674B1 (en) Multimedia data searching and browsing system
CA2387404A1 (en) Video summary description scheme and method and system of video summary description data generation for efficient overview and browsing
JP4732418B2 (en) Metadata processing method
CN101132528A (en) Metadata reproduction apparatus, metadata delivery apparatus, metadata search apparatus, metadata re-generation condition setting apparatus
JP4652389B2 (en) Metadata processing method
Kim et al. MPEG-7-based metadata generator and its browser

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION