WO2003084241A2 - Context-adaptive macroblock type encoding/decoding methods and apparatuses - Google Patents

Context-adaptive macroblock type encoding/decoding methods and apparatuses Download PDF

Info

Publication number
WO2003084241A2
WO2003084241A2 PCT/US2003/007882 US0307882W WO03084241A2 WO 2003084241 A2 WO2003084241 A2 WO 2003084241A2 US 0307882 W US0307882 W US 0307882W WO 03084241 A2 WO03084241 A2 WO 03084241A2
Authority
WO
WIPO (PCT)
Prior art keywords
macroblock
picture
macroblocks
type
macroblock type
Prior art date
Application number
PCT/US2003/007882
Other languages
French (fr)
Other versions
WO2003084241A3 (en
Inventor
Gregory J. Conklin
Original Assignee
Realnetworks, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Realnetworks, Inc. filed Critical Realnetworks, Inc.
Priority to US10/508,597 priority Critical patent/US7978765B2/en
Priority to AU2003214181A priority patent/AU2003214181A1/en
Publication of WO2003084241A2 publication Critical patent/WO2003084241A2/en
Publication of WO2003084241A3 publication Critical patent/WO2003084241A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/196Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
    • H04N19/197Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters including determination of the initial value of an encoding parameter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Definitions

  • the present invention relates to the field of video encoding/decoding. More specifically, the present invention is related to the encoding macroblock types of macroblocks of pictures of video, and decoding of the encodings.
  • Video devices include but are not limited to digital camcorders, digital versatile disk (DVD) players, video enabled laptop and desktop computing devices as well as servers, and so forth.
  • DVD digital versatile disk
  • Video delivery and rendering often involve encoding and decoding to reduce the amount of data to be stored, retrieved and/or transmitted.
  • Encoding/decoding of a video often involves processing the video as a stream of pictures. Each picture may be a field or a frame (typically consisting of two interleaved fields) comprising a number of macroblocks.
  • Each picture may be typed, e.g. an l-type, a P-type, or a B-type (also referred as I picture, P picture and B picture).
  • An I picture is a picture coded using information only from itself.
  • a P picture is a picture coded using motion compensated prediction from previously-decoded reference fields or frames, using at most one motion vector and reference picture to predict the value of each individual region.
  • a B picture is a "predictive-coded" picture, where some macroblocks may use a weighted average of two distinct motion-compensated prediction values for the prediction of the macroblock sample values.
  • Each macroblock typically comprises tiles of pixels, e.g. tiles of 16 x 16 pixels.
  • each macroblock is typically typed, with the macroblock type indicating the specific method to encode (and therefore decode) this group of pixels, e.g. whether coding (and therefore decoding) is based on global motion, local motion, and so forth.
  • each macroblock type itself is typically coded into a codeword, along with coding of other aspects of the macroblock, e.g. its transform coefficients and so forth.
  • macroblock type is typically encoded in a static, i.e. non-adaptive, variable length encoding (VLC) manner.
  • VLC variable length encoding
  • ITU-T Recommendation H.263 ITU-T stands for International Telecommunication Union - Telecommunication Standardisation Sector.
  • Figure 1 illustrates an overview of a context-adaptive encoder of the present invention for encoding macroblock types of macroblocks of a picture, in accordance with one embodiment
  • Figure 2 illustrates the operational flow of the relevant aspects of the encoder block of Fig. 1 for encoding macroblock types of macroblocks of a picture, in accordance with one embodiment
  • Figure 3a illustrates processing of macroblocks of a picture, in accordance with one embodiment
  • Figure 3b illustrates the neighboring macroblocks which macroblock types are considered in the selection of a codeword table for use to encode the macroblock type of a macroblock, in accordance with one embodiment
  • Figure 4 illustrates an overview of a context-adaptive decoder of the present invention for decoding macroblock types encodings generated in accordance with principles similar to those practiced by the encoder of Fig. 1 , in accordance with one embodiment;
  • Figure 5 illustrates the operational flow of the relevant aspects of the decoder block of Fig. 4 for decoding adaptively generated macroblock type encodings of macroblocks of a picture, in accordance with one embodiment
  • Figure 6 illustrates a video device having an encoder and a decoder incorporated with the encoding/decoding teachings of the present invention, in accordance with one embodiment
  • Figure 7 illustrates an article of manufacture with a recordable medium having a software implementation of the encoder/decoder of the present invention, designed for use to program a device to equip the device with the encoding/decoding capability of the present invention, in accordance with one embodiment
  • FIG. 8 illustrates a system having a video sender device and a video receiver device incorporated with the encoding/decoding teachings of the present invention, in accordance with one embodiment.
  • the present invention includes a context-adaptive macroblock type encoder, a complementary decoder, devices equipped with these encoders. and/or decoders, systems made up of such devices, and methods of operations of these elements, devices and systems, and related subject matters.
  • various aspects of the present invention will be described. However, it will be apparent to those skilled in the art that the present invention may be practiced with only some or all aspects of the present invention. For purposes of explanation, specific numbers, materials and configurations are set forth in order to provide a thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without the specific details. In other instances, well-known features are omitted or simplified in order not to obscure the present invention.
  • Section Headings, Order of Descriptions and Embodiments Section headings are merely employed to improve readability, and they are not to be construed to restrict or narrow the present invention.
  • Encoder Figure 1 illustrates an overview of a context-adaptive encoder of the present invention for encoding macroblock types of macroblocks of a picture, in accordance with one embodiment.
  • context- adaptive encoder 100 includes codeword tables 102, coding logic/writer 104, macroblock type buffer 106, coupled to each other and to input 108 as shown, to receive macroblock types of macroblocks of pictures of a video.
  • the macroblock types are received in the form of a stream of binary data.
  • coding logic/writer 104 In response, for each received macroblock type, coding logic/writer 104 using one or more macroblock type related characteristics of one or more macroblocks neighbor to the macroblock to select one of codeword tables 102, and encodes the macroblock type, in accordance with the selected one of codeword tables 102. Coding logic/writer 104 further outputs the codewords into a bit stream at output 110.
  • macroblock type buffer 106 is employed to store at least the macroblock types of the neighboring macroblocks of interest.
  • buffer 106 has sufficient capacity to store the macroblock types of all macroblocks of a picture, and for each macroblock type of a macroblock to be encoded, coding logic/writer 104 reads out only the macroblock types of the neighboring macroblocks of interest.
  • the macroblocks of a picture are processed left- to-right, top-to-bottom, starting with the top leftmost macroblock, as depicted by arrows 304a-304c in Figure 3a, superimposed on the macroblocks of an example picture 302.
  • the neighboring macroblocks 312a-312d which macroblock types are considered in the selection of a codeword table 102 to encode the macroblock type of a macroblock 312e of a picture comprises a) macroblock 312d immediately preceding macroblock 312e "at the same horizontal level", i.e. to the left of macroblock 312e, if present, b) macroblock 312b immediately "above” macroblock 312e vertically, if present, c) macroblock 312a immediately preceding macroblock 312b "at the same horizontal level", i.e. to the left of macroblock 312b, if preset, and d) macroblock 312c immediately following macroblock 312b "at the same horizontal level", i.e. to the right of macroblock 312b, if present.
  • neighboring macroblocks which macroblock types are considered of interest include "preceding" macroblocks which are immediately adjacent to the macroblock which macroblock type is to be encoded, in both a horizontal and a vertical direction, as well as “preceding" macroblocks which are one degree removed from the macroblock, which macroblock type is to be encoded. In alternate embodiments, more or less preceding neighboring macroblock's macroblock types may be considered.
  • Macroblocks 312a and 312d are “not present”, when the current macroblock which macroblock type is to be encoded is located at the left edge of the picture.
  • macroblocks 312a-312c are “not present”, when the current macroblock which macroblock type is to be encoded is located at the top edge of the picture.
  • the selection of a codeword table 102 is based at least in part on a macroblock type characteristic of the neighboring macroblocks of interest. More specifically, in various embodiments, the selection of a codeword table 102 is based at least in part on the most common macroblock type of the neighboring macroblocks of interest. In alternate embodiments, one or more other characteristics in addition to or in lieu of the most common macroblock type of the neighboring macroblocks of interest may be employed in the selection of the codeword tables 104 instead. Further, in various embodiments, the adaptive encoding of macroblock types of the present invention is practiced for pictures of certain picture type only. In various embodiments, it is practiced for P pictures and B pictures only.
  • the selection of a codeword table 102 is based at least in part on the picture type of the current picture which macroblocks' macroblock types are being encoded.
  • the picture type of a picture may be at least one of n picture types, n being an integer, and different sets of codeword tables are employed in the encoding of macroblock types of macroblocks of pictures of the different types. In various embodiments, n equals two.
  • the two picture types will simply be referred to as picture type I and picture type II.
  • the macroblock type of a macroblock of a Type I picture may be one of ml macroblock types.
  • the set of codeword tables to be adaptively employed to encode macroblock types of macroblocks of a picture of Type I comprises ml codeword tables, each having ml codewords.
  • the codewords are VLC codewords, and ml equal seven.
  • the macroblock type of a macroblock of a Type II picture may be one of m2 macroblock types.
  • the set of codeword tables to be adaptively employed to encode macroblock types of macroblocks of a picture of Type II comprises m2 codeword tables, each having m2 codewords.
  • the codewords are VLC codewords, and m2 equals six.
  • the exact meaning of each of the macroblock types of macroblocks of a picture of a particular type is also non-essential to the practice of the present invention. Accordingly, for ease of understanding, they shall simply be referred to as macroblock type A1 through macroblock type G1 , in the case where there are seven macroblock types, and macroblock type A2 through F2, in the case where there are six macroblock types.
  • the codeword table selection criteria comprises the most common macroblock type characteristic of the neighboring macroblocks of interest and there are seven possible macroblock types for the macroblocks of a picture
  • the codeword tables for encoding macroblock types of the macroblocks of the picture may be
  • the codeword table selection criteria comprises the most common macroblock type characteristic of the neighboring macroblocks of interest and there are six possible macroblock types for the macroblocks of a picture
  • the codeword tables for encoding macroblock types of the macroblocks of the picture may be
  • MC also equals "Most Common Macroblock Type" of the neighboring macroblocks of interest.
  • a tie breaking rule may be a precedence rule.
  • the precedence rule may be employed as the selection criteria, and the macroblock type may be one of seven macroblock types, the precedence rule may be
  • precedence value 1 is highest and 7 is lowest.
  • the precedence rule may be
  • precedence value 1 is highest and 6 is lowest.
  • precedence rules are merely exemplary. They do not suggest that the precedence values of a precedence rule have to have either an ascending or a descending correlation with the manner the macroblock types is "labeled".
  • the present invention includes all possible combinations of macroblock type labeling and precedence ordering.
  • Figure 2 illustrates the operational flow of the relevant aspects of coding logic/writer 104 of Fig. 1 for adaptively encoding macroblock types of macroblocks of a picture, in accordance with one embodiment.
  • coding logic/writer 104 may first determine a picture type of a picture, if appropriate. Typically, the determination is performed once per picture. The determination may e.g. involve examining a picture type indicator in one or more of the leading data bits of a picture.
  • the macroblock type may be received in stream as illustrated in Fig. 1 or also retrieved from buffer 106 after it has been received and stored.
  • coding logic/writer 104 obtains macroblock type related characteristic data of neighboring macroblocks of interest.
  • coding logic/writer 104 retrieves from macroblock type buffer 106 the macroblock types of up to 4 macroblocks of interest as earlier described.
  • coding logic/writer 104 determines at least one macroblock type characteristic of the neighboring macroblocks of interest. In one embodiment, coding logic/writer 104 determines the most common macroblock type among the neighboring macroblocks of interest (employing one or more tie breaking rules, such as a precedence rule, if necessary).
  • coding logic/writer 104 selects one of the codeword tables 102 based at least in part on the one or more determined macroblock type characteristics of the neighboring macroblocks of interest. In various embodiments, the selection is further based on the picture type of the picture of which the macroblock (which macroblock type is to be encoded) is a member.
  • coding logic/writer 104 encodes the macroblock type of the macroblock accordingly, using an appropriate one of the codewords of the selected codeword table, and outputs the encoding, i.e. the VLC codeword (in embodiments where VLC codewords are used).
  • encoder 100 including codeword table 102, coding logic/writer 104, and macroblock type buffer 106 may be implemented in hardware, e.g. via application specific integrated circuit (ASIC), or in software, e.g. in programming languages such as C, or a combination of both.
  • ASIC application specific integrated circuit
  • coding logic/writer 104 also generates an encoding (DQUANT) indicating whether quantization parameters of the macroblocks have changed.
  • DQUANT is also looked up from the same codeword table 102 selected to encode a macroblock type of the macroblock.
  • the codeword tables 102 may further include the following DQUANT codewords, one each for the corresponding seven codeword tables:
  • the codeword tables 102 may further include the following DQUANT codewords, one each for the corresponding six codeword tables:
  • MC also equals "Most Common Macroblock Type" of the neighboring macroblocks of interest.
  • Decoder Figure 4 illustrates an overview of a context-adaptive decoder of the present invention for decoding macroblock type encodings generated as earlier described, in accordance with one embodiment.
  • context-adaptive decoder 400 is similarly constituted as encoder 100, having codeword tables 402, decoding logic/reader 404 and macroblock type buffer 406 coupled to each other and to input 410 as shown, to receive a bit stream comprising macroblock types encoded in codewords generated in accordance with the same principles as earlier described.
  • decoding logic/reader 404 In response, for each received macroblock type encoding, decoding logic/reader 404 using one or more macroblock types of one or more macroblocks neighbor to the macroblock to select one of codeword tables 402, and decodes the macroblock type encoding, in accordance with the selected one of codeword tables 402. Decoding logic/reader 404 further outputs the decoded macroblock type into a bit stream at output 110.
  • macroblock type buffer 406 is employed to store at least the decoded macroblock types of the neighboring macroblocks of interest.
  • buffer 406 has sufficient capacity to store the decoded macroblock types of all macroblocks of a picture, and for each macroblock type of a macroblock to be decoded, decoding logic/reader 404 reads out only the decoded macroblock type of the macroblocks of interest.
  • the selection of an appropriate one of codeword tables 402 for use in the decoding of a macroblock type encoding is complementary to the manner an appropriate one of codeword tables 102 is selected for use in encoding. That is, an appropriate one of codeword tables 402 is selected based at least in part on one or more macroblock type related attribute of the neighboring macroblocks of interest, if the appropriate one of codeword tables 102 is so selected.
  • selection of an appropriate one of codeword tables 402 is based at least in part on the most common macroblock type of the neighboring macroblocks of interest, if selection of an appropriate one of codeword tables 102 is so based.
  • One or more tie breaking rules corresponding to the ones used during encoding may be used during decoding.
  • Selection of an appropriate one of codeword tables 402 is further based on the picture type of the picture of which the macroblock is a member, if selection of an appropriate one of codeword tables 102 is so further based.
  • the neighboring macroblocks of interest are those illustrated in Fig. 3b, if they are the neighboring macroblocks of interest during encoding.
  • the codeword tables employed for pictures of different picture types are the tables earlier described, if they are the tables employed for encoding.
  • FIG. 5 illustrates the operational flow of the relevant aspects of decoding logic/reader 404 of Fig. 4 for adaptively decoding encoded macroblock types of macroblocks of a picture, in accordance with one embodiment.
  • decoding logic/reader 404 may first determine a picture type of a picture, if appropriate. Typically, the determination is performed once per picture. The determination may e.g. involve examining a picture type indicator in one or more of the leading data bits of a picture.
  • decoding logic/reader 404 obtains macroblock type related characteristic data of neighboring macroblocks of interest. In one embodiment, decoding logic/reader 404 retrieves from macroblock type buffer 406 the macroblock types of up to 4 macroblocks of interest as earlier described. At block 506, decoding logic/reader 404 determines at least one macroblock type characteristic of the neighboring macroblocks of interest. In one embodiment, decoding logic/reader 404 determines the most common macroblock type among the neighboring macroblocks of interest (employing a tie breaking rule if necessary). At block 508, decoding logic/reader 404 selects one of the codeword tables 102 based at least in part on the one or more determined macroblock type characteristics of the neighboring macroblocks of interest.
  • the selection is further based on the picture type of the picture of which the macroblock is a member.
  • decoding logic/reader 404 decodes the encoded macroblock type of the macroblock accordingly, using an appropriate one of the codewords of the selected codeword table, and outputs the decoded macroblock type.
  • decoder 400 including codeword table 402, decoding logic/reader 404, and macroblock type buffer 406 may be similarly implemented in hardware, e.g. via application specific integrated circuit (ASIC), or in software, e.g. in programming languages such as C, or a combination of both.
  • ASIC application specific integrated circuit
  • coding logic/reader 404 accommodates the presence of an encoding (DQUANT) inter-mixed among the macroblock type encodings, with DQUANT, as earlier described, indicating whether quantization parameters of the macroblocks have changed.
  • DQUANT an encoding
  • the encoding to be recognized is also looked up from the same codeword table 102 selected to decode a macroblock type of the macroblock.
  • the codeword tables 102 may further include the DQUANT codewords, one each for the corresponding codeword tables, as set forth above.
  • video device 600 includes encoder 610 and decoder 620 coupled to the inputs and outputs of the device.
  • encoder 610 is designed to receive macroblock types of macroblocks of pictures of a video, and to adaptively encode them in response, into VLC codewords 634a.
  • Decoder 620 is designed to receive VLC codewords 634b of the macroblock types of macroblocks of pictures of another video, and to adaptively decode in response the codewords back into macroblock types 632b.
  • Encoder 610 and decoder 620 are similarly constituted as the earlier described encoder 100 and decoder 400.
  • encoder 610 and decoder 620 may share at least in part their constituting tables and coding/decoding logics (as denoted by the intersecting blocks of encoder 610 and decoder 620).
  • video device 600 may be a wireless mobile phone, a palm sized computing device, such as a personal digital assistant, a laptop computing device, a desktop computing device, a server, and other computing devices of the like.
  • video device 600 may be a circuit board component, such as a video "add-on" circuit board (also referred to as a daughter circuit board), a motherboard, and other circuit boards of the like.
  • video device 600 may include encoder 610 only, as in the case of a video camera, or decoder 620 only, as in the case of a DVD player, a television, a display monitor, or a set-top box.
  • Figure 7 illustrates an article of manufacture including a recordable medium 700 having programming instructions implementing a software embodiment of the earlier described encoder 100 and/or decoder 400.
  • the programming instructions are designed for use to program video device 710 to equip video device 710 with the encoding and decoding capabilities of the present invention.
  • video device 710 include storage medium 712 to store at least a portion of a working copying of the programming instructions implementing the software embodiment of encoder 100 and/or decoder 400, and at least one processor 714 coupled to storage medium 712 to execute the programming instructions.
  • Video device 712 may be any one of the earlier enumerated example device devices or other video devices of the like.
  • Article 710 may e.g. be a diskette, a compact disk (CD), a DVD or other computer readable medium of the like.
  • article 710 may be a distribution server distributing encoder 100 and/or decoder 400 on line, via private and/or public networks, such as the Internet.
  • article 710 is a web server.
  • Figure 8 illustrates an example system having video sender 802 and video receiver 804 communicatively coupled to each other as shown, with video sender 802 encoding a video in accordance with the teachings of the present invention, and providing the encoded video to video receiver 802, and video receiver 802, in turn decoding the encoded video to render the video.
  • Video sender 802 and video receiver 804 are equipped with the earlier described encoder 100 and decoder 400 respectively.
  • video sender 802 is a video server
  • video receiver 804 is a client device coupled to video sender 802.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computing Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Macroblock types of macroblocks of a video picture are encoded by adaptively employing codewords of codeword tables, based at least in part on one or more macroblock type related characteristics of one or more neighboring macroblocks of interest. The codewords may be variable in length. The one or more macroblock type characteristics may include a most common macroblock type characteristic of the neighboring macroblocks of interest. The adaptive employment of the codeword tables may be further based on a picture type of the picture of which the macroblocks are members. Decoding may be performed in an inverse manner.

Description

CONTEXT-ADAPTIVE MACROBLOCK TYPE ENCODING/DECODING METHODS AND APPARATUSES
Related Application This application is a non-provisional application of provisional application number 60/366,835, filed 03/22/02, "Adaptive Macroblock Type Coding for Block Based Video Compression", which specification is hereby fully incorporated by reference.
FIELD OF THE INVENTION The present invention relates to the field of video encoding/decoding. More specifically, the present invention is related to the encoding macroblock types of macroblocks of pictures of video, and decoding of the encodings.
BACKGROUND OF THE INVENTION Advances in microprocessor and video related technologies have led to wide spread deployment and adoption of numerous types of video devices. Examples of such video devices include but are not limited to digital camcorders, digital versatile disk (DVD) players, video enabled laptop and desktop computing devices as well as servers, and so forth.
Advances in networking, telecommunication, satellite and other related technologies have also led to increase in on demand and/or real time online delivery of video, including delivery over public networks, such as the Internet. Whether videos are delivered offline (e.g. from a DVD player) or online (e.g. from a video server), high quality video inherently requires a high volume of data. Thus, video delivery and rendering often involve encoding and decoding to reduce the amount of data to be stored, retrieved and/or transmitted. Encoding/decoding of a video often involves processing the video as a stream of pictures. Each picture may be a field or a frame (typically consisting of two interleaved fields) comprising a number of macroblocks.
Each picture may be typed, e.g. an l-type, a P-type, or a B-type (also referred as I picture, P picture and B picture). An I picture is a picture coded using information only from itself. A P picture is a picture coded using motion compensated prediction from previously-decoded reference fields or frames, using at most one motion vector and reference picture to predict the value of each individual region. A B picture is a "predictive-coded" picture, where some macroblocks may use a weighted average of two distinct motion-compensated prediction values for the prediction of the macroblock sample values. Each macroblock typically comprises tiles of pixels, e.g. tiles of 16 x 16 pixels. Further, each macroblock is typically typed, with the macroblock type indicating the specific method to encode (and therefore decode) this group of pixels, e.g. whether coding (and therefore decoding) is based on global motion, local motion, and so forth. Moreover, each macroblock type itself is typically coded into a codeword, along with coding of other aspects of the macroblock, e.g. its transform coefficients and so forth.
However, in the prior art, macroblock type is typically encoded in a static, i.e. non-adaptive, variable length encoding (VLC) manner. Experience has shown static VLC encoding of macroblock types of macroblocks of a picture may be inefficient, at least at times.
Thus, it will be desirable to encode and decode macroblock types of macroblocks of pictures of a video in a context-adaptive manner that is more effective, than the static non-adaptive techniques known to-date.
For further information on macroblock type, and prior art approaches to encoding macroblock type, see e.g. ITU-T Recommendation H.263 (ITU-T stands for International Telecommunication Union - Telecommunication Standardisation Sector).
BRIEF DESCRIPTION OF THE DRAWINGS The present invention will be described by way of exemplary embodiments, but not limitations, illustrated in the accompanying drawings in which like references denote similar elements, and in which:
Figure 1 illustrates an overview of a context-adaptive encoder of the present invention for encoding macroblock types of macroblocks of a picture, in accordance with one embodiment; Figure 2 illustrates the operational flow of the relevant aspects of the encoder block of Fig. 1 for encoding macroblock types of macroblocks of a picture, in accordance with one embodiment;
Figure 3a illustrates processing of macroblocks of a picture, in accordance with one embodiment;
Figure 3b illustrates the neighboring macroblocks which macroblock types are considered in the selection of a codeword table for use to encode the macroblock type of a macroblock, in accordance with one embodiment;
Figure 4 illustrates an overview of a context-adaptive decoder of the present invention for decoding macroblock types encodings generated in accordance with principles similar to those practiced by the encoder of Fig. 1 , in accordance with one embodiment;
Figure 5 illustrates the operational flow of the relevant aspects of the decoder block of Fig. 4 for decoding adaptively generated macroblock type encodings of macroblocks of a picture, in accordance with one embodiment; Figure 6 illustrates a video device having an encoder and a decoder incorporated with the encoding/decoding teachings of the present invention, in accordance with one embodiment;
Figure 7 illustrates an article of manufacture with a recordable medium having a software implementation of the encoder/decoder of the present invention, designed for use to program a device to equip the device with the encoding/decoding capability of the present invention, in accordance with one embodiment; and
Figure 8 illustrates a system having a video sender device and a video receiver device incorporated with the encoding/decoding teachings of the present invention, in accordance with one embodiment.
DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION The present invention includes a context-adaptive macroblock type encoder, a complementary decoder, devices equipped with these encoders. and/or decoders, systems made up of such devices, and methods of operations of these elements, devices and systems, and related subject matters. In the following description, various aspects of the present invention will be described. However, it will be apparent to those skilled in the art that the present invention may be practiced with only some or all aspects of the present invention. For purposes of explanation, specific numbers, materials and configurations are set forth in order to provide a thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without the specific details. In other instances, well-known features are omitted or simplified in order not to obscure the present invention.
Terminology Parts of the description will be presented in video encoding and decoding terms consistent with the manner commonly employed by those skilled in the art to convey the substance of their work to others skilled in the art. These common video encoding and decoding terms are well understood by those skilled in the art. In particular, in a video device, these quantities may take the form of electrical, magnetic, or optical signals capable of being stored, transferred, combined, and otherwise manipulated through electrical and/or optical components of a processor, and its subsystems.
In various video encoding/decoding standards, encodings are organized in accordance with certain syntactical rules, thus they are also referred to as "syntax elements" at times.
Section Headings, Order of Descriptions and Embodiments Section headings are merely employed to improve readability, and they are not to be construed to restrict or narrow the present invention.
Various operations will be described as multiple discrete steps in turn, in a manner that is helpful in understanding the present invention, however, the order of description should not be construed as to imply that these operations are necessarily order dependent. In particular, these operations need not be performed in the order of presentation.
The phrase "in one embodiment" is used repeatedly. The phrase generally does not refer to the same embodiment, however, it may. The terms "comprising", "having" and "including" are synonymous, unless the context dictates otherwise.
Encoder Figure 1 illustrates an overview of a context-adaptive encoder of the present invention for encoding macroblock types of macroblocks of a picture, in accordance with one embodiment. As illustrated, for the embodiment, context- adaptive encoder 100 includes codeword tables 102, coding logic/writer 104, macroblock type buffer 106, coupled to each other and to input 108 as shown, to receive macroblock types of macroblocks of pictures of a video. Typically, the macroblock types are received in the form of a stream of binary data. In response, for each received macroblock type, coding logic/writer 104 using one or more macroblock type related characteristics of one or more macroblocks neighbor to the macroblock to select one of codeword tables 102, and encodes the macroblock type, in accordance with the selected one of codeword tables 102. Coding logic/writer 104 further outputs the codewords into a bit stream at output 110.
In other words, macroblock type buffer 106 is employed to store at least the macroblock types of the neighboring macroblocks of interest. In one embodiment, buffer 106 has sufficient capacity to store the macroblock types of all macroblocks of a picture, and for each macroblock type of a macroblock to be encoded, coding logic/writer 104 reads out only the macroblock types of the neighboring macroblocks of interest.
In various embodiments, the macroblocks of a picture are processed left- to-right, top-to-bottom, starting with the top leftmost macroblock, as depicted by arrows 304a-304c in Figure 3a, superimposed on the macroblocks of an example picture 302.
In various embodiments, as illustrated in Fig. 3b, the neighboring macroblocks 312a-312d which macroblock types are considered in the selection of a codeword table 102 to encode the macroblock type of a macroblock 312e of a picture comprises a) macroblock 312d immediately preceding macroblock 312e "at the same horizontal level", i.e. to the left of macroblock 312e, if present, b) macroblock 312b immediately "above" macroblock 312e vertically, if present, c) macroblock 312a immediately preceding macroblock 312b "at the same horizontal level", i.e. to the left of macroblock 312b, if preset, and d) macroblock 312c immediately following macroblock 312b "at the same horizontal level", i.e. to the right of macroblock 312b, if present.
In one words, for the embodiment, neighboring macroblocks which macroblock types are considered of interest include "preceding" macroblocks which are immediately adjacent to the macroblock which macroblock type is to be encoded, in both a horizontal and a vertical direction, as well as "preceding" macroblocks which are one degree removed from the macroblock, which macroblock type is to be encoded. In alternate embodiments, more or less preceding neighboring macroblock's macroblock types may be considered.
Macroblocks 312a and 312d are "not present", when the current macroblock which macroblock type is to be encoded is located at the left edge of the picture. Similarly, macroblocks 312a-312c are "not present", when the current macroblock which macroblock type is to be encoded is located at the top edge of the picture.
In various embodiments, the selection of a codeword table 102 is based at least in part on a macroblock type characteristic of the neighboring macroblocks of interest. More specifically, in various embodiments, the selection of a codeword table 102 is based at least in part on the most common macroblock type of the neighboring macroblocks of interest. In alternate embodiments, one or more other characteristics in addition to or in lieu of the most common macroblock type of the neighboring macroblocks of interest may be employed in the selection of the codeword tables 104 instead. Further, in various embodiments, the adaptive encoding of macroblock types of the present invention is practiced for pictures of certain picture type only. In various embodiments, it is practiced for P pictures and B pictures only. Moreover, in various ones of these embodiments, the selection of a codeword table 102 is based at least in part on the picture type of the current picture which macroblocks' macroblock types are being encoded. In various embodiments, the picture type of a picture may be at least one of n picture types, n being an integer, and different sets of codeword tables are employed in the encoding of macroblock types of macroblocks of pictures of the different types. In various embodiments, n equals two.
The exact nature of the picture types is non-essential to the practice of the present invention. Accordingly, for ease of understanding, for the two picture type embodiments, the two picture types will simply be referred to as picture type I and picture type II. In various embodiments, the macroblock type of a macroblock of a Type I picture may be one of ml macroblock types. Accordingly, for the embodiments where the selection criteria comprises one attribute of the neighboring macroblocks of interest, such as the most common macroblock type, the set of codeword tables to be adaptively employed to encode macroblock types of macroblocks of a picture of Type I comprises ml codeword tables, each having ml codewords. In various embodiments, the codewords are VLC codewords, and ml equal seven.
In various embodiments, the macroblock type of a macroblock of a Type II picture may be one of m2 macroblock types. Accordingly, for the embodiments where the selection criteria comprises one attribute of the neighboring macroblocks of interest, such as the most common macroblock type, the set of codeword tables to be adaptively employed to encode macroblock types of macroblocks of a picture of Type II comprises m2 codeword tables, each having m2 codewords. In various embodiments, the codewords are VLC codewords, and m2 equals six. The exact meaning of each of the macroblock types of macroblocks of a picture of a particular type is also non-essential to the practice of the present invention. Accordingly, for ease of understanding, they shall simply be referred to as macroblock type A1 through macroblock type G1 , in the case where there are seven macroblock types, and macroblock type A2 through F2, in the case where there are six macroblock types.
In one embodiment where the codeword table selection criteria comprises the most common macroblock type characteristic of the neighboring macroblocks of interest and there are seven possible macroblock types for the macroblocks of a picture, the codeword tables for encoding macroblock types of the macroblocks of the picture may be
Figure imgf000010_0001
Figure imgf000010_0002
Figure imgf000011_0001
where MC = Most Common Macroblock Type of the neighboring macroblocks of interest.
In one embodiment where the codeword table selection criteria comprises the most common macroblock type characteristic of the neighboring macroblocks of interest and there are six possible macroblock types for the macroblocks of a picture, the codeword tables for encoding macroblock types of the macroblocks of the picture may be
Figure imgf000011_0002
where MC also equals "Most Common Macroblock Type" of the neighboring macroblocks of interest.
In various embodiments, where the selection process may end with a tie, such as embodiments employing the "most common macroblock type" among the neighboring macroblocks of interest as the selection criteria, one or more tie breaking rules may be employed to break a tie in the event two or more macroblock types have the same frequency of occurrence. In various embodiments, a tie breaking rule may be a precedence rule. In one embodiment, where the "most common macroblock type" among the neighboring macroblocks of interest is employed as the selection criteria, and the macroblock type may be one of seven macroblock types, the precedence rule may be
Figure imgf000012_0001
where precedence value 1 is highest and 7 is lowest. In another similar embodiment, where there are six possible macroblock types, the precedence rule may be
Figure imgf000012_0002
where precedence value 1 is highest and 6 is lowest. The above precedence rules are merely exemplary. They do not suggest that the precedence values of a precedence rule have to have either an ascending or a descending correlation with the manner the macroblock types is "labeled". The present invention includes all possible combinations of macroblock type labeling and precedence ordering.
Figure 2 illustrates the operational flow of the relevant aspects of coding logic/writer 104 of Fig. 1 for adaptively encoding macroblock types of macroblocks of a picture, in accordance with one embodiment. As illustrated, for the embodiment, at block 202, on receipt of a macroblock type, coding logic/writer 104 may first determine a picture type of a picture, if appropriate. Typically, the determination is performed once per picture. The determination may e.g. involve examining a picture type indicator in one or more of the leading data bits of a picture. The macroblock type may be received in stream as illustrated in Fig. 1 or also retrieved from buffer 106 after it has been received and stored.
At block 204, coding logic/writer 104 obtains macroblock type related characteristic data of neighboring macroblocks of interest. In one embodiment, coding logic/writer 104 retrieves from macroblock type buffer 106 the macroblock types of up to 4 macroblocks of interest as earlier described.
At block 206, coding logic/writer 104 determines at least one macroblock type characteristic of the neighboring macroblocks of interest. In one embodiment, coding logic/writer 104 determines the most common macroblock type among the neighboring macroblocks of interest (employing one or more tie breaking rules, such as a precedence rule, if necessary).
At block 208, coding logic/writer 104 selects one of the codeword tables 102 based at least in part on the one or more determined macroblock type characteristics of the neighboring macroblocks of interest. In various embodiments, the selection is further based on the picture type of the picture of which the macroblock (which macroblock type is to be encoded) is a member. At block 210, coding logic/writer 104 encodes the macroblock type of the macroblock accordingly, using an appropriate one of the codewords of the selected codeword table, and outputs the encoding, i.e. the VLC codeword (in embodiments where VLC codewords are used).
Referring back to Fig. 1 , except for codeword table 102, novel employment of buffer 106 to track macroblock types of neighboring macroblocks of interest, and the employment of these elements by coding logic/writer 104 to adaptively select an appropriate codeword table 102 to encode a macroblock type of a macroblock of a picture, other aspects of encoder 100 are known, and therefore, not illustrated nor described.
In various embodiments, encoder 100 including codeword table 102, coding logic/writer 104, and macroblock type buffer 106 may be implemented in hardware, e.g. via application specific integrated circuit (ASIC), or in software, e.g. in programming languages such as C, or a combination of both.
In various embodiments, coding logic/writer 104 also generates an encoding (DQUANT) indicating whether quantization parameters of the macroblocks have changed. In various embodiments, DQUANT is also looked up from the same codeword table 102 selected to encode a macroblock type of the macroblock.
In one implementation of the earlier described codeword table designed for use to encode macroblock types of macroblocks of a picture having seven possible macroblock types, the codeword tables 102 may further include the following DQUANT codewords, one each for the corresponding seven codeword tables:
Figure imgf000014_0001
Figure imgf000015_0001
where MC = Most Common Macroblock Type of the neighboring macroblocks of interest.
In another implementation of the earlier described codeword table designed for use to encode macroblock types of macroblocks of a picture having six possible macroblock types, the codeword tables 102 may further include the following DQUANT codewords, one each for the corresponding six codeword tables:
Figure imgf000015_0002
where MC also equals "Most Common Macroblock Type" of the neighboring macroblocks of interest.
Decoder Figure 4 illustrates an overview of a context-adaptive decoder of the present invention for decoding macroblock type encodings generated as earlier described, in accordance with one embodiment. As illustrated, for the embodiment, context-adaptive decoder 400 is similarly constituted as encoder 100, having codeword tables 402, decoding logic/reader 404 and macroblock type buffer 406 coupled to each other and to input 410 as shown, to receive a bit stream comprising macroblock types encoded in codewords generated in accordance with the same principles as earlier described. In response, for each received macroblock type encoding, decoding logic/reader 404 using one or more macroblock types of one or more macroblocks neighbor to the macroblock to select one of codeword tables 402, and decodes the macroblock type encoding, in accordance with the selected one of codeword tables 402. Decoding logic/reader 404 further outputs the decoded macroblock type into a bit stream at output 110.
In other words, macroblock type buffer 406 is employed to store at least the decoded macroblock types of the neighboring macroblocks of interest. In one embodiment, buffer 406 has sufficient capacity to store the decoded macroblock types of all macroblocks of a picture, and for each macroblock type of a macroblock to be decoded, decoding logic/reader 404 reads out only the decoded macroblock type of the macroblocks of interest. The selection of an appropriate one of codeword tables 402 for use in the decoding of a macroblock type encoding is complementary to the manner an appropriate one of codeword tables 102 is selected for use in encoding. That is, an appropriate one of codeword tables 402 is selected based at least in part on one or more macroblock type related attribute of the neighboring macroblocks of interest, if the appropriate one of codeword tables 102 is so selected.
In particular, selection of an appropriate one of codeword tables 402 is based at least in part on the most common macroblock type of the neighboring macroblocks of interest, if selection of an appropriate one of codeword tables 102 is so based. One or more tie breaking rules corresponding to the ones used during encoding may be used during decoding.
Selection of an appropriate one of codeword tables 402 is further based on the picture type of the picture of which the macroblock is a member, if selection of an appropriate one of codeword tables 102 is so further based.
The neighboring macroblocks of interest are those illustrated in Fig. 3b, if they are the neighboring macroblocks of interest during encoding. The codeword tables employed for pictures of different picture types are the tables earlier described, if they are the tables employed for encoding.
Figure 5 illustrates the operational flow of the relevant aspects of decoding logic/reader 404 of Fig. 4 for adaptively decoding encoded macroblock types of macroblocks of a picture, in accordance with one embodiment. As illustrated, for the embodiment, at block 502, on receipt of a macroblock type encoding, decoding logic/reader 404 may first determine a picture type of a picture, if appropriate. Typically, the determination is performed once per picture. The determination may e.g. involve examining a picture type indicator in one or more of the leading data bits of a picture.
At block 504, decoding logic/reader 404 obtains macroblock type related characteristic data of neighboring macroblocks of interest. In one embodiment, decoding logic/reader 404 retrieves from macroblock type buffer 406 the macroblock types of up to 4 macroblocks of interest as earlier described. At block 506, decoding logic/reader 404 determines at least one macroblock type characteristic of the neighboring macroblocks of interest. In one embodiment, decoding logic/reader 404 determines the most common macroblock type among the neighboring macroblocks of interest (employing a tie breaking rule if necessary). At block 508, decoding logic/reader 404 selects one of the codeword tables 102 based at least in part on the one or more determined macroblock type characteristics of the neighboring macroblocks of interest. In various embodiments, the selection is further based on the picture type of the picture of which the macroblock is a member. At block 510, decoding logic/reader 404 decodes the encoded macroblock type of the macroblock accordingly, using an appropriate one of the codewords of the selected codeword table, and outputs the decoded macroblock type.
Referring back to Fig. 4, except for codeword table 402, novel employment of buffer 406 to track macroblock types of neighboring macroblocks of interest, and the employment of these elements by decoding logic/writer 404 to adaptively select an appropriate codeword table 402 to decode a macroblock type encoding for a macroblock of a picture, other aspects of decoder 400 are known, and therefore, not illustrated nor described.
In various embodiments, decoder 400 including codeword table 402, decoding logic/reader 404, and macroblock type buffer 406 may be similarly implemented in hardware, e.g. via application specific integrated circuit (ASIC), or in software, e.g. in programming languages such as C, or a combination of both.
In various embodiments, coding logic/reader 404 accommodates the presence of an encoding (DQUANT) inter-mixed among the macroblock type encodings, with DQUANT, as earlier described, indicating whether quantization parameters of the macroblocks have changed.
In various embodiments, the encoding to be recognized is also looked up from the same codeword table 102 selected to decode a macroblock type of the macroblock.
In one implementation of the earlier described codeword tables designed for use to encode macroblock types of macroblocks of a picture having seven or six possible macroblock types, the codeword tables 102 may further include the DQUANT codewords, one each for the corresponding codeword tables, as set forth above.
Example Applications of the Present Invention Figure 6 illustrates a video device incorporated with the teachings of the present invention, in accordance with one embodiment. As illustrated, video device 600 includes encoder 610 and decoder 620 coupled to the inputs and outputs of the device. As described earlier, encoder 610 is designed to receive macroblock types of macroblocks of pictures of a video, and to adaptively encode them in response, into VLC codewords 634a. Decoder 620 is designed to receive VLC codewords 634b of the macroblock types of macroblocks of pictures of another video, and to adaptively decode in response the codewords back into macroblock types 632b.
Encoder 610 and decoder 620 are similarly constituted as the earlier described encoder 100 and decoder 400. In various embodiments, encoder 610 and decoder 620 may share at least in part their constituting tables and coding/decoding logics (as denoted by the intersecting blocks of encoder 610 and decoder 620).
In various embodiments, video device 600 may be a wireless mobile phone, a palm sized computing device, such as a personal digital assistant, a laptop computing device, a desktop computing device, a server, and other computing devices of the like. In other embodiments, video device 600 may be a circuit board component, such as a video "add-on" circuit board (also referred to as a daughter circuit board), a motherboard, and other circuit boards of the like. In yet other embodiments, instead of having both encoder 610 and decoder 620, video device 600 may include encoder 610 only, as in the case of a video camera, or decoder 620 only, as in the case of a DVD player, a television, a display monitor, or a set-top box.
Figure 7 illustrates an article of manufacture including a recordable medium 700 having programming instructions implementing a software embodiment of the earlier described encoder 100 and/or decoder 400. The programming instructions are designed for use to program video device 710 to equip video device 710 with the encoding and decoding capabilities of the present invention.
For the embodiment, video device 710 include storage medium 712 to store at least a portion of a working copying of the programming instructions implementing the software embodiment of encoder 100 and/or decoder 400, and at least one processor 714 coupled to storage medium 712 to execute the programming instructions.
Video device 712 may be any one of the earlier enumerated example device devices or other video devices of the like. Article 710 may e.g. be a diskette, a compact disk (CD), a DVD or other computer readable medium of the like. In other embodiments, article 710 may be a distribution server distributing encoder 100 and/or decoder 400 on line, via private and/or public networks, such as the Internet. In one embodiment, article 710 is a web server. Figure 8 illustrates an example system having video sender 802 and video receiver 804 communicatively coupled to each other as shown, with video sender 802 encoding a video in accordance with the teachings of the present invention, and providing the encoded video to video receiver 802, and video receiver 802, in turn decoding the encoded video to render the video. Video sender 802 and video receiver 804 are equipped with the earlier described encoder 100 and decoder 400 respectively.
An example of video sender 802 is a video server, whereas an example of a video receiver 804 is a client device coupled to video sender 802.
Conclusion and Epilogue Thus, it can be seen from the above descriptions, a novel method for encoding and decoding macroblock types of macroblocks of a picture, including encoders, decoders, devices and systems incorporated with the method have been described.
While the present invention has been described in terms of the foregoing embodiments and example applications, those skilled in the art will recognize that the invention is not limited to the embodiments and example application described. The present invention can be practiced with modification and alteration within the spirit and scope of the appended claims. For examples, different number of encoder/decoder blocks, different number of codeword tables in the various encoder/decoder blocks, different codeword tables, and different codeword table selection logic.
Thus, the description is to be regarded as illustrative instead of restrictive on the present invention.

Claims

CLAIMS What is claimed is:
1. An apparatus comprising: storage medium; and a plurality of codeword tables stored in said storage medium, with each of said plurality of codeword tables having a plurality of codewords, to be selectively accessed, based at least in part on a macroblock type characteristic of one or more neighboring macroblocks of a macroblock of a picture, for performing a selected one of encoding a macroblock type of the macroblock of the picture; and decoding a macroblock type of a macroblock of the picture.
2. The apparatus of claim 1 , wherein the plurality of codeword tables are to be selectively accessed based also on a picture type of the picture.
3. The apparatus of claim 1 , wherein the one or more neighboring macroblocks comprise first one or more macroblocks immediately adjacent to the macroblock.
4. The apparatus of claim 3, wherein the one or more neighboring macroblocks further comprises second one or more macroblocks immediately adjacent to one of the first one or more macroblocks.
5. The apparatus of claim 1 , wherein the macroblock type characteristic comprises a most common macroblock type of the neighboring macroblocks of the macroblock.
6. The apparatus of claim 5, wherein the codeword tables comprise at least one of the following codeword tables
Figure imgf000022_0001
Figure imgf000022_0002
7. The apparatus of claim 5, wherein the codeword tables comprise at least one of the following codeword tables
Figure imgf000022_0003
Figure imgf000023_0001
8. The apparatus of claim 5, wherein the most common macroblock type comprises a first macroblock type having equal frequency of occurrence among the one or more neighboring macroblocks, with at least a second macroblock type, but precedence over that of the second macroblock type.
9. The apparatus of claim 1 , wherein at least one of the codeword tables further comprises a codeword to encode whether a quantization parameter of the macroblock has changed (DQUANT).
10. The apparatus of claim 9, wherein at least one of the DQUANTs is a selected one of "0000000", "01100", and "100000".
11. The apparatus of claim 9, wherein at least one of the DQUANTs is a selected one of "000000", "01000, and "10000".
12. The apparatus of claim 1 , wherein the apparatus further comprises logic coupled to the plurality of codeword tables to perform at least one of encoding a macroblock type of a macroblock of a picture; and decoding a macroblock type of a macroblock of a picture.
13. The apparatus of claim 1 , wherein the apparatus further comprises a processor coupled to the storage medium to selectively access said codewords of said codeword tables to perform said encoding/decoding.
14. The apparatus of claim 1 , wherein the apparatus comprises a selected one of a palm sized computing device, a wireless mobile phone, a digital personal assistant, a laptop computing device, a desktop computing device, a set-top box, a server, a compact disk player, a digital versatile disk player, a television, and a display monitor.
15. The apparatus of claim 1 , wherein the apparatus comprises a video daughter card and a motherboard having integrated video capability.
16. An article of manufacture comprising: a recordable medium; and a plurality of codeword tables recorded on the recordable medium to be retrieved to program an apparatus, with each of said plurality of codeword tables having a plurality of codewords, to be selectively accessed, based at least in part on a macroblock type characteristic of one or more neighboring macroblocks of a macroblock of a picture, for performing a selected one of encoding a macroblock type of the macroblock of the picture; and decoding a macroblock type of a macroblock of the picture.
17. The article of claim 16, wherein the plurality of codeword tables are to be selectively accessed based also on a picture type of the picture.
18. The article of claim 16, wherein the one or more neighboring macroblocks comprise first one or more macroblocks immediately adjacent to the macroblock.
19. The article of claim 18, wherein the one or more neighboring macroblocks further comprises second one or more macroblocks immediately adjacent to one of the first one or more macroblocks.
20. The article of claim 16, wherein the macroblock type characteristic comprises a most common macroblock type of the neighboring macroblocks of the macroblock.
21. The article of claim 20, wherein the codeword tables comprise at least one of the following codeword tables
Figure imgf000025_0001
Figure imgf000025_0002
22. The article of claim 20, wherein the codeword tables comprise at least one of the following codeword tables
Figure imgf000026_0001
23. The article of claim 20, wherein the most common macroblock type comprises a first macroblock type having equal frequency of occurrence among the one or more neighboring macroblocks, with a second macroblock type, but precedence over that of the second macroblock type.
24. The article of claim 16, wherein at least one of the codeword tables further comprises a codeword to encode whether a quantization parameter of the macroblock has changed (DQUANT).
25. The article of claim 16, wherein the article further comprises programming instructions recorded on the recordable medium, designed to the plurality of codeword tables to perform at least one of encoding a macroblock type of a macroblock of a picture; and decoding a macroblock type of a macroblock of a picture.
26. A video encoding/decoding method comprising: determining a macroblock type characteristic of one or more neighborhood macroblocks of a macroblock of a picture; selecting a codeword table comprising a plurality of codewords, based at least in part on the determined macroblock type characteristic of the one or more neighboring macroblocks of the macroblock; and performing a selected one of encoding and decoding of a macroblock type of the macroblock of the picture, using an appropriate one of the codewords of the selected codeword table.
27. The method of claim 26, wherein the method further comprises determining a picture type of the picture, and said selecting is further based on the determined picture type of the picture.
28. The method of claim 26, wherein the one or more neighboring macroblocks comprise first one or more macroblocks immediately adjacent to the macroblock.
29. The method of claim 28, wherein the one or more neighboring macroblocks further comprises second one or more macroblocks immediately adjacent to one of the first one or more macroblocks.
30. The method of claim 26, wherein the macroblock type characteristic comprises a most common macroblock type of the neighboring macroblocks of the macroblock.
31. The method of claim 30, wherein the codeword tables comprise at least one of the following codeword tables
Figure imgf000028_0001
Figure imgf000028_0002
32. The method of claim 30, wherein the codeword tables comprise at least one of the following codeword tables
Figure imgf000028_0003
Figure imgf000029_0001
33. The method of claim 30, wherein the most common macroblock type comprises a first macroblock type having equal frequency of occurrence among the one or more neighboring macroblocks, with a second macroblock type, but precedence over that of the second macroblock type.
34. The method of claim 26, wherein at least one of the codeword tables further comprises a codeword to encode whether a quantization parameter of the macroblock has changed (DQUANT).
35. The method of claim 34, wherein at least one of the DQUANTs is a selected one of "0000000", "01100", and "100000".
36. The method of claim 34, wherein at least one of the DQUANTs is a selected one of "000000", "01000, and "10000".
PCT/US2003/007882 2002-03-22 2003-03-12 Context-adaptive macroblock type encoding/decoding methods and apparatuses WO2003084241A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US10/508,597 US7978765B2 (en) 2002-03-22 2003-03-12 Context-adaptive macroblock type encoding/decoding methods and apparatuses
AU2003214181A AU2003214181A1 (en) 2002-03-22 2003-03-12 Context-adaptive macroblock type encoding/decoding methods and apparatuses

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US36683502P 2002-03-22 2002-03-22
US60/366,835 2002-03-22

Publications (2)

Publication Number Publication Date
WO2003084241A2 true WO2003084241A2 (en) 2003-10-09
WO2003084241A3 WO2003084241A3 (en) 2004-04-01

Family

ID=28675286

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/007882 WO2003084241A2 (en) 2002-03-22 2003-03-12 Context-adaptive macroblock type encoding/decoding methods and apparatuses

Country Status (3)

Country Link
US (1) US7978765B2 (en)
AU (1) AU2003214181A1 (en)
WO (1) WO2003084241A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012044931A1 (en) * 2010-10-01 2012-04-05 Qualcomm Incorporated Indicating intra-prediction mode selection for video coding
WO2012094506A1 (en) * 2011-01-06 2012-07-12 Qualcomm Incorporated Indicating intra-prediction mode selection for video coding using cabac
US10171810B2 (en) 2015-06-22 2019-01-01 Cisco Technology, Inc. Transform coefficient coding using level-mode and run-mode

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4979355B2 (en) * 2006-11-30 2012-07-18 パナソニック株式会社 Image coding apparatus and image coding method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0613300A2 (en) * 1993-01-18 1994-08-31 Sony Corporation Apparatus for encoding and decoding header data in picture signal transmission
WO2000033583A1 (en) * 1998-11-30 2000-06-08 Microsoft Corporation Efficient macroblock header coding for video compression
US20010022855A1 (en) * 1996-11-07 2001-09-20 Matsushita Electric Industrial Co., Ltd Image coding method and an image coding apparatus
US20010043653A1 (en) * 1997-03-26 2001-11-22 Kazuhusa Hosaka Method and apparatus for image encoding method and appartus for image decoding and recording medium
DE10143063A1 (en) * 2001-01-08 2002-09-05 Siemens Ag Header compression during video encoding involves encoding all possible/most frequently occurring header elements with code word table, transmitting code words instead of elements

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5428396A (en) * 1991-08-03 1995-06-27 Sony Corporation Variable length coding/decoding method for motion vectors
US5400075A (en) * 1993-01-13 1995-03-21 Thomson Consumer Electronics, Inc. Adaptive variable length encoder/decoder
US5493513A (en) * 1993-11-24 1996-02-20 Intel Corporation Process, apparatus and system for encoding video signals using motion estimation
JP3013698B2 (en) * 1994-04-20 2000-02-28 松下電器産業株式会社 Vector quantization encoding device and decoding device
JP3474005B2 (en) * 1994-10-13 2003-12-08 沖電気工業株式会社 Video coding method and video decoding method
US5729527A (en) * 1995-12-29 1998-03-17 Tellabs Operations, Inc. Fault management in a multichannel transmission system
US5867221A (en) * 1996-03-29 1999-02-02 Interated Systems, Inc. Method and system for the fractal compression of data using an integrated circuit for discrete cosine transform compression/decompression
JP4034380B2 (en) * 1996-10-31 2008-01-16 株式会社東芝 Image encoding / decoding method and apparatus
US7080319B1 (en) * 1999-09-29 2006-07-18 Lucent Technologies Inc. Technology to translate non-text display generation data representing an indicator into text variables
FI116819B (en) * 2000-01-21 2006-02-28 Nokia Corp Procedure for transferring images and an image encoder
JP3561485B2 (en) * 2000-08-18 2004-09-02 株式会社メディアグルー Coded signal separation / synthesis device, difference coded signal generation device, coded signal separation / synthesis method, difference coded signal generation method, medium recording coded signal separation / synthesis program, and difference coded signal generation program recorded Medium
US6856701B2 (en) * 2001-09-14 2005-02-15 Nokia Corporation Method and system for context-based adaptive binary arithmetic coding

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0613300A2 (en) * 1993-01-18 1994-08-31 Sony Corporation Apparatus for encoding and decoding header data in picture signal transmission
US20010022855A1 (en) * 1996-11-07 2001-09-20 Matsushita Electric Industrial Co., Ltd Image coding method and an image coding apparatus
US20010043653A1 (en) * 1997-03-26 2001-11-22 Kazuhusa Hosaka Method and apparatus for image encoding method and appartus for image decoding and recording medium
WO2000033583A1 (en) * 1998-11-30 2000-06-08 Microsoft Corporation Efficient macroblock header coding for video compression
DE10143063A1 (en) * 2001-01-08 2002-09-05 Siemens Ag Header compression during video encoding involves encoding all possible/most frequently occurring header elements with code word table, transmitting code words instead of elements

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Recommendation H.263: Video coding for low bit rate communication" ITU-T DRAFT RECOMMENDATION H.263, XX, XX, February 1998 (1998-02), pages 1-167, XP002176560 cited in the application *
WIEGAND T: "JOINT MODEL NUMBER 1, REVISION 1(JM-IRL)" ITU STUDY GROUP 16 - VIDEO CODING EXPERTS GROUP, XX, XX, 3 December 2001 (2001-12-03), pages 1,3-75, XP001086627 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012044931A1 (en) * 2010-10-01 2012-04-05 Qualcomm Incorporated Indicating intra-prediction mode selection for video coding
US9025661B2 (en) 2010-10-01 2015-05-05 Qualcomm Incorporated Indicating intra-prediction mode selection for video coding
WO2012094506A1 (en) * 2011-01-06 2012-07-12 Qualcomm Incorporated Indicating intra-prediction mode selection for video coding using cabac
US8913662B2 (en) 2011-01-06 2014-12-16 Qualcomm Incorporated Indicating intra-prediction mode selection for video coding using CABAC
US10171810B2 (en) 2015-06-22 2019-01-01 Cisco Technology, Inc. Transform coefficient coding using level-mode and run-mode

Also Published As

Publication number Publication date
AU2003214181A8 (en) 2003-10-13
US20050147160A1 (en) 2005-07-07
US7978765B2 (en) 2011-07-12
WO2003084241A3 (en) 2004-04-01
AU2003214181A1 (en) 2003-10-13

Similar Documents

Publication Publication Date Title
US7099387B2 (en) Context-adaptive VLC video transform coefficients encoding/decoding methods and apparatuses
US10397592B2 (en) Method and apparatus for multi-threaded video decoding
US7627039B2 (en) Parallel video decoding
US9124889B2 (en) High frequency emphasis in coding signals
US7209059B2 (en) Decoding method and encoding method
US8494295B2 (en) Variable length coding for clustered transform coefficients in video compression
US8938001B1 (en) Apparatus and method for coding using combinations
US20050105889A1 (en) Video picture compression artifacts reduction via filtering and dithering
US7978765B2 (en) Context-adaptive macroblock type encoding/decoding methods and apparatuses
US20130083858A1 (en) Video image delivery system, video image transmission device, video image delivery method, and video image delivery program
US20040013200A1 (en) Advanced method of coding and decoding motion vector and apparatus therefor
JP2006526960A (en) Group of pictures restructuring method to provide random access to group of pictures
US20130287100A1 (en) Mechanism for facilitating cost-efficient and low-latency encoding of video streams
Stobaugh Novel use of video and image analysis in a video compression system
JPH08289289A (en) Moving image compressor
JP2005051621A (en) Device and method for encoding moving image

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 10508597

Country of ref document: US

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP