USRE43567E1 - Methods and systems for image intra-prediction mode estimation - Google Patents

Methods and systems for image intra-prediction mode estimation Download PDF

Info

Publication number
USRE43567E1
USRE43567E1 US12/608,848 US60884809A USRE43567E US RE43567 E1 USRE43567 E1 US RE43567E1 US 60884809 A US60884809 A US 60884809A US RE43567 E USRE43567 E US RE43567E
Authority
US
United States
Prior art keywords
prediction mode
block
prediction
adjacent
mode
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US12/608,848
Inventor
Shijun Sun
Louis Joseph Kerofsky
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby Laboratories Licensing Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Priority to US12/608,848 priority Critical patent/USRE43567E1/en
Application granted granted Critical
Publication of USRE43567E1 publication Critical patent/USRE43567E1/en
Assigned to DOLBY LABORATORIES LICENSING CORPORATION reassignment DOLBY LABORATORIES LICENSING CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SHARP KABUSHIKI KAISHA
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/196Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
    • H04N19/198Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters including smoothing of a sequence of encoding parameters, e.g. by averaging, by choice of the maximum, minimum or median value
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/11Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/196Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/196Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
    • H04N19/197Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters including determination of the initial value of an encoding parameter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/63Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets

Definitions

  • Embodiments of the present invention relate to intra-prediction for an image.
  • Digital video requires a large amount of data to represent each and every frame of a digital video sequence (e.g., series of frames) in an uncompressed manner. It is not feasible for most applications to transmit uncompressed digital video across computer networks because of bandwidth limitations. In addition, uncompressed digital video requires a large amount of storage space.
  • the digital video is normally encoded in some manner to reduce the storage requirements and reduce the bandwidth requirements.
  • Interframe encoding exploits the fact that different frames of video typically include regions of pixels, normally selected as x by x blocks, that remain substantially the same.
  • a motion vector interrelates the movement of a block of pixels in one frame to a block of similar pixels in another frame. Accordingly, the system is not required to encode the block of pixels twice, but rather encodes the block of pixels once and provides a motion vector to predict the other block of pixels.
  • Intraframe encoding encodes a frame or a portion thereof without reference to pixels in other frames.
  • intraframe encoding encodes the frame, or portions thereof, on a block by block basis.
  • the intraframe encoding makes use of discrete cosine transforms of a block of pixels and subsequent encoding of the transformed coefficients.
  • Other intraframe encoding techniques exist, such as for example, wavelet encoding.
  • these techniques employ relatively large data tables for referencing prediction modes.
  • Memory for these data tables can be burdensomely expensive for many low cost machines.
  • it is also burdensomely expensive to provide sufficient memory within processing devices to store the data table.
  • the resulting system has increased complexity with the large data table.
  • FIG. 1 illustrates some forms of block adjacency
  • FIG. 2 illustrates a block of pixels and the adjacent pixels for prediction
  • FIG. 3 illustrates general prediction mode directions
  • FIG. 4 illustrates the general directions of prediction modes in an embodiment of the present invention
  • FIG. 5 illustrates the general directions of prediction modes in an embodiment of the present invention
  • FIG. 6 illustrates the general directions of prediction modes in an embodiment of the present invention
  • FIG. 7 illustrates the general directions of prediction modes in an embodiment of the present invention
  • FIG. 8 illustrates the general directions of prediction modes in an embodiment of the present invention
  • FIG. 9 is a block diagram illustrating mode estimation in some embodiments of the present invention.
  • FIG. 10 is a block diagram illustrating mode estimation in embodiments with an ordered set of prediction modes
  • FIG. 11 is a block diagram illustrating mode estimation with ordered sets associated with numerical values
  • FIG. 12 is a block diagram illustrating mode estimation options when some adjacent block data is unavailable
  • FIG. 13 is a block diagram illustrating mode order modification in some embodiments of the present invention.
  • FIG. 14 is a block diagram illustrating the methods of an embodiment of the present invention in which an estimated mode is used to modify mode order usage.
  • FIG. 15 is a block diagram illustrating the method of an embodiment of the present invention in which an estimate mode is used to modify mode order using specific designators.
  • Embodiments of the present invention comprise methods and systems related to intra-prediction of images. As all embodiments are related to intra-prediction, the terms “intra-prediction” and “prediction” may be used interchangeably to refer to intra-prediction processes.
  • Embodiments of the present invention use intraframe coding or intracoding to exploit spatial redundancies within a video image. Because adjacent blocks generally have similar attributes, the efficiency of the coding process is improved by referencing the spatial correlation between adjacent blocks. This correlation may be exploited by prediction of a target block based on prediction modes used in adjacent blocks.
  • a digital image may be divided into blocks for more efficient processing or for other reasons.
  • a target block “C” 12 may be situated adjacent to adjacent block “A,” 14 which is located immediately above target block “C” 12 .
  • Another adjacent block “B” 16 is located immediately to the left of target block “C” 12 .
  • Other blocks that share boundaries with target block “C” 12 may also be considered to be adjacent blocks to block “C” 12 .
  • Blocks may comprise various numbers of pixels in different configurations.
  • a block may comprise a 4 ⁇ 4 array of pixels.
  • a block may also comprise a 16 ⁇ 16 array of pixels or an 8 ⁇ 8 array.
  • Other pixel configurations, including both square and rectangular arrays may also make up a block.
  • Each pixel in a target block may be predicted with reference to data regarding pixels in adjacent blocks.
  • This adjacent pixel data or adjacent block data comprises the prediction modes used to predict those adjacent blocks or adjacent pixels.
  • Specific adjacent pixels and pixels within a target block may be referenced using an alphanumeric index as illustrated in FIG. 2 .
  • FIG. 2 illustrates a 4 ⁇ 4 target block, such as block “C” 12 comprising 16 pixels designated by lower case alphabetic characters 22 . Pixels in an adjacent block immediately above the target block are designated by capital alphabetic characters 24 . Pixels in an adjacent block immediately to the left of the target block are designated by capital alphabetical characters 26 . The bottom right pixel 25 in an adjacent block above and to the left of the target block 12 is designated by the capital letter “Q.”
  • Prediction modes may comprise instructions or algorithms for predicting specific pixels in a target block. These modes may refer to one or more adjacent block pixels as described in the following mode descriptions.
  • all samples A, B, C, D, I, J, K, L, are available all samples may be predicted by (A+B+C+D+I+J+K+L+4)>>3. If A, B, C, and D are not available and I, J, K, and L are available, all samples may be predicted by (I+J+K+L+2)>>2. If I, J, K, and L are not available and A, B, C, and D are available, all samples may be predicted by (A+B+C+D+2)>>2. If all eight samples are not available, the prediction for all lama samples in the block may be 128. A block may be always predicted in this mode.
  • diagonal down/left prediction a may be predicted by (A + 2B + C + I + 2J + K + 4) >> 3 b, e may be predicted by (B + 2C + D + J + 2K + L + 4) >> 3 c, f, i may be predicted by (C + 2D + E + K + 2L + M + 4) >> 3 d, g, j, m may be predicted by (D + 2E + F + L + 2M + N + 4) >> 3 h, k, n may be predicted by (E + 2F + G + M + 2N + O + 4) >> 3 l, o may be predicted by (F + 2G + H + N + 2O + P + 4) >> 3 p may be predicted by (G + H + O + P + 2) >> 2
  • diagonal down/right prediction m may be predicted by (J + 2K + L + 3) >> 2 i
  • n may be predicted by (I + 2J + K + 2) >> 2 e
  • j o may be predicted by (Q + 2I + J + 2) >> 2 a
  • f, k p may be predicted by (A + 2Q + I + 2) >> 2 b
  • g l may be predicted by (Q + 2A + B + 2) >> 2
  • h may be predicted by (A + 2B + C + 2) >> 2 d may be predicted by (B + 2C + D + 2) >> 2
  • Mode 5 vertical-left prediction a, j may be predicted by (Q + A + 1) >> 1 b, k, may be predicted by (A + B + 1) >> 1 c, l may be predicted by (B + C + 1) >> 1 d may be predicted by (C + D + 1) >> 1 e, n may be predicted by (I + 2Q + A + 2) >> 2 f, o may be predicted by (Q + 2A + B + 2) >> 2 g, p may be predicted by (A + 2B + C + 2) >> 2 h may be predicted by (B + 2C + D + 2) >> 2 i may be predicted by (Q + 2I + J + 2) >> 2 m may be predicted by (I + 2J + K + 2) >> 2
  • Mode 6 horizontal-down prediction a, g may be predicted by (Q + I + 1) >> 1 b, h may be predicted by (I + 2Q + A + 2) >> 2 c may be predicted by (Q + 2A + B + 2) >> 2 d may be predicted by (A + 2B + C + 2) >> 2 e, k may be predicted by (I + J + 1) >> 1 f, l may be predicted by (Q + 2I + J + 2) >> 2 i, o may be predicted by (J + K + 1) >> 1 j, p may be predicted by (I + 2J + K + 2) >> 2 m may be predicted by (K + L + 1) >> 1 n may be predicted by (J + 2K + L + 2) >> 2
  • vertical-right prediction a may be predicted by (2A + 2B + J + 2K + L + 4) >> 3 b, i may be predicted by (B + C + 1) >> 1 c, j may be predicted by (C + D + 1) >> 1 d, k may be predicted by (D + E + 1) >> 1 l may be predicted by (E + F + 1) >> 1 e may be predicted by (A + 2B + C + K + 2L + M + 4) >> 3 f, m may be predicted by (B + 2C + D + 2) >> 2 g, n may be predicted by (C + 2D + E + 2) >> 2 h, o may be predicted by (D + 2E + F + 2) >> 2 p may be predicted by (E + 2F + G + 2) >> 2
  • horizontal-up prediction a may be predicted by (B + 2C + D + 2I + 2J + 4) >> 3 b may be predicted by (C + 2D + E + I + 2J + K + 4) >> 3 c, e may be predicted by (J + K + 1) >> 1 d, f may be predicted by (J + 2K + L + 2) >> 2 g, i may be predicted by (K + L + 1) >> 1 h, j may be predicted by (K + 2L + M + 2) >> 2 l, n may be predicted by (L + 2M + N + 2) >> 2 k, m may be predicted by (L + M + 1) >> 1 o may be predicted by (M + N + 1) >> 1 p may be predicted by (M + 2N + O + 2) >> 2
  • the ordering process which is based upon the likelihood of producing a lesser prediction error for each of the modes, increases the coding efficiently, reduces the memory requirements, and may be at least partially mathematically defined.
  • Each prediction mode may be described by a general direction of prediction as described verbally in each of the mode titles above (i.e., horizontal up, vertical and diagonal down left).
  • a prediction mode may also be described graphically by an angular direction. This angular direction may be expressed through a diagram with arrows radiating outward from a center point as shown in FIG. 3 . In this type of diagram, each arrow and the center point may represent a prediction mode.
  • the angle corresponding to a prediction mode has a general relationship to the direction from the weighted average location of the adjacent pixels used to predict the target pixel to the actual target pixel location.
  • the modes are more precisely defined in the definitions above and in the NT standard. In FIG.
  • the center point 32 represents no direction so this point may be associated with a DC prediction mode.
  • a horizontal arrow 34 may represent a horizontal prediction mode.
  • a vertical arrow 36 may represent a vertical prediction mode.
  • An arrow extending from the center point diagonally downward to the right at approximately a 45 degree angle from horizontal 38 may represent a Diagonal Down/Right (DDR) prediction mode.
  • An arrow extending from the center point diagonally downward to the left at approximately a 45 degree angle from horizontal 40 may represent a Diagonal Down/Left (DDL) prediction mode.
  • DDR Diagonal Down/Right
  • DDL Diagonal Down/Left
  • An arrow extending from the center point diagonally upward to the right at approximately a 22.5 degree angle from horizontal 42 may represent a Horizontal Up (HU) prediction mode.
  • An arrow extending from the center point diagonally downward to the right at approximately a 22.5 degree angle from horizontal 44 may represent a Horizontal Down (HD) prediction mode.
  • An arrow extending from the center point diagonally downward to the right at approximately a 67.5 degree angle from horizontal 46 may represent a Vertical Right (VR) prediction mode.
  • An arrow extending from the center point diagonally downward to the left at approximately a 67.5 degree angle from horizontal 48 may represent a Vertical Left (VL) prediction mode.
  • the HU, HD, VR and VL prediction modes may be referred to collectively as intermediate angle prediction modes.
  • prediction modes may be ordered in a manner generally consistent with their likelihood of producing a reduced prediction error. With the prediction modes ordered according to their general likelihood of producing a lesser prediction error, the resulting data itself may have a greater tendency to be more consistently ordered. Furthermore, communication of modes may take advantage of coding techniques that reduce memory and bandwidth requirements. For example, the present inventors determined that the horizontal prediction mode and the vertical prediction mode are generally more likely than diagonal prediction modes, which are generally more likely than intermediate angle prediction modes. In addition, a DC prediction mode (e.g., when an adjacent block is coded in inter mode) is generally less likely than horizontal and vertical prediction modes and generally more likely than diagonal prediction modes.
  • the order established in some embodiments of the present invention may be expressed, in general terms, as follows: vertical and horizontal prediction modes are more likely to produce a reduced prediction error than a DC prediction mode and that a DC prediction mode is more likely to produce a reduced prediction error than diagonal prediction modes and that diagonal prediction modes are more likely to produce a reduced prediction error than intermediate angle prediction modes.
  • DC prediction mode is more likely to produce a reduced prediction error than vertical and horizontal prediction modes and vertical and horizontal prediction modes are more likely to produce a reduced prediction error than diagonal prediction modes and that diagonal prediction modes are more likely to produce a reduced prediction error than intermediate angle prediction modes.
  • modes may be defined in order as follows:
  • modes may be defined in order as follows:
  • modes may be defined in order as follows:
  • modes may be defined in order as follows:
  • modes may be defined in order as follows:
  • the horizontal prediction (mode 0) and the vertical prediction (mode 1) may be reversed, if desired.
  • the diagonal down/left prediction mode and the horizontal down prediction mode may be reversed, if desired.
  • the diagonal down/right prediction (mode 5), the vertical right prediction (mode 6), the vertical left prediction (mode 7), and the horizontal up prediction (mode 8) may be reordered, if desired.
  • the DC prediction is between the mode 0/mode 1 set and the mode 3/mode 4 set, but may be located between mode 3/mode 4 set and mode 5/mode 6/mode 7/mode 8 set, if desired, or any other location.
  • the angled modes 3-8 may be renumbered as desired without significant impact on the encoding efficiency.
  • the prediction modes may be reordered for all of the respective blocks (e.g., those blocks using the described prediction scheme) using such a prediction basis.
  • less than all of the respective blocks e.g., those blocks using the described prediction scheme
  • the order of the prediction modes may be the same or varied for different blocks.
  • the reordering of each of the modes of such a prediction basis is preferably at least 5 modes, 6 modes, or 7 modes, with the remaining being ordered in any other manner.
  • the order of the prediction modes is preferably 0, 1, 2, 3, 4, 5, 6, 7, 8. Other predefined ordering of the prediction modes may likewise be employed.
  • Some embodiments of the present invention may comprise one or more data tables for organization of mode data. With the modes being generally arranged in an ordered manner, this may be used together with each cell in a data table, to provide a more ordered set.
  • each entry in the data table may include the ordered set of numbers 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9.
  • the ordered set of numbers in the data table may include 5, 6, 7, 8, or 9 sets of ordered numbers for each entry in the data table.
  • the data table entries may include the following sets of data entries ⁇ 1, 2, 3, 5, 7 ⁇ ; ⁇ 0, 1, 2, 3, 4, 5, 6 ⁇ ; ⁇ 0, 1, 3, 5, 6, 7, 8 ⁇ , where each of the numbers in the set are of increasing numerical value.
  • the data table entries may include the following sets of data entries ⁇ 1, 2, 3, 5, 7 ⁇ ; ⁇ 0, 1, 2, 3, 4, 5, 6 ⁇ ; ⁇ 0, 1, 3, 5, 6, 7, 8 ⁇ , where each set is included in at least 25%, or 35%, or 50%, or 75%, or 90%, or more, of the cells.
  • the table will have significantly more predictability than known data table methods, which decreases memory requirements.
  • the predetermined manner of the ordering of the sets of data entries should be independent of the prediction modes of adjoining sets of pixels (e.g. macroblocks).
  • the data table may be “static” in nature or may be effectively dynamically generated, in whole or in part, when needed based upon patterns in the data. Accordingly, a mathematical equation or an algorithm may be used to determine the entries, which in this case the “table” could be created by such a technique. Accordingly, a “data table” as used herein is not merely restricted to a static table, but further includes such a set of values, however determined, that are used for such prediction.
  • the most likely prediction mode should be ordered first, the second most likely prediction mode ordered second, if desired, followed by the remaining modes in a predetermined manner.
  • the predetermined manner should be independent of the prediction modes of adjoining macroblocks.
  • the preferred order of the remaining modes should be in a decreasing likelihood of occurrence of the remaining modes (most likely prediction mode, and if desired, second most likely prediction mode).
  • the intra prediction mode order for block C may be defined as follows:
  • block A is “outside” (e.g., not available) and block B is not “outside”, only DC prediction (mode 2) and horizontal prediction (mode 0) are permitted for block C, therefore;
  • intra prediction mode order for block C is ⁇ 1, 3, 0, 2, 4, 5, 6, 7, 8 ⁇ .
  • the modes arranged in a generally decreasing likelihood (or increasing) of occurrence then the automatic arrangement of the remaining modes of occurrence will still be generally arranged in the proper sequence.
  • the ordering of the sequence from higher to lower probability increases the likelihood of the proper prediction toward the front. With entropy encoding this decreases the resulting encoded bit stream.
  • Other arrangements may likewise be used.
  • the aforementioned selection scheme is based upon the principle that if the prediction of block A is X and the prediction of block B is Y, then it is likely the prediction of block C is X or Y.
  • the prediction for X and/or Y is located at the start of the list and the remaining modes are sequentially listed thereafter.
  • the most probable mode of C is given, namely, the minimum of the modes used for blocks A and B. If one of the blocks A or B is “outside” the most probable mode is equal to prediction mode 2.
  • the ordering of prediction modes assigned to blocks C is therefore the most probable mode followed by the remaining modes in the ascending order.
  • Embodiments of the present invention may be described with reference to FIG. 9 .
  • a target block is selected 50 for prediction.
  • a prediction mode used for prediction of a first adjacent block, which is immediately adjacent to said target block, is then determined 52 .
  • a prediction mode used for prediction of a second adjacent block, which is also adjacent to said target block is also determined 54 .
  • These adjacent block prediction modes are then examined 56 to determine which is more likely to produce a lesser prediction error.
  • a set of prediction modes is ordered 58 according to the modes' likelihood of producing a lesser prediction error.
  • a target block is selected 60 .
  • the prediction mode used for a first adjacent block is determined 62 and the prediction mode used for a second adjacent block is also determined 64 .
  • These two prediction modes are then examined 66 to determine which occurs first in the ordered set of modes thereby corresponding to the mode with the higher likelihood of producing a lesser prediction error.
  • a set of prediction modes is ordered 68 by likelihood of producing a lesser prediction error. These modes in the ordered set are then associated 70 with numerical values such that modes with a higher likelihood of producing a lesser prediction error are associated with lower numerical values.
  • the mode used to predict a first adjacent block is then determined 72 and the mode used to predict a second adjacent block is also determined 74 . These adjacent block modes are then examined to determine which mode is associated with a lower numerical value. This mode is designated as the estimated mode for prediction of the target block.
  • a set of prediction modes is ordered 78 by likelihood of producing a lesser prediction error. These modes in the ordered set are then associated 80 with numerical values such that modes with a higher likelihood of producing a lesser prediction error are associated with lower numerical values.
  • An attempt 82 is made to determine the mode used to predict a first adjacent block and an attempt 84 is made to determine the mode used to predict a second adjacent block. If the prediction mode used to predict the first adjacent block is not available 86 , a default prediction mode, such as a DC prediction mode, may be designated 90 as an estimated prediction mode for the target block.
  • a default prediction mode such as a DC prediction mode
  • this adjacent block modes may be examined to determine which mode is associated with a lower numerical value. This mode is then designated 92 as the estimated mode for prediction of the target block.
  • the prediction mode orders described above which have been determined independently of the adjacent block data, may be modified with adjacent block data.
  • Prediction mode estimates determined with reference to adjacent block data can be inserted into prediction mode orders to modify the orders to reflect the additional information obtained from adjacent block data.
  • a prediction mode estimate based on adjacent block data, can be inserted directly into a prediction mode order set.
  • the prediction mode estimate will be inserted or prepended at the front of the prediction mode order at the position of the mode most likely to produce a reduced prediction error.
  • the estimate may be inserted at different positions in the mode order.
  • a prediction mode order is selected 102 wherein the prediction mode order elements may be arranged according to their likelihood of producing a lesser prediction error.
  • the first element in the order represents the prediction mode most likely to yield a lesser prediction error
  • the next element in the order represents the prediction mode that is the next most likely to yield a lesser prediction error and so on to the last element in the order, which represents the prediction mode in the order that is least likely to yield a lesser prediction error.
  • a prediction mode estimate is also determined 104 , as described above. This estimate is determined using adjacent block data. Generally, the estimate is the prediction mode used in one or more adjacent blocks that is likely to yield a lesser prediction error. However, the estimate may be determined in other ways. When sufficient adjacent block prediction mode data is not available, such as at an image edge or a slice boundary, a prediction mode for the target block may be estimated based on the lack of one or more adjacent blocks or their prediction mode data. In many cases, a DC prediction mode will be estimated when adjacent block data is limited or unavailable.
  • the estimated prediction mode may be placed 106 into the mode order as the mode most likely to yield a lesser prediction error. In some embodiments, this will be the first mode in the order or the mode associated with the lowest numerical value.
  • the estimated prediction mode may take precedence over the pre-selected mode order.
  • a pre-selected mode order is designated 110 at the encoder and the decoder. This order comprises a set of prediction modes arranged in order of likelihood of yielding a lesser prediction error or some other order.
  • An estimated prediction mode is also determined 112 based on adjacent block data. This estimated prediction mode is determined at the encoder and the decoder according to the same algorithm or method.
  • the encoder also determines the actual best prediction mode 114 for predicting a pixel based on motion vectors or other known techniques. The encoder may, then, compare 116 the actual best prediction mode to the estimated prediction mode to determine whether they are the same.
  • the encoder may signal to the decoder that the estimated prediction mode is to be used 118 .
  • this estimated prediction mode signal may be performed with a 1-bit flag to signify whether the estimated mode is to be used on not.
  • the encoder may signal to the decoder that another mode may be used 120 . This may be performed by reference to the pre-established mode order. The encoder may determine which mode in the mode order is most equivalent to the actual best prediction mode and signal the decoder to use that mode.
  • the set order may be rearranged once further data is obtained.
  • an ordered set of prediction modes may be reordered when an estimated prediction mode is determined or when a best actual prediction mode is determined.
  • the modifying mode may be interjected into the ordered set, placed ahead of the ordered set or, in some cases, removed from the ordered set.
  • each mode in the mode order may be associated with a numerical value according to the order.
  • the numerical value associated with the mode to be used may be sent to the decoder to signal the decoder to use that prediction mode.
  • a mode order comprising 9 prediction modes may be selected 130 .
  • An estimated prediction mode based on adjacent block data, and which is one of the 9 modes in the order, may also be determined 132 .
  • a best prediction mode may also be determined 134 by motion vector methods or other methods. The best prediction mode may then be compared to the estimated prediction mode 136 .
  • the decoder may be signaled with a 1-bit designator to use the estimated prediction mode, which is already identified at the decoder. If the estimated prediction mode is not equivalent to the best prediction mode, the estimated prediction mode is essentially eliminated from the mode order 140 . This elimination may be performed by re-ordering the set, skipping the estimated mode in the order or by other means. The remaining order will effectively comprise 8 modes, which can be represented by a 3-bit designator. This 3-bit designator may be sent to the decoder 142 to designate which mode to use for prediction.

Abstract

Embodiments of the present invention relate to methods and systems for estimating a pixel prediction mode to be used in a video encoding or decoding process.

Description

RELATED REFERENCES
This application claims the benefits of U.S. patent application Ser. No. 60/319,272 filed May 28, 2002 and U.S. patent application Ser. No. 60/319,390 filed Jul. 11, 2002.
BACKGROUND
Embodiments of the present invention relate to intra-prediction for an image.
Digital video requires a large amount of data to represent each and every frame of a digital video sequence (e.g., series of frames) in an uncompressed manner. It is not feasible for most applications to transmit uncompressed digital video across computer networks because of bandwidth limitations. In addition, uncompressed digital video requires a large amount of storage space. The digital video is normally encoded in some manner to reduce the storage requirements and reduce the bandwidth requirements.
One technique for encoding digital video is interframe encoding. Interframe encoding exploits the fact that different frames of video typically include regions of pixels, normally selected as x by x blocks, that remain substantially the same. During the encoding process a motion vector interrelates the movement of a block of pixels in one frame to a block of similar pixels in another frame. Accordingly, the system is not required to encode the block of pixels twice, but rather encodes the block of pixels once and provides a motion vector to predict the other block of pixels.
Another technique for encoding digital video is intraframe encoding. Intraframe encoding encodes a frame or a portion thereof without reference to pixels in other frames. Typically intraframe encoding encodes the frame, or portions thereof, on a block by block basis. For example, in MEPG-2 the intraframe encoding makes use of discrete cosine transforms of a block of pixels and subsequent encoding of the transformed coefficients. Other intraframe encoding techniques exist, such as for example, wavelet encoding.
In general, these techniques employ relatively large data tables for referencing prediction modes. Memory for these data tables can be burdensomely expensive for many low cost machines. Moreover, it is also burdensomely expensive to provide sufficient memory within processing devices to store the data table. Also, the resulting system has increased complexity with the large data table.
BRIEF DESCRIPTION OF THE DRAWINGS
The following drawings depict only typical embodiments of the present invention and are not therefore to be considered to be limiting of its scope, the invention will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:
FIG. 1 illustrates some forms of block adjacency;
FIG. 2 illustrates a block of pixels and the adjacent pixels for prediction;
FIG. 3 illustrates general prediction mode directions;
FIG. 4 illustrates the general directions of prediction modes in an embodiment of the present invention;
FIG. 5 illustrates the general directions of prediction modes in an embodiment of the present invention;
FIG. 6 illustrates the general directions of prediction modes in an embodiment of the present invention;
FIG. 7 illustrates the general directions of prediction modes in an embodiment of the present invention;
FIG. 8 illustrates the general directions of prediction modes in an embodiment of the present invention;
FIG. 9 is a block diagram illustrating mode estimation in some embodiments of the present invention
FIG. 10 is a block diagram illustrating mode estimation in embodiments with an ordered set of prediction modes;
FIG. 11 is a block diagram illustrating mode estimation with ordered sets associated with numerical values;
FIG. 12 is a block diagram illustrating mode estimation options when some adjacent block data is unavailable;
FIG. 13 is a block diagram illustrating mode order modification in some embodiments of the present invention;
FIG. 14 is a block diagram illustrating the methods of an embodiment of the present invention in which an estimated mode is used to modify mode order usage; and
FIG. 15 is a block diagram illustrating the method of an embodiment of the present invention in which an estimate mode is used to modify mode order using specific designators.
DETAILED DESCRIPTION
Embodiments of the present invention comprise methods and systems related to intra-prediction of images. As all embodiments are related to intra-prediction, the terms “intra-prediction” and “prediction” may be used interchangeably to refer to intra-prediction processes.
Embodiments of the present invention use intraframe coding or intracoding to exploit spatial redundancies within a video image. Because adjacent blocks generally have similar attributes, the efficiency of the coding process is improved by referencing the spatial correlation between adjacent blocks. This correlation may be exploited by prediction of a target block based on prediction modes used in adjacent blocks.
A digital image may be divided into blocks for more efficient processing or for other reasons. As illustrated in FIG. 1, a target block “C” 12 may be situated adjacent to adjacent block “A,” 14 which is located immediately above target block “C” 12. Another adjacent block “B” 16 is located immediately to the left of target block “C” 12. Other blocks that share boundaries with target block “C” 12 may also be considered to be adjacent blocks to block “C” 12.
Blocks may comprise various numbers of pixels in different configurations. For example, a block may comprise a 4×4 array of pixels. A block may also comprise a 16×16 array of pixels or an 8×8 array. Other pixel configurations, including both square and rectangular arrays may also make up a block.
Each pixel in a target block may be predicted with reference to data regarding pixels in adjacent blocks. This adjacent pixel data or adjacent block data comprises the prediction modes used to predict those adjacent blocks or adjacent pixels. Specific adjacent pixels and pixels within a target block may be referenced using an alphanumeric index as illustrated in FIG. 2. FIG. 2 illustrates a 4×4 target block, such as block “C” 12 comprising 16 pixels designated by lower case alphabetic characters 22. Pixels in an adjacent block immediately above the target block are designated by capital alphabetic characters 24. Pixels in an adjacent block immediately to the left of the target block are designated by capital alphabetical characters 26. The bottom right pixel 25 in an adjacent block above and to the left of the target block 12 is designated by the capital letter “Q.”
Prediction modes may comprise instructions or algorithms for predicting specific pixels in a target block. These modes may refer to one or more adjacent block pixels as described in the following mode descriptions.
Prediction Modes
    • Mode 0: vertical prediction
    • a, e, i, m may be predicted by A
    • b, f, j, n, may be predicted by B,
    • c, g, k, o, may be predicted by C
    • d, j, l, p may be predicted by D
    • Mode 1: horizontal prediction
    • a, b, c, d, may be predicted by I
    • e, f, g, h, may be predicted by J
    • i, j, k, l, may be predicted by K
    • m, n, o, p, may be predicted by L
    • Mode 2: DC prediction
If all samples A, B, C, D, I, J, K, L, are available, all samples may be predicted by (A+B+C+D+I+J+K+L+4)>>3. If A, B, C, and D are not available and I, J, K, and L are available, all samples may be predicted by (I+J+K+L+2)>>2. If I, J, K, and L are not available and A, B, C, and D are available, all samples may be predicted by (A+B+C+D+2)>>2. If all eight samples are not available, the prediction for all lama samples in the block may be 128. A block may be always predicted in this mode.
Mode 3: diagonal down/left prediction
a may be predicted by (A + 2B + C + I + 2J + K + 4) >> 3
b, e may be predicted by (B + 2C + D + J + 2K + L + 4) >> 3
c, f, i may be predicted by (C + 2D + E + K + 2L + M + 4) >> 3
d, g, j, m may be predicted by (D + 2E + F + L + 2M + N + 4) >> 3
h, k, n may be predicted by (E + 2F + G + M + 2N + O + 4) >> 3
l, o may be predicted by (F + 2G + H + N + 2O + P + 4) >> 3
p may be predicted by (G + H + O + P + 2) >> 2
Mode 4: diagonal down/right prediction
m may be predicted by (J + 2K + L + 3) >> 2
i, n may be predicted by (I + 2J + K + 2) >> 2
e, j, o may be predicted by (Q + 2I + J + 2) >> 2
a, f, k, p may be predicted by (A + 2Q + I + 2) >> 2
b, g, l may be predicted by (Q + 2A + B + 2) >> 2
c, h may be predicted by (A + 2B + C + 2) >> 2
d may be predicted by (B + 2C + D + 2) >> 2
Mode 5: vertical-left prediction
a, j may be predicted by (Q + A + 1) >> 1
b, k, may be predicted by (A + B + 1) >> 1
c, l may be predicted by (B + C + 1) >> 1
d may be predicted by (C + D + 1) >> 1
e, n may be predicted by (I + 2Q + A + 2) >> 2
f, o may be predicted by (Q + 2A + B + 2) >> 2
g, p may be predicted by (A + 2B + C + 2) >> 2
h may be predicted by (B + 2C + D + 2) >> 2
i may be predicted by (Q + 2I + J + 2) >> 2
m may be predicted by (I + 2J + K + 2) >> 2
Mode 6: horizontal-down prediction
a, g may be predicted by (Q + I + 1) >> 1
b, h may be predicted by (I + 2Q + A + 2) >> 2
c may be predicted by (Q + 2A + B + 2) >> 2
d may be predicted by (A + 2B + C + 2) >> 2
e, k may be predicted by (I + J + 1) >> 1
f, l may be predicted by (Q + 2I + J + 2) >> 2
i, o may be predicted by (J + K + 1) >> 1
j, p may be predicted by (I + 2J + K + 2) >> 2
m may be predicted by (K + L + 1) >> 1
n may be predicted by (J + 2K + L + 2) >> 2
Mode 7: vertical-right prediction
a may be predicted by (2A + 2B + J + 2K + L + 4) >> 3
b, i may be predicted by (B + C + 1) >> 1
c, j may be predicted by (C + D + 1) >> 1
d, k may be predicted by (D + E + 1) >> 1
l may be predicted by (E + F + 1) >> 1
e may be predicted by (A + 2B + C + K + 2L + M + 4) >> 3
f, m may be predicted by (B + 2C + D + 2) >> 2
g, n may be predicted by (C + 2D + E + 2) >> 2
h, o may be predicted by (D + 2E + F + 2) >> 2
p may be predicted by (E + 2F + G + 2) >> 2
Mode 8: horizontal-up prediction
a may be predicted by (B + 2C + D + 2I + 2J + 4) >> 3
b may be predicted by (C + 2D + E + I + 2J + K + 4) >> 3
c, e may be predicted by (J + K + 1) >> 1
d, f may be predicted by (J + 2K + L + 2) >> 2
g, i may be predicted by (K + L + 1) >> 1
h, j may be predicted by (K + 2L + M + 2) >> 2
l, n may be predicted by (L + 2M + N + 2) >> 2
k, m may be predicted by (L + M + 1) >> 1
o may be predicted by (M + N + 1) >> 1
p may be predicted by (M + 2N + O + 2) >> 2
The ordering process, which is based upon the likelihood of producing a lesser prediction error for each of the modes, increases the coding efficiently, reduces the memory requirements, and may be at least partially mathematically defined.
Each prediction mode may be described by a general direction of prediction as described verbally in each of the mode titles above (i.e., horizontal up, vertical and diagonal down left). A prediction mode may also be described graphically by an angular direction. This angular direction may be expressed through a diagram with arrows radiating outward from a center point as shown in FIG. 3. In this type of diagram, each arrow and the center point may represent a prediction mode. The angle corresponding to a prediction mode has a general relationship to the direction from the weighted average location of the adjacent pixels used to predict the target pixel to the actual target pixel location. However, the modes are more precisely defined in the definitions above and in the NT standard. In FIG. 3, the center point 32 represents no direction so this point may be associated with a DC prediction mode. A horizontal arrow 34 may represent a horizontal prediction mode. A vertical arrow 36 may represent a vertical prediction mode. An arrow extending from the center point diagonally downward to the right at approximately a 45 degree angle from horizontal 38 may represent a Diagonal Down/Right (DDR) prediction mode. An arrow extending from the center point diagonally downward to the left at approximately a 45 degree angle from horizontal 40 may represent a Diagonal Down/Left (DDL) prediction mode. Both the DDR and DDL prediction modes may be referred to as diagonal prediction modes.
An arrow extending from the center point diagonally upward to the right at approximately a 22.5 degree angle from horizontal 42 may represent a Horizontal Up (HU) prediction mode. An arrow extending from the center point diagonally downward to the right at approximately a 22.5 degree angle from horizontal 44 may represent a Horizontal Down (HD) prediction mode. An arrow extending from the center point diagonally downward to the right at approximately a 67.5 degree angle from horizontal 46 may represent a Vertical Right (VR) prediction mode. An arrow extending from the center point diagonally downward to the left at approximately a 67.5 degree angle from horizontal 48 may represent a Vertical Left (VL) prediction mode. The HU, HD, VR and VL prediction modes may be referred to collectively as intermediate angle prediction modes.
Many other prediction modes may be created and described using this angular description scheme.
Prediction Mode Order
The present inventors have determined that prediction modes may be ordered in a manner generally consistent with their likelihood of producing a reduced prediction error. With the prediction modes ordered according to their general likelihood of producing a lesser prediction error, the resulting data itself may have a greater tendency to be more consistently ordered. Furthermore, communication of modes may take advantage of coding techniques that reduce memory and bandwidth requirements. For example, the present inventors determined that the horizontal prediction mode and the vertical prediction mode are generally more likely than diagonal prediction modes, which are generally more likely than intermediate angle prediction modes. In addition, a DC prediction mode (e.g., when an adjacent block is coded in inter mode) is generally less likely than horizontal and vertical prediction modes and generally more likely than diagonal prediction modes.
For blocks that do not border discontinuities such as image edges or swipe/swath boundaries, the order established in some embodiments of the present invention may be expressed, in general terms, as follows: vertical and horizontal prediction modes are more likely to produce a reduced prediction error than a DC prediction mode and that a DC prediction mode is more likely to produce a reduced prediction error than diagonal prediction modes and that diagonal prediction modes are more likely to produce a reduced prediction error than intermediate angle prediction modes.
For blocks near edges or boundaries or where adjacent block or pixel prediction mode data is not available, the order established in some embodiments of the present invention may be expressed, in general terms, as follows: DC prediction mode is more likely to produce a reduced prediction error than vertical and horizontal prediction modes and vertical and horizontal prediction modes are more likely to produce a reduced prediction error than diagonal prediction modes and that diagonal prediction modes are more likely to produce a reduced prediction error than intermediate angle prediction modes.
In a first set of embodiments as illustrated in FIG. 4, modes may be defined in order as follows:
    • Mode 0: Vertical prediction
    • Mode 1: Horizontal prediction
    • Mode 2: DC prediction
    • Mode 3: Diagonal Down/Left prediction
    • Mode 4: Diagonal Down/Right prediction
    • Mode 5: Horizontal Down prediction
    • Mode 6: Vertical Right prediction
    • Mode 7: Vertical Left prediction
    • Mode 8: Horizontal Up prediction
In a second set of embodiments, as illustrated in FIG. 5, modes may be defined in order as follows:
    • Mode 0: Horizontal prediction
    • Mode 1: Vertical prediction
    • Mode 2: DC prediction
    • Mode 3: Diagonal Down/Left prediction
    • Mode 4: Diagonal Down/Right prediction
    • Mode 5: Horizontal Down prediction
    • Mode 6: Vertical Right prediction
    • Mode 7: Vertical Left prediction
    • Mode 8: Horizontal Up prediction
In a third set of embodiments, as illustrate in FIG. 6, modes may be defined in order as follows:
    • Mode 0: Vertical prediction
    • Mode 1: Horizontal prediction
    • Mode 2: DC prediction
    • Mode 3: Diagonal Down/Left prediction
    • Mode 4: Diagonal Down/Right prediction
    • Mode 5: Vertical Right prediction
    • Mode 6: Horizontal Down prediction
    • Mode 7: Vertical Left prediction
    • Mode 8: Horizontal Up prediction
In a fourth set of embodiments, as illustrated in FIG. 7, modes may be defined in order as follows:
    • Mode 0: Horizontal prediction
    • Mode 1: Vertical prediction
    • Mode 2: DC prediction
    • Mode 3: Diagonal Down/Left prediction
    • Mode 4: Diagonal Down/Right prediction
    • Mode 5: Vertical Right prediction
    • Mode 6: Horizontal Down prediction
    • Mode 7: Vertical Left prediction
    • Mode 8: Horizontal Up prediction
In a fifth set of embodiments, as illustrated in FIG. 8, modes may be defined in order as follows:
    • Mode 0: DC prediction
    • Mode 1: Vertical prediction
    • Mode 2: Horizontal prediction
    • Mode 3: Diagonal Down/Left prediction
    • Mode 4: Diagonal Down/Right prediction
    • Mode 5: Vertical Right prediction
    • Mode 6: Horizontal Down prediction
    • Mode 7: Vertical Left prediction
    • Mode 8: Horizontal Up prediction
It should be noted that the mode order may vary beyond these exemplary orders in various other embodiments of the present invention.
In some embodiments of the present invention, the horizontal prediction (mode 0) and the vertical prediction (mode 1) may be reversed, if desired. Also, it is to be understood that the diagonal down/left prediction mode and the horizontal down prediction mode may be reversed, if desired. In addition, it is to be understood the diagonal down/right prediction (mode 5), the vertical right prediction (mode 6), the vertical left prediction (mode 7), and the horizontal up prediction (mode 8) may be reordered, if desired. Further, it is desirable that the DC prediction is between the mode 0/mode 1 set and the mode 3/mode 4 set, but may be located between mode 3/mode 4 set and mode 5/mode 6/mode 7/mode 8 set, if desired, or any other location. Moreover, the angled modes 3-8 may be renumbered as desired without significant impact on the encoding efficiency.
In some embodiments of the present invention, the prediction modes may be reordered for all of the respective blocks (e.g., those blocks using the described prediction scheme) using such a prediction basis. In addition, less than all of the respective blocks (e.g., those blocks using the described prediction scheme) may use such a prediction basis, for example, more than 50%, 75% or 90%, if desired. Also, the order of the prediction modes may be the same or varied for different blocks. Further, the reordering of each of the modes of such a prediction basis (e.g., in a predetermined consistent manner) is preferably at least 5 modes, 6 modes, or 7 modes, with the remaining being ordered in any other manner. In addition, the order of the prediction modes is preferably 0, 1, 2, 3, 4, 5, 6, 7, 8. Other predefined ordering of the prediction modes may likewise be employed.
Some embodiments of the present invention may comprise one or more data tables for organization of mode data. With the modes being generally arranged in an ordered manner, this may be used together with each cell in a data table, to provide a more ordered set. For example, each entry in the data table may include the ordered set of numbers 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9. Alternatively, the ordered set of numbers in the data table may include 5, 6, 7, 8, or 9 sets of ordered numbers for each entry in the data table. For example, the data table entries may include the following sets of data entries {1, 2, 3, 5, 7}; {0, 1, 2, 3, 4, 5, 6}; {0, 1, 3, 5, 6, 7, 8}, where each of the numbers in the set are of increasing numerical value. Alternatively for example, the data table entries may include the following sets of data entries {1, 2, 3, 5, 7}; {0, 1, 2, 3, 4, 5, 6}; {0, 1, 3, 5, 6, 7, 8}, where each set is included in at least 25%, or 35%, or 50%, or 75%, or 90%, or more, of the cells. In this manner, the table will have significantly more predictability than known data table methods, which decreases memory requirements.
The predetermined manner of the ordering of the sets of data entries should be independent of the prediction modes of adjoining sets of pixels (e.g. macroblocks). It is to be understood that the data table may be “static” in nature or may be effectively dynamically generated, in whole or in part, when needed based upon patterns in the data. Accordingly, a mathematical equation or an algorithm may be used to determine the entries, which in this case the “table” could be created by such a technique. Accordingly, a “data table” as used herein is not merely restricted to a static table, but further includes such a set of values, however determined, that are used for such prediction.
Unfortunately, the substitution of the previous mode numbers with the new mode numbers (e.g., a substitution of numbers into the cells of known data tables), while perhaps an improvement, still results in a generally unordered set of data.
Estimating a Pixel Prediction Mode Based on Adjacent Block Data
In contrast to the generally unordered set of data shown, even with substitutions, the present inventors came to the further realization that the most likely prediction mode should be ordered first, the second most likely prediction mode ordered second, if desired, followed by the remaining modes in a predetermined manner. The predetermined manner should be independent of the prediction modes of adjoining macroblocks. The preferred order of the remaining modes should be in a decreasing likelihood of occurrence of the remaining modes (most likely prediction mode, and if desired, second most likely prediction mode).
Based on the intra prediction modes of block A and block B, as shown in FIG. 1, the intra prediction mode order for block C may be defined as follows:
(1) If both block A and block B are “outside” (e.g., not available), only DC prediction (mode 2) is permitted, therefore the intra prediction mode order for block C is {2}.
(2) If block A is “outside” (e.g., not available) and block B is not “outside”, only DC prediction (mode 2) and horizontal prediction (mode 0) are permitted for block C, therefore;
    • (i) if block B is 2, intra prediction mode order for block C is {2, 0};
    • (ii) otherwise, intra prediction mode order for block C is {0, 2}.
(3) If block A is not “outside” but block B is “outside”, only DC prediction (mode 2) and vertical prediction (mode 1) are permitted for block C, therefore
    • (i) if block A is 2, intra prediction mode order for block C is {2, 1};
    • (ii) otherwise, intra prediction mode order for block C is {1, 2}.
(4) If neither block A nor block B is “outside”,
    • (i) if the prediction mode of block A is less than the prediction mode of block B, then intra prediction mode order for block C is {intra prediction block mode A, intra prediction block mode B, other modes in ascending order};
    • (ii) if the prediction mode of block A is greater than the prediction mode of block B, then intra prediction mode order for block C is {intra prediction block mode B, intra prediction block mode A, other modes in ascending order};
    • (iii) if the prediction mode of block A equals the prediction mode of block B, then intra prediction mode order for block C is {intra prediction block mode A, other modes in ascending order}.
For example, if the prediction mode of block A is 3 and the prediction mode of block B is 1, then intra prediction mode order for block C is {1, 3, 0, 2, 4, 5, 6, 7, 8}. With the modes arranged in a generally decreasing likelihood (or increasing) of occurrence, then the automatic arrangement of the remaining modes of occurrence will still be generally arranged in the proper sequence. The ordering of the sequence from higher to lower probability increases the likelihood of the proper prediction toward the front. With entropy encoding this decreases the resulting encoded bit stream. Other arrangements may likewise be used.
Conceptually the aforementioned selection scheme is based upon the principle that if the prediction of block A is X and the prediction of block B is Y, then it is likely the prediction of block C is X or Y. The prediction for X and/or Y is located at the start of the list and the remaining modes are sequentially listed thereafter.
Stated another way, when the prediction modes of A and B are known (including the case that A or B or both are outside the slice) the most probable mode of C is given, namely, the minimum of the modes used for blocks A and B. If one of the blocks A or B is “outside” the most probable mode is equal to prediction mode 2. The ordering of prediction modes assigned to blocks C is therefore the most probable mode followed by the remaining modes in the ascending order.
Embodiments of the present invention may be described with reference to FIG. 9. In these embodiments, a target block is selected 50 for prediction. A prediction mode used for prediction of a first adjacent block, which is immediately adjacent to said target block, is then determined 52. A prediction mode used for prediction of a second adjacent block, which is also adjacent to said target block is also determined 54. These adjacent block prediction modes are then examined 56 to determine which is more likely to produce a lesser prediction error.
In other embodiments of the present invention, as illustrated in FIG. 10, a set of prediction modes is ordered 58 according to the modes' likelihood of producing a lesser prediction error. A target block is selected 60. The prediction mode used for a first adjacent block is determined 62 and the prediction mode used for a second adjacent block is also determined 64. These two prediction modes are then examined 66 to determine which occurs first in the ordered set of modes thereby corresponding to the mode with the higher likelihood of producing a lesser prediction error.
In further embodiments of the present invention, as illustrated in FIG. 11, a set of prediction modes is ordered 68 by likelihood of producing a lesser prediction error. These modes in the ordered set are then associated 70 with numerical values such that modes with a higher likelihood of producing a lesser prediction error are associated with lower numerical values. The mode used to predict a first adjacent block is then determined 72 and the mode used to predict a second adjacent block is also determined 74. These adjacent block modes are then examined to determine which mode is associated with a lower numerical value. This mode is designated as the estimated mode for prediction of the target block.
In still further embodiments, as illustrated in FIG. 12, a set of prediction modes is ordered 78 by likelihood of producing a lesser prediction error. These modes in the ordered set are then associated 80 with numerical values such that modes with a higher likelihood of producing a lesser prediction error are associated with lower numerical values. An attempt 82 is made to determine the mode used to predict a first adjacent block and an attempt 84 is made to determine the mode used to predict a second adjacent block. If the prediction mode used to predict the first adjacent block is not available 86, a default prediction mode, such as a DC prediction mode, may be designated 90 as an estimated prediction mode for the target block. Also, if the prediction mode used to predict the second adjacent block is not available 88, a default prediction mode, such as a DC prediction mode, may be designated 90 as an estimated prediction mode for the target block. When the adjacent block prediction modes are available, these adjacent block modes may be examined to determine which mode is associated with a lower numerical value. This mode is then designated 92 as the estimated mode for prediction of the target block.
Modification of Prediction Mode Order Based on Adjacent Block Data
In some embodiments of the present invention the prediction mode orders described above, which have been determined independently of the adjacent block data, may be modified with adjacent block data. Prediction mode estimates determined with reference to adjacent block data can be inserted into prediction mode orders to modify the orders to reflect the additional information obtained from adjacent block data.
In some of these embodiments, a prediction mode estimate, based on adjacent block data, can be inserted directly into a prediction mode order set. Typically, the prediction mode estimate will be inserted or prepended at the front of the prediction mode order at the position of the mode most likely to produce a reduced prediction error. However, in some embodiments the estimate may be inserted at different positions in the mode order.
In some embodiments of the present invention, as shown in FIG. 13, a prediction mode order is selected 102 wherein the prediction mode order elements may be arranged according to their likelihood of producing a lesser prediction error. In other words, the first element in the order represents the prediction mode most likely to yield a lesser prediction error, the next element in the order represents the prediction mode that is the next most likely to yield a lesser prediction error and so on to the last element in the order, which represents the prediction mode in the order that is least likely to yield a lesser prediction error.
A prediction mode estimate is also determined 104, as described above. This estimate is determined using adjacent block data. Generally, the estimate is the prediction mode used in one or more adjacent blocks that is likely to yield a lesser prediction error. However, the estimate may be determined in other ways. When sufficient adjacent block prediction mode data is not available, such as at an image edge or a slice boundary, a prediction mode for the target block may be estimated based on the lack of one or more adjacent blocks or their prediction mode data. In many cases, a DC prediction mode will be estimated when adjacent block data is limited or unavailable.
In some embodiments, once the estimated prediction mode is estimated, the estimated prediction mode may be placed 106 into the mode order as the mode most likely to yield a lesser prediction error. In some embodiments, this will be the first mode in the order or the mode associated with the lowest numerical value.
In other embodiments, the estimated prediction mode may take precedence over the pre-selected mode order. In some of these embodiments, as illustrated in FIG. 14. a pre-selected mode order is designated 110 at the encoder and the decoder. This order comprises a set of prediction modes arranged in order of likelihood of yielding a lesser prediction error or some other order. An estimated prediction mode is also determined 112 based on adjacent block data. This estimated prediction mode is determined at the encoder and the decoder according to the same algorithm or method. The encoder also determines the actual best prediction mode 114 for predicting a pixel based on motion vectors or other known techniques. The encoder may, then, compare 116 the actual best prediction mode to the estimated prediction mode to determine whether they are the same. If the estimated prediction mode is the same mode as the actual best prediction mode, the encoder may signal to the decoder that the estimated prediction mode is to be used 118. In some embodiments, this estimated prediction mode signal may be performed with a 1-bit flag to signify whether the estimated mode is to be used on not.
If the estimated prediction mode is not the actual best prediction mode, the encoder may signal to the decoder that another mode may be used 120. This may be performed by reference to the pre-established mode order. The encoder may determine which mode in the mode order is most equivalent to the actual best prediction mode and signal the decoder to use that mode.
When an ordered set of prediction modes is used, the set order may be rearranged once further data is obtained. For example, an ordered set of prediction modes may be reordered when an estimated prediction mode is determined or when a best actual prediction mode is determined. In these cases, the modifying mode may be interjected into the ordered set, placed ahead of the ordered set or, in some cases, removed from the ordered set.
In some embodiments of the present invention, each mode in the mode order may be associated with a numerical value according to the order. In these embodiments, the numerical value associated with the mode to be used may be sent to the decoder to signal the decoder to use that prediction mode. In some of these embodiments, as illustrated in FIG. 15, a mode order comprising 9 prediction modes may be selected 130. An estimated prediction mode based on adjacent block data, and which is one of the 9 modes in the order, may also be determined 132. A best prediction mode may also be determined 134 by motion vector methods or other methods. The best prediction mode may then be compared to the estimated prediction mode 136. If the estimated prediction mode is substantially the same as the best prediction mode, the decoder may be signaled with a 1-bit designator to use the estimated prediction mode, which is already identified at the decoder. If the estimated prediction mode is not equivalent to the best prediction mode, the estimated prediction mode is essentially eliminated from the mode order 140. This elimination may be performed by re-ordering the set, skipping the estimated mode in the order or by other means. The remaining order will effectively comprise 8 modes, which can be represented by a 3-bit designator. This 3-bit designator may be sent to the decoder 142 to designate which mode to use for prediction.
The terms and expressions employed in the foregoing specification are used therein as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding equivalents of the features shown and described or portions thereof, it being recognized that the scope of the invention is defined and limited only by the claims that follow.

Claims (17)

1. A method for determining an estimated pixel intra-prediction mode, said method comprising:
a) ordering a set of prediction modes in an order;
b) selecting a target block for prediction mode estimation;
c) determining a first prediction mode that has been used for encoding a first adjacent block, said first adjacent block being adjacent to said target block;
d) determining a second prediction mode that has been used for encoding a second adjacent block, said second adjacent block being adjacent to said target block;
e) selecting one of said first prediction mode and said second prediction mode as an estimated prediction mode, said selecting being based on the relative positions of said first and second prediction modes in said order; and
f) calculating an actual prediction mode based on target block content analysis and comparing said actual prediction mode to said estimated prediction mode.
2. A method as described in claim 1 wherein said estimated pixel prediction mode is a DC prediction mode when at least one of said first prediction mode and said second prediction mode are not available.
3. A method as described in claim 1 wherein said estimated pixel prediction mode is a DC prediction mode when at least one of said first prediction mode and said second prediction mode is available, but coded in Inter prediction mode.
4. A method for determining an estimated pixel intra-prediction mode to be used for efficient prediction mode communication between an encoder and a decoder, said method comprising:
a) ordering a set of potential prediction modes in an order of decreasing likelihood of producing a lesser prediction error by associating a sequential numerical value with each prediction mode such that the lowest numerical value is associated with the prediction mode most likely to produce a lesser prediction error and the prediction mode that is least likely to produce a lesser prediction error is associated with the highest numerical value in the sequence;
b) selecting a target block for prediction mode estimation;
c) receiving a first prediction mode that has been used for encoding a first adjacent block, said first adjacent block being adjacent to said target block;
d) receiving a second prediction mode that has been used for encoding a second adjacent block, said second adjacent block being adjacent to said target block;
determining an estimated prediction mode among said first prediction mode and said second prediction mode, wherein said estimated prediction mode is the prediction mode associated with the lesser value; and
calculating an actual prediction mode based on target block content analysis and comparing said actual prediction mode to said estimated prediction mode.
5. A method as described in claim 4 wherein said estimated pixel prediction mode is a DC prediction mode when at least one of said first prediction mode and said second prediction mode are not available.
6. A method as described in claim 4 wherein said set of potential prediction modes comprises, in order:
a) one mode taken from a first subset consisting of a horizontal prediction mode and a vertical prediction mode;
b) the other mode of said first subset; and
c) a DC prediction mode.
7. A method for determining a pixel intra-prediction mode, said method comprising:
a) selecting a target block for prediction mode estimation;
b) receiving an actual prediction mode indicator, wherein said actual prediction mode is based on target block content analysis;
c) when said indicator indicates that an estimated prediction mode shall be used:
i) determining a first prediction mode that has been used for encoding a first adjacent block, said first adjacent block being adjacent to said target block;
ii) determining a second prediction mode that has been used for encoding a second adjacent block, said second adjacent block being adjacent to said target block;
iii) determining an estimated prediction mode among said first prediction mode and said second prediction mode, wherein said estimated prediction mode is the prediction mode that is least likely to produce a lesser prediction error; and
iv) using said estimated prediction mode to decode said target block;
d) when said indicator indicates that said estimated prediction mode shall not be used:
i) receiving a mode numerical value indicating a prediction mode to be used; and
ii) using the prediction mode associated with said mode numerical value to decode said target block.
8. An apparatus for estimating a pixel prediction mode, said apparatus comprising:
a) a plurality of prediction modes arranged in an order based on a likelihood of producing a lesser prediction error;
b) a selector for selecting a target block for prediction mode estimation;
c) a first determiner for determining a first prediction mode that has been used to decode a first adjacent block, said first adjacent block being adjacent to said target block;
d) a second determiner for determining a second prediction mode that has been used to decode a second adjacent block, said second adjacent block being adjacent to said target block;
e) a selector for selecting an estimated prediction mode for said target block from among said first prediction mode and said second prediction mode based on the locations of said first and second prediction modes in said order; and
f) a calculator for calculating an actual prediction mode based on target block content analysis and comparing said actual prediction mode to said estimated prediction mode.
9. A computer-readable medium comprising computer-executable instructions for estimating a pixel prediction mode, said instructions comprising the acts of:
a) ordering a set of prediction modes in an order based on a likelihood of producing a lesser prediction error;
b) selecting a target block for prediction mode estimation;
c) determining a first prediction mode that has been used for encoding a first adjacent block, said first adjacent block being adjacent to said target block;
d) determining a second prediction mode that has been used for encoding a second adjacent block, said second adjacent block being adjacent to said target block;
e) selecting one of said first prediction mode and said second prediction mode as an estimated prediction mode based on the relative positions of said first and second prediction modes in said order; and
f) calculating an actual prediction mode based on target block content analysis and comparing said actual prediction mode to said estimated prediction mode.
10. A method as described in claim 7 further comprising designating a DC prediction mode as an estimated prediction mode if at least one of said first prediction mode and said second prediction mode are not intra-coded.
11. A method as described in claim 7 further comprising designating a DC prediction mode as an estimated prediction mode if at least one of said first prediction mode and said second prediction mode are unavailable.
12. A method as described in claim 4 wherein said ordering is performed such that the modes most likely to yield a lower prediction error are associated with the lowest numbers.
13. A method for selecting a numbered prediction mode for effective estimation of an estimated pixel intra-prediction mode, said method comprising:
giving ascending mode numbers to prediction modes in the order of a vertical prediction mode, a horizontal prediction mode, and a DC prediction mode, wherein
the vertical prediction mode takes a pixel value of a block adjacent to an upper side of a block to be encoded as a prediction value,
the horizontal prediction mode takes a pixel value of a block adjacent to a left side of the block to be encoded as a prediction value, and
the DC prediction mode takes an average value of the pixel value of the block adjacent to the upper side of the block to be encoded and the pixel value of the block adjacent to the left side of the block to be encoded as a prediction value;
identifying a target block for prediction mode estimation;
identifying a mode number of a first prediction mode that has actually been used for encoding a first adjacent block, said first adjacent block being adjacent to an upper side of said target block;
identifying a mode number of a second prediction mode that has actually been used for encoding a second adjacent block, said second adjacent block being adjacent to a left side of said target block; and
selecting a prediction mode having the smaller mode number between the first prediction mode and the second prediction mode as the estimated prediction mode of the target block.
14. A method for selecting a numbered prediction mode for effective estimation of an estimated pixel intra-prediction mode to be used for efficient prediction mode communication between an encoder and a decoder, said method comprising:
giving ascending mode numbers to prediction modes in the order of a vertical prediction mode, a horizontal prediction mode, and a DC prediction mode, wherein
the vertical prediction mode takes a pixel value of a block adjacent to an upper side of a block to be encoded as a prediction value,
the horizontal prediction mode takes a pixel value of a block adjacent to a left side of the block to be encoded as a prediction value, and
the DC prediction mode takes an average value of the pixel value of the block adjacent to the upper side of the block to be encoded and the pixel value of the block adjacent to the left side of the block to be encoded as a prediction value;
identifying a target block for prediction mode estimation;
receiving a mode number of a first prediction mode that has actually been used for encoding a first adjacent block, said first adjacent block being adjacent to an upper side of said target block;
receiving a mode number of a second prediction mode that has actually been used for encoding a second adjacent block, said second adjacent block being adjacent to a left side of said target block; and
selecting a prediction mode having the smaller mode number between the first prediction mode and the second prediction mode as an estimated prediction mode of the target block.
15. A method for selecting a numbered prediction mode for effective estimation of a pixel intra-prediction mode, said method comprising:
a) giving ascending mode numbers to prediction modes in the order of a vertical prediction mode, a horizontal prediction mode, and a DC prediction mode, wherein
the vertical prediction mode takes a pixel value of a block adjacent to an upper side of a block to be encoded as a prediction value,
the horizontal prediction mode takes a pixel value of a block adjacent to a left side of the block to be encoded as a prediction value, and
the DC prediction mode takes an average value of the pixel value of the block adjacent to the upper side of the block to be encoded and the pixel value of the block adjacent to the left side of the block to be encoded as a prediction value;
b) identifying a target block for prediction mode estimation;
c) receiving a prediction mode indicator indicating whether or not an estimated prediction mode shall be used to decode the target block;
d) when said indicator indicates that an estimated prediction mode shall be used:
i) identifying a mode number of a first prediction mode that has actually been used for encoding a first adjacent block, said first adjacent block being adjacent to an upper side of said target block;
ii) identifying a mode number of a second prediction mode that has actually been used for encoding a second adjacent block, said second adjacent block being adjacent to a left side of said target block;
iii) selecting a prediction mode having the smaller mode number between the first prediction mode and the second prediction mode as an estimated prediction mode of the target block; and
iv) using said estimated prediction mode to decode said target block;
e) when said indicator indicates that said estimated prediction mode shall not be used:
i) receiving a mode information indicating a prediction mode to be used; and
ii) using the prediction mode indicated by said mode information to decode said target block.
16. An apparatus for selecting a numbered prediction mode for effective estimation of a pixel prediction mode, said apparatus comprising:
a) an assignor for assigning ascending mode numbers to prediction modes in the order of a vertical prediction mode, a horizontal prediction mode, and a DC prediction mode, wherein
the vertical prediction mode takes a pixel value of a block adjacent to an upper side of a block to be encoded as a prediction value,
the horizontal prediction mode takes a pixel value of a block adjacent to a left side of the block to be encoded as a prediction value, and
the DC prediction mode takes an average value of the pixel value of the block adjacent to the upper side of the block to be encoded and the pixel value of the block adjacent to the left side of the block to be encoded as a prediction value;
b) a first identifier for identifying a target block for prediction mode estimation;
c) a second identifier for identifying a first prediction mode that has actually been used to decode a first adjacent block, said first adjacent block being adjacent to an upper side of said target block;
d) a third identifier for identifying a second prediction mode that has actually been used to decode a second adjacent block, said second adjacent block being adjacent to a left side of said target block; and
e) a selector for selecting a prediction mode having the smaller mode number between the first prediction mode and the second prediction mode as an estimated prediction mode of the target block.
17. A computer-readable medium comprising computer-executable instructions for selecting a numbered prediction mode for effective estimation of a pixel prediction mode, said instructions comprising the acts of:
assigning ascending mode numbers to prediction modes in the order of a vertical prediction mode, a horizontal prediction mode, and a DC prediction mode, wherein
the vertical prediction mode takes a pixel value of a block adjacent to an upper side of a block to be encoded as a prediction value,
the horizontal prediction mode takes a pixel value of a block adjacent to a left side of the block to be encoded as a prediction value, and
the DC prediction mode takes an average value of the pixel value of the block adjacent to the upper side of the block to be encoded and the pixel value of the block adjacent to the left side of the block to be encoded as a prediction value;
identifying a target block for prediction mode estimation;
identifying a first prediction mode that has actually been used for encoding a first adjacent block, said first adjacent block being adjacent to an upper side of said target block;
identifying a second prediction mode that has actually been used for encoding a second adjacent block, said second adjacent block being adjacent to a left side of said target block; and
selecting a prediction mode having the smaller mode number between the first prediction mode and the second prediction mode as an estimated prediction mode of the target block.
US12/608,848 2002-05-28 2009-10-29 Methods and systems for image intra-prediction mode estimation Active 2025-04-06 USRE43567E1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/608,848 USRE43567E1 (en) 2002-05-28 2009-10-29 Methods and systems for image intra-prediction mode estimation

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US31927202P 2002-05-28 2002-05-28
US31939002P 2002-07-11 2002-07-11
US10/404,211 US7289672B2 (en) 2002-05-28 2003-03-31 Methods and systems for image intra-prediction mode estimation
US12/608,848 USRE43567E1 (en) 2002-05-28 2009-10-29 Methods and systems for image intra-prediction mode estimation

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/404,211 Reissue US7289672B2 (en) 2002-05-28 2003-03-31 Methods and systems for image intra-prediction mode estimation

Publications (1)

Publication Number Publication Date
USRE43567E1 true USRE43567E1 (en) 2012-08-07

Family

ID=29587614

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/404,211 Ceased US7289672B2 (en) 2002-05-28 2003-03-31 Methods and systems for image intra-prediction mode estimation
US12/608,848 Active 2025-04-06 USRE43567E1 (en) 2002-05-28 2009-10-29 Methods and systems for image intra-prediction mode estimation

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/404,211 Ceased US7289672B2 (en) 2002-05-28 2003-03-31 Methods and systems for image intra-prediction mode estimation

Country Status (1)

Country Link
US (2) US7289672B2 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150334389A1 (en) * 2012-09-06 2015-11-19 Sony Corporation Image processing device and image processing method
US20170142423A1 (en) * 2002-05-28 2017-05-18 Dolby Laboratories Licensing Corporation Methods And Systems For Image Intra-Prediction Mode Management
US9756338B2 (en) 2010-09-30 2017-09-05 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, program, and integrated circuit
US9860539B2 (en) 2011-06-13 2018-01-02 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus
US9930331B2 (en) 2010-07-15 2018-03-27 Sharp Kabushiki Kaisha Decoding and encoding devices using intra-frame prediction based on prediction modes of neighbor regions
US20190124328A1 (en) * 2011-01-12 2019-04-25 Mitsubishi Electric Corporation Image encoding device, image decoding device, image encoding method, and image decoding method for generating a prediction image
US11770550B2 (en) * 2010-12-07 2023-09-26 Electronics And Telecommunications Research Institute Method and device for simplifying the encoding and decoding of ultra-high definition images

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6765964B1 (en) 2000-12-06 2004-07-20 Realnetworks, Inc. System and method for intracoding video data
US7236524B2 (en) * 2002-05-28 2007-06-26 Sharp Laboratories Of America, Inc. Methods and systems for image intra-prediction mode communication
US9330060B1 (en) 2003-04-15 2016-05-03 Nvidia Corporation Method and device for encoding and decoding video image data
US8660182B2 (en) 2003-06-09 2014-02-25 Nvidia Corporation MPEG motion estimation based on dual start points
US8467447B2 (en) * 2004-05-07 2013-06-18 International Business Machines Corporation Method and apparatus to determine prediction modes to achieve fast video encoding
US8369402B2 (en) * 2004-06-17 2013-02-05 Canon Kabushiki Kaisha Apparatus and method for prediction modes selection based on image formation
US7565020B2 (en) * 2004-07-03 2009-07-21 Microsoft Corp. System and method for image coding employing a hybrid directional prediction and wavelet lifting
JP4501631B2 (en) * 2004-10-26 2010-07-14 日本電気株式会社 Image coding apparatus and method, computer program for image coding apparatus, and portable terminal
JP4763422B2 (en) * 2004-12-03 2011-08-31 パナソニック株式会社 Intra prediction device
KR101108681B1 (en) * 2005-01-19 2012-01-25 삼성전자주식회사 Frequency transform coefficient prediction method and apparatus in video codec, and video encoder and decoder therewith
JP2006246431A (en) 2005-02-07 2006-09-14 Matsushita Electric Ind Co Ltd Image coding apparatus and method
US7706442B2 (en) * 2005-02-15 2010-04-27 Industrial Technology Research Institute Method for coding mode selection of intra prediction in video compression
KR100678911B1 (en) * 2005-07-21 2007-02-05 삼성전자주식회사 Method and apparatus for video signal encoding and decoding with extending directional intra prediction
US8731071B1 (en) 2005-12-15 2014-05-20 Nvidia Corporation System for performing finite input response (FIR) filtering in motion estimation
US8724702B1 (en) 2006-03-29 2014-05-13 Nvidia Corporation Methods and systems for motion estimation used in video coding
US8660380B2 (en) 2006-08-25 2014-02-25 Nvidia Corporation Method and system for performing two-dimensional transform on data value array with reduced power consumption
US8111756B2 (en) * 2006-08-30 2012-02-07 Jiun-In Guo Method for reducing computational complexity of video compression standard
KR101411315B1 (en) * 2007-01-22 2014-06-26 삼성전자주식회사 Method and apparatus for intra/inter prediction
TW200845758A (en) * 2007-05-03 2008-11-16 Nat Univ Chung Cheng Fast intra coding algorithms for dynamic definition adjustment
US8756482B2 (en) 2007-05-25 2014-06-17 Nvidia Corporation Efficient encoding/decoding of a sequence of data frames
US9118927B2 (en) 2007-06-13 2015-08-25 Nvidia Corporation Sub-pixel interpolation and its application in motion compensated encoding of a video signal
CN103338365B (en) * 2007-06-29 2017-04-12 夏普株式会社 Image encoding device, image encoding method, image decoding device, image decoding method
US8873625B2 (en) 2007-07-18 2014-10-28 Nvidia Corporation Enhanced compression in representing non-frame-edge blocks of image frames
TWI364222B (en) * 2007-09-05 2012-05-11 Via Tech Inc Method and device for generating prediction mode parameter
JP5212373B2 (en) 2007-09-12 2013-06-19 ソニー株式会社 Image processing apparatus and image processing method
TWI392372B (en) * 2007-09-12 2013-04-01 Sony Corp Image processing apparatus and image processing method
CN101637026B (en) 2007-09-12 2012-03-28 索尼株式会社 Image processing apparatus and method
JP2009094828A (en) * 2007-10-10 2009-04-30 Hitachi Ltd Device and method for encoding image, and device and method for decoding image
ES2752001T3 (en) * 2008-04-15 2020-04-02 Orange Image prediction by pre-determining a family of reference, encoding and decoding pixels using such prediction
US20090274211A1 (en) * 2008-04-30 2009-11-05 Omnivision Technologies, Inc. Apparatus and method for high quality intra mode prediction in a video coder
US20090274213A1 (en) * 2008-04-30 2009-11-05 Omnivision Technologies, Inc. Apparatus and method for computationally efficient intra prediction in a video coder
KR100949917B1 (en) * 2008-05-28 2010-03-30 한국산업기술대학교산학협력단 Fast Encoding Method and System Via Adaptive Intra Prediction
US8761253B2 (en) * 2008-05-28 2014-06-24 Nvidia Corporation Intra prediction mode search scheme
KR101306834B1 (en) * 2008-09-22 2013-09-10 에스케이텔레콤 주식회사 Video Encoding/Decoding Apparatus and Method by Using Prediction Possibility of Intra Prediction Mode
US8666181B2 (en) 2008-12-10 2014-03-04 Nvidia Corporation Adaptive multiple engine image motion detection system and method
US8831099B2 (en) * 2008-12-17 2014-09-09 Nvidia Corporation Selecting a macroblock encoding mode by using raw data to compute intra cost
US9432674B2 (en) * 2009-02-02 2016-08-30 Nvidia Corporation Dual stage intra-prediction video encoding system and method
CN101505428B (en) * 2009-03-24 2012-10-03 威盛电子股份有限公司 Intra-frame prediction method and apparatus thereof
TWI407797B (en) * 2009-03-27 2013-09-01 Via Tech Inc Intra-frame prediction method and prediction apparatus using the same
JP5158003B2 (en) * 2009-04-14 2013-03-06 ソニー株式会社 Image coding apparatus, image coding method, and computer program
US8917769B2 (en) 2009-07-03 2014-12-23 Intel Corporation Methods and systems to estimate motion based on reconstructed reference frames at a video decoder
US8462852B2 (en) 2009-10-20 2013-06-11 Intel Corporation Methods and apparatus for adaptively choosing a search range for motion estimation
US9654792B2 (en) 2009-07-03 2017-05-16 Intel Corporation Methods and systems for motion vector derivation at a video decoder
KR101702553B1 (en) * 2009-07-04 2017-02-03 에스케이 텔레콤주식회사 Video Coding Method and Apparatus
KR20110068792A (en) * 2009-12-16 2011-06-22 한국전자통신연구원 Adaptive image coding apparatus and method
US8743957B2 (en) * 2010-04-12 2014-06-03 Sony Corporation Context adaptive directional intra prediction
MY186093A (en) * 2010-07-15 2021-06-21 Velos Media Int Ltd Image encoding device, and image decoding device
US10136130B2 (en) * 2010-08-17 2018-11-20 M&K Holdings Inc. Apparatus for decoding an image
US11284072B2 (en) 2010-08-17 2022-03-22 M&K Holdings Inc. Apparatus for decoding an image
KR20120016991A (en) * 2010-08-17 2012-02-27 오수미 Inter prediction process
PL3125553T3 (en) * 2010-08-17 2018-09-28 M&K Holdings Inc. Method for encoding an intra prediction mode
CN102986224B (en) 2010-12-21 2017-05-24 英特尔公司 System and method for enhanced dmvd processing
WO2012167418A1 (en) 2011-06-07 2012-12-13 Technicolor (China) Technology Co., Ltd. Method for encoding and/or decoding images on macroblock level using intra-prediction
TWI552583B (en) * 2011-06-28 2016-10-01 三星電子股份有限公司 Method and apparatus for intra prediction encoding of image, and method and apparatus for intra prediction decoding of image
CN106851300B (en) 2011-09-13 2020-05-26 寰发股份有限公司 Method and device for intra-frame prediction mode based on most probable mode
US8811760B2 (en) * 2011-10-25 2014-08-19 Mitsubishi Electric Research Laboratories, Inc. Coding images using intra prediction modes
CN108174205A (en) * 2011-12-05 2018-06-15 Lg 电子株式会社 The method and apparatus of intra prediction
KR101702551B1 (en) * 2015-06-05 2017-02-06 에스케이 텔레콤주식회사 Video Coding Method and Apparatus

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05183758A (en) 1991-05-17 1993-07-23 Internatl Business Mach Corp <Ibm> System and method of masking quantization having adaptability for transferring image
JPH08186823A (en) 1994-12-28 1996-07-16 Canon Inc Coding device and method
WO2000033583A1 (en) 1998-11-30 2000-06-08 Microsoft Corporation Efficient macroblock header coding for video compression
JP2000295627A (en) 1996-11-07 2000-10-20 Matsushita Electric Ind Co Ltd Image coding method, image coder, and recording medium storing image coding program
US6345121B1 (en) 1996-11-07 2002-02-05 Matsushita Electric Industrial Co., Ltd. Image encoding apparatus and an image decoding apparatus
US20020031276A1 (en) 1995-06-02 2002-03-14 Takahiro Yagishita Image encoding method and apparatus, image decoding method and apparatus, image processing apparatus, image formation apparatus, and computer-executable programs
WO2002054779A2 (en) 2001-01-08 2002-07-11 Siemens Aktiengesellschaft Method for header compression in a video coding
US20030081838A1 (en) 2001-05-31 2003-05-01 Chia-Yuan Teng Header compression for image data stream
US6668095B2 (en) 1998-12-03 2003-12-23 Koninklijke Philips Electronics N.V. Systems and methods for compressing and decompressing images
US6765964B1 (en) 2000-12-06 2004-07-20 Realnetworks, Inc. System and method for intracoding video data
US6907142B2 (en) 2000-01-21 2005-06-14 Nokia Mobile Phones Ltd. Method for encoding images, and an image coder

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05183758A (en) 1991-05-17 1993-07-23 Internatl Business Mach Corp <Ibm> System and method of masking quantization having adaptability for transferring image
JPH08186823A (en) 1994-12-28 1996-07-16 Canon Inc Coding device and method
US20020031276A1 (en) 1995-06-02 2002-03-14 Takahiro Yagishita Image encoding method and apparatus, image decoding method and apparatus, image processing apparatus, image formation apparatus, and computer-executable programs
JP2000295627A (en) 1996-11-07 2000-10-20 Matsushita Electric Ind Co Ltd Image coding method, image coder, and recording medium storing image coding program
US6345121B1 (en) 1996-11-07 2002-02-05 Matsushita Electric Industrial Co., Ltd. Image encoding apparatus and an image decoding apparatus
WO2000033583A1 (en) 1998-11-30 2000-06-08 Microsoft Corporation Efficient macroblock header coding for video compression
US6668095B2 (en) 1998-12-03 2003-12-23 Koninklijke Philips Electronics N.V. Systems and methods for compressing and decompressing images
US6907142B2 (en) 2000-01-21 2005-06-14 Nokia Mobile Phones Ltd. Method for encoding images, and an image coder
US6765964B1 (en) 2000-12-06 2004-07-20 Realnetworks, Inc. System and method for intracoding video data
WO2002054779A2 (en) 2001-01-08 2002-07-11 Siemens Aktiengesellschaft Method for header compression in a video coding
US20030081838A1 (en) 2001-05-31 2003-05-01 Chia-Yuan Teng Header compression for image data stream

Non-Patent Citations (24)

* Cited by examiner, † Cited by third party
Title
"H.26L Test Model Long Term No. 6 (TML-6) Draft O"ITU-T Telecommunication Standarization Sector of ITU Geneva, CH. Jan. 9, 2001, pp. 1-35, X00108915 *paragraph '3.4 Intra prediction mode (Intra-pred-mode)!* *tables 1, 2*.
Bojun Mend et al; "Fast intra-prediction mode selection for 4×4 blocks in H.264" 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. (ICASSP). Hong Kong, Apr. 6-10, 2003,IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New York, NY: IEEE, US, vol. vol. 1 of 6, Apr. 6, 2003, pp. 11389-111392, XPO1O639091 ISBN: 0-7803-7666-3 * the whole document*.
Bojun Meng et al: "Fast intra prediction mode selection for 4×4 blocks in H.264" 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings (ICASSP), Hong Kong, Apr. 6-10, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New York, NY: IEEE, US, vol. 1 of 6, Apr. 6, 2003 (Apr. 6, 2003), pp. 111389-111392, XPO1O639091 ISBN: 0/7803-7663-3 *the whole document*.
Bojun Meng, Oscar C. Au, "Fast Intra-Prediction Mode Selection for 4×4 Blocks in H.264", ICASSP 2003 IEEE International Conference, (Apr. 6-10, 2003), IEEE, pp. III-389-III-392.
European Search Report for EP Application No. 08 00 7745.6, Jan. 28, 2011.
European Search Report for EP Application No. 10 17 9562.03, Jan. 28, 2011.
European Search Report for EP Application No. 10 17 9572.2, Jan. 28, 2011.
Greg Conklin, "New Intra Prediction Modes," document VCEG-N54 doc, ITU-T Video Coding Experts Group (VCEG) Meeting Santa Barbara, CA USA 24-28, May 2001.
Greg Conklin, "New Intra Prediction Modes," document VCEG-N54 doc, ITU-T Video Coding Experts Group (VCEG) Meeting Santa Barbara, CA, USA 24-28.
H.26L Test Model Long Term No. 6 (TML-6) Draft O. ITU-T Telecommunication Standardization Sector of ITU Geneva, CH.9 Jan. 2001 (Jan. 9, 2001, pp. 1-35, XP00108915 *paragraph '3.4 Intra prediction mode (Intra-pred-mode)!* *tables 1.2*.
H.26L Test Model Long Term No. 6 (TML-6) draft0. [online], ITU-Telecommunications Standardization Sector Study Group 16 Video Coding Experts Group (VCEG), 2001, pp. 11-14 [retrieved on May 28, 2003].
Iain E G Richardson, H.264/MPEG-4 Part 10:Intra Prediction, pp. 1 of 1-7 of 7, [online], 07.10.02, H.264/MPEG-4 Part 10 White Paper, [retrieved on Sep. 22, 2003], Retrieved from Internet:,//218.30.21.30/sc/jywd/h264-intrapred.pdf.
Joint committe Draft (CD), Document JVT-C167, Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG.
Joint Committee Draft (CD), Document JVT-C167, Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, May 2002.
Karczewicz: "Analysis and Simplification of Intra Prediction", ITU Study Group 16-Video Coding Experts Group-ISO/IEC MPEG & ITU-T VCEG(ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q6), XX, XX, No. JVT-D025, Jul. 26, 2002 (Jul. 26, 2002), XP030005299.
lain E G Richardson, "H.264/MPEG-4 part 10 Tutorials", H.264/MPEG-4 part 10: Intra Prediction, pp. 1 of 6-6 of 6, [online], Apr. 30, 2008.
lain E G Richardson, "H.264/MPEG-4 Part 10:Intra Prediction", pp. 1 of 1-7 of 7, [online], 07.10.02, H.264/MPEG-4 Part 10 White Paper, [retrieved on Sep. 22, 2003], Retrieved from Internet:,//218.30.21.30/sc/jywd/h264-intrapred.pdf.
Minhua Zhou, "Scalable Intra Prediction," document JVT-C033 doc, Joint Vide Team (JVT) of ISO/IEC MPEG & ITU-T VCEG Meeting Fairfax, Virginia, USA 6-10.
Minhua Zhou, "Scalable Intra Prediction," document JVT-C033 doc, Joint Vide Team (JVT) of ISO-IEC MPEG & ITU-T VCEG Meeting Fairfax, Virginia, USA 6-10, Dec. 2001.
Pan F et al: "Fast Mode Decision for Infra Prediction" Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG-16 Q.6), Document JVT-G013, 'Online! Mar. 7, 2003 (Mar. 7, 2003), pp. 1-22, XP002325066 Retrieved from the Internet: URL:http://ftp3.itu.int/av-arch/jvt-site/2003 03 Pattaya/JVT-GO13.doc>1retrieved on Mar. 23, 2005! "the whole document".
Pan F et al: "Fast Mode Decision for Intra Prediction" Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), Document JVT-GO13, 'Online! Mar. 7, 2003 pp. 1-22, XP002325066 Retrieved from the Internet: URL;http://ftp3.Itu.int/av-arch/jvl-site/2003-03-Pattaya/JVT-G013.doc> retrieved on Mar. 23, 2005! *the whole document*.
Sun: "Intra-Prediction Mode Ordering and Coding", ITU Study Group 16-Video Coding Experts Group ISO/IEC MPEG & ITU-T VCEG(ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q6), XX, XX, No. JVT-D027, Jul. 26, 2002 (Jul. 26, 2002) XP030005301.
Weigland T: "Joint Model No. 1, Revision 1(JM-IRL)" ITU Study Group 16-Video Coding Experts Group, Dec. 3, 2001 (Mar. 12, 2001), pp. 1, 3-75, XP001086627 *paragraph '3.4.3 Intra Prediction Mode (Intra-pre-mode)!*.
Weigland T: "Joint Model No. 1, Revision 1(JM-IRL)" ITU Study Group 16-Video Coding Experts Group, Dec. 3, 2001, pp. 1, 3-75, XP001086627 *paragraph '3.4.3 Intra Prediction Mode (Intra-pred-mode)!*.

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10368066B2 (en) 2002-05-28 2019-07-30 Dolby Laboratories Licensing Corporation Methods and systems for image intra-prediction mode management
US20170142423A1 (en) * 2002-05-28 2017-05-18 Dolby Laboratories Licensing Corporation Methods And Systems For Image Intra-Prediction Mode Management
US10715799B2 (en) 2002-05-28 2020-07-14 Dolby Laboratories Licensing Corporation Methods and systems for image intra-prediction mode management
US9774868B2 (en) * 2002-05-28 2017-09-26 Dolby Laboratories Licensing Corporation Methods and systems for image intra-prediction mode management
US9930331B2 (en) 2010-07-15 2018-03-27 Sharp Kabushiki Kaisha Decoding and encoding devices using intra-frame prediction based on prediction modes of neighbor regions
US10313689B2 (en) 2010-07-15 2019-06-04 Sharp Kabushiki Kaisha Decoding device, coding device, and method
US11405624B2 (en) 2010-07-15 2022-08-02 Sharp Kabushiki Kaisha Decoding device, coding device, and method
US11032557B2 (en) 2010-07-15 2021-06-08 Sharp Kabushiki Kaisha Decoding device, coding device, and method
US10887599B2 (en) 2010-09-30 2021-01-05 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, program, and integrated circuit
US10306234B2 (en) 2010-09-30 2019-05-28 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, program, and integrated circuit
US11206409B2 (en) 2010-09-30 2021-12-21 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, program, and integrated circuit
US9756338B2 (en) 2010-09-30 2017-09-05 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, program, and integrated circuit
US11770550B2 (en) * 2010-12-07 2023-09-26 Electronics And Telecommunications Research Institute Method and device for simplifying the encoding and decoding of ultra-high definition images
US10931946B2 (en) * 2011-01-12 2021-02-23 Mitsubishi Electric Corporation Image encoding device, image decoding device, image encoding method, and image decoding method for generating a prediction image
US20190124328A1 (en) * 2011-01-12 2019-04-25 Mitsubishi Electric Corporation Image encoding device, image decoding device, image encoding method, and image decoding method for generating a prediction image
US10887606B2 (en) 2011-06-13 2021-01-05 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus
US10484692B2 (en) 2011-06-13 2019-11-19 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus
US10250887B2 (en) 2011-06-13 2019-04-02 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus
US11431989B2 (en) 2011-06-13 2022-08-30 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus
US11758155B2 (en) 2011-06-13 2023-09-12 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus
US9860539B2 (en) 2011-06-13 2018-01-02 Sun Patent Trust Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus
US20150334389A1 (en) * 2012-09-06 2015-11-19 Sony Corporation Image processing device and image processing method

Also Published As

Publication number Publication date
US20030223645A1 (en) 2003-12-04
US7289672B2 (en) 2007-10-30

Similar Documents

Publication Publication Date Title
USRE43567E1 (en) Methods and systems for image intra-prediction mode estimation
US10368066B2 (en) Methods and systems for image intra-prediction mode management
US7236524B2 (en) Methods and systems for image intra-prediction mode communication
EP1510078B1 (en) Methods and systems for image intra-prediction mode estimation, communication, and organization

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 8

AS Assignment

Owner name: DOLBY LABORATORIES LICENSING CORPORATION, CALIFORN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SHARP KABUSHIKI KAISHA;REEL/FRAME:036724/0111

Effective date: 20150929

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 12