WO2010111841A1 - Predicting method and apparatus for frequency domain pulse decoding and decoder - Google Patents

Predicting method and apparatus for frequency domain pulse decoding and decoder Download PDF

Info

Publication number
WO2010111841A1
WO2010111841A1 PCT/CN2009/071161 CN2009071161W WO2010111841A1 WO 2010111841 A1 WO2010111841 A1 WO 2010111841A1 CN 2009071161 W CN2009071161 W CN 2009071161W WO 2010111841 A1 WO2010111841 A1 WO 2010111841A1
Authority
WO
WIPO (PCT)
Prior art keywords
current frame
frame
block
spectrum
decoded
Prior art date
Application number
PCT/CN2009/071161
Other languages
French (fr)
Chinese (zh)
Inventor
苗磊
刘泽新
齐峰岩
胡晨
陈龙吟
郎玥
吴文海
塔迪·哈维·米希尔
张清
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2009/071161 priority Critical patent/WO2010111841A1/en
Priority to CN2009801486921A priority patent/CN102246229B/en
Publication of WO2010111841A1 publication Critical patent/WO2010111841A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques

Abstract

A predicting method and apparatus for frequency domain pulse decoding and a decoder are provided. The method includes: dividing current frame and previous frame into spectral blocks according to spectral coefficient of the previous frame (11), judging whether the prediction between frames for the spectral block divided in the current frame is needed according to the correlation between the spectral blocks divided in the current frame and previous frame (12), for the spectral block of the current frame which is judged that the prediction between frames is needed, predicting the spectral coefficient which is not decoded of the spectral block of the current frame by using the decoded spectral coefficient in the spectral block corresponding to the previous frame and the decoded spectral coefficient in the current frame (13)..

Description

频域脉冲解码的预测方法和预测装置及解码器 技术领域  Prediction method and prediction device and decoder for frequency domain pulse decoding
本发明涉及音频编解码技术领域, 具体而言是涉及一种频域脉冲解码的 预测方法和预测装置及解码器。 背景技术  The present invention relates to the field of audio codec technology, and in particular to a prediction method and prediction apparatus and decoder for frequency domain pulse decoding. Background technique
在目前的频域脉冲解码中, 当比特数不足时, 一般根据频谱能量大小, 或者根据听觉特性, 选择一些相对比较重要的频谱系数进行编码。 而对未被 编码的频谱系数, 在解码端恢复时, 一般用 0填充。  In the current frequency domain pulse decoding, when the number of bits is insufficient, generally according to the magnitude of the spectrum energy, or according to the auditory characteristics, some relatively important spectral coefficients are selected for encoding. For un-encoded spectral coefficients, when the decoder recovers, it is usually padded with 0s.
由于不同帧之间, 帧内的频语相对能量大小或听觉特性会有不同, 所以, 在同样的比特速率下, 连续的多帧间, 并不能保证同一频段的频谱系数都被 编码。 这样, 在解码端恢复出来的频谱系数中, 同一频段有些帧能够解码出 频谱系数, 而有些帧只能是通过填充 0得到, 因此就会造成同一频段频谱系 数的不连续, 进而造成听觉质量不够理想, 尤其是对谐波性强的信号。  Since the relative energy magnitude or auditory characteristics of the intra-frame frequencies are different between frames, at the same bit rate, the continuous multi-frame does not guarantee that the spectral coefficients of the same frequency band are encoded. In this way, in the spectrum coefficients recovered by the decoding end, some frames in the same frequency band can decode the spectral coefficients, and some frames can only be obtained by filling 0, thus causing discontinuity of the spectral coefficients in the same frequency band, thereby causing insufficient auditory quality. Ideal, especially for signals with strong harmonics.
现有技术中, 根据频域内不同频段频谱系数的相似性, 通过少量比特或 不用比特做帧内的预测, 使得没有解码出的频谱系数通过其它频段或频点的 频谱系数预测得到。如目前较多釆用的带宽扩展( BWE, Bandwidth Extension ) 算法, 根据高低频之间的相关性使用低频的频谱系数预测得到高频段的频谱 系数, 以提升输出信号的带宽, 进而提升输出信号的听觉质量。  In the prior art, according to the similarity of spectral coefficients of different frequency bands in the frequency domain, intra-frame prediction is performed by using a small number of bits or no bits, so that the undecoded spectral coefficients are predicted by spectral coefficients of other frequency bands or frequency points. For example, the bandwidth extension (BWE, Bandwidth Extension) algorithm is used at present, and the spectral coefficient of the high frequency band is predicted by using the low frequency spectral coefficient according to the correlation between the high and low frequencies, so as to increase the bandwidth of the output signal, thereby improving the output signal. Hearing quality.
在实现本发明过程中, 本发明人研究发现: 帧内预测出的频谱能量在某 些频段往往偏差较大, 特别是当低频谐波性较强且高频谐波性也较强时, 预 测出的波峰位置往往与真实的波峰位置有较大偏差, 从而使得音频信号引入 了较多的噪声, 影响了音频信号的听觉质量。 发明内容  In the process of implementing the present invention, the inventors have found that: the spectral energy predicted in the frame tends to be biased in some frequency bands, especially when the low frequency harmonics are strong and the high frequency harmonics are also strong. The peak position often has a large deviation from the true peak position, which causes the audio signal to introduce more noise, which affects the auditory quality of the audio signal. Summary of the invention
鉴于此, 本发明实施例提供一种频域脉冲解码的预测方法和预测装置及 解码器, 能够更好地提升音频输出信号的听觉质量。 本发明实施例提供的频域脉冲解码的预测方法, 包括如下步骤: 根据之前帧的频谱系数对当前帧和之前帧进行频谱块划分; In view of this, the embodiments of the present invention provide a prediction method, a prediction apparatus, and a decoder for frequency domain pulse decoding, which can better improve the auditory quality of an audio output signal. The method for predicting frequency domain pulse decoding provided by the embodiment of the present invention includes the following steps: performing spectrum block division on a current frame and a previous frame according to a spectral coefficient of a previous frame;
根据当前帧和之前帧划分的频谱块的相关性, 判断当前帧划分的频谱块 是否需要做帧间预测;  Determining, according to the correlation between the current frame and the spectrum block divided by the previous frame, whether the spectrum block of the current frame division needs to perform inter prediction;
对判断出需要做帧间预测的当前帧频谱块, 使用所述之前帧相对应频谱 块中解码出的频谱系数和当前帧已解码出的频谱系数, 预测当前帧频谱块中 未解码出的频谱系数。  For determining the current frame spectrum block that needs to be inter-predicted, using the decoded spectral coefficients in the corresponding spectrum block of the previous frame and the decoded spectral coefficients of the current frame, predicting the undecoded spectrum in the current frame spectrum block. coefficient.
本发明实施例提供的频域脉冲解码的预测装置, 包括:  The apparatus for predicting frequency domain pulse decoding provided by the embodiment of the present invention includes:
块划分单元, 用于根据之前帧的频语系数对当前帧和之前帧进行频语块 划分;  a block dividing unit, configured to perform frequency bin partitioning on the current frame and the previous frame according to the frequency coefficient of the previous frame;
判断单元, 用于根据所述块划分单元划分的当前帧和之前帧频谱块的相 关性, 判断当前帧划分的频谱块是否需要做帧间预测;  a determining unit, configured to determine, according to the correlation between the current frame and the previous frame spectrum block that are divided by the block dividing unit, whether the spectrum block of the current frame division needs to perform inter prediction;
预测单元, 用于对所述判断单元判断出需要做帧间预测的当前帧频谱块, 使用所述之前帧相对应频谱块中解码出的频谱系数和当前帧已解码出的频谱 系数, 预测当前帧频谱块中未解码出的频谱系数。  a prediction unit, configured to determine, by the determining unit, a current frame spectrum block that needs to be inter-predicted, and use the decoded spectral coefficient in the corresponding spectrum block of the previous frame and the decoded spectral coefficient of the current frame to predict the current Undecoded spectral coefficients in the frame spectrum block.
本发明实施例提供的解码器, 包括上述的频域脉冲解码的预测装置和变 换器, 所述频域脉冲解码的预测装置, 用于对判断出需要做帧间预测的当前 帧频谱块, 使用之前帧相对应频谱块中解码出的频谱系数和当前帧已解码出 的频谱系数, 预测当前帧频谱块中未解码出的频谱系数; 所述变换器, 用于 根据所述频域脉冲解码的预测装置预测后的帧频谱系数, 进行频域至时域的 变换, 输出时域音频信号。  The decoder provided by the embodiment of the present invention includes the above-mentioned frequency domain pulse decoding prediction apparatus and converter, and the frequency domain pulse decoding prediction apparatus is configured to determine a current frame spectrum block that needs to be inter-predicted. Precoding the undecoded spectral coefficients in the current frame spectrum block by the decoded spectral coefficients in the previous frame corresponding to the spectral block and the decoded spectral coefficients of the current frame; the converter is configured to decode according to the frequency domain pulse The predicted frame spectral coefficients of the prediction device are subjected to frequency domain to time domain transformation, and output time domain audio signals.
根据本发明实施例提供的以上技术方案可知, 首先根据之前帧的频谱系 数对当前帧和之前帧进行频语块划分, 然后根据当前帧和之前帧划分的频语 块的相关性, 判断当前帧划分的频谱块是否需要做帧间预测, 最后对判断出 需要做帧间预测的当前帧频谱块, 使用之前帧相对应频谱块中解码出的频谱 系数和当前帧已解码出的频谱系数, 预测当前帧频谱块中未解码出的频谱系 数。 与现有技术的帧内预测方法相比, 对于强谐波性的信号, 使得当前帧的 频谱更平滑, 减少了不连续的现象, 从而与真实的频谱更接近, 更好地提升 了音频输出信号尤其是强谐波性的输出信号的听觉质量。 附图说明 施例描述中所需要使用的附图作简单地介绍, 显而易见地, 下面描述中的附 图仅仅是本发明的一些实施例, 对于本领域普通技术人员来讲, 在不付出创 造性劳动性的前提下, 还可以根据这些附图获得其他的附图。 According to the foregoing technical solution provided by the embodiment of the present invention, the current frame and the previous frame are first frequency-band partitioned according to the spectral coefficient of the previous frame, and then the current frame is determined according to the correlation between the current frame and the frequency block divided by the previous frame. Whether the divided spectrum blocks need to be inter-predicted, and finally determine the current frame spectrum block that needs to be inter-predicted, use the decoded spectral coefficients in the corresponding spectrum block of the previous frame and the decoded spectral coefficients of the current frame, and predict Undecoded spectrum system in the current frame spectrum block Number. Compared with the intra-prediction method of the prior art, for the signal of strong harmonicity, the spectrum of the current frame is smoother, the discontinuity phenomenon is reduced, and the sound spectrum is closer to the real spectrum, and the audio output is better improved. The signal is especially the auditory quality of the strong harmonic output signal. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are incorporated in FIG. Other drawings can also be obtained from these drawings on the premise of sex.
图 1为本发明实施例一提供的一种频域脉冲解码的预测方法流程图; 图 2为本发明实施例二提供的一种频域脉冲解码的预测方法流程图; 图 3为本发明实施例提供的一种当前帧和前一帧的频语信息示例图; 图 4为本发明实施例二提供的一种算法结构框图;  1 is a flowchart of a method for predicting frequency domain pulse decoding according to Embodiment 1 of the present invention; FIG. 2 is a flowchart of a method for predicting frequency domain pulse decoding according to Embodiment 2 of the present invention; An example diagram of frequency information of a current frame and a previous frame is provided by the example; FIG. 4 is a block diagram of an algorithm structure according to Embodiment 2 of the present invention;
图 5为本发明实施例二提供的另一种算法结构框图;  FIG. 5 is a structural block diagram of another algorithm according to Embodiment 2 of the present invention; FIG.
图 6为本发明实施例三提供的一种算法结构框图;  6 is a structural block diagram of an algorithm according to Embodiment 3 of the present invention;
图 7为本发明实施例四提供的一种频域脉冲解码的预测装置结构示意图; 图 8为本发明实施例五提供的一种解码器组成框图。 具体实施方式  FIG. 7 is a schematic structural diagram of a prediction apparatus for frequency domain pulse decoding according to Embodiment 4 of the present invention; FIG. 8 is a block diagram of a decoder according to Embodiment 5 of the present invention. detailed description
下面将结合本发明实施例中的附图, 对本发明实施例中的技术方案进行 清楚、 完整地描述, 显然, 所描述的实施例仅仅是本发明一部分实施例, 而 不是全部的实施例。 基于本发明中的实施例, 本领域普通技术人员在没有作 出创造性劳动前提下所获得的所有其他实施例, 都属于本发明保护的范围。  The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
实施例一  Embodiment 1
参见图 1 ,本发明实施例提供的频域脉冲解码的预测方法,包括如下步骤: 步骤 11 , 根据之前帧的频谱系数对当前帧和之前帧进行频谱块划分; 步骤 12, 根据当前帧和之前帧划分的频谱块的相关性, 判断当前帧划分 的频谱块是否需要做帧间预测; Referring to FIG. 1, a method for predicting frequency domain pulse decoding according to an embodiment of the present invention includes the following steps: Step 11: Perform spectrum block division on a current frame and a previous frame according to a spectral coefficient of a previous frame; Step 12, according to a current frame and before Correlation of the spectral block of the frame division, judging the current frame division Whether the spectrum block needs to do inter prediction;
步骤 13 , 对判断出需要做帧间预测的当前帧频谱块, 使用之前帧相对应 频谱块中解码出的频谱系数和当前帧已解码出的频谱系数, 预测当前帧频谱 块中未解码出的频谱系数。  Step 13: predicting the current frame spectrum block that needs to be inter-predicted, using the decoded spectral coefficients in the corresponding spectrum block of the previous frame and the decoded spectral coefficients of the current frame, and predicting the undecoded in the current frame spectrum block. Spectral coefficient.
在完成对当前帧中需要做帧间预测的各频谱块的预测之后, 可以输出预 测后的当前帧, 之后可以进一步对当前帧进行后续的解码处理, 最终输出音 频信号。  After the prediction of each spectrum block that needs inter prediction in the current frame is completed, the predicted current frame may be output, and then the subsequent decoding process may be further performed on the current frame, and finally the audio signal is output.
本发明实施例提供的频域脉冲解码的预测方法, 首先根据之前帧的频谱 系数对当前帧和之前帧进行频语块划分, 然后根据当前帧和之前帧划分的频 谱块的相关性, 判断当前帧划分的频谱块是否需要做帧间预测, 最后对判断 出需要做帧间预测的当前帧频谱块, 使用之前帧相对应频谱块中解码出的频 谱系数和当前帧已解码出的频谱系数, 预测当前帧频谱块中未解码出的频谱 系数。 与现有技术的帧内预测方法相比, 对于强谐波性的信号, 使得当前帧 的频语更平滑, 减少了不连续的现象, 从而与真实谱更接近, 更好地提升了 音频输出信号尤其是强谐波性的输出信号的听觉质量。  The method for predicting frequency domain pulse decoding provided by the embodiment of the present invention first performs frequency bin partitioning on the current frame and the previous frame according to the spectral coefficient of the previous frame, and then determines the current according to the correlation between the current frame and the spectrum block divided by the previous frame. Whether the spectrum block of the frame division needs inter-frame prediction, and finally, the current frame spectrum block that needs to be inter-predicted is used, and the spectrum coefficient decoded in the corresponding spectrum block of the previous frame and the decoded spectrum coefficient of the current frame are used. The undecoded spectral coefficients in the current frame spectrum block are predicted. Compared with the intra-prediction method of the prior art, for the signal of strong harmonicity, the frequency of the current frame is smoother, the discontinuity phenomenon is reduced, and the sound spectrum is closer, which improves the audio output better. The signal is especially the auditory quality of the strong harmonic output signal.
实施例二  Embodiment 2
参见图 2,本发明实施例提供的频域脉冲解码的预测方法,包括如下步骤: 步骤 21 , 对应之前帧解码出的频谱系数所在频点, 在当前帧和之前帧中 以该频点为中心在前后各 N个频点的范围划分出一个频谱块; 其中, N > 1。  Referring to FIG. 2, a method for predicting frequency domain pulse decoding according to an embodiment of the present invention includes the following steps: Step 21: Corresponding to a frequency point of a spectrum coefficient decoded by a previous frame, centering the frequency point in a current frame and a previous frame A spectrum block is divided in the range of N frequency points before and after; wherein N > 1.
频语块是由连续 L个频点对应的频语系数组成, 以对应之前帧解码出频 谱系数的每个频点为中心, 在前后各 N个频点的范围划分出一个频谱块, 这 时当前帧划分出的频谱块的个数等于之前帧解码出的频谱系数的个数,其中 L 是频谱块的长度, 表示每个频谱块内的频谱个数, L = 2N + 1。 参见图 3 , 假 定每帧待处理的频带内的总频点的个数为 64, 在解码端, 当前帧解码出 4个 非 0的频语系数, 对应的频点分别为 4、 15、 40和 50 (如实竖线所示); 当前 帧的前一帧也解码出 4个非 0的频语系数, 对应的频点分别为 3、 27、 40和 50 (如实竖线所示)。 若取 N=2, 即每个频谱块内的频谱个数 L=5, 则对应之 前帧解码出的 4个非 0的频语系数所在频点, 对当前帧和前一帧分别划分出 四个待处理的频语块(如虚线框所示)。 对应于图 3 , 当前帧和之前帧划分出 的频谱块的个数等于之前帧解码出的频谱系数的个数 4。 The frequency block is composed of frequency coefficients corresponding to consecutive L frequency points, and is centered on each frequency point corresponding to the spectrum coefficient decoded by the previous frame, and a spectrum block is divided in the range of N frequency points before and after. The number of spectral blocks divided by the current frame is equal to the number of spectral coefficients decoded by the previous frame, where L is the length of the spectral block, indicating the number of spectra in each spectral block, L = 2N + 1. Referring to FIG. 3, it is assumed that the number of total frequency points in the frequency band to be processed per frame is 64. At the decoding end, the current frame decodes four non-zero frequency coefficients, and the corresponding frequency points are 4, 15, and 40, respectively. And 50 (as shown by the vertical line); the previous frame of the current frame also decodes four non-zero frequency coefficients, corresponding to the frequency points of 3, 27, 40 and 50 (as shown by the vertical line). If N=2, that is, the number of spectrums in each spectrum block is L=5, corresponding to the frequency points of the four non-zero frequency coefficients decoded in the previous frame, the current frame and the previous frame are respectively divided into four. The frequency blocks to be processed (as indicated by the dashed box). Corresponding to FIG. 3, the number of spectral blocks divided by the current frame and the previous frame is equal to the number 4 of spectral coefficients decoded by the previous frame.
需要说明的是, 如果之前帧在相邻 N个频点范围内解码出两个以上的频 谱系数, 则当前帧以所述两个以上的频谱系数的所在的任一频点为中心, 在 前后各 N个频点的范围只划分出一个频语块。 假如, 之前帧在第 3和 4频点 上都解码出非 0的频谱系数, 若取 N=2, 则对应于之前帧的第 3和 4频点当 前帧和之前帧只划分出一个频语块, 这个频语块可以以第 3个或第 4个频点 为中心。 这时每个划分出的频谱块之间无区域重叠, 之前帧解码出的频谱系 数的个数大于当前帧和之前帧划分出的频谱块的个数。  It should be noted that if the previous frame decodes two or more spectral coefficients in the range of adjacent N frequency points, the current frame is centered on any frequency point where the two or more spectral coefficients are located, before and after The range of each of the N frequency points is divided into only one frequency block. Suppose, the previous frame decodes non-zero spectral coefficients at the 3rd and 4th frequency points. If N=2, the current frame and the previous frame corresponding to the 3rd and 4th points of the previous frame are only divided into one frequency. Block, this frequency block can be centered on the 3rd or 4th frequency. At this time, there is no area overlap between each of the divided spectrum blocks, and the number of spectrum coefficients decoded by the previous frame is larger than the number of spectrum blocks divided by the current frame and the previous frame.
步骤 22, 判断当前帧和之前帧所有解码出的频谱系数所对应频点的间距 小于等于 N的个数是否大于等于 M;如果大于等于 M则判断当前帧需要做帧 间预测, 否则不需要; 其中 M为预先设定的数值。  Step 22: Determine whether the distance between the frequency points corresponding to all the decoded spectral coefficients of the current frame and the previous frame is less than or equal to N. If the value is greater than or equal to M, determine that the current frame needs to perform inter prediction, otherwise it is not required; Where M is a preset value.
需要说明的是, M 的取值根据解码出的频谱系数的个数决定, 解码出的 频谱系数越多 M的取值越大。  It should be noted that the value of M is determined according to the number of decoded spectral coefficients, and the more the decoded spectral coefficients are, the larger the value of M is.
仍参见图 3 , 取 N=2, M=3 , 当前帧和前一帧相应位置相差小于等于 N 的有 3个位置(3/4, 40/40, 50/50 ), 大于等于 M, 则判断当前帧需要做帧间 预测。 由于步骤 21 中进行了频谱块的划分, 所以, 步骤 22中, 也可以通过 如下方式判断当前帧是否需要做帧间预测: 如果当前帧中的各频谱块中, 具 有解码出的频谱系数的频谱块的个数大于等于预先设定的数值 M, 则判断当 前帧需要 ^1帧间预测; 否则, 当前帧不需要 ^1帧间预测。  Still referring to Figure 3, taking N=2, M=3, there are 3 positions (3/4, 40/40, 50/50) with the difference between the current frame and the previous frame being less than or equal to N, and greater than or equal to M, then Judging the current frame requires inter prediction. Since the division of the spectrum block is performed in step 21, in step 22, it is also possible to determine whether the current frame needs to be inter-predicted by: if each spectrum block in the current frame has a spectrum of the decoded spectral coefficient. If the number of blocks is greater than or equal to the preset value M, it is determined that the current frame requires ^1 inter prediction; otherwise, the current frame does not require ^1 inter prediction.
需要说明的有两点: 一是当前帧的处理, 不一定仅仅参照前一帧, 也可 以参照前几帧的信息。 二是当之前帧有丟帧现象时, 既可以将之前帧的频谱 系数全置成 0, 这样就可以保证丟帧后的第一帧不做帧间预测,从而避免不好 的影响; 也可以保持丟帧时之前帧的频谱系数不变, 而通过对当前帧是否做 帧间预测的条件 (当前帧和丟帧时之前帧解码出频谱系数的位置间距小于等 于 N的个数大于等于 M ), 来保证算法的鲁棒性, 不会造成不好的效果。 There are two points to be explained: First, the processing of the current frame does not necessarily refer to the previous frame, but also refers to the information of the previous frames. Second, when there is a frame loss phenomenon in the previous frame, the spectrum coefficient of the previous frame can be completely set to 0, so that the first frame after the frame loss can be prevented from inter-frame prediction, thereby avoiding bad influence; Keep the spectrum coefficient of the previous frame unchanged when the frame is lost, and whether to do it by the current frame. The condition of inter prediction (the number of positions where the frame of the previous frame and the frame is decoded before the frame loss is less than or equal to N is greater than or equal to M) to ensure the robustness of the algorithm and does not cause bad effects.
步骤 23 , 逐个判断划分的频谱块是否需要做帧间预测, 如果当前帧的频 谱块内没有解码出的频谱系数, 而之前帧的相应频谱块内有解码出的频谱系 数, 则判断当前帧的所述频谱块需要做帧间预测。  Step 23: Determine whether the divided spectrum blocks need to be inter-predicted one by one. If there is no decoded spectral coefficient in the spectrum block of the current frame, and there is a decoded spectral coefficient in the corresponding spectrum block of the previous frame, the current frame is determined. The spectrum block needs to be inter-predicted.
仍参见图 3 , 在对应前一帧解码出的 4个非 0的频谱系数所在频点, 当前 帧和之前帧划分出的四个待处理的频语块中,只有在频点 27对应的频语块处, 前一帧解码出了频谱系数, 而在当前帧的相应位置(27+/-2 )范围内没有解码 出频谱系数, 而其他频语块(频点 4、 40和 50分别对应的频谱块)中, 当前 帧都有相应的频谱系数解码出来, 因此, 相应于图 3 , 判断出频点 27对应的 频谱块需要做帧间预测。  Still referring to FIG. 3, in the frequency point corresponding to the four non-zero spectral coefficients decoded in the previous frame, among the four to-be-processed frequency blocks divided by the current frame and the previous frame, only the frequency corresponding to the frequency point 27 At the block, the previous frame decodes the spectral coefficients, and the spectral coefficients are not decoded in the corresponding position of the current frame (27 +/- 2 ), while the other frequency blocks (frequency points 4, 40, and 50 correspond to In the spectrum block, the current frame has corresponding spectral coefficients decoded. Therefore, corresponding to FIG. 3, it is determined that the spectrum block corresponding to the frequency point 27 needs to be inter-predicted.
步骤 24, 对判断出需要做帧间预测的频谱块, 将之前帧相对应频谱块中 解码出频谱系数所在频点的幅度, 和当前帧已解码出的所有频谱系数中的最 小幅度的 倍进行加权求和, 将加权求和的结果作为当前帧频谱块中未解码 出频谱系数相对应频点的幅度, 频谱系数的符号与之前帧相对应频点的频谱 系数的符号相同, 其中 0≤ ≤1。  Step 24: Perform, for the spectrum block that needs to perform inter prediction, the amplitude of the frequency point at which the spectral coefficient is decoded in the corresponding spectrum block of the previous frame, and the minimum amplitude of all the spectral coefficients that have been decoded in the current frame. Weighted summation, the result of the weighted summation is taken as the amplitude of the corresponding frequency point of the undecoded spectral coefficient in the current frame spectrum block, and the sign of the spectral coefficient is the same as the sign of the spectral coefficient of the corresponding frequency point of the previous frame, where 0≤ ≤ 1.
对当前频谱块做帧间预测的具体方法如式(1 )所示:  The specific method for inter-prediction of the current spectrum block is as shown in equation (1):
cur spec = * pre spec + β ^ {δ ^ cur specmin ); a -\- β = \, 0 < S < \ ( 1 ) 其中, cur— specmm表示当前帧已解码出的所有频谱系数中的最小幅度, pre _ spec表示之前帧相对应频语块中解码出的频语系数所在频, 的幅度。 Cur spec = * pre spec + β ^ {δ ^ cur spec min ); a -\- β = \, 0 < S < \ ( 1 ) where cur- spec mm indicates that all the spectral coefficients of the current frame have been decoded The minimum amplitude, pre _ spec, represents the amplitude of the frequency at which the frequency coefficient decoded in the previous frame corresponds to the frequency block.
是加权系数, 可以根据实际情况选择, 对于强谐波性信号, 优选地, Is a weighting coefficient, which can be selected according to actual conditions. For a strong harmonic signal, preferably,
« = = 0.5, = 0.8。 « = = 0.5, = 0.8.
仍参见附图 3 , 对频点 27做频谱系数预测,是将前一帧 27频点处的频谱 系数的幅度和当前帧解码出的所有频谱系数中的最小幅度的 0.8倍的加权和, 作为当前帧频点 27处的频谱系数的幅度, 符号和前一帧 27频点处的符号保 持一致。 例如, 前一帧 27频点幅度是 30, 当前帧的四个频点幅度依次是 25、 15、 20、 21 , « = ^ = 0.5,^ = 0.8 , 由式(1 )预测得到当前帧 27 频点的幅度为 21 , 符号为正, 与前一帧 27频点处的符号保持一致。 Still referring to FIG. 3, the spectral coefficient prediction for the frequency point 27 is a weighted sum of the amplitude of the spectral coefficient at the 27-bit point of the previous frame and the minimum amplitude of 0.8 times of all the spectral coefficients decoded by the current frame. The amplitude of the spectral coefficients at the current frame rate point 27 is the same as the sign at the 27th point of the previous frame. For example, the amplitude of the 27-point frequency of the previous frame is 30, and the amplitude of the four frequency points of the current frame is 25, 15, 20, 21, « = ^ = 0.5, ^ = 0.8, the amplitude of the current frame 27 is predicted by equation (1) to be 21, and the sign is positive, which is consistent with the symbol at the 27th point of the previous frame.
需要说明的是, 本发明实施例在判断当前频谱块是否需要处理时, 是基 于频谱块来判断的, 而不是基于频点来判断的。 可能之前帧有些频点解码出 了频谱系数, 当前帧同一频点没有解码出频谱系数, 但其周围相距很小的位 置处有频谱系数解码出, 此时也不做相应的处理, 如附图 3中, 前一帧频点 3 解码出了频谱系数, 而当前帧频点 3 没有解码出频谱系数, 但当前帧频点 4 解码出了频谱系数, 此时对当前帧的频点 3是不需要做帧间预测处理的。  It should be noted that, in the embodiment of the present invention, when determining whether the current spectrum block needs to be processed, it is determined based on the spectrum block, instead of being determined based on the frequency point. It is possible that some frequency points in the previous frame decode the spectral coefficients, and the spectrum coefficients are not decoded at the same frequency point of the current frame, but the spectral coefficients are decoded at a position where the distance is small, and no corresponding processing is performed at this time, as shown in the figure. In the third frame, the previous frame frequency point 3 decodes the spectral coefficient, and the current frame frequency point 3 does not decode the spectral coefficient, but the current frame frequency point 4 decodes the spectral coefficient, and at this time, the frequency of the current frame is not Need to do inter-frame prediction processing.
还需要说明的是, 如果之前帧的同一频谱块中解码出两个以上的频语系 数, 而当前帧中相对应频谱块内没有频谱系数解码出来, 则对应之前帧同一 频谱块中解码出的两个以上的频谱系数所在的每个频点, 对当前帧分别进行 频谱预测。  It should also be noted that if more than two frequency coefficients are decoded in the same spectrum block of the previous frame, and no spectral coefficients are decoded in the corresponding spectrum block in the current frame, the corresponding spectrum blocks are decoded in the same spectrum block. For each frequency point where two or more spectral coefficients are located, the current frame is separately subjected to spectrum prediction.
步骤 25 , 输出预测处理后的当前帧的频谱系数, 并保存预测处理前的当 前帧的频语系数作为下一帧的之前帧的帧信息。  Step 25: Output the spectral coefficient of the current frame after the prediction processing, and save the frequency coefficient of the current frame before the prediction processing as the frame information of the previous frame of the next frame.
经过上述步骤 24, 当前帧在频点 27预测出频谱系数, 此时当前帧有 5个 非 0的频语系数, 对应的频点分别为 4、 15、 27、 40和 50, 对预测处理后的 当前帧的频谱系数进行输出, 并保存预测处理前的当前帧的频谱系数(对应 的频点分别为 4、 15、 40和 50 )作为下一帧的之前帧的帧信息。  After the above step 24, the current frame predicts the spectral coefficient at the frequency point 27. At this time, the current frame has five non-zero frequency coefficients, and the corresponding frequency points are 4, 15, 27, 40, and 50, respectively. The spectral coefficients of the current frame are output, and the spectral coefficients of the current frame before the prediction processing (the corresponding frequency points are 4, 15, 40, and 50, respectively) are saved as the frame information of the previous frame of the next frame.
至此, 完成了频域脉冲解码的当前帧预测的整个处理流程。  So far, the entire processing flow of the current frame prediction of the frequency domain pulse decoding is completed.
本发明实施例的算法结构框图如图 4所示, 根据当前帧和之前帧解码出 的频语系数, 首先确定待处理频谱块的个数, k表示频谱块的序数, 初始化从 0开始。对应于附图 3所示的帧信息,待处理频谱块的个数等于前一帧解码出 的频谱系数的个数 4。然后判断当前帧和之前帧解码出频谱系数的所有对应块 内位置间距小于等于 N的个数是否大于等于 M, 如果满足条件, 则对当前帧 做帧间预测, 并保存当前帧预测前的频谱系数作为下一帧的之前帧信息; 否 则, 对当前帧不处理, 只是保存当前帧预测前的频谱系数作为下一帧的之前 帧信息。 The block diagram of the algorithm structure of the embodiment of the present invention is as shown in FIG. 4. First, according to the frequency coefficient decoded by the current frame and the previous frame, the number of spectrum blocks to be processed is first determined, and k represents the ordinal number of the spectrum block, and the initialization starts from 0. Corresponding to the frame information shown in FIG. 3, the number of spectrum blocks to be processed is equal to the number 4 of spectral coefficients decoded in the previous frame. Then, it is judged whether the number of positions in the corresponding block in which the current frame and the previous frame are decoded out of the spectral coefficients is less than or equal to N. If the condition is met, the interframe prediction is performed on the current frame, and the spectrum before the current frame prediction is saved. The coefficient is used as the previous frame information of the next frame; otherwise, the current frame is not processed, but only the spectrum coefficient before the current frame prediction is saved as the next frame. Frame information.
对应于附图 3 , 可取 N=2, M=3。 在之前帧频点 27解码出了频谱系数, 而在当前帧划分的频谱块(27+/-N )范围内没有解码出频谱系数; 同时, 当前 帧和之前帧相应位置相差小于等于 N的有 3个( 3/4, 40/40, 50/50 ), 大于等 于 Μ=3 , 此时, 将之前帧频点 27的频谱系数的幅度和当前帧所有解码出的频 谱系数的幅度中最小幅度的 =0.8 倍求加权和, 将加权和结果作为当前帧频 点 27的频谱系数的幅度, 符号和之前帧频点 27的符号保持一致。  Corresponding to Fig. 3, N=2 and M=3 are preferable. The spectral coefficients are decoded at the previous frame frequency point 27, and the spectral coefficients are not decoded within the spectrum block (27 +/- N) of the current frame division; meanwhile, the difference between the current frame and the corresponding position of the previous frame is less than or equal to N. 3 (3/4, 40/40, 50/50), greater than or equal to Μ=3, at this time, the amplitude of the spectral coefficient of the previous frame frequency point 27 and the amplitude of all the decoded spectral coefficients of the current frame are the smallest. The weighted sum result is taken as the amplitude of the spectral coefficient of the current frame frequency point 27, and the symbol is consistent with the sign of the previous frame frequency point 27.
需要说明的是, 当大多数帧都能满足帧间预测处理条件时, 在判断当前 帧是否做帧间预测时, 可以在处理完所有频谱块后再进行判断, 而不是在进 行频谱块帧间预测处理之前判断, 这样可以节省算法的平均复杂度。  It should be noted that when most frames can satisfy the inter prediction processing condition, when judging whether the current frame is inter-predicted, it may be determined after all the spectrum blocks are processed, instead of being between the spectrum block frames. Judging before the prediction process, this saves the average complexity of the algorithm.
如图 5 所示, 本实施例还可以釆用另一种算法结构框图, 判断当前帧是 否做帧间预测是在处理完所有频谱块后才得知, 具体的处理算法如下:  As shown in FIG. 5, this embodiment can also use another algorithm structure block diagram to determine whether the current frame is inter-predicted after all the spectrum blocks are processed. The specific processing algorithm is as follows:
首先初始化当前帧和之前帧解码出频谱系数的位置间距小于等于 N的个 数 i=0和待处理的频谱块的序数 k=0, 并确定最大待处理频谱块数。 不论当前 帧是否满足做帧间预测的条件, 都将当前帧暂时通过本发明算法: 针对每个 待处理频谱块, 如果之前帧的频谱系数块中解码出了频谱系数, 而当前帧的 相应块没有解码出频谱系数, 如附图 3中的频点 27所在的频谱块即满足这个 条件, 从而可以将前一帧频点 27的频谱系数的幅度和当前帧所有解码出的频 谱系数中的最小幅度的 =0.8倍的加权和, 作为当前帧频点 27的频谱系数的 幅度, 符号和前一帧频点 27的符号保持一致; 如果当前帧频谱块和之前帧相 对应频谱块都有频谱系数解码出, 判断频谱块内当前帧位置和前一帧位置间 距是否小于等于 N=2, 如果是, i加一, 否则 i不变, 直到处理完所有的频谱 块。 当所有的频谱块处理完后, 判断 i是否大于等于 M, 如果是, 当前帧满 足处理条件, 输出预测处理后的当前帧频谱系数, 并保存预测处理前的当前 帧频谱系数作为下一帧的前一帧频谱系数; 否则, 当前帧不满足处理条件, 将当前帧频谱系数还原为预测处理前的频谱系数进行输出, 并保存当前帧预 测处理前的频语系数作为下一帧的之前帧频语系数。 First, the number of positions where the positional interval between the current frame and the previous frame decoded spectral coefficients is less than or equal to N, i=0, and the ordinal number k=0 of the spectrum block to be processed are initialized, and the maximum number of spectrum blocks to be processed is determined. Regardless of whether the current frame satisfies the condition for inter prediction, the current frame is temporarily passed through the algorithm of the present invention: For each spectrum block to be processed, if the spectral coefficient block of the previous frame is decoded, the corresponding block of the current frame is decoded. The spectral coefficients are not decoded. The spectrum block in which the frequency point 27 in FIG. 3 is located satisfies this condition, so that the amplitude of the spectral coefficients of the previous frame frequency point 27 and the smallest of all decoded spectral coefficients of the current frame can be obtained. The weighted sum of the amplitude = 0.8 times, the amplitude of the spectral coefficient of the current frame frequency point 27, the symbol and the symbol of the previous frame frequency point 27 are consistent; if the current frame spectrum block and the previous frame corresponding to the spectrum block have spectral coefficients Decode out to determine whether the current frame position and the previous frame position in the spectrum block are less than or equal to N=2. If yes, i is incremented by one, otherwise i is unchanged until all spectrum blocks are processed. After all the spectrum blocks are processed, it is judged whether i is greater than or equal to M. If yes, the current frame satisfies the processing condition, the current frame spectral coefficient after the prediction processing is output, and the current frame spectral coefficient before the prediction processing is saved as the next frame. The previous frame spectral coefficient; otherwise, the current frame does not satisfy the processing condition, the current frame spectral coefficient is restored to the spectral coefficient before the prediction processing, and the current frame is saved. The frequency coefficient before the processing is used as the previous frame frequency coefficient of the next frame.
实施例三  Embodiment 3
本发明实施例提供了另一种频域脉冲解码的帧间预测的方法, 本实施例 与实施例二的不同是, 本实施例判断当前帧是否做帧间预测的处理是根据当 前帧和之前帧相对应频谱块中解码出的频谱系数的能量的相关性来进行判断 的,而实施例二是根据当前帧和之前帧解码出频谱系数的位置间距小于等于 N 的个数(即频谱系数的位置的相关性) 来判断的。  The embodiment of the present invention provides another method for inter-frame prediction of frequency domain pulse decoding. The difference between this embodiment and the second embodiment is that, in this embodiment, whether the current frame is inter-predicted is based on the current frame and before. The frame is determined according to the correlation of the energy of the decoded spectral coefficients in the spectrum block, and the second embodiment is that the positional spacing of the spectral coefficients decoded according to the current frame and the previous frame is less than or equal to N (ie, the spectral coefficient The relevance of the location) to judge.
具体地, 在判断当前帧是否需要做帧间预测时, 本实施例是根据当前帧 和之前帧的各频谱块中所有解码出的频谱系数所对应的能量信息进行判断 的, 如果所述能量相当频谱块的个数大于等于预先设定的数值, 则判断当前 帧需要做帧间预测。 频谱块能量可以用频谱块内频谱系数幅度的平方和或平 方和的均方根或频谱系数幅度和等来表示, 一种基于频谱系数幅度平方和的 频谱块能量的计算公式为:  Specifically, in determining whether the current frame needs to perform inter prediction, the embodiment is determined according to energy information corresponding to all decoded spectral coefficients in each spectrum block of the current frame and the previous frame, if the energy is equivalent If the number of spectrum blocks is greater than or equal to a preset value, it is determined that the current frame needs to be inter-predicted. The spectral block energy can be expressed by the sum of the squares of the amplitudes of the spectral coefficients in the spectral block or the root mean square of the squared sum or the amplitude of the spectral coefficients. A formula for calculating the energy of the spectral block based on the sum of the squares of the amplitudes of the spectral coefficients is:
1-1  1-1
ener[k] = ^spec[k + ] * spec[k + ] ( 2 ) 其中, k为频谱块的序数, L为频谱块内频谱系数的个数。  Ener[k] = ^spec[k + ] * spec[k + ] ( 2 ) where k is the ordinal number of the spectrum block and L is the number of spectral coefficients in the spectrum block.
所谓能量相当, 是指之前帧和当前帧频谱块之间的能量比在 [1/E, E]范围 内, 如 E可取 0.8, —般来说 E取值可较接近 1 , 以保证预测的准确性。  The so-called energy is equivalent, that is, the energy ratio between the previous frame and the current frame spectrum block is in the range of [1/E, E], for example, E can take 0.8, and generally the value of E can be closer to 1 to ensure prediction. accuracy.
本实施例对当前帧是否做帧间预测的判断流程, 可釆用类似图 4或图 5 的算法结构框图。 当大多数帧都能满足帧间预测处理条件时, 在判断当前帧 是否做帧间预测时, 可以在处理完所有频谱块后再进行判断, 以节省算法的 平均复杂度。 参见图 6, 具体的处理算法如下:  In this embodiment, a flow chart of the algorithm structure similar to FIG. 4 or FIG. 5 can be used to determine whether the current frame is inter-predicted. When most frames can satisfy the inter prediction processing conditions, when judging whether the current frame is inter-predicted, it can be judged after all the spectrum blocks are processed, so as to save the average complexity of the algorithm. Referring to Figure 6, the specific processing algorithm is as follows:
首先初始化当前帧和之前帧解码出频谱块内频谱系数的能量相当的个数 i=0和待处理的频谱系数块的序数 k=0, 并确定最大待处理频谱块数。 然后不 论当前帧是否满足做帧间预测算法的条件都将当前帧暂时通过本发明算法: 针对每个频谱块, 如果之前帧解码出了频谱系数块, 而当前帧没有解码出, 如图 3中的频点 27所在的频谱块, 将之前帧频点 27的频谱系数的幅度和当 前帧所有解码出的频谱系数中的最小幅度的 =0.8 倍的加权和, 作为当前帧 频点 27的频谱系数的幅度, 符号和前一帧频点 27的符号保持一致; 如果当 前帧频谱块和之前帧频谱块都有频谱系数解码出, 判断该频谱块中当前帧和 之前帧解码出的频谱系数的能量是否相当, 如果是, i加一, 否则 i保持不变, 直到处理完所有的频谱块。当所有的频谱块处理完后,判断 i是否大于等于 M, M 为预先设定的数值。 如果是, 当前帧满足处理条件, 输出预测处理后的当 前帧频语系数, 并保存预测处理前的当前帧频语系数作为下一帧的之前帧频 谱系数; 否则, 当前帧不满足处理条件, 将当前帧频谱系数还原为预测处理 前的当前帧频谱系数进行输出, 并保存预测处理前的当前帧频谱系数作为下 一帧的之前帧频谱系数。 First, the current frame and the previous frame are decoded to decode the energy of the spectral coefficients in the spectrum block by the number i=0 and the ordinal number k=0 of the spectral coefficient block to be processed, and the maximum number of spectrum blocks to be processed is determined. Then, the current frame is temporarily passed through the algorithm of the present invention regardless of whether the current frame satisfies the condition of the inter prediction algorithm: For each spectrum block, if the previous frame decodes the spectral coefficient block, and the current frame is not decoded, As shown in the spectrum block where the frequency point 27 is located in FIG. 3, the weighted sum of the spectral coefficients of the previous frame frequency point 27 and the minimum amplitude of all the decoded spectral coefficients of the current frame = 0.8 times as the current frame frequency point. The amplitude of the spectral coefficient of 27 is consistent with the sign of the previous frame frequency point 27; if both the current frame spectral block and the previous frame spectral block have spectral coefficients decoded, it is determined that the current frame and the previous frame in the spectral block are decoded. Whether the energy of the spectral coefficients is equal, if it is, i is increased by one, otherwise i remains unchanged until all spectrum blocks are processed. After all the spectrum blocks have been processed, it is judged whether i is greater than or equal to M, and M is a preset value. If yes, the current frame satisfies the processing condition, outputting the current frame frequency coefficient after the prediction processing, and saving the current frame frequency coefficient before the prediction processing as the previous frame spectral coefficient of the next frame; otherwise, the current frame does not satisfy the processing condition, The current frame spectral coefficient is restored to the current frame spectral coefficient before the prediction process for output, and the current frame spectral coefficient before the prediction process is saved as the previous frame spectral coefficient of the next frame.
需要说明的是, 本实施例中划分的频谱块内的频谱个数 L最好小于实施 例二中的频谱块内的频谱个数 L, 因为在进行能量相当性计数时, 当频谱块较 大时, 即便连续两帧之间频谱块内频谱系数的能量相当, 也有可能连续两帧 之间的相关性却 4艮低。 因此, 如果实施例二中频谱块的长度 L=5, 则在本实 施例中频谱块的长度 L最好取小于等于 3 , 即 L 3。  It should be noted that the number L of spectrums in the spectrum block divided in this embodiment is preferably smaller than the number L of spectrums in the spectrum block in the second embodiment, because when the energy equivalent is counted, when the spectrum block is larger At the same time, even if the energy of the spectral coefficients in the spectrum block is equal between two consecutive frames, it is possible that the correlation between two consecutive frames is low. Therefore, if the length of the spectrum block in the second embodiment is L = 5, the length L of the spectrum block in the embodiment is preferably less than or equal to 3, that is, L 3 .
可以理解的是, 在根据帧间相关性判断当前帧是否需要做帧间预测时, 也可以将基于当前帧和之前帧解码出频谱块内频谱系数的能量的相关性的判 断和基于当前帧中解码出频谱系数的频谱块的个数的判断相结合进行对当前 帧是否做帧间预测的判断。 具体的帧间预测的方法流程可参见以上发明实施 例的说明, 在此不再赘述。  It can be understood that, when determining whether the current frame needs to perform inter prediction according to the inter-frame correlation, the determination of the correlation of the energy of the spectral coefficients in the spectrum block based on the current frame and the previous frame may also be determined based on the current frame. The determination of the number of spectral blocks in which the spectral coefficients are decoded is combined with a determination as to whether or not the current frame is inter-predicted. For a specific method of inter-frame prediction, refer to the description of the above embodiments of the invention, and details are not described herein again.
本发明提供的以上方法实施例, 首先根据之前帧的频谱系数对当前帧和 之前帧进行频语块划分, 并根据当前帧和之前帧划分的频谱块, 判断当前帧 是否需要做帧间预测, 然后对判断出需要做帧间预测的当前帧, 判断划分的 频谱块是否需要做帧间预测, 如果当前帧的频谱块内没有解码出的频谱系数, 而之前帧的相应频谱块内有解码出的频谱系数, 则判断当前帧的所述频谱块, 最后对需要做帧间预测的频谱块, 将该之前帧相对应频谱块中解码出的频谱 系数所在频点的幅度, 和当前帧已解码出的所有频谱系数中的最小幅度的The foregoing method embodiment of the present invention firstly performs frequency bin partitioning on the current frame and the previous frame according to the spectral coefficient of the previous frame, and determines whether the current frame needs to perform inter prediction according to the current frame and the spectrum block divided by the previous frame. Then, it is determined whether the divided spectrum block needs to be inter-predicted for the current frame that needs to be inter-predicted. If there is no decoded spectral coefficient in the spectrum block of the current frame, the corresponding spectrum block of the previous frame is decoded. The spectral coefficient, then determine the spectrum block of the current frame, Finally, for the spectrum block that needs to be inter-predicted, the amplitude of the frequency point of the spectral coefficient decoded in the corresponding spectrum block of the previous frame, and the minimum amplitude of all the spectral coefficients that have been decoded by the current frame
( 0≤^≤1 )倍进行加权求和, 将加权求和的结果作为所述当前帧频谱块中未 解码出频谱系数相对应频点的幅度, 频谱系数的符号与所述之前帧相对应频 点的符号相同, 其中根据帧间相关性判断当前帧是否需要做帧间预测可据当 前帧和之前帧所有解码出的频谱系数所对应频点的间距信息进行判断,和 /或, 根据当前帧和之前帧所有解码出的频谱系数所对应频谱块的能量信息进行判 断。 与现有技术的帧内预测方法相比, 对于强谐波性的信号, 使得当前帧的 频谱更平滑, 减少了不连续的现象, 从而与真实谱更接近, 更好地提升了音 频输出信号尤其是强谐波性的音频输出信号的听觉质量。 (0≤^≤1) times performing weighted summation, and the result of the weighted summation is used as the amplitude of the corresponding frequency point of the undecoded spectral coefficient in the current frame spectrum block, and the symbol of the spectral coefficient corresponds to the previous frame The symbols of the frequency points are the same, wherein whether the current frame needs to be inter-predicted according to the inter-frame correlation can be judged according to the spacing information of the frequency points corresponding to all the decoded spectral coefficients of the current frame and the previous frame, and/or according to the current The energy information of the spectrum block corresponding to all the decoded spectral coefficients of the frame and the previous frame is judged. Compared with the intra-prediction method of the prior art, for the signal of strong harmonicity, the spectrum of the current frame is smoother, the discontinuity phenomenon is reduced, and the signal is closer to the real spectrum, and the audio output signal is better improved. Especially the auditory quality of strong harmonic audio output signals.
实施例四  Embodiment 4
如图 7 所示, 相应于上述方法实施例, 本发明实施例提供的一种频域脉 冲解码的预测装置, 包括:  As shown in FIG. 7, corresponding to the foregoing method embodiment, a prediction apparatus for frequency domain pulse decoding provided by the embodiment of the present invention includes:
块划分单元 71 , 用于根据之前帧的频谱系数对当前帧和之前帧进行频语 块划分;  a block dividing unit 71, configured to perform frequency bin partitioning on the current frame and the previous frame according to the spectral coefficients of the previous frame;
判断单元 70,用于根据所述块划分单元 71划分的当前帧和之前帧频谱块, 判断当前帧划分的频谱块是否需要做帧间预测;  The determining unit 70 is configured to determine, according to the current frame and the previous frame spectrum block that are divided by the block dividing unit 71, whether the spectrum block of the current frame division needs to perform inter prediction;
预测单元 74,用于对所述判断单元 70判断出需要做帧间预测的当前帧频 谱块, 使用所述之前帧相对应频谱块中解码出的频谱系数和当前帧已解码出 的频谱系数, 预测当前帧频谱块中未解码出的频谱系数。  The prediction unit 74 is configured to determine, by the determining unit 70, a current frame spectrum block that needs to be inter-predicted, and use the decoded spectral coefficient in the corresponding spectrum block of the previous frame and the decoded spectral coefficient of the current frame. The undecoded spectral coefficients in the current frame spectrum block are predicted.
其中, 所述块划分单元 71包括:  The block dividing unit 71 includes:
第一划分模块 711 , 用于对应之前帧解码出的频谱系数所在频点, 在当前 帧和之前帧中分别以该频点为中心在前后各 N个频点的范围划分出一个频语 块, 其中, N≥l。 这时当前帧划分出的频谱块的个数等于之前帧解码出的频谱 系数的个数。 和 /或,  The first dividing module 711 is configured to: at a frequency point corresponding to the spectral coefficient decoded by the previous frame, and divide a frequency block in the range of the N frequency points in the current frame and the previous frame respectively. Where N ≥ l. At this time, the number of spectral blocks divided by the current frame is equal to the number of spectral coefficients decoded by the previous frame. and / or,
第二划分模块 712, 用于如果之前帧在相邻 N个频点范围内解码出两个 以上的频语系数, 则从所述两个以上的频谱系数所在频点中选择任一频点, 在当前帧和之前帧中分别以该选择的频点为中心在前后各 N个频点的范围划 分出一个频语块, 其中, N≥l。 这时每个划分出的频谱块之间无区域重叠, 之 前帧解码出的频谱系数的个数大于当前帧划分出的频谱块的个数。 a second dividing module 712, configured to decode two if the previous frame is in the range of adjacent N frequency points For the above frequency coefficient, any frequency point is selected from the frequency points where the two or more spectral coefficients are located, and the selected frequency points are centered at the N frequency points in the current frame and the previous frame respectively. The range is divided into a frequency block, where N ≥ l. At this time, there is no area overlap between each of the divided spectrum blocks, and the number of spectral coefficients decoded by the previous frame is larger than the number of spectrum blocks divided by the current frame.
所述判断单元 70包括:  The determining unit 70 includes:
帧判断子单元 72,用于根据所述块划分单元 71划分的当前帧和之前帧频 谱块, 判断当前帧是否需要做帧间预测;  The frame determining sub-unit 72 is configured to determine, according to the current frame and the previous frame spectral block divided by the block dividing unit 71, whether the current frame needs to be inter-predicted;
块判断子单元 73 ,用于对所述帧判断单元 72判断出需要做帧间预测的当 前帧, 判断当前帧划分的频谱块是否需要做帧间预测。  The block judging sub-unit 73 is configured to determine, by the frame judging unit 72, the current frame that needs to be inter-predicted, and determine whether the spectrum block of the current frame division needs to perform inter-frame prediction.
所述帧判断子单元 72包括:  The frame judging subunit 72 includes:
位置判断模块 721 ,用于根据当前帧中解码出的频谱系数的频谱块的个数 进行判断, 如果当前帧中解码出的频谱系数的频谱块的个数大于等于预先设 定的数值, 则判断当前帧需要做帧间预测; 和 /或,  The position determining module 721 is configured to determine, according to the number of the spectrum blocks of the decoded spectral coefficients in the current frame, if the number of the spectral blocks of the decoded spectral coefficients in the current frame is greater than or equal to a preset value, The current frame needs to be inter-predicted; and/or,
能量判断模块 722,用于根据当前帧中解码出的频谱系数的频谱块和之前 帧中对应频谱块的能量信息进行判断, 如果对应频谱块能量相当的个数大于 等于预先设定的数值, 则判断当前帧需要做帧间预测。  The energy judging module 722 is configured to determine, according to the spectrum block of the spectral coefficient decoded in the current frame and the energy information of the corresponding spectrum block in the previous frame, if the number of corresponding spectrum block energy is greater than or equal to a preset value, Judging the current frame requires inter prediction.
所述块判断子单元 73可具体用于, 根据当前帧和之前帧对应频谱块内有 无解码出的频谱系数进行判断, 如果当前帧的频谱块内没有解码出的频谱系 数, 而之前帧的相应频谱块内有解码出的频谱系数, 则判断当前帧的所述频 谱块需要做帧间预测。  The block determining sub-unit 73 may be specifically configured to determine, according to whether the current frame and the previous frame corresponding to the spectrum block have decoded spectral coefficients, if there is no decoded spectral coefficient in the spectrum block of the current frame, and the previous frame If there is a decoded spectral coefficient in the corresponding spectrum block, it is determined that the spectrum block of the current frame needs to be inter-predicted.
所述预测单元 74可具体用于, 将所述之前帧相对应频谱块中解码出的频 谱系数所在频点的幅度, 和当前帧已解码出的所有频谱系数中的最小幅度的 δ倍进行加权求和, 将加权求和的结果作为所述当前帧频谱块中未解码出频 谱系数相对应频点的幅度, 频谱系数的符号与所述之前帧相对应频点的符号 相同, 其中 0≤ ≤1。  The prediction unit 74 may be specifically configured to: weight an amplitude of a frequency point of the spectral coefficient decoded in the corresponding spectrum block of the previous frame, and a δ times of a minimum amplitude of all the spectral coefficients that have been decoded in the current frame. Summing, the result of the weighted summation is used as the amplitude of the corresponding frequency point of the undecoded spectral coefficient in the current frame spectrum block, and the sign of the spectral coefficient is the same as the symbol of the corresponding frequency point of the previous frame, where 0≤ ≤ 1.
可以理解的是, 本发明实施例提供的装置还可包括: 输出保存单元 75 , 用于输出预测处理后的当前帧的频谱系数, 并保存预 测处理前的当前帧的频语系数作为下一帧的之前帧的帧信息。 It is to be understood that the apparatus provided by the embodiment of the present invention may further include: The output saving unit 75 is configured to output the spectral coefficient of the current frame after the prediction processing, and save the frequency coefficient of the current frame before the prediction processing as the frame information of the previous frame of the next frame.
本发明实施例提供的频域脉冲解码的预测装置, 首先通过帧判断单元对 当前帧是否需要做帧间预测进行判断, 然后对判断出需要做帧间预测的当前 帧, 通过块判断单元对划分的频谱块是否需要做帧间预测进行判断, 最后对 判断出需要做帧间预测的频谱块, 通过预测单元使用之前帧相对应频谱块中 解码出的频谱系数和当前帧已解码出的频谱系数, 预测当前帧频谱块中未解 码出的频谱系数。 与现有技术的帧内预测方法相比, 对于强谐波性的信号, 使得当前帧的频谱更平滑, 减少了不连续的现象, 从而与真实谱更接近, 更 好地提升了音频输出信号尤其是强谐波性的输出信号的听觉质量。  The apparatus for predicting frequency domain pulse decoding provided by the embodiment of the present invention first determines whether the current frame needs to be inter-predicted by the frame determining unit, and then determines the current frame that needs to be inter-predicted by the block determining unit. Whether the spectrum block needs to be inter-predicted for judgment, and finally, to determine the spectrum block that needs to be inter-predicted, the prediction unit uses the decoded spectrum coefficient in the corresponding spectrum block of the previous frame and the decoded spectrum coefficient of the current frame. , predicting undecoded spectral coefficients in the current frame spectrum block. Compared with the intra-prediction method of the prior art, for the signal of strong harmonicity, the spectrum of the current frame is smoother, the discontinuity phenomenon is reduced, and the signal is closer to the real spectrum, and the audio output signal is better improved. In particular, the auditory quality of a strong harmonic output signal.
实施例五  Embodiment 5
如图 8 所示, 本发明实施例还提供一种解码器, 包括频域脉冲解码的预 测装置 81和变换器 82;  As shown in FIG. 8, an embodiment of the present invention further provides a decoder, including a frequency domain pulse decoding predicting device 81 and a converter 82;
所述频域脉冲解码的预测装置 81 , 用于对判断出需要做帧间预测的当前 帧频谱块, 使用之前帧相对应频谱块中解码出的频谱系数和当前帧已解码出 的频谱系数, 预测当前帧频谱块中未解码出的频谱系数;  The frequency domain pulse decoding prediction apparatus 81 is configured to: determine a current frame spectrum block that needs to be inter-predicted, use a spectrum coefficient decoded in a corresponding spectrum block of the previous frame, and a decoded spectrum coefficient of the current frame, Predicting undecoded spectral coefficients in the current frame spectrum block;
所述变换器 82,用于根据所述频域脉冲解码的预测装置 81预测后的帧频 谱系数, 进行频域至时域的变换, 输出时域音频信号。  The converter 82 is configured to perform frequency domain to time domain transform according to the predicted frame frequency spectrum coefficient of the frequency domain pulse decoded prediction apparatus 81, and output a time domain audio signal.
所述频域脉冲解码的预测装置 81更具体地可以参见上述的方法实施例和 装置实施例四中的频域脉冲解码的预测装置, 在此不再赘述。  The prediction device 81 of the frequency domain pulse decoding can be more specifically referred to the prediction device of the frequency domain pulse decoding in the foregoing method embodiment and the device embodiment 4, and details are not described herein again.
专业人员还可以意识到, 结合本文中所公开的实施例描述的各示例的单 元及算法步骤, 能够以电子硬件、 计算机软件或者二者的结合来实现, 为了 清楚地说明硬件和软件的可互换性, 在上述说明中已经按照功能一般性地描 述了各示例的组成及步骤。 这些功能究竟以硬件还是软件方式来执行, 取决 于技术方案的特定应用和设计约束条件。 专业技术人员可以对每个特定的应 用来使用不同方法来实现所描述的功能, 但是这种实现不应认为超出本发明 的范围。 A person skilled in the art will also appreciate that the elements and algorithm steps of the various examples described in connection with the embodiments disclosed herein can be implemented in electronic hardware, computer software, or a combination of both, in order to clearly illustrate the inter The composition and steps of the examples have been generally described in terms of functions in the above description. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the present invention. The scope.
结合本文中所公开的实施例描述的方法或算法的步骤可以用硬件、 处理 器执行的软件模块, 或者二者的结合来实施。 软件模块可以置于随机存储器 The steps of a method or algorithm described in connection with the embodiments disclosed herein can be implemented in hardware, a software module executed by a processor, or a combination of both. Software modules can be placed in random access memory
( RAM ),内存、只读存储器(ROM )、电可编程 ROM、电可擦除可编程 ROM、 寄存器、 硬盘、 可移动磁盘、 CD-ROM、 或任意其它形式的存储介质中。 (RAM), memory, read only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other form of storage medium.
以上所述仅为本发明的较佳实施例而已, 并不用以限制本发明, 凡在本 发明的精神和原则之内所作的任何修改、 等同替换和改进等, 均应包含在本 发明的保护范围之内。  The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims

权 利 要 求 书 Claim
1、 一种频域脉冲解码的预测方法, 其特征在于, 包括: A method for predicting frequency domain pulse decoding, comprising:
根据之前帧的频谱系数对当前帧和之前帧进行频谱块划分;  Performing spectrum block division on the current frame and the previous frame according to the spectral coefficient of the previous frame;
根据当前帧和之前帧划分的频谱块的相关性, 判断当前帧划分的频谱块是 否需要做帧间预测;  Determining whether the spectrum block of the current frame division needs to be inter-predicted according to the correlation between the current frame and the spectrum block divided by the previous frame;
对判断出需要做帧间预测的当前帧频谱块, 使用所述之前帧相对应频谱块 中解码出的频谱系数和当前帧已解码出的频谱系数, 预测当前帧频谱块中未解 码出的频谱系数。  For determining the current frame spectrum block that needs to be inter-predicted, using the decoded spectral coefficients in the corresponding spectrum block of the previous frame and the decoded spectral coefficients of the current frame, predicting the undecoded spectrum in the current frame spectrum block. coefficient.
2、 根据权利要求 1所述的方法, 其特征在于, 所述根据之前帧的频谱系数 对当前帧和之前帧进行频谱块划分的步骤包括:  2. The method according to claim 1, wherein the step of performing spectrum block partitioning on the current frame and the previous frame according to the spectral coefficient of the previous frame comprises:
对应之前帧解码出的频谱系数所在频点, 在当前帧和之前帧中分别以该频 点为中心在前后各 N个频点的范围划分出一个频谱块; 和 /或,  Corresponding to the frequency point where the spectral coefficient decoded by the previous frame is located, a spectrum block is divided in the range of the N frequency points in the current frame and the previous frame, respectively, with the frequency point as the center; and/or
如果之前帧在相邻 N个频点范围内解码出两个以上的频谱系数, 则从所述 两个以上的频谱系数所在频点中选择任一频点, 在当前帧和之前帧中分别以该 选择的频点为中心在前后各 N个频点的范围划分出一个频谱块;  If the previous frame decodes more than two spectral coefficients in the range of adjacent N frequency points, then any frequency point is selected from the frequency points where the two or more spectral coefficients are located, and respectively in the current frame and the previous frame. The selected frequency point is a spectrum block divided by the center of the N frequency points before and after;
其中, N > 1。  Where N > 1.
3、 根据权利要求 1或 2所述的方法, 其特征在于, 所述根据当前帧和之前 帧划分的频谱块的相关性, 判断当前帧划分的频谱块是否需要做帧间预测的步 骤包括:  The method according to claim 1 or 2, wherein, according to the correlation between the current frame and the spectrum block divided by the previous frame, the step of determining whether the spectrum block of the current frame division needs inter prediction includes:
根据当前帧和之前帧划分的频谱块的相关性, 判断当前帧是否需要做帧间 预测;  Determining whether the current frame needs to be inter-frame predicted according to the correlation between the current frame and the spectrum block divided by the previous frame;
对判断出需要做帧间预测的当前帧, 逐个地判断当前帧划分的频谱块是否 需要做帧间预测。  For determining the current frame that needs to be inter-predicted, it is judged one by one whether the spectrum block of the current frame division needs to be inter-predicted.
4、 根据权利要求 3所述的方法, 其特征在于, 所述根据当前帧和之前帧划 分的频语块相关性, 判断当前帧划分的频谱块是否需要做帧间预测的步骤包括: 如果当前帧中解码出的频谱系数的频谱块的个数大于等于预先设定的数 值, 则判断当前帧需要做帧间预测; 和 /或, The method according to claim 3, wherein the step of determining whether the spectrum block of the current frame division needs to perform inter prediction according to the frequency block correlation of the current frame and the previous frame includes: If the number of the spectrum blocks of the decoded spectral coefficients in the current frame is greater than or equal to a preset value, it is determined that the current frame needs to be inter-predicted; and/or,
如果当前帧中解码出的频谱系数的频谱块和之前帧中对应频谱块的能量相 当的个数大于等于预先设定的数值, 则判断当前帧需要做帧间预测。  If the number of the spectral blocks of the decoded spectral coefficients in the current frame and the corresponding spectral blocks in the previous frame are greater than or equal to a preset value, it is determined that the current frame needs to be inter-predicted.
5、 根据权利要求 3所述的方法, 其特征在于, 所述判断当前帧划分的频谱 块是否需要做帧间预测的步骤包括:  The method according to claim 3, wherein the step of determining whether the spectrum block of the current frame division needs to perform inter prediction comprises:
如果当前帧的频谱块内没有解码出的频谱系数, 而之前帧的相对应频谱块 内有解码出的频谱系数, 则判断当前帧的该频谱块需要做帧间预测。  If there is no decoded spectral coefficient in the spectrum block of the current frame, and there is a decoded spectral coefficient in the corresponding spectral block of the previous frame, it is determined that the spectral block of the current frame needs to be inter-predicted.
6、 根据权利要求 5所述的方法, 其特征在于, 所述对判断出需要做帧间预 测的当前帧频语块, 使用所述之前帧相对应频语块中解码出的频语系数和当前 帧已解码出的频谱系数, 预测该当前帧频谱块中未解码出的频谱系数的步骤包 括:  The method according to claim 5, wherein the pair determines a current frame frequency block that needs to be inter-predicted, and uses a frequency coefficient that is decoded in the corresponding frequency block of the previous frame and The spectral coefficients of the current frame have been decoded, and the steps of predicting the undecoded spectral coefficients in the current frame spectrum block include:
将所述之前帧相对应频谱块中解码出的频谱系数所在频点的幅度, 和当前 帧已解码出的所有频谱系数中的最小幅度的 倍进行加权求和, 将加权求和的 结果作为所述当前帧频谱块中未解码出频谱系数相对应频点的幅度, 频谱系数 的符号与所述之前帧相对应频点的符号相同, 其中 0≤ ≤1。  And weighting the amplitude of the frequency point of the spectral coefficient decoded in the corresponding spectrum block of the previous frame and the minimum of the minimum amplitude of all the spectral coefficients decoded by the current frame, and using the result of the weighted summation as a The amplitude of the corresponding frequency point of the spectral coefficient is not decoded in the current frame spectrum block, and the symbol of the spectral coefficient is the same as the symbol of the corresponding frequency point of the previous frame, where 0≤ ≤1.
7、 根据权利要求 1所述的方法, 其特征在于, 所述方法还包括: 输出预测处理后的当前帧的频谱系数, 并保存预测处理前的当前帧的频谱 系数作为下一帧的之前帧的帧信息。  The method according to claim 1, wherein the method further comprises: outputting a spectral coefficient of the current frame after the prediction processing, and saving a spectral coefficient of the current frame before the prediction processing as a previous frame of the next frame. Frame information.
8、 一种频域脉冲解码的预测装置, 其特征在于, 包括:  8. A prediction apparatus for frequency domain pulse decoding, comprising:
块划分单元, 用于根据之前帧的频语系数对当前帧和之前帧进行频语块划 分;  a block dividing unit, configured to perform frequency segmentation on the current frame and the previous frame according to the frequency coefficient of the previous frame;
判断单元, 用于根据所述块划分单元划分的当前帧和之前帧频谱块的相关 性, 判断当前帧划分的频谱块是否需要做帧间预测;  a determining unit, configured to determine, according to the correlation between the current frame and the previous frame spectrum block that are divided by the block dividing unit, whether the spectrum block of the current frame division needs to perform inter prediction;
预测单元, 用于对所述判断单元判断出需要做帧间预测的当前帧频谱块, 使用所述之前帧相对应频谱块中解码出的频谱系数和当前帧已解码出的频谱系 数, 预测当前帧频谱块中未解码出的频谱系数。 a prediction unit, configured to determine, by the determining unit, a current frame spectrum block that needs to be inter-predicted, use a spectrum coefficient decoded in a corresponding spectrum block of the previous frame, and a spectrum system that is decoded by the current frame Number, predicts the undecoded spectral coefficients in the current frame spectrum block.
9、 根据权利要求 8所述的装置, 其特征在于, 所述块划分单元包括: 第一划分模块, 用于对应之前帧解码出的频谱系数所在频点, 在当前帧和 之前帧中分别以该频点为中心在前后各 N个频点的范围划分出一个频语块, 其 中, N > 1 ; 和 /或,  The device according to claim 8, wherein the block dividing unit comprises: a first dividing module, configured to correspond to a frequency point of a spectral coefficient decoded by a previous frame, in the current frame and the previous frame respectively The frequency point is divided into a frequency block in the range of N frequency points before and after the center, where N > 1 ; and/or,
第二划分模块, 用于如果之前帧在相邻 N个频点范围内解码出两个以上的 频谱系数, 则从所述两个以上的频谱系数所在频点中选择任一频点, 在当前帧 和之前帧中分别以该选择的频点为中心在前后各 N个频点的范围划分出一个频 谱块, 其中, N > 1。  a second dividing module, configured to select any frequency point from the frequency points where the two or more spectral coefficients are located, if the previous frame decodes two or more spectral coefficients in the range of adjacent N frequency points, at the current A spectrum block is divided into a range of N frequency points before and after the selected frequency point in the frame and the previous frame, where N > 1.
10、 根据权利要求 8或 9所述的装置, 其特征在于, 所述判断单元包括: 帧判断子单元, 根据所述块划分单元划分的当前帧和之前帧频谱块的相关 性, 判断当前帧是否需要做帧间预测;  The device according to claim 8 or 9, wherein the determining unit comprises: a frame determining sub-unit, determining a current frame according to a correlation between a current frame and a previous frame spectrum block divided by the block dividing unit Whether to do inter prediction;
块判断子单元, 用于对所述帧判断模块判断出需要做帧间预测的当前帧, 逐个地判断当前帧划分的频谱块是否需要做帧间预测。  The block determining sub-unit is configured to determine, by the frame determining module, that the current frame that needs to be inter-predicted, and determine whether the spectrum block of the current frame division needs inter-frame prediction one by one.
11、 根据权利要求 10所述的装置, 其特征在于, 所述帧判断子单元包括: 位置判断模块, 用于根据当前帧中解码出的频谱系数的频谱块的个数进行 判断, 如果当前帧中解码出的频谱系数的频谱块的个数大于等于预先设定的数 值, 则判断当前帧需要做帧间预测; 和 /或,  The apparatus according to claim 10, wherein the frame determining subunit comprises: a position determining module, configured to determine, according to the number of spectrum blocks of the decoded spectral coefficients in the current frame, if the current frame If the number of the spectrum blocks of the decoded spectral coefficients is greater than or equal to a preset value, it is determined that the current frame needs to be inter-frame predicted; and/or,
能量判断模块, 用于根据当前帧中解码出的频谱系数的频谱块和之前帧中 对应频谱块的能量信息进行判断, 如果对应频谱块能量相当的个数大于等于预 先设定的数值, 则判断当前帧需要做帧间预测。  The energy judging module is configured to determine, according to the spectrum block of the spectral coefficient decoded in the current frame and the energy information of the corresponding spectrum block in the previous frame, if the number of corresponding energy of the corresponding spectrum block is greater than or equal to a preset value, The current frame needs to be inter-predicted.
12、根据权利要求 11所述的装置, 其特征在于, 所述块判断单元具体用于, 根据当前帧和之前帧对应频谱块内有无解码出的频谱系数进行判断, 如果当前 帧的频谱块内没有解码出的频谱系数, 而之前帧的相应频谱块内有解码出的频 谱系数, 则判断当前帧的该频谱块需要做帧间预测。  The apparatus according to claim 11, wherein the block determining unit is configured to: determine, according to whether the current frame and the corresponding spectrum block in the previous frame correspond to the decoded spectral coefficient, if the spectrum block of the current frame There is no decoded spectral coefficient, and there is a decoded spectral coefficient in the corresponding spectral block of the previous frame, then it is determined that the spectral block of the current frame needs to be inter-predicted.
13、 根据权利要求 12所述的装置, 其特征在于, 所述预测单元具体用于, 将所述之前帧相对应频谱块中解码出的频谱系数所在频点的幅度, 和当前帧已 解码出的所有频谱系数中的最小幅度的 倍进行加权求和, 将加权求和的结果 作为所述当前帧频谱块中未解码出频谱系数相对应频点的幅度, 频谱系数的符 号与所述之前帧相对应频点的符号相同, 其中 0≤ ≤1。 The device according to claim 12, wherein the prediction unit is specifically configured to: And weighting the amplitude of the frequency point of the spectral coefficient decoded in the corresponding spectrum block of the previous frame and the minimum of the minimum amplitude of all the spectral coefficients decoded by the current frame, and using the result of the weighted summation as a The amplitude of the corresponding frequency point of the spectral coefficient is not decoded in the current frame spectrum block, and the symbol of the spectral coefficient is the same as the symbol of the corresponding frequency point of the previous frame, where 0≤ ≤1.
14、 根据权利要求 8所述的装置, 其特征在于, 所述装置还包括: 输出保存单元, 用于输出预测处理后的当前帧的频谱系数, 并保存预测处 理前的当前帧的频语系数作为下一帧的之前帧的帧信息。  The device according to claim 8, wherein the device further comprises: an output saving unit, configured to output a spectral coefficient of the current frame after the prediction processing, and save the frequency coefficient of the current frame before the prediction processing The frame information of the previous frame as the next frame.
15、 一种解码器, 其特征在于, 包括权利要求 8至 14任一项所述的频域脉 冲解码的预测装置和变换器,  A decoder, comprising the frequency domain pulse decoding prediction apparatus and converter according to any one of claims 8 to 14,
所述频域脉冲解码的预测装置, 用于对判断出需要做帧间预测的当前帧频 谱块, 使用之前帧相对应频谱块中解码出的频谱系数和当前帧已解码出的频谱 系数, 预测当前帧频谱块中未解码出的频谱系数;  The frequency domain pulse decoding prediction apparatus is configured to: determine a current frame spectrum block that needs to be inter-predicted, use a spectrum coefficient decoded in a corresponding spectrum block of the previous frame, and a decoded spectrum coefficient of the current frame, and predict Undecoded spectral coefficients in the current frame spectrum block;
所述变换器, 用于根据所述频域脉冲解码的预测装置预测后的帧频谱系数, 进行频域至时域的变换, 输出时域音频信号。  The converter is configured to perform frequency domain to time domain transform according to the frame spectrum coefficient predicted by the prediction device of the frequency domain pulse decoding, and output a time domain audio signal.
PCT/CN2009/071161 2009-04-03 2009-04-03 Predicting method and apparatus for frequency domain pulse decoding and decoder WO2010111841A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2009/071161 WO2010111841A1 (en) 2009-04-03 2009-04-03 Predicting method and apparatus for frequency domain pulse decoding and decoder
CN2009801486921A CN102246229B (en) 2009-04-03 2009-04-03 Predicting method and apparatus for frequency domain pulse decoding and decoder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2009/071161 WO2010111841A1 (en) 2009-04-03 2009-04-03 Predicting method and apparatus for frequency domain pulse decoding and decoder

Publications (1)

Publication Number Publication Date
WO2010111841A1 true WO2010111841A1 (en) 2010-10-07

Family

ID=42827473

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2009/071161 WO2010111841A1 (en) 2009-04-03 2009-04-03 Predicting method and apparatus for frequency domain pulse decoding and decoder

Country Status (2)

Country Link
CN (1) CN102246229B (en)
WO (1) WO2010111841A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003150191A (en) * 2001-11-14 2003-05-23 Nippon Telegr & Teleph Corp <Ntt> Method, device and program for estimating voice spectrum, and recording medium therefor
CN1504993A (en) * 2002-11-29 2004-06-16 ���ǵ�����ʽ���� Audio decoding method and apparatus for reconstructing high frequency components with less computation
US7003448B1 (en) * 1999-05-07 2006-02-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and device for error concealment in an encoded audio-signal and method and device for decoding an encoded audio signal
CN1813286A (en) * 2004-01-23 2006-08-02 微软公司 Efficient coding of digital media spectral data using wide-sense perceptual similarity
US20070016415A1 (en) * 2005-07-15 2007-01-18 Microsoft Corporation Prediction of spectral coefficients in waveform coding and decoding

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3747492B2 (en) * 1995-06-20 2006-02-22 ソニー株式会社 Audio signal reproduction method and apparatus

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7003448B1 (en) * 1999-05-07 2006-02-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and device for error concealment in an encoded audio-signal and method and device for decoding an encoded audio signal
JP2003150191A (en) * 2001-11-14 2003-05-23 Nippon Telegr & Teleph Corp <Ntt> Method, device and program for estimating voice spectrum, and recording medium therefor
CN1504993A (en) * 2002-11-29 2004-06-16 ���ǵ�����ʽ���� Audio decoding method and apparatus for reconstructing high frequency components with less computation
CN1813286A (en) * 2004-01-23 2006-08-02 微软公司 Efficient coding of digital media spectral data using wide-sense perceptual similarity
US20070016415A1 (en) * 2005-07-15 2007-01-18 Microsoft Corporation Prediction of spectral coefficients in waveform coding and decoding

Also Published As

Publication number Publication date
CN102246229A (en) 2011-11-16
CN102246229B (en) 2013-03-27

Similar Documents

Publication Publication Date Title
US9779749B2 (en) Audio signal coding method and apparatus
WO2013060223A1 (en) Frame loss compensation method and apparatus for voice frame signal
RU2665889C2 (en) Selection of procedure for masking packet losses
TWI332193B (en) Method and apparatus of processing time-varying signals coding and decoding and computer program product
JP2019215545A (en) Systems and methods of communicating redundant frame information
RU2765985C2 (en) Classification and encoding of audio signals
WO2013127364A1 (en) Voice frequency signal processing method and device
WO2011110031A1 (en) Method and device for encoding high frequency signal, and method and device for decoding high frequency signal
RU2705458C2 (en) Masking errors in frames
WO2009092309A1 (en) A control method and apparatus for quantizing noise leakage
JP2020204778A5 (en)
WO2017044245A1 (en) Audio signal classification and post-processing following a decoder
US20130317831A1 (en) Bandwidth expansion method and apparatus
JP2013084002A (en) Device and method for enhancing quality of speech codec
RU2651184C1 (en) Method of processing a speech/audio signal and apparatus
RU2644078C1 (en) Method, device and coding / decoding system
JP2006018023A (en) Audio signal coding device, and coding program
WO2022012629A1 (en) Method and apparatus for estimating time delay of stereo audio signal
WO2010111841A1 (en) Predicting method and apparatus for frequency domain pulse decoding and decoder
WO2014000559A1 (en) Processing method for speech or audio signals and encoding apparatus thereof
WO2012159370A1 (en) Voice enhancement method and device
US20150334501A1 (en) Method and Apparatus for Generating Sideband Residual Signal
TW202103146A (en) Speech encoding method and electronic device
JP2002162982A (en) Device and method for voiced/voiceless decision
JPH0364800A (en) Voice encoding and decoding system

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200980148692.1

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09842498

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09842498

Country of ref document: EP

Kind code of ref document: A1