CN104217725A

CN104217725A - Audio watermarking method based on multi-echo core

Info

Publication number: CN104217725A
Application number: CN201410507604.5A
Authority: CN
Inventors: 谢湘; 张学君; 赵军辉
Original assignee: Beijing Institute of Technology BIT
Current assignee: Beijing Institute of Technology BIT
Priority date: 2014-09-29
Filing date: 2014-09-29
Publication date: 2014-12-17

Abstract

The invention discloses an audio watermarking method based on a multi-echo core and belongs to the field of audio signal processing. The method includes first processing binary original watermarking signals, determining the echo core according to the attenuation coefficient and the echo delay point position, embedding the multiple ways of echo into the original audio signals, adopting an energy reversed spectral analysis method to decode the signals during watermarking extraction and extracting the corresponding watermarking information. The echo cache means is adopted in the embedding process, the synchronous performance is good, the problem of watermarking damage caused by the fact that the audio signals are shorn randomly is solved, the problem of echo interference in the multiple echo is solved, the data capacity is improved, the odd-even check information is added, the system hostile attack resistance is improved, and the method has high robustness.

Description

A kind of audio-frequency water mark method based on many echoes core

Technical field

The present invention relates to a kind of multimedia messages hidden method, particularly relate to a kind of audio-frequency water mark method based on many echoes core, can be used as audio-frequency information label mark, or for Information hiding, belong to Audio Signal Processing field and information security field.

Background technology

The copyright protection of audio multimedia and content retrieval are all concerned problems all the time; digital watermark technology is the approach that this problem provides solution; using label as watermark embedment audio multimedia; label can mark the source of audio frequency, copyright, synopsis or other attributes; simultaneously; the embedding of watermark label should be pursued and improve robustness and watermark capacity under the prerequisite not reducing audio quality, and can realize the embedding of watermark quick, convenient, steadily and extract detection.

For example, when employing FM audio broadcasting is as carrier, carry the satellite information (as copyright, source, Checking label etc.) relevant with broadcasted content, these information as watermark embedment in audio frequency, and the embedding of watermark can not destroy the audio quality of broadcast itself, together transmit with audio-frequency information during transmission.Audio watermarking technique is exactly under the prerequisite not affecting the quality of audio frequency own, will hide Info and be embedded in host signal, and extracts a kind of technology of watermark by corresponding decoding algorithm.Originally, as long as its function is limited in the aspects such as copyright protection, secret communication, tampering detection, now, along with the development of technology, algorithms of digital audio watermarking can be divided into following four classes: phase encoding, band spectrum modulation, joining method, echo hiding.Echo hiding technology, based on the masking effect of people's ear, artificial adds echo signal to audio frequency, under people's perception is less than the prerequisite of echo by information insertion in voice signal, in guarantee voice quality, there is natural advantage.In addition, the advantages such as it also has low complex degree, do not have noise to superpose, busyly detect, the good performance of net synchronization capability is good, so range of application is more extensive.At present, there are some related invention, they discuss by be applied to based on the audio frequency watermark of echo hiding be similar to FM broadcast media program in.

Such as, the title of Bender is the U.S. Patent No. 5,893 of " Method and Apparatus for Echo Data Hiding in Audio Signals ", and one or more echo is embedded in host audio signal by 067.The title of Yoiti Suzuki is that US Patent No. 2003/0172277 A1 of " Digital Watermark System " discloses and a kind ofly the echo signal of generation is inserted into data waterprint embedded method in original audio signal by expanding echo signal on a timeline.The title of Chen Ning is that " multi-watermarking based on echo hiding embeds and extraction algorithm " proposes and be embedded in same host signal by multi-watermarking signal.

But the echo hiding method capacity of Bender is very low and decode procedure is dangerous.The methods combining of Yoiti echo hiding and spread-spectrum, to improve capacity and security.But consider that the down-sampling in media interaction scenarios is attacked, the length of the embedding echo array allowed is limited, thus it can not provide length to be enough to ensure the PN sequence of good statistical attribute.On the other hand, this method is fragile for echo jitter attack.Chen Ning improves the embedded mobile GIS of echo hiding, but does not carry out rational theoretical validation to selected parameter, once the quantity of watermark is much, system performance can become very poor, and it does not consider the impact of FM modulation /demodulation on system yet.The behaviors such as the random editing especially often occurred in FM broadcast, require that system has good anti-lock-out attacking ability.

Therefore, prior art fails to provide a kind of effective ways by obtaining media program relevant information based on audio frequency watermark.

For the audio frequency watermark system being applied to FM broadcast, meet the following conditions:

(1) related content FM can broadcasted as watermark embedment in original broadcast, and can not affect the voice quality of broadcast itself.

(2) just can obtain by simple method, step the relevant information hidden in broadcast, to synchronous not requirement.

(3) compression, resampling, re-quantization, the attack maliciously such as wave filter, neighbourhood noise, synchronization attack can be resisted.

(4) after FM modulation, demodulation, watermark information is not destroyed, the information of our needs of decoding out normally.

(5) because FM broadcasted content enriches, require that this power system capacity is high.

Summary of the invention

The object of the invention is overcome the deficiency of above system and meet the above condition of system requirements, a kind of audio-frequency water mark method based on many echoes core is provided, the watermark label that can embed in quick, convenient, sane acquisition audio content, and do not affect the quality of broadcast own, the attack of multiple malice can be resisted.

A kind of audio-frequency water mark method based on many echoes core of the present invention, comprises the steps:

The telescopiny of watermark:

Step one: be scale-of-two watermark sequence { a by information coding to be embedded ₀a ₁a ₂a _(J*U)-1, wherein a _n0 or 1,0≤n≤(J*U)-1; Described scale-of-two watermark sequence length is M=U*J bit, and wherein J, U are the positive integer being more than or equal to 1; J is the way of default echo, and U is the binary sequence bit number that every road echo embeds;

Such as, if No. ID of to be embedded is each audio frequency, distribute unique No. ID (one-to-one relationship) so first to each audio multimedia data, this No. ID is converted to scale-of-two watermark sequence (such as by mapping table or be directly converted to scale-of-two);

Step 2: scale-of-two watermark sequence step one obtained is embedded in corresponding sound signal, and concrete grammar is as follows:

1, first original audio signal X (n) is carried out segmentation, split time is second, wherein F _sfor sampling rate, the i-th section audio signal writing X after segmentation _i(n);

2, M-bit scale-of-two watermark information { a step one obtained ₀a ₁a ₂a _(J*U)-1change into the matrix that size is U × J:

[\begin{matrix} a_{0} & a_{U} & . . . & a_{U (J - 1)} \\ . \\ a_{1} & a_{U + 2} & . . . & . \\ . \\ . & . \\ . & . . . & . . . & . \\ . & . \\ a_{U - 1} & a_{2 U - 1} & . . . & a_{(J * U) - 1} \end{matrix}];

Then will often arrange the binary number that writing one is new from top to bottom, and be translated into decimal number, obtain J decimal value { b ₁b ₂b _jand preserve;

3, the delay of J road echo signal is determined according to this J decimal value: d ₁, d ₂..., d _j;

d ₁＝A+F×b ₁

d ₂＝d ₁+F×b ₂+K

.

d _J＝d _J-1+F×b _J+K

Wherein parameter A represents initial delay point value; K is intersegmental boundary belt length, namely minimum between different delay interval, and preferred value is 6; The F section of being intrinsic resolution, namely same road postpones the interval between different value, the optimal value of preferred value to be the preferred value of 3, K and F be experimentally result verification;

Then, to each section audio X _in (), by X _in () postpones d ₁individual delay point, is then multiplied by attenuation coefficient α, namely obtains the 1st road echo signal α X _i(n-d ₁); In like manner, X is obtained successively _iall the other echo signals α X of (n) _i(n-d ₂) ..., α X _i(n-d _j), namely obtain J road echo signal; As preferably, α=0.4;

4, the parity information of 1 bit is determined according to the number of " 1 " in M-bit scale-of-two watermark sequence in step one; Scale-of-two watermark sequence is embedded into as follows each section audio X _iin (n), until whole audio frequency all embeds complete:

If parity information is ' 0 ', the number representing " 1 " in M-bit scale-of-two watermark sequence is odd number, then i-th section containing watermark signal:

Y _i(n)＝X _i(n)+αX _i(n-d ₁)+αX _i(n-d ₂)+…+αX _i(n-d _J).

That is now:

h(n)＝δ(n)+αδ(n-d ₁)+…+αδ(n-d _J)

Y _i(n)＝X _i(n)*h(n)

If parity information is ' 0 ', the number representing " 1 " in M-bit scale-of-two watermark sequence is even number, then i-th section containing watermark signal:

Y _i(n)＝X _i(n)-αX _i(n-d ₁)-αX _i(n-d ₂)-…-αX _i(n-d _J)

That is now:

h(n)＝δ(n)-αδ(n-d ₁)-…-αδ(n-d _J)；

Y _i(n)＝X _i(n)*h(n)

5, by every section audio signal Y of embed watermark _in () head and the tail couple together and obtain containing watermark audio signal Y (n);

The leaching process of watermark:

Step one: carry out segmentation to sound signal Y ' (n) to be detected, section length is second, the i-th section audio signal after the section of scoring is Y ' _i(n);

Step 2: the energy cepstrum calculating every section audio signal to be detected, the i-th section audio signal Y ' _ithe energy cepstrum P of (n) _{y ' i}(n) be:

P _y′i(n)＝{IFFT(log(|FFT(Y _i′(n))| ²))} ²

Defined can be obtained by energy cepstrum

P _yi(n)＝{IFFT(log(|FFT(Y _i(n))| ²))} ²＝P _xi(n)+P _h(n)，

The end be arbitrary value can, choose as single 10 or e be the end; Thus obtain:

P _y(n)≈P _x(n)+n ²α ²δ(n)

+α ²{[δ(n-d ₁)+δ(n+d ₁)]+…+[δ(n-d _J)+δ(n+d _J)]}

+α ⁴{[δ(n-(d ₂-d ₁))+δ(n+(d ₂-d ₁))]+[δ(n-(d ₃-d ₁))+δ(n+(d ₃-d ₁))]+…+[δ(n-(d _J-d ₁))+δ(n+(d _J-d ₁))]}

+α ⁴{[δ(n-(d ₃-d ₂))+δ(n+(d ₃-d ₂))]+[δ(n-(d ₄-d ₂))+δ(n+(d ₄-d ₂))]+…+[δ(n-(d _J-d ₂))+δ(n+(d _J-d ₂))]}

+

.

+α ⁴{[δ(n-(d _J-1-d _J-2))+δ(n+(d _J-1-d _J-2))]+[δ(n-(d _J-d _J-2))+δ(n+(d _J-d _J-2))]+…+[δ(n-(d _J-d ₂))+δ(n+(d _J-d ₂))]}

+α ⁴[δ(n-(d _J-d _J-1))+δ(n+(d _J-d _J-1))]

Can find out according to formula, the sound signal containing watermark will at delay point d ₁, d ₂..., d _jthere is peak value in place;

Step 3: extract watermark information; Concrete grammar is as follows:

(1) to each segmentation sound signal Y ' to be detected _in (), detects its energy cepstrum P _{y ' i}each peak point position of (n), and the peak point position detected is recorded as d ₁', d ₂' ..., d _j'

(2) { b is calculated according to the inverse operation of formula in the 3rd small step embedded in step 2 ₁' b ₂' ... b _j', that is:

b ₁′＝(d ₁′-A)/F

b ₂′＝(d ₂′-d ₁′-K)/F

.

b _J′＝(d _J′-d _J-1′-K)/F

(3) by { b ₁' b ₂' ... b _j' in each element round up be taken as integer after be converted to U position scale-of-two respectively, and the new binary number of the end to end composition of binary number one after these being changed, think that this binary number is extracted watermark;

Further, also comprise:

(4) by extracted watermark and the scale-of-two watermark sequence { a embedded in step ₀a ₁a ₂α _(J*U)-1contrast, to judge that whether this sound signal is containing watermark.

Below be namely the audio-frequency water mark method based on many echoes core proposed by the invention.

Contrast prior art, beneficial effect of the present invention is:

1) the present invention is of many uses, such as, people can be made from media program, to obtain relevant information easily and efficiently when broadcast listening.

2) present invention employs many echoes core, and derived the upper limit of echo quantity in theory, improve the capacity of data, only need the broadcast of recording 1s, can 2 be identified ¹⁶individual FM broadcast program.

3) the present invention with the addition of parity information, enhances the ability of the anti-malicious attack of system, has very high robustness.

4) the present invention well solves in multiple echo, the problem that echo is disturbed mutually, and this postpones dot matrix to determine the delay point of several roads echo owing to adopting.

5) the present invention analyzes the performance of the band audio frequency watermark signal after FM modulation /demodulation, makes it to be more suitable for being applied to FM live broadcast.

6) echo telescopiny of the present invention takes echo buffer memory means, makes net synchronization capability good, solves the problem that FM broadcasts the destruction watermark that random shearing brings thus.

Accompanying drawing explanation

Fig. 1 is audio frequency watermark corresponding relation schematic diagram of the present invention;

Fig. 2 is the inventive method implementing procedure figure;

Fig. 3 is that embodiment adopts the inventive method to embed the process flow diagram of audio frequency watermark to sound signal;

Fig. 4 is that embodiment adopts the inventive method from sound signal, extract the process flow diagram of watermark;

Fig. 5 is the analogous diagram of the present invention's detected energy cepstrum peak point when detecting watermark;

Fig. 6 is multichannel echo delay signal schematic representation of the present invention.

Embodiment

Below in conjunction with drawings and Examples, the present invention is described in detail, also describe technical matters and the beneficial effect of technical solution of the present invention solution simultaneously, it is pointed out that described embodiment is only intended to be convenient to the understanding of the present invention, and any restriction effect is not play to it.

A kind of method to sound signal embed digital watermark of the present invention, see accompanying drawing 2, comprising:

Binary original watermark signal is processed, obtains the watermark signal after processing.According to the value of watermark information determination parity check bit, determine attenuation coefficient, determine the delay point position of echo.By the echo core that the delay point position of attenuation coefficient and echo determines, multichannel echo is embedded in original sound signal.Comprise when extracting watermark: adopt the method for energy cepstrum analysis to decode to signal, extract watermark information.

The present invention has also derived the upper limit of echo quantity in multiple echo core theoretically.

The following describes the theoretical foundation by the theoretical foundation of four tunnel inhibit signals and the value of parameter A.

1, by the theoretical foundation of multipath delay signal

Be provided with J road to postpone, that is

H (n)=δ (n)+α δ (n-d ₁)+... + α δ (n-d _j), calculating energy cepstrum is:

P _y(n)≈P _x(n)+n ²α ²δ(n)

+α ²{[δ(n-d ₁)+δ(n+d ₁)]+…+[δ(n-d _J)+δ(n+d _J)]}

+

.

+α ⁴[δ(n-(d _J-d _J-1))+δ(n+(d _J-d _J-1))]

Analyze above formula can obtain: containing in the energy cepstrum of watermark signal, have two quasi-peak values.The first kind is the peak value produced in the position postponing point, and amplitude is α ², we are referred to as to postpone some peak value, and we can utilize this peak value to determine to postpone the value of point, thus extract watermark information.Equations of The Second Kind peak value is at d _i-d _jthe peak value that (1 <=j < i <=J) place produces, amplitude is α ⁴, total (J-1) J/2 is individual.They can produce interference to delay point peak value, and then cause occurring decoding error, and we are referred to as to disturb peak value.Supposing a kind of limiting case, if all interference peak values all appear at same position, if amplitude now reaches with postponing some peak amplitude suitable, so just will inevitably occur decoding error, that is: and α=0.4, we can calculate J=4.07.Also the theoretical upper limit of many echoes core is just said.Based on this theory deduction, we get J=4.

2, the theoretical foundation of the value of parameter A

Have above-mentioned analysis to obtain, during Dang You tetra-road peak value, the position of interference peak value is

(d ₂-d ₁)，(d ₃-d ₁)，…，(d _J-d ₁)；

(d ₃-d ₂)，(d ₄-d ₂)，…，(d _J-d ₂)；

…….

(d _J-1-d _J-2)，(d _J-d _J-2)

(d _J-d _J-1)

Known accordingly:

d ₂-d ₁＝3×(b ₁+2)＝interp1

d ₃-d ₁＝3×(b ₂+b ₃+4)＝interp2

d ₄-d ₁＝3×(b ₂+b ₃+b ₄+6)＝interp3

.

d _J-d _J-1＝3×(b _J-1+2)＝interp[(J-1)J/2]

Thus the total size calculating interference peak value is [min (interp1, interp2, interp3, interp4, interp [(J-1) J/2]), max (interp1, interp2, interp3, interp4 ..., interp [(J-1) J/2])].In order to make interference peak value can not affect postpone some a peak value, we get A=max (interp1, interp2, interp3, interp4 ..., interp [(J-1) J/2]) and+1 be optimal value.

Below in conjunction with embodiment, the specific embodiment of the present invention is made an explanation.Take watermark as the ID of audio data be example:

First multimedia database is set up, as Fig. 1.

Specific practice: compile unique No. ID (one-to-one relationship) to each multimedia document

Watermark embed step (accompanying drawing 3):

Step one: be embedded into No. ID in sound signal, if No. ID is the binary message of 16bits;

Specific practice is as follows:

1, first original complete audio frequency is carried out segmentation, split time for (44100/fs) s wherein fs=441.00 be sampling rate.If originally audio frequency is X (n), then after segmentation i-th section be X _i(n).

2,16bits binary message is divided into 4 parts, every part is 4bits information, then every part of 4bits binary number is converted into corresponding decimal number, obtains four decimal value [b ₁b ₂b ₃b ₄], wherein b _i∈ [0,15]

3, the delay d of four road echo signals is determined according to four decimal values ₁, d ₂, d ₃, d ₄.(F＝3，K＝6)

D ₁=A+F × b ₁, by X _in () postpones d ₁individual delay point, is then multiplied by attenuation coefficient α=0.4, obtains first via echo signal α X _i(n-d ₁). in like manner, we calculate other delay: d ₂=d ₁+ F × b ₂+ K; d ₃=d ₂+ F × b ₃+ K; d ₄=d ₃+ F × b ₄+ K; Thus obtain remaining 3 road echo signal α X _i(n-d ₂), α X _i(n-d ₃), α X _i(n-d ₄).

4, the parity information of 1bits is determined according to 16bits binary message, if be ' 0 ', the band watermark signal after embedding:

Y _i(n)＝X _i(n)+αX _i(n-d ₁)+αX _i(n-d ₂)+αX _i(n-d ₃)+αX _i(n-d ₄).

That is:

H (n)=δ (n)+α δ (n-d ₁)+... + α δ (n-d ₄), Y _i(n)=X _i(n) * h (n); Otherwise:

Y _i(n)＝X _i(n)-αX _i(n-d ₁)-αX _i(n-d ₂)-αX _i(n-d ₃)-αX _i(n-d ₄)。

That is h (n)=δ (n)-α δ (n-d ₁)-...-α δ (n-d ₄), Y _i(n)=X _i(n) * h (n); See accompanying drawing 6;

5, by id information loop embedding to next section audio X _i+1n (), until whole audio frequency all embeds complete.Every section audio signal of embed watermark is coupled together.

Introduce the extraction step (accompanying drawing 4) of watermark below: from audio signal decoding out watermark information (No. ID)

Calculate the energy cepstrum of band watermark signal

1, carry out segmentation to sound signal Y ' (n) to be detected, split time is (44100/fs) s;

2, defined can be obtained by energy cepstrum:

P _y(n)＝{IFFT(log(|FFT(y(n))| ²))} ²＝P _x(n)+P _h(n)

, thus obtain:

P _y(n)≈P _x(n)+n ²α ²δ(n)+{α ²[δ(n-d ₁)+δ(n+d ₁)]+…+α ²[δ(n-

d ₄)+δ(n+d ₄)]}+α ⁴{[δ(n-(d ₂-d ₁))+δ(n+(d ₂-d ₁))]+…+[δ(n-

(d _J-d ₁))+δ(n+(d _J-d ₁))]}+…+α ⁴[δ(n-(d ₄-d ₃))+δ(n+(d ₄-

d ₃))]

3, according to above-mentioned formula, we can find out, and band watermark signal is at delay point d ₁, d ₂, d ₃, d ₄there is peak value in place, thus the position of detection peak point can obtain d ₁, d ₂, d ₃, d ₄value (accompanying drawing 5).

Calculate by the value postponing some the watermark information (No. ID) embedded, according to No. ID, in data bank, search corresponding multimedia document, multimedia document is fed back to user

Illustrate below in conjunction with embodiment: the theoretical foundation of the value of parameter A:

Have above-mentioned analysis to obtain, during Dang You tetra-road peak value, the position of interference peak value is (d ₂-d ₁), (d ₃-d ₁), (d ₄-d ₁), (d ₃-d ₂), (d ₄-d ₂), (d ₄-d ₃):

d ₂-d ₁＝3×(b ₁+2)∈[6，51]

d ₃-d ₁＝3×(b ₂+b ₃+4)∈[12，102]

d ₄-d ₁＝3×(b ₂+b ₃+b ₄+6)∈[18，153]

d ₃-d ₂＝3×(b ₃+2)∈[6，51]

d ₄-d ₂＝3×(b ₃+b ₄+4)∈[12，102]

d ₄-d ₃＝3×(b ₄+2)∈[6，51]

The total size that therefrom we can calculate interference peak value is [6,153].Can not affect to make interference peak value and postpone some peak value, we get A=154 is optimal value.

Further, based on this method, the present invention can be used for obtaining FM broadcast related information, comprising:

Embed step: in the sound signal that this step broadcast to FM, embedding comprises the sequence number watermark information of the relevant information of media program;

FM step of transmitting: the broadcast singal comprising watermark information is launched by this step after FM modulation;

Transmission step: this step F M broadcast singal transmits in the channel;

FM demodulation step: in this step, FM receiver receives FM broadcast singal and signal receiving out;

Recording step: the part in the broadcasted content of this step user record FM demodulator solution band watermark out;

Extraction step: this step extracts corresponding watermark information from the part sound signal of record, namely obtains the sequence number of this section of broadcast associated content;

Relevant information forwarding step: this step according to the sequence number of the related content extracted, by the content of sequence number sending to user.

During transmission of audio watermark signal, comprise FM transmitter device, this equipment has been used and has been launched after FM modulation by the voice signal that embedded in audio frequency watermark.And FM demodulated equipment, this equipment is used for receiving FM broadcast singal, and by signal receiving out, shows etc.

Below by the performance of experimental evaluation this method.First by changing the length of segmentation, changing transfer rate, thus contrasting the watermarking algorithm based on many echoes core of the present invention and existing monokaryon watermarking algorithm performance when different transmission rates.Some are attacked, such as wave filter, resampling, re-quantization, add white noise etc. and be used for the robustness of testing algorithm; Subjective test results shows, and the audio frequency watermark system based on many echoes core has well not sentience.Finally, we whole method is applied to FM broadcast in and test.Here is the experimental result of embodiment:

This experiment tests as carrier audio frequency with 20 dissimilar audio clips, and these 20 audio clips adopt the sampling of 44.1Khz, and 16-bit quantizes, and can be specifically a symbol with reference to table 1 watermark content, wherein comprise the binary message of 16bits.

The carrier audio frequency of table 1. for testing

The accuracy of symbol is defined as:

{AR}_{S} = \frac{number of symbol correctly decoded}{number of symbols encoded} \times 100 %

One of them symbol contains the binary message of 16bits.

1, many echoes core audio frequency watermark system performance

(1) transfer rate

Adopting conventional single-core water mark method as a comparison in the present embodiment, by changing the length of segmentation, thus testing the performance of two kinds of methods at different rates.By result, we can find out, the water mark method based on many echoes core that we propose has obvious advantage, and especially when we improve transfer rate time, this advantage becomes more obvious.

Table 2.Result of transmission rate test

(2) robustness test

For FM broadcast system, evitable scarcely exist the behaviors such as resampling, quantification, compression, filtering, editing, so our system must have resistivity to the attack of these malice.In this experiment, what we thought adds attack, tests under difference is attacked, the performance of this system.Concrete attack can with reference to table 3 in this experiment, and section length is 1 second.As can be seen from the result of table 4 we, in contrast to traditional audio frequency watermark system, our system has higher accuracy, except resampling, the accuracy AR under other attack _sall exceed 90%.

Table 3.Detail of attack

Table 4.Result of attack test

(3) subjective testing

Take AB method for subjective testing to survey the quality of the audio frequency containing watermark.Wherein have 10 people to participate in test, everyone listens to 20 audio clips.In test process, they need to elect one from two options (raw tone and band watermark voice) and think that audio quality is better themselves.By this experiment, we can test the degree of injury increased to audio quality after watermark.Concrete outcome can with reference to table 5.

As can be seen from almost reciprocity result, we, add watermarking process and produce damage to audio frequency, is namely with watermarked audio to have good not sentience.

Table 5.Result of subjective test

prefer	Host?audio	Watermarked?audio
			number	97	103
rate	0.485	0.515

2, performance test

In this experiment, the audio frequency watermark system of design is applied in FM broadcast by we, and split time is 1 second, and watermark is 1 symbol (16bits scale-of-two), adopt the mode record FM receiver decoding sound signal out of recording, test result can with reference to table 6.We know, along with the increase of recording distance, signal to noise ratio (S/N ratio) (SNR) can decline, accuracy AR _snature can decline, and result shows, and our system has showed performance well, and recording distance can reach 5 meters.

Table6.Test?result?of?the?whole?system

The above; be only the specific embodiment of the present invention; but protection scope of the present invention is not limited thereto; any people being familiar with this technology is in the technical scope disclosed by the present invention; the conversion and replacement expected can be understood; all should be encompassed in and of the present inventionly comprise within scope, therefore, protection scope of the present invention should be as the criterion with the protection domain of claims.

Claims

1. based on an audio-frequency water mark method for many echoes core, it is characterized in that, comprise the steps:

The telescopiny of watermark:

(1) first original audio signal X (n) is carried out segmentation, split time is second, wherein F _sfor sampling rate, the i-th section audio signal writing X after segmentation _i(n);

(2) M-bit scale-of-two watermark information { a step one obtained ₀a ₁a ₂a _(J*U)-1change into the matrix that size is U × J:

[\begin{matrix} a_{0} & a_{U} & . . . & a_{U (J - 1)} \\ . \\ a_{1} & a_{U + 2} & . . . & . \\ . \\ . & . \\ . & . . . & . . . & . \\ . & . \\ a_{U - 1} & a_{2 U - 1} & . . . & a_{(J * U) - 1} \end{matrix}];

(3) delay of J road echo signal is determined according to this J decimal value: d ₁, d ₂..., d _j;

d ₁＝A+F×b ₁

d ₂＝d ₁+F×b ₂+K

·

d _J＝d _J-1+F×b _J+K

Wherein parameter A represents initial delay point value; K is intersegmental boundary belt length, namely minimum between different delay interval, the F section of being intrinsic resolution, and namely same road postpones the interval between different value;

Then, to each section audio X _in (), by X _in () postpones d ₁individual delay point, is then multiplied by attenuation coefficient α, namely obtains the 1st road echo signal α X _i(n-d ₁); In like manner, X is obtained successively _iall the other echo signals α X of (n) _i(n-d ₂) ..., α X _i(n-d _j), namely obtain J road echo signal;

(4) parity information of 1 bit is determined according to the number of " 1 " in M-bit scale-of-two watermark sequence in step one; Scale-of-two watermark sequence is embedded into as follows each section audio X _iin (n), until whole audio frequency all embeds complete:

Y _i(n)＝X _i(n)+αX _i(n-d ₁)+αX _i(n-d ₂)+…+αX _i(n-d _J).

Y _i(n)＝X _i(n)-αX _i(n-d ₁)-αX _i(n-d ₂)-…-αX _i(n-d _J)

(5) by every section audio signal Y of embed watermark _in () head and the tail couple together and obtain containing watermark audio signal Y (n);

The leaching process of watermark:

P _y′i(n)＝{IFFT(log(|FFT(Y _i′(n))| ²))} ²

Step 3: extract watermark information; Method is as follows:

(2) { b is calculated ₁' b ₂' ... b _j', that is:

\begin{matrix} {b_{1}}^{'} = ({d_{1}}^{'} - A) / F \\ {b_{2}}^{'} = ({d_{2}}^{'} - {d_{1}}^{'} - K) / F \\ . \\ . \\ . \\ {b_{J}}^{'} = ({d_{J}}^{'} - {d_{J - 1}}^{'} - K) / F \end{matrix}

(3) by { b ₁' b ₂' ... b _j' in each element round up be taken as integer after be converted to U position scale-of-two respectively, and the new binary number of the end to end composition of binary number one after these being changed, think that this binary number is extracted watermark.

2. according to a kind of audio-frequency water mark method based on many echoes core described in claim 1, it is characterized in that, the step 3 of described extraction step, after extracting watermark, also comprises step: (4) are by extracted watermark and the scale-of-two watermark sequence { a embedded in step ₀a ₁a ₂a _(J*U)-1contrast, to judge that whether this sound signal is containing watermark.

3., according to a kind of audio-frequency water mark method based on many echoes core described in claim 1, it is characterized in that, α=0.4.

4., according to a kind of audio-frequency water mark method based on many echoes core described in claim 3, it is characterized in that, M=4.

5., according to a kind of audio-frequency water mark method based on many echoes core described in claim 3, it is characterized in that, U=4.

6., according to a kind of audio-frequency water mark method based on many echoes core described in claim 1, it is characterized in that, K=6.

7., according to a kind of audio-frequency water mark method based on many echoes core described in claim 1, its feature exists. in, F=3.

8., according to a kind of audio-frequency water mark method based on many echoes core described in claim 4, it is characterized in that, A=154.