WO2007131445A1 - A method, a system and a apparatus for censoring video code stream - Google Patents

A method, a system and a apparatus for censoring video code stream Download PDF

Info

Publication number
WO2007131445A1
WO2007131445A1 PCT/CN2007/001548 CN2007001548W WO2007131445A1 WO 2007131445 A1 WO2007131445 A1 WO 2007131445A1 CN 2007001548 W CN2007001548 W CN 2007001548W WO 2007131445 A1 WO2007131445 A1 WO 2007131445A1
Authority
WO
WIPO (PCT)
Prior art keywords
code stream
module
review
layer code
content
Prior art date
Application number
PCT/CN2007/001548
Other languages
French (fr)
Chinese (zh)
Inventor
Zhong Luo
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Publication of WO2007131445A1 publication Critical patent/WO2007131445A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/166Passage/non-passage of the television signal, e.g. jamming, band suppression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/454Content or additional data filtering, e.g. blocking advertisements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/454Content or additional data filtering, e.g. blocking advertisements
    • H04N21/4542Blocking scenes or portions of the received content, e.g. censoring scenes

Definitions

  • IPTV and V0D One type of service on streaming media, such as IPTV and V0D, is designed to provide video and audio content.
  • the content is very broad, including film and television programs, news, sports competitions, concerts, etc.
  • operators/ISPs Internet Service Providers
  • content providers There is such a need at operators/ISPs (Internet Service Providers) and content providers.
  • the IPTV operation will be launched on a large scale in China. So the first question is how to ensure effective content monitoring and filtering to achieve filtering of harmful content. If this problem is not solved, the operation of IPTV in China will not be discussed, and the relevant state departments will not be able to issue licenses. Therefore, the solution to this problem is of great significance for promoting the development of the IPTV industry.
  • the usual understanding includes two aspects:
  • the present invention also provides a video stream review system, including: a video input device, a video encoding device, and a streaming server, and a review device;
  • the decision principle storage submodule is connected between the decision module and the control module, and stores the decision principle input through the control module;

Abstract

A method, a system and a relative apparatus are provided to solve the problem of censoring video code stream based on layered video coding technique. The correlation between base layer and enhancement layer in the layered coding technique is used to generate a censoring code stream.The censoring code stream includes base layer code stream, or may further include part of enhancement layer code stream. A specifically designed censoring apparatus censors and transfers the censoring code stream. When the code stream includes inappropriate content, the censoring apparatus will cut off the transfer of the censoring code stream such that the user receiving the stream can't decode and reconstruct the original video stream anymore. In this way, inappropriate content can be prevented from broadcasting. The method, system and apparatus disclosed can be applied in live mode or storage mode in the streaming media service.

Description

一种视频码流审查方法、 系统和设备 技术领域  Video code stream review method, system and device
本发明涉及多媒体通信技术, 特别涉及一种多媒体通信过程中, 一种基 于分层编码实现的视频码流审查方法、 系统和相关设备。 背景技术  The present invention relates to multimedia communication technologies, and in particular to a method, system and related device for video code stream review based on layered coding in a multimedia communication process. Background technique
流媒体 ( Streaming Media )作为一种基本的多媒体通信形式, 派生出了众 多的多媒体通信业务形式: 会议电视 /可视电话 , IPTV ( Internet Protocol Television, IP电视), VOD ( Video on Demand, 视频点播), 即时通信等。 因 此流媒体将成为 NGN ( Next Generation Network, 下一代网络)上的基本通信 形式。 尤其是近几年来国内外 IPTV业务的快速兴起, 流媒体在网络上的应用 也在急速发展。  As a basic form of multimedia communication, Streaming Media has spawned many forms of multimedia communication services: conference television/visual telephone, IPTV (Internet Protocol Television, IP TV), VOD (Video on Demand, video on demand). ), instant messaging, etc. Therefore, streaming media will become the basic form of communication on the NGN (Next Generation Network). Especially in recent years, the rapid rise of IPTV services at home and abroad, the application of streaming media on the network is also developing rapidly.
流媒体上的一类业务比如 IPTV和 V0D都是以提供视频音频内容为功能 的。 内容的范围非常广阔, 包括影视节目、 新闻、 体育比赛、 演唱会等。 各 个国家, 尤其是我国, 对于内容的安全和监控一直是高度重视的, 都有相关 的法律。从保护未成年人角度出发,各国也有相关的规定。同时,在运营商 /ISP ( Internet Service Provider, 因特网服务提供商) 和内容提供商那里也有这样 的需求。 国内即将大规模开展 IPTV的运营, 那么首先一个问题是如何保证有 效的内容监控和过滤, 实现有害内容的过滤。 不解决这个问题, IPTV在国内 的运营将无从谈起, 国家相关部门也不可能发放牌照。 因此, 这个问题的解 决对于推动 IPTV产业的发展有着重要的意义。对于内容安全, 通常的理解包 括两个方面:  One type of service on streaming media, such as IPTV and V0D, is designed to provide video and audio content. The content is very broad, including film and television programs, news, sports competitions, concerts, etc. Various countries, especially China, have always attached great importance to the safety and monitoring of content, and all have relevant laws. From the perspective of protecting minors, countries also have relevant regulations. At the same time, there is such a need at operators/ISPs (Internet Service Providers) and content providers. The IPTV operation will be launched on a large scale in China. So the first question is how to ensure effective content monitoring and filtering to achieve filtering of harmful content. If this problem is not solved, the operation of IPTV in China will not be discussed, and the relevant state departments will not be able to issue licenses. Therefore, the solution to this problem is of great significance for promoting the development of the IPTV industry. For content security, the usual understanding includes two aspects:
1、 对于内容的保护, 防止内容被没有权限的用户接收;  1. For content protection, prevent content from being received by users without permission;
比如防止盗看电视节目等。 对于这类入侵, 有很多成熟技术, 比如加密 For example, to prevent theft of TV shows. For this type of intrusion, there are many mature technologies, such as encryption.
( Encryption ) 和加扰 (Scrambling ), 认证鉴权、 以及数字版权管理 DRM ( Digital Right Management )等; 2、 对于有害和非法内容的入侵的防范, 保护的对象是内容攻击的对象, 通常是受众。 (Encryption) and Scrambling, authentication and authentication, and Digital Right Management (DRM); 2. For the prevention of intrusion of harmful and illegal content, the object of protection is the object of content attack, usually the audience.
这需要对于内容进行实时的审查, 借鉴目前在广播电视行业的做法, 主 要是 4巴电视节 目 码流 ( 一般都是按照 DVB-T , Digital Video Broadcasting-Terrestrial 传送的, 视频音频的压缩格式是 MPEG-2、 MPEG=Moving Picture Experts Group, 一个国际标准组织)在从节目源(比如 卫星)传送到用户电视机 /STB的路径上设置检查节点, 进行人工审查。 当然 随着技术的进步, 有些内容的审查可以有系统自动完成或者半自动 (人机结 合) 来完成。 人工审查一旦发现节目内容有问题, 就要采取措施, 停止该节 目码流的传送, 并且(多数情况下) 用一个临时替代的节目来替代被中断的 内容有害的节目, 比如公益广告或者字幕公告等等。 当然人工判断需要集合 内容的上下文, 并且反应和处置需要一定的时间, 因此必须有一个延迟装置 来提供这个延时, 比如 5秒钟。  This requires real-time review of the content, drawing on the current practice in the broadcast and television industry, mainly the 4 bar TV program stream (generally transmitted according to DVB-T, Digital Video Broadcasting-Terrestrial, the compression format of video and audio is MPEG). - 2, MPEG = Moving Picture Experts Group, an international standards organization) set up inspection nodes on the path from the program source (such as satellite) to the user's TV / STB for manual review. Of course, with the advancement of technology, some content reviews can be done automatically by the system or semi-automatically (human-machine combination). Manual review, once it is found that there is a problem with the content of the program, measures must be taken to stop the transmission of the program stream, and (in most cases) replace the program with harmful content, such as public service advertisements or subtitle announcements, with a temporarily replaced program. and many more. Of course, manual judgment requires the context of the collection content, and it takes a certain amount of time to react and dispose, so there must be a delay device to provide this delay, such as 5 seconds.
所谓内容过滤, 就是对于内容的某些属性进行处理和判断, 这些内容属 性可以包括: 内容提供商的名字、 内容的 URL ( Universal Resource Locator 通用资源定位器, 网址是一类重要的 U L )、 内容提供服务器的 IP地址等, 以及媒体流以数据包封装情况下的数据包的包头(packet header )信息、 包中 的信息等。 可以看出, 这种处理和过滤也是按照由浅入深的层次进行的。  Content filtering is the processing and judgment of certain attributes of content. These content attributes can include: the name of the content provider, the URL of the content (Universal Resource Locator, the URL is an important type of UL), content The IP address of the server, etc., and the packet header information of the packet in the case of the packet encapsulation in the case of the packet, the information in the packet, and the like. It can be seen that this processing and filtering is also carried out in a hierarchy from shallow to deep.
现有技术一主要是依据内容的外部特征, 或者叫做浅层特征来进行内容 过滤。 其中最典型的例子是 U L过滤, 其原理如图 1所示: 内容过滤设备位 于网络上的核心网和边缘接入网之间, 那么是来自内容源的媒体流到达接收 端之间的必由之路关口, 在实际中, 可以和企业网的代理、 NAT ( Network Address Translator, 网络地址翻译设备) /F ( Firewall, 防火墙)放在同一个 网络位置, 对于宽带家庭用户的情况, 可以和 BAS ( Broadband Administration System,宽带管理系统 ) /BRAS ( Broadband Registration and Admission System, 宽带注册和准入系统:)、 DISLAM放在同一个位置,或者放在 ISP的 POP( Point of Presence, 存在点 )上。 过滤设备自己有内部数据库, 存有多个内容源 URL的信息, 根据这个数 据库就可以判断一部分内容源是否有害, 并屏蔽有害内容源、 放行无害内容 源。 同时, 还有很多提供第三方服务的内容分级服务商, 他们的数据库更加 丰富和专业, 内容过滤设备也可以和这种第三方服务商连接, 使用他们的服 务来进行 URL过滤。 The prior art one mainly performs content filtering according to external features of the content, or shallow features. The most typical example is UL filtering. The principle is shown in Figure 1. The content filtering device is located between the core network and the edge access network on the network, so that the media stream from the content source reaches the necessary gateway between the receiving ends. In practice, it can be placed in the same network location as the agent of the enterprise network, NAT (Network Address Translator) / F (Firewall, firewall), and for the case of broadband home users, and BAS (Broadband Administration) System, Broadband Management and Admission System, DISLAM is placed in the same location, or placed on the POP's Point of Presence. The filtering device itself has an internal database with information about multiple content source URLs. According to this database, it is possible to determine whether a part of the content source is harmful, and to block harmful content sources and release harmless content sources. At the same time, there are many content rating service providers that provide third-party services. Their databases are more abundant and professional. Content filtering devices can also connect with such third-party service providers to use their services for URL filtering.
现有技术一存在如下问题:  The prior art 1 has the following problems:
1、 错杀问题: 根据 URL过滤, 可能把无害的内容过滤掉了, 比如有的 网站提供视频节目点播, 其中有些节目是有害的, 但是有些是健康的电影, 仅根据 URL无法区分;  1. Mistakes: According to URL filtering, it is possible to filter out harmless content. For example, some websites provide video program on demand, some of which are harmful, but some are healthy movies, which cannot be distinguished only according to URLs;
2、 错放问题: 有些 U L可能因为在分级体制中被认为是资质优良的网 站, 可能也会出问题(被黑客攻击冒充其网址, 或者自己有违法的企图等);  2, misplacement problem: Some U L may be considered as a qualified website in the grading system, may also have problems (being hacked to impersonate its website, or their own illegal attempts, etc.);
3、 采用 U L过滤, 通常也还需要第三方的评级体系, 这样的评级体系 是有的, 有些收费的评级服务商专门提供评级服务。 但是他们的结果也不能 完全准确和穷尽网络上所有的内容。 并且网络上的内容也是经常变化的, 任 何一个评级体系也不可能及时跟上这些变化。  3, U L filtering, usually also requires a third-party rating system, such a rating system is available, some paid rating service providers specialize in providing rating services. But their results are not completely accurate and exhaustive of all the content on the network. And the content on the web is constantly changing, and it is impossible for any rating system to keep up with these changes in a timely manner.
对于要求非常高的应用场景, 比如面向全国公众的 IPTV, 如果一旦有有 害内容尤其是政治敏感内容入侵成功, 造成的危害是巨大的。 必须做到万无 一失, 因此采用浅层次的过滤都是不可靠的。 必须采用最深层次的内容过滤, 即视频音频数据本身的过滤, 比如对于图像的识别,识别其中的有害场景(暴 力、 色情等)、 有害文字信息 (字幕)、 特定人物的面孔等等。  For very demanding application scenarios, such as IPTV for the national public, the harm caused by the successful invasion of harmful content, especially politically sensitive content, is enormous. It must be foolproof, so shallow filtering is unreliable. The deepest level of content filtering must be used, that is, the filtering of video and audio data itself, such as the recognition of images, the identification of harmful scenes (violent, pornographic, etc.), harmful text information (subtitles), faces of specific people, and so on.
要达到很高的过滤正确率, 必须深入到最深的层次, 即内容数据本身。 这个方面属于目前研究热点, 深度包过滤 DPF ( Deep Packet Filtering )0 To achieve a high rate of filtering accuracy, you must go deep into the deepest level, the content data itself. Belonging to this research focus, deep packet filter DPF (Deep Packet Filtering) 0
现有技术二深度 DPF基于人工的深度内容设别, 这种情况下, 内容过滤 设备能够对于媒体流进行解码并将内容播放出来(假设加密不是问题, 因为 加密的问题可以通过通信设备的合法监听要求解决), 供人工监控者审查。 如 果发现有问题, 监控者立即采取措施, 切断有害内容, 同时切换到一段无害 的内容比如公益广告等。 当然在内容过滤设备之后必须有一个相当大容量的 延迟设备, 来延迟有害内容, 给监控人员一定的判断和反应处置时间(比如 5 秒)。 当然, 这个过程目前也可以釆用自动或者人机结合半自动的方法实现, 其实现原理如图 2所示。 The prior art 2 deep DPF is based on manual deep content setting. In this case, the content filtering device can decode the media stream and play the content (assuming that encryption is not a problem, because the encryption problem can be legally monitored by the communication device. Request for resolution), for review by manual monitors. If a problem is found, the monitor immediately takes action to cut off harmful content and switch to a harmless content such as a public service advertisement. Of course, there must be a considerable capacity after the content filtering device. Delay the device to delay harmful content and give the supervisor a certain amount of judgment and reaction time (for example, 5 seconds). Of course, this process can also be implemented by automatic or human-machine combined semi-automatic methods. The implementation principle is shown in Figure 2.
现有技术二的基本思想是对的, 并且在广播电视领域实际应用了多年, 效果还是不错的。但是要用于 IP ( Internet Protocol )网络上的流媒体内容服务 的内容审查, 还需要进行相当多的改进和完善。 问题主要有:  The basic idea of the prior art 2 is correct, and has been practically applied for many years in the field of broadcast television, and the effect is good. However, the content review for streaming media content services on the IP (Internet Protocol) network requires considerable improvement and improvement. The main problems are:
1、 IP网络的情况比广播电视网絡在结构和网络拓朴上要复杂得多, 节目 也要多得多, 被审查的节目如果都传送到网络中心来, 则占用过多通信资源; 1. The situation of IP networks is much more complicated than that of broadcast TV networks in terms of structure and network topology. There are many more programs. If the programs being reviewed are transmitted to the network center, they will occupy too much communication resources.
2、 IP网络的内容源多、 节目多, 如果在进行审查时都进行集中的解码处 理, 则解码的处理量太大, 对审查设备的容量要求过高, 现有设备无法满足。 2. The IP network has many content sources and many programs. If the centralized decoding process is performed during the review, the processing capacity of the decoding is too large, and the capacity of the review device is too high, and the existing equipment cannot be satisfied.
分层视频编码( layered video coding )是一种 4巴视频数据流进行分层压缩 编码的方法, 主要思想是输出多个编码层, 最主要的部分是基本层 (base layer ), 基本层之上有多个增强层( enhancement layer ), 基本层和增强层可以 分开发送(可以走不同的网络路径)。 在接收端, 基本层可以独立解码重构出 基本层视频, 但是增强层必须依赖于基本层和 /或者其下面的增强层, 才能解 码重构出各自对应的视频。 在接收端, 解码重构出来的基本层和各个增强层 视频码流按照由具体分层编码方法规定的规则进行叠加, 从而得到总的视频 码流。 分层编码有^ ί艮多好处, 最显著的是能够增强流媒体业务对于各种网络 条件(静态的差别和动态的变化) 的适应性, 比如节目源可以输出一个基本 层其比特率为 384kpbs,和一个增强层其比特率为 768kbps, 那么如果用户的接 入带宽大于 1024kbps,就可以接收基本层加上增强层, 如果小于 1024kbps, 只 能接收基本层。 这是异质(heterogeneous )网络存在静态差异的情况。 网络更 加普遍的情况是动态变化, 比如因为网络拥塞, 造成带宽的降低, 原来用户 可以接收基本层和增强层, 但是带宽降低后, 只能接收基本层。 但是用户仍 然能够看, 如果不采用分层编码, 只有一个层, 有可能带宽降低后, 用户就 无法收看了。  Layered video coding is a method of layered compression coding of a 4 bar video data stream. The main idea is to output multiple coding layers. The most important part is the base layer, above the basic layer. There are multiple enhancement layers, and the base layer and the enhancement layer can be sent separately (you can take different network paths). At the receiving end, the base layer can independently decode and reconstruct the base layer video, but the enhancement layer must rely on the base layer and/or the enhancement layer below it to decode the corresponding video. At the receiving end, the reconstructed base layer and each enhancement layer video code stream are superimposed according to rules specified by a specific layered coding method, thereby obtaining a total video stream. Hierarchical coding has many advantages. The most significant is the ability to enhance the adaptability of streaming services to various network conditions (static differences and dynamic changes). For example, the program source can output a basic layer with a bit rate of 384kpbs. And an enhancement layer whose bit rate is 768 kbps, then if the user's access bandwidth is greater than 1024 kbps, the base layer plus the enhancement layer can be received, and if it is less than 1024 kbps, only the base layer can be received. This is the case where there is a static difference in the heterogeneous network. The more common situation in the network is dynamic changes. For example, because of network congestion, the bandwidth is reduced. The original user can receive the base layer and the enhancement layer, but after the bandwidth is reduced, only the base layer can be received. However, the user can still see that if there is no layered coding, there is only one layer, and the bandwidth may be reduced, and the user cannot watch it.
如图 3所示, 图 3给出了分层视频编码的原理示意图, 该图概括了当前 各种主要的分层编码技术。 一般来说, 有三种不同的分层编码机制:As shown in Figure 3, Figure 3 shows a schematic diagram of the principle of layered video coding, which summarizes the current A variety of major layered coding techniques. In general, there are three different hierarchical coding mechanisms:
1、 时间分层编码 1, time layered coding
丢弃一些编码帧, 降低单位时间内的编码帧数, 也就是所谓的帧率, 从 而达到降低比特率的目的。 如果这些帧是所谓的双向预测帧 (也叫做 B帧) 的话, 那么这些 B帧可以组成一个增强层, 因为 B帧的编码和解码依赖其它 帧, 比如帧内编码帧 ( I帧)和预测编码帧 ( P帧), 但是其它帧并不依赖 B 帧。 因此刚好符合基本层和增强层之间的依赖关系。 因此形成时间分层编码 的方法是: 基本层编码只包含 I帧和 P帧; 然后在这些 I帧和 P帧之间通过 双向预测的方法产生 B帧, 从而形成增强层, 根据需要可以加入任意数量的 B帧。  Discard some coded frames and reduce the number of coded frames per unit time, which is called the frame rate, so as to reduce the bit rate. If these frames are so-called bi-predictive frames (also called B-frames), then these B-frames can form an enhancement layer because the encoding and decoding of B-frames depends on other frames, such as intra-coded frames (I-frames) and predictive coding. Frame (P frame), but other frames do not depend on B frame. Therefore, it just meets the dependency between the base layer and the enhancement layer. Therefore, the method for forming the time layer coding is: the base layer coding only includes the I frame and the P frame; then the B frame is generated by the bidirectional prediction method between the I frame and the P frame, thereby forming an enhancement layer, and may be added as needed The number of B frames.
2、 SNR分层编码  2. SNR layered coding
基本层的编码质量可以比较低, 从而导致比较低的重够图像的信号噪声 比 ( SNR=Signal-to-Noise Ratio )。 一般来说, 通过增大量化参数 ( QP=Quantization Parameter )进行粗量化, 可以降低基本层的 SNR, 得到较 低的比特率。 而基本层的重构残差被送入增强层进行编码。 在增强层中, 如 果进行帧内编码, 得到 EI帧, 如果进行帧间预测编码, 得到 EP帧。  The coding quality of the base layer can be relatively low, resulting in a relatively low signal-to-noise ratio (SNR = Signal-to-Noise Ratio). In general, coarse quantization by increasing the quantization parameter (QP = Quantization Parameter) can lower the SNR of the base layer and obtain a lower bit rate. The reconstructed residual of the base layer is sent to the enhancement layer for encoding. In the enhancement layer, if intra coding is performed, an EI frame is obtained, and if inter prediction coding is performed, an EP frame is obtained.
3、 空间分层编码  3, spatial layered coding
空间分层编码和 SNR分层编码类似, 只是在由基本层重构残查形成增强 层编码输入数据之前, 要进行空间上的放大, 在视频编码的技术语言中叫做 上采样( Upsampling )。 一般情况,导致图像在水平和垂直方向上各自放大 kh, kv倍。 当然, 一般情况下, kh, kv相等。 否则图像变形, 比例失调。 但是在特 殊应用中, 可以不相等。 典型的应用是把基本层的图像在水平和垂直方向各 自放大 2倍, 如果基本层是 QCIF图像, 那么增强层就是 CIF图像。 Spatial layered coding is similar to SNR layered coding, except that the spatial amplification is performed before the enhancement of the enhancement layer coded input data by the base layer reconstruction. In the technical language of video coding, it is called Upsampling. In general, the image is magnified k h , k v times in the horizontal and vertical directions. Of course, in general, k h , k v are equal. Otherwise the image is distorted and the scale is out of balance. But in special applications, it can be unequal. A typical application is to magnify the image of the base layer by a factor of two in the horizontal and vertical directions. If the base layer is a QCIF image, the enhancement layer is a CIF image.
对应于图 3 , 为了概括以上三种机制, 引入了编码端的分解变换 T, 和解 码端的合成变换1。 只要定义合适的变换 T和 R, 不同的分层编码都适用于 这个基本框架。 比如对于空间分层编码, 分解变换 T就是求差获得重构残差, 然后再进行上采样; 而合成变换 R首先把低层的重构帧进行上采样再和高层 的重构帧进行叠加。 Corresponding to Fig. 3, in order to summarize the above three mechanisms, a decomposition transform T of the encoding end and a synthetic transform 1 of the decoding end are introduced. As long as the appropriate transforms T and R are defined, different layered codes apply to this basic framework. For example, for spatial layered coding, the decomposition transform T is the difference to obtain the reconstructed residual, and then upsampling; and the synthetic transform R first upsamples the low-level reconstructed frame and then the upper layer. The reconstructed frames are superimposed.
以上三种基本的机制, 广泛应用于 ITU-T H .263/H.263+、 H.264、 MPEG-2、 MPEG-4等主要的视频压缩编码标准中。 在这三种基本机制之上, 还有其它变种技术, 比如所谓的 FGS ( Fine Granularity Scalability, 精细粒度 分层编码), 就是在增强层不是按照常规的编码方法, 而是对于运动预测残差 进行 DCT ( Discrete Cosine Transform离散余弦变换 )后, 按照比特平面 ( bit plane ) 方式进行编码, 得到可以为任意位置进行截断都能正常解码的所谓嵌 入式码流, 从而可以为增强层提供非常细的带宽粒度和更好的网络条件适应 性。 但是因为精细粒度分层编码需要计算复杂, 因此其实用性较低。 发明内容  The above three basic mechanisms are widely used in major video compression coding standards such as ITU-T H.263/H.263+, H.264, MPEG-2, and MPEG-4. On top of these three basic mechanisms, there are other variant techniques, such as the so-called FGS (Fine Granularity Scalability), in which the enhancement layer is not in accordance with the conventional coding method, but on the motion prediction residual. After DCT (Discrete Cosine Transform), encoding is performed according to the bit plane method, and a so-called embedded code stream that can be decoded normally for truncation at any position is obtained, thereby providing a very thin bandwidth for the enhancement layer. Granularity and better adaptability to network conditions. However, because fine-grained hierarchical coding requires computational complexity, its practicability is low. Summary of the invention
本发明提供一种视频码流审查方法、 系统和相关设备, 以解决现有技术 中如何基于分层编码技术实现视频码流深度内容审查的问题。  The present invention provides a video code stream review method, system and related device to solve the problem of how to implement video code stream deep content review based on layered coding technology in the prior art.
为解决上述问题, 本发明提供如下技术方案:  In order to solve the above problems, the present invention provides the following technical solutions:
一种视频码流审查方法, 内容源将原始视频码流进行分层编码为基本层 码流和至少一层增强层码流, 在传输所述视频码流时, 所述方法包括如下步 骤:  A video stream review method, the content source layer-encoding the original video code stream into a base layer code stream and at least one layer of enhancement layer code stream. When transmitting the video code stream, the method includes the following steps:
A、将所述基本层码流转发给审查设备, 将所述增强层码流进行延迟处理 后发送给用户接收端;  A, forwarding the basic layer code stream to the review device, delaying the code stream of the enhancement layer, and transmitting the code stream to the user receiving end;
B、 审查设备审查所述基本层码流中是否包含有害内容, 如果否再将该基 本层码流转发给用户接收端, 否则切断向用户接收端转发该基本层码流。  B. The review device checks whether the basic layer code stream contains harmful content, and if the basic layer code stream is forwarded to the user receiving end, the basic layer code stream is forwarded to the user receiving end.
其中: 直播模式时, 所述步骤 A中, 内容源直接发送或转发分层编码的 相应码流; 所述步骤 B中, 审查设备根据需要的视频质量, 通知内容源相应 提高 /降低所述基本层码流的编码质量, 或者在转发该基本层码流的同时, 增 加 /减少转发用于增强审查效果和准确性的增强层码流。  In the live broadcast mode, in the step A, the content source directly sends or forwards the corresponding code stream of the layered code; in the step B, the review device notifies the content source to raise/lower the basic according to the required video quality. The coding quality of the layer code stream, or the forwarding of the base layer code stream, while increasing/decreasing the enhancement layer code stream for enhancing the review effect and accuracy.
存储内容播放模式时, 所述步骤 A中, 所述内容源先将产生的所述基本 层码流和增强层码流分别存储到对应的基本层码流轨道和增强层轨道; 流媒 体服务器从所述基本层码流轨道中读出所述基本层码流并转发给审查设备, 读出所述增强层码流并进行延迟处理后发送给用户终端; 所述步骤 B中, 审 查设备根据需要的视频质量, 通知流媒体服务器在转发该基本层码流的同时, 增加 /减少转发用于增强审查效果和准确性的增强层码流。 When the content play mode is stored, in the step A, the content source first stores the generated base layer code stream and the enhancement layer code stream respectively to the corresponding base layer code stream track and the enhancement layer track; The base server reads the base layer code stream from the base layer code stream track and forwards it to the review device, reads the enhancement layer code stream and performs delay processing, and then sends the result to the user terminal; in step B, the review The device notifies the streaming media server to increase/decrease the enhanced layer code stream for enhancing the review effect and accuracy while forwarding the base layer code stream according to the required video quality.
审查设备的审查方法包括: 解码所述基本层码流图像并输入自动识别装 置, 自动识别装置将预存的有害内容数据库中的有害内容与所述基本层码流 图像中包含的相关内容进行比较分析来进行有害内容的自动识别; 和 /或将所 述基本层码流图像显示给监控者来进行有害内容的人工识别。  The review method of the review device includes: decoding the base layer code stream image and inputting an automatic identification device, and the automatic identification device compares and analyzes the harmful content in the pre-stored harmful content database with the related content included in the base layer code stream image Performing automatic identification of harmful content; and/or displaying the base layer code stream image to a monitor for manual identification of harmful content.
当人工识别和自动识别同时进行时, 如果二者的识别结果出现不一致, 则优先执行自动识别装置或监控者的判决结果。  When the manual recognition and the automatic recognition are simultaneously performed, if the recognition results of the two are inconsistent, the judgment result of the automatic identification device or the monitor is preferentially executed.
或者, 当人工识别和自动识别同时进行时, 由自动识别装置和监控者分 别根据预先设定的规则, 为识别出的有害内容给出相应的有害程度分值, 然 后对两个分值进行加权处理后得到最终执行的判决结果, 当仅收到一方为认 定的有害内容给出的分值时, 默认另一方为该内容给出的分值为零。  Alternatively, when the manual identification and the automatic recognition are simultaneously performed, the automatic identification device and the monitor respectively give corresponding harmful degree scores for the identified harmful content according to a preset rule, and then weight the two scores. After processing, the final execution result is obtained. When only the score given by one party for the identified harmful content is received, the default value given by the other party for the content is zero.
具体所述的加权处理方法为:  The specific weighting processing method is as follows:
S尸 ( WMXSM+WHXSH ) I ( WM+WH ) S corpse ( W M XSM+W H XSH ) I ( WM+WH )
其中, WM和 WH表示自动识别装置和监控者的权值, WM和 WH之间的 相对大小表示了对识别结果的信任度, SM和 SH分别自动识别装置和监控者给 出的分数, 如果 大于一个给定值, 则判决结果为有害, 否则判决结果为无 害, WM 、 WH和给定值分别根据经验值设定。 Wherein, W M and W H represent the weights of the automatic identification device and the monitor, and the relative size between W M and W H represents the degree of trust in the recognition result, and the S M and the S H respectively automatically identify the device and the monitor to give If the score is greater than a given value, the judgment result is harmful, otherwise the judgment result is harmless, and W M , W H and the given value are respectively set according to the empirical value.
所述方法中, 审查设备切断转发所述基本层码流的同时, 启动转发备用 无害视频码流。  In the method, the reviewing device turns off and forwards the basic layer code stream, and starts forwarding the standby harmless video code stream.
所述方法还同时包括: 审查设备录制并保存指定时段的基本层码流和用 于增强审查效果和准确性的增强层码流。  The method also includes: reviewing the device recording and saving the base layer code stream for the specified time period and the enhancement layer code stream for enhancing the review effect and accuracy.
所述方法还同时包括: 将有害内容的识别情况记录在日志中并生成曰志 报告。  The method also includes: recording the identification of the harmful content in a log and generating a report.
所述方法中, 用户接收端接收到所有增强层码流和基本层码流时, 解码 重构所述原始视频码流。 In the method, when the user receiving end receives all the enhancement layer code streams and the base layer code stream, decoding Reconstructing the original video stream.
当基于空间分层视频编码方法时, 内容源先 居设定比例对原始视频码 流进行缩小处理后 , 再进行分层编码。  When the spatial layered video coding method is based on the content source, the original video code stream is reduced by a set ratio, and then layered coding is performed.
本发明还提供一种视频码流审查系统, 包括: 视频输入设备、 视频编码 设备和流媒体服务器, 还包括审查设备;  The present invention also provides a video stream review system, including: a video input device, a video encoding device, and a streaming server, and a review device;
所述视频编码设备将视频输入设备采集的原始视频码流的分层编码为基 本层码流和至少一个增强层码流, 并将所述基本层码流通过通信网络转发给 审查设备, 同时将所有增强层码流经进行延迟处理后通过通信网络发送给用 户接收端; 审查设备审查所述基本层码流中是否包含有害内容, 如果否再通 过通信网络将该基本层码流转发给用户接收端, 否则切断向用户接收端转发 该基本层码流。  The video encoding device hierarchically encodes the original video code stream collected by the video input device into a base layer code stream and at least one enhancement layer code stream, and forwards the basic layer code stream to the review device through the communication network, and simultaneously All the enhancement layer code streams are sent to the user receiving end through the communication network after the delay processing; the reviewing device checks whether the basic layer code stream contains harmful content, and if not, forwards the basic layer code stream to the user through the communication network. End, otherwise cut off to forward the base layer code stream to the user receiver.
其中, 所述视频编码设备包括:  The video encoding device includes:
编码器, 编码所述基本层码流和增强层码流, 将所述基本层码流直接转 发给审查设备;  An encoder that encodes the base layer code stream and the enhancement layer code stream, and forwards the base layer code stream directly to the reviewing device;
第一延迟模块, 将所述增强层码流进行延迟处理后发送给用户接收端。 所述审查设备还包括: 通信模块, 通信连接视频编码设备, 审查设备根 据需要的视频质量, 通知视频编码设备相应提高 /降低所述基本层码流的编码 质量, 或者在转发该基本层码流的同时, 增加 /减少转发用于增强审查效果和 准确性的增强层码流, 并同时将用于增强审查效果和准确性的增强层码流进 行延迟处理后转发给用户接收端。  The first delay module delays the enhancement layer code stream and sends the stream to the user receiving end. The review device further includes: a communication module, the communication connection video coding device, the review device notifying the video coding device to correspondingly improve/reduce the coding quality of the basic layer code stream, or forwarding the basic layer code stream according to the required video quality At the same time, the enhancement layer code stream for enhancing the review effect and accuracy is added/reduced, and the enhancement layer code stream for enhancing the review effect and accuracy is delayed and forwarded to the user receiving end.
所述视频码流审查系统还包括: 视频内容数据库, 保存所述基本层码流 和增强层码流, 其中, 所述基本层码流保存在设定的基本层码流轨道中, 所 述增强层码流保存在设定的增强层码流轨道中; 所述流媒体服务器包括: 码 流读取模块和第二延迟模块, 码流读取模块从所述基本层码流轨道中读出所 述基本层码流并转发给审查设备, 从所述增强层码流轨道中读取增强层码流 并通过所述第二延迟模块进行延迟处理后发送给用户接收端。  The video stream review system further includes: a video content database, where the base layer code stream and the enhancement layer code stream are saved, wherein the base layer code stream is saved in a set base layer code stream track, and the enhancement The layer code stream is stored in the set enhancement layer code stream track; the streaming media server comprises: a code stream reading module and a second delay module, wherein the code stream reading module reads out from the base layer code stream track The base layer code stream is forwarded to the reviewing device, and the enhancement layer code stream is read from the enhancement layer code stream track and sent to the user receiving end by performing delay processing by the second delay module.
所述审查设备还包括: 通信模块, 通信连接视频编码设备流媒体服务器, 审查设备根据需要的视频质量, 通知流媒体服务器在转发该基本层码流的同 时, 增加 /减少转发用于增强审查效果和准确性的增强层码流, 并同时将用于 增强审查效果和准确性的增强层码流进行延迟处理后转发给用户接收端。 The review device further includes: a communication module, a communication connection video encoding device streaming media server, The reviewing device notifies the streaming media server to increase/decrease the enhanced layer code stream for enhancing the review effect and accuracy according to the required video quality, and at the same time, is used to enhance the review effect and accuracy. The enhancement layer code stream is delayed and forwarded to the user receiver.
所述审查设备包括:  The review device includes:
第三延迟模块, 对接收到的所有码流进行延迟处理后转发给用户接收端; 审查模块, 审查接收到的所有码流, 在审查出该码流中包含有害内容时 输出相应的控制信号;  The third delay module delays processing all the received code streams and forwards them to the user receiving end; the review module reviews all the received code streams, and outputs corresponding control signals when it is detected that the code stream contains harmful content;
开关模块, 连接在所述第三延迟模块之后;  a switch module connected behind the third delay module;
控制模块, 连接在所述审查模块和开关模块之间, 根据所述控制信号触 发断开所述开关模块。  And a control module, connected between the review module and the switch module, triggering disconnection of the switch module according to the control signal.
所述审查模块中具体包括人工识别子模块和 /或自动识别子模块, 其中: 所述人工识别子模块中具体包括: 解码单元、 增强处理单元、 显示单元 和指令接收单元, 所述解码单元解码接收到的所有码流图像, 所述图像通过 增强处理单元处理后在显示单元上显示给监控者, 监控者人工识别出该图像 中包含有害内容时, 通过所述指令接收单元触发所述主控单元输出所述控制 信号;  The review module specifically includes a manual identification sub-module and/or an automatic identification sub-module, where: the manual identification sub-module specifically includes: a decoding unit, an enhancement processing unit, a display unit, and an instruction receiving unit, where the decoding unit decodes All the code stream images received, the image is processed by the enhancement processing unit and displayed on the display unit to the monitor, and when the monitor manually recognizes that the image contains harmful content, the instruction is triggered by the instruction receiving unit. The unit outputs the control signal;
所述自动识别子模块中具体包括: 自动识别单元和有害内容数据库, 自 动识别单元根据所述有害内容数据库中的有害内容与接收到的所有码流中包 含的相关内容进行对比分析来进行有害内容的自动识别, 并在识别出有害内 容时自动触发所述主控单元输出所述控制信号。  The automatic identification sub-module specifically includes: an automatic identification unit and a harmful content database, and the automatic identification unit performs comparative analysis according to the harmful content in the harmful content database and the related content included in all the received code streams to perform harmful content. Automatic identification, and automatically triggering the main control unit to output the control signal when identifying harmful content.
当所述审查模块中同时包括人工识别子模块和自动识别子模块时, 该审 查模块中还包括: 审查方式切换模子块, 根据控制模块的控制选择启动所述 人工识别子模块和 /或自动识别子模块; 以及判决子模块, 同时接收所述指令 接收单元和自动识别单元的输出信号, 并根据设定的判决原则判定是否触发 所述主控单元输出所述控制信号。  When the review module includes both the manual identification sub-module and the automatic identification sub-module, the review module further includes: a review mode switching module block, and the manual identification sub-module and/or automatic identification is started according to the control option of the control module. a sub-module; and a decision sub-module, receiving an output signal of the instruction receiving unit and the automatic identification unit at the same time, and determining whether to trigger the main control unit to output the control signal according to a set decision rule.
所述审查模块中还包括以下主要结构:  The review module also includes the following main structures:
判决原则存储子模块, 连接在所述判决子模块和控制模块之间, 存储通 过控制模块输入的判决原则。 a decision principle storage submodule connected between the decision submodule and the control module The decision principle entered by the control module.
内容录制模块, 连接所述第三延迟模块, 用于录制指定时段该第三延迟 模块输出的所有码流。  The content recording module is connected to the third delay module, and is configured to record all the code streams output by the third delay module in a specified time period.
所述审查设备还包括:  The review device further includes:
日志记录模块, 分别连接其它模块或子模块, 用于生成并输出所述审查 设备的运行状态曰志;  The logging module is respectively connected to other modules or sub-modules for generating and outputting the running status of the reviewing device;
决策支持知识库, 连接控制模块, 用于存储特定或临时的有害内容文字、 人脸图像以及相关的法律法规文件。  Decision support knowledge base, connection control module, used to store specific or temporary harmful content text, face images and related legal and regulatory documents.
替换片源库, 连接在所述开关模块, 存储无害备用视频内容。  The chip source library is replaced, connected to the switch module, and the harmless spare video content is stored.
本发明同时还提供用于上述系统的一种视频编码设备、 流媒体服务器和 审查设备。  The present invention also provides a video encoding apparatus, a streaming server, and a reviewing apparatus for the above system.
本发明的有益效果如下:  The beneficial effects of the present invention are as follows:
本发明技术方案利用了分层编码机制的码流多层结构中基本层和增强层 的关系, 产生审查用码流, 审查用码流包括基本层码流, 或者还可以包括部 分增强层码流, 通过专用的审查设备审查并转发审查用码流, 该专用审查设 备在审查出审查用码流中包含有害内容时, 切断向用户转发审查用码流, 用 户端的无法解码重构原始视频流, 从而达到控制有害内容传播的目的; 基于 分层编码技术形成的审查用码流的数据量较少, 不会因为转发过程对系统造 成负担;  The technical solution of the present invention utilizes the relationship between the base layer and the enhancement layer in the code stream multi-layer structure of the layered coding mechanism to generate a code stream for review, and the code stream for review includes a base layer code stream, or may further include a part of the enhancement layer code stream. Examining and forwarding the code stream for review through a dedicated review device. When the private review device includes reviewing the code stream for review, the dedicated review device cuts off the code stream for review to the user, and the client cannot decode and reconstruct the original video stream. Therefore, the purpose of controlling the transmission of harmful content is achieved; the data volume of the review code stream formed based on the layered coding technology is small, and the system does not burden the forwarding process;
进一步, 可以在分层编码中, 灵活设置各层的比特率, 只要总的比特率 达到一定水平, 重构的视频码流的质量就能达到较高水平。 因此, 为了降低 审查用码流造成的开销, 可以将基本层码流设置得较低, 已达到进一步减轻 系统负担的效果;  Further, in the layered coding, the bit rate of each layer can be flexibly set, and the quality of the reconstructed video bitstream can reach a higher level as long as the total bit rate reaches a certain level. Therefore, in order to reduce the overhead caused by the code stream for review, the base layer code stream can be set lower, and the effect of further reducing the system load has been achieved;
本发明技术方案可以分别用于直播模式和存储内容播放模式, 为现有基 于分层编码技术实现的流媒体业务提供了通用的深度内容审查方案, 保证了 目前 IPTV、 数字电视等流媒体业务的内容安全, 有效防止了有害内容通过流 媒体业务的扩散。 附图说明 The technical solution of the present invention can be used for the live broadcast mode and the stored content play mode, respectively, and provides a general deep content review scheme for the existing streaming media service based on the layered coding technology, thereby ensuring the current streaming media services such as IPTV and digital television. Content security, effectively preventing the spread of harmful content through streaming media services. DRAWINGS
图 1为现有基于内容的 URL进行过滤的原理示意图;  FIG. 1 is a schematic diagram showing the principle of filtering based on a content-based URL;
图 2为现有流媒体内容的深度审查原理示意图;  2 is a schematic diagram of the principle of deep review of existing streaming media content;
图 3为分层视频编码原理示意图;  3 is a schematic diagram of a layered video coding principle;
图 4为实现本发明技术构思的审查系统结构示意图;  4 is a schematic structural diagram of a review system for implementing the technical idea of the present invention;
图 5和图 6为本发明所述审查设备的一种结构示意图;  FIG. 5 and FIG. 6 are schematic diagrams showing a structure of the review device according to the present invention;
图 7为本发明技术方法基于空间分层编码技术实现时的分层编码原理示 意图。 具体实施方式  FIG. 7 is a schematic diagram of a layered coding principle when the technical method of the present invention is implemented based on a spatial layered coding technique. detailed description
本发明基于现有分层编码技术。 通过基本层来产生审查码流, 由审查设 备对于审查码流进行审查, 审查没有问题时, 审查设备再向接收用户接收端 转发该基本层码流, 增强层码流则不经过审查设备, 直接发送给用户接收端, 用户终端接收到所有基本层加上增强层后, 可以进行解码重构总的视频码流 并收看节目, 用户终端收不到基本层, 就无法解码重构收看节目。  The present invention is based on existing layered coding techniques. The review code stream is generated by the basic layer, and the review device checks the review code stream. When there is no problem in the review, the review device forwards the base layer code stream to the receiving user receiving end, and the enhancement layer code stream is not subjected to the review device. After being sent to the receiving end of the user, after receiving all the base layers and the enhancement layer, the user terminal can decode and reconstruct the total video code stream and watch the program. If the user terminal does not receive the basic layer, the reconstructed viewing program cannot be decoded.
实现本发明技术构思的审查系统结构示意图如图 4所示, 主要包括内容 源的相关设备、 内容数据库、 流媒体服务器。 审查设备和用户接收端设备, 其中:  The schematic diagram of the structure of the review system for implementing the technical idea of the present invention is as shown in FIG. 4, and mainly includes a related device of a content source, a content database, and a streaming media server. Review devices and user sink devices, where:
实线部分为直播模式的审查实现流程, 直播模式下, 内容源直接将审查 用的基本码流直接发送给通过流媒体服务器发送给内容审查设备, 剩余增强 码流延迟发送给用户接收端, 审查设备审查基本码流中不包含有害内容时, 再将基本码流转发给用户接收端, 如果基本码流中不包含有害内容, 则不将 基本码流转发给用户接收端, 这样, 用户接收端收不到基本码流时, 无法解 码重构原始视频码流。 这一过程中, 审查设备可以根据需要的视频质量, 通 知内容源相应提高 /降低基本层码流的编码质量, 或者在转发该基本层码流的 同时,增加 /减少转发用于增强审查效果和准确性的增强层码流。如果是后者, 相应用于审查的码流中还包括一部分增强码流。 The solid line part is the review implementation process of the live broadcast mode. In the live broadcast mode, the content source directly sends the basic code stream for review directly to the content review device through the streaming media server, and the remaining enhanced code stream is delayed and sent to the user receiving end for review. When the device review basic code stream does not contain harmful content, the basic code stream is forwarded to the user receiving end. If the basic code stream does not contain harmful content, the basic code stream is not forwarded to the user receiving end, so that the user receiving end When the elementary stream is not received, the reconstructed original video stream cannot be decoded. In this process, the reviewing device can notify the content source to increase/decrease the encoding quality of the base layer code stream according to the required video quality, or increase/decrease forwarding for enhancing the review effect and while forwarding the base layer code stream. Accuracy layer code stream. if the latter one, A portion of the enhanced code stream is also included in the code stream for review accordingly.
虚线部分为存储内容播放模式的审查实现流程, 存储内容播放模式下, 内容源先将产生的基本层码流和增强层码流分別存储到对应的基本层码流轨 道和增强层轨道; 流媒体服务器在播放该视频内容时, 从基本层码流轨道中 读出基本层码流并转发给审查设备, 读出增强层码流并进行延迟处理后发送 给用户终端; 同样, 这一过程中, 审查设备根据需要的视频盾量, 通知流媒 体服务器在转发该基本层码流的同时, 增加 /减少转发用于增强审查效果和准 确性的增强层码流, 相应用于审查的码流中还包括一部分增强码流。  The dotted line part is a review implementation process for storing the content play mode. In the storage content play mode, the content source first stores the generated base layer code stream and the enhancement layer code stream respectively to the corresponding base layer code stream track and the enhancement layer track; When playing the video content, the server reads the basic layer code stream from the base layer code stream track and forwards it to the reviewing device, reads the enhancement layer code stream and performs delay processing, and then sends the result to the user terminal; likewise, in the process, The reviewing device notifies the streaming media server to forward/decrease the enhancement layer code stream for enhancing the review effect and accuracy according to the required video shield quantity, and correspondingly used in the code stream for reviewing Includes a portion of the enhanced code stream.
下面详细说明该系统主要包括的如下几部分设备的具体结构:  The following is a detailed description of the specific structure of the following parts of the system:
一、 内容源  First, the content source
内容源设备包括采集视频码流的视频输入设备和视频编码设备, 该视频 输入设备一般为摄像机, 本发明的技术构思基于分层编码技术实现, 因此视 频编码设备的基于分层编码技术对原始视频码流编码为基本码流和至少一层 增强码流。  The content source device includes a video input device and a video encoding device for collecting a video code stream, and the video input device is generally a camera. The technical idea of the present invention is implemented based on a layered coding technology, so the video encoding device is based on a layered coding technology for the original video. The code stream is encoded as a base stream and at least one layer of enhanced code.
仍参阅图 4所示, 视频编码设备的主要包括以下结构:  Still referring to FIG. 4, the video encoding device mainly includes the following structure:
编码器, 编码器将原始视频流分层编码为基本层码流和至少一层增强层 码流;  An encoder, the encoder layering the original video stream into a base layer code stream and at least one layer of enhancement layer code stream;
直播模式时, 直接通过通信网络向流媒体服务器输出基本层码流, 通过 第一延迟模块经进行延迟处理后输出增强层码流; 或者向流媒体服务器直接 输出基本层码流和部分增强层码流, 通过第一延迟模块进行延迟处理后输出 剩余的增强层码流。  In the live broadcast mode, the basic layer code stream is directly outputted to the streaming media server through the communication network, and the enhancement layer code stream is output after the delay processing is performed by the first delay module; or the base layer code stream and the partial enhancement layer code are directly output to the streaming media server. The stream is subjected to delay processing by the first delay module to output the remaining enhancement layer code stream.
流媒体服务器将审查用的码流转发给审查设备, 将非审查用的码流直接 发送给用户接收端。  The streaming server forwards the code stream for review to the review device, and sends the code stream for non-review directly to the user receiver.
当然, 直播模式下, 可以同时将视频内容存储到内容数据库中, 存放方 法为: 将基本码流存放在设定的基本层码流轨道中, 将增强层码流存放在设 定的增强层层码流轨道中。  Of course, in the live mode, the video content can be stored in the content database at the same time, and the storage method is as follows: storing the basic code stream in the set basic layer code stream track, and storing the enhancement layer code stream in the set enhancement layer layer. In the code stream track.
二、 流媒体服务器 在直播模式下, 流媒体服务器仅承担转发功能, 不进行延迟处理, 在播 放存储内容模式下, 流媒体服务器需要对非审查用的进行延迟处理。 Second, streaming media server In the live broadcast mode, the streaming media server only performs the forwarding function and does not perform the delay processing. In the playback storage content mode, the streaming media server needs to delay the non-reviewing.
仍参阅图 4所示, 流媒体服务器包括视频码流读取模块, 还包括连接视 频码流读取模块的第二延迟模块, 视频码流读取模块从外部连接的内容数据 库中读取基于分层编码的基本层码流和至少一层增强层码流, 然后:  Still referring to FIG. 4, the streaming media server includes a video stream reading module, and further includes a second delay module connected to the video stream reading module, and the video stream reading module reads the score based on the externally connected content database. Layer-coded base layer code stream and at least one layer of enhancement layer code stream, then:
直接输出基本层码流, 通过第二延迟模块经进行延迟处理后输出增强层 码流; 或者  Directly outputting the base layer code stream, and performing delay processing on the second delay module to output the enhancement layer code stream; or
直接输出基本层码流和部分增强层码流, 通过第二延迟模块经进行延迟 处理后输出剩余的增强层码流。  The base layer code stream and the part of the enhancement layer code stream are directly output, and the remaining enhancement layer code stream is output after the delay processing is performed by the second delay module.
三、 审查设备  Third, review equipment
如图 5 , 基于分层编码实现视频码流内容审查的审查设备, 包括: 通信模 块, 通信连接其他网络设备; 还包括:  As shown in FIG. 5, a review device for implementing video stream content review based on layered coding includes: a communication module, and a communication connection with other network devices;
第三延迟模块, 接收基本层码流、 或者基本层码流和部分增强层码流, 并对接收到的所有码流进行延迟处理后转发给用户接收端;  The third delay module receives the base layer code stream, or the base layer code stream and the part of the enhancement layer code stream, and delays processing all the received code streams and forwards them to the user receiving end;
审查模块, 审查接所有延迟转发的码流, 在审查出码流中包含有害内容 时输出相应的控制信号;  The review module, which examines all delayed forwarded code streams, and outputs corresponding control signals when the detected code stream contains harmful content;
开关模块, 连接在第三延迟模块之后;  a switch module connected behind the third delay module;
控制模块, 连接在审查模块和开关模块之间, 根据控制信号触发断开开 关模块。  The control module is connected between the review module and the switch module, and triggers the disconnection switch module according to the control signal.
如图 6所示,审查模块中具体包括人工识别子模块和 /或自动识别子模块: 人工识别子模块中具体包括: 解码单元、 增强处理单元、 显示单元和指 令接收单元, 解码单元解码接收到的所有码流图像。 通过增强处理单元处理 后在显示单元上显示给监控者, 监控者人工识别出该图像中包含有害内容时, 通过指令接收单元触发主控单元输出控制信号;  As shown in FIG. 6, the review module specifically includes a manual identification submodule and/or an automatic identification submodule: the manual identification submodule specifically includes: a decoding unit, an enhancement processing unit, a display unit, and an instruction receiving unit, and the decoding unit decodes and receives All codestream images. After being processed by the enhancement processing unit and displayed on the display unit to the monitor, when the monitor manually recognizes that the image contains harmful content, the instruction receiving unit triggers the main control unit to output a control signal;
自动识别子模块中具体包括: 自动识别单元和有害内容数据库, 自动识 别单元根据有害内容数据库中的有害内容与接收到的所有码流中包含的相关 内容进行对比分析来进行有害内容的自动识别, 并在识别出有害内容时自动 触发主控单元输出控制信号。 The automatic identification sub-module specifically includes: an automatic identification unit and a harmful content database, and the automatic identification unit performs automatic identification of the harmful content according to the comparative analysis of the harmful content in the harmful content database and the related content included in all the received code streams, And automatically identify harmful content Trigger the main control unit to output a control signal.
当审查模块中同时包括人工识别子模块和自动识别子模块时, 该审查模 块中还包括:  When the review module includes both the manual identification submodule and the automatic identification submodule, the review module further includes:
审查方式切换子模块, 根据控制模块的控制选择启动人工识别子模块和 / 或自动识别子模块;  The review mode switching sub-module, and starting the manual identification sub-module and/or the automatic identification sub-module according to the control of the control module;
判决子模块, 同时接收指令接收单元和自动识别单元的输出信号, 并根 据设定的判决原则判定是否触发控制模块输出控制信号;  The decision sub-module receives the output signals of the instruction receiving unit and the automatic identification unit at the same time, and determines whether to trigger the control module to output the control signal according to the set decision principle;
判决原则存储子模块, 连接在判决模块和控制模块之间, 存储通过控制 模块输入的判决原则;  The decision principle storage submodule is connected between the decision module and the control module, and stores the decision principle input through the control module;
决策支持知识库, 连接控制模块, 用于存储特定或临时的有害内容文字、 人脸图像, 例如刚收到的恐怖分子图像等, 以及相关的发了法规文件。  Decision support knowledge base, connection control module, used to store specific or temporary harmful content text, face images, such as terrorist images just received, and related regulatory documents.
审查设备还包括: 内容录制模块, 连接第三延迟模块的输出端, 用于录 制指定时段该第三延迟模块输出的所有码流。  The review device further includes: a content recording module, connected to the output end of the third delay module, for recording all the code streams output by the third delay module in a specified time period.
审查设备还包括: 日志记录模块, 分別连接其它模块或子模块, 用于生 成并输出审查设备的运行状态日志。  The review device further includes: a logging module, which is respectively connected to other modules or sub-modules for generating and outputting an operation status log of the review device.
审查设备还包括: 替换片源库, 连接在开关模块, 存储无害备用视频内 容。  The review device also includes: replacing the source library, connected to the switch module, and storing the harmless spare video content.
上述视频编码设备、 流媒体服务器和审查设备组成的审查系统如图 4所 示, 内容源将原始视频码流进行分层编码为基本层码流和至少一层增强层码 流, 在直播模式下, 本发明技术方案实施例的流程包括如下步骤:  The review system consisting of the above video coding device, streaming media server and review device is shown in FIG. 4, and the content source hierarchically encodes the original video code stream into a base layer code stream and at least one layer of enhancement layer code stream, in a live mode. The process of the embodiment of the technical solution of the present invention includes the following steps:
1、 内容源将基本层码流转发给审查设备, 将增强层码流进行延迟处理后 发送给用户接收端;  1. The content source forwards the basic layer code stream to the reviewing device, delays processing the enhancement layer code stream, and sends the stream to the user receiving end;
2、 审查设备审查基本层码流中是否包含有害内容, 如果否再将该基本层 码流转发给用户接收端, 否则切断向用户接收端转发该基本层码流。  2. Examine the device to check whether the basic layer code stream contains harmful content. If the base layer code stream is forwarded to the user receiving end, the basic layer code stream is forwarded to the user receiving end.
并且, 直播模式时, 审查设备根据需要的视频质量, 通知内容源相应提 高 /降低基本层码流的编码质量, 或者在转发该基本层码流的同时, 增加 /减少 转发用于增强审查效果和准确性的增强层码流。 在存储内容播放模式下, 内容源先将产生的基本层码流和增强层码流分 别存储到对应的基本层码流轨道和增强层轨道; 流媒体服务器从基本层码流 轨道中读出基本层码流并转发给审查设备, 读出增强层码流并进行延迟处理 后发送给用户终端; Moreover, in the live mode, the review device notifies the content source to increase/decrease the encoding quality of the base layer code stream according to the required video quality, or increases/decreases the forwarding for enhancing the review effect while forwarding the base layer code stream. Accuracy layer code stream. In the storage content play mode, the content source first stores the generated base layer code stream and the enhancement layer code stream respectively to the corresponding base layer code stream track and the enhancement layer track; the streaming media server reads the basic from the base layer code stream track. Layer code stream is forwarded to the review device, and the enhancement layer code stream is read out and delayed processing is sent to the user terminal;
审查设备可以根据需要的视频质量, 通知流媒体服务器在转发该基本层 码流的同时, 增加 /减少转发用于增强审查效果和准确性的增强层码流。  The reviewing device can notify the streaming server to forward/decrease the layer code stream, while increasing/decreasing the enhancement layer stream for enhancing the review effect and accuracy.
其中, 审查设备的审查方法包括:  Among them, the review methods for reviewing equipment include:
解码基本层码流图像并输入自动识别装置, 自动识别装置将预存的有害 内容数据库中的有害内容与基本层码流图像中包含的相关内容进行比较分析 来进行有害内容的自动识别; 和 /或  Decoding the base layer code stream image and inputting the automatic identification device, the automatic identification device compares and analyzes the harmful content in the pre-stored harmful content database with the related content contained in the base layer code stream image to perform automatic identification of the harmful content; and/or
将基本层码流图像显示给监控者来进行有害内容的人工识别。  The base layer stream image is displayed to the monitor for manual identification of the harmful content.
当人工识别和自动识别同时进行时, 如果二者的识别结果出现不一致, 则优先执行自动识别装置或监控者的判决结果。  When the manual recognition and the automatic recognition are simultaneously performed, if the recognition results of the two are inconsistent, the judgment result of the automatic identification device or the monitor is preferentially executed.
当人工识別和自动识别同时进行时, 由自动识别装置和监控者分别根据 预先设定的规则, 为识别出的有害内容给出相应的有害程度分值, 然后对两 个分值进行加权处理后得到最终执行的判决结果, 当仅收到一方为认定的有 害内容给出的分值时, 默认另一方为该内容给出的分值为零。  When the manual identification and the automatic recognition are simultaneously performed, the automatic identification device and the monitor respectively give corresponding harmful degree scores for the identified harmful content according to a preset rule, and then weight the two scores. After the final execution of the judgment result, when only the score given by one party for the identified harmful content is received, the default score given by the other party for the content is zero.
上述加权处理方法为:  The above weighting method is:
S = ( WMXSM+WHXSH ) I ( WM+WH ) S = ( W M XS M +W H XSH ) I ( WM+WH )
其中, WM和 WH表示自动识别装置和监控者的权值, WM和 WH之间的 相对大小表示了对识别结果的信任度, SM和 SH分别自动识别装置和监控者给 出的分数, 如果 8Ϊ大于一个给定值, 则判决结果为有害, 否则判决结果为无 害, WM 、 WH和给定值分别根据经验值设定。 Wherein, W M and W H represent the weights of the automatic identification device and the monitor, and the relative size between W M and W H represents the degree of trust in the recognition result, and the S M and the S H respectively automatically identify the device and the monitor to give If the score is greater than a given value, the judgment result is harmful, otherwise the judgment result is harmless, and W M , W H and the given value are respectively set according to the empirical value.
本发明中, 有害内容至少包括如下之一: 有害图像、 有害叠加文字或符 号、 特定人脸图像。  In the present invention, the harmful content includes at least one of the following: harmful images, harmful superimposed characters or symbols, and specific face images.
本发明中, 审查设备切断转发基本层码流的同时, 启动转发备用无害视 频码流。 审查设备还可以录制并保存指定时段的基本层码流和用于增强审查效果 和准确性的增强层码流; 并将有害内容的识别情况记录在日志中并生成曰志 报告。 In the present invention, when the reviewing device cuts off and forwards the base layer code stream, it starts forwarding the standby harmless video code stream. The review device can also record and save the base layer stream for a specified time period and an enhancement layer stream for enhanced review effectiveness and accuracy; and record the identification of harmful content in a log and generate a report.
当基于空间分层视频编码方法时, 内容源先根据设定比例对原始视频码 流进行缩小处理后, 再进行分层编码。  When the spatial layered video coding method is used, the content source first reduces the original video code stream according to the set ratio, and then performs layered coding.
本发明技术方案主要基于现有三种主流分层编码技术实现:  The technical solution of the present invention is mainly implemented based on the existing three mainstream layered coding technologies:
1、 基于时间分层视频编码方法  1. Time-based layered video coding method
对于基本层设置比较低的帧率, 比如 5帧每秒(5fps ), 按照帧内编码模 式和帧间预测编码模式进行编码产生基本层, 然后再按照双向预测模式编码, 产生增强层。 使得增强层加上基本层达到 25φ3 (对于 PAL制式), 30φ3 (对 于 NTSC制式)。 For the base layer, a relatively low frame rate is set, for example, 5 frames per second (5 fps), and the base layer is encoded according to the intra coding mode and the inter prediction coding mode, and then encoded in the bidirectional prediction mode to generate an enhancement layer. The enhancement layer plus the base layer is 25φ 3 (for PAL), 30φ3 (for NTSC).
2、 基于 SNR分层视频编码方法  2, based on SNR layered video coding method
控制基本层编码的量化参数等影响因素,使得基本层的比特率低于 Rb(比 如 128kbps ), 再形成第一增强层, 或者叫做主增强层 (Prime Enhancement Layer )。 比特率为 Rpe (比如 384kbps ), 然后在主增强层基础上, 在形成若干 个辅助增强层 ( Secondary Enhancement Layer ) , 起比特率分别为 RselInfluencing factors such as quantization parameters of the base layer coding are controlled such that the bit rate of the base layer is lower than R b (for example, 128 kbps), and then the first enhancement layer is formed, or is called a Prime Enhancement Layer. The bit rate is R pe (for example, 384 kbps), and then based on the main enhancement layer, a plurality of Secondary Enhancement Layers are formed, and the bit rates are respectively R sel ,
Rse2、 、 Rsek 。 Rse2, Rsek.
因为基本层在本方面中, 仅仅是为了提供该节目的审查, 只要能够看清 图像轮廓等基本信息就可以了, 比特率要尽可能低, 用户单独接收到基本层 无法满足收看节目要求, 因此必须结合主增强层才能正常收看节目。 当然如 果网络带宽充裕, 用户可以接受更多甚至全部增强层, 观看高清晰度的节目。 在以上的例子中, 可以设置 Rb=128kbps, Rpe=384kbps, Rsel=512kbps。 那么如 果用户是 ADSL ( Asymmetric Digital Subscriber Line )接入宽带用户, 可以保 证 512kbps, 则可以接收到基本层和主增强层。 如果用户是 Ethernet接入宽带 用户, 则可以接收基本层和全部增强层, 得到最好的视频质量。 Because the basic layer is in this aspect only to provide review of the program, as long as the basic information such as the outline of the image can be seen, the bit rate should be as low as possible, and the user simply receives the basic layer and cannot meet the program requirements. The main enhancement layer must be combined to allow normal viewing of the program. Of course, if the network bandwidth is sufficient, users can accept more or even all enhancement layers to watch high-definition programs. In the above example, R b = 128 kbps, R pe = 384 kbps, and R sel = 512 kbps can be set. Then, if the user is an ADSL (Asymmetric Digital Subscriber Line) access broadband user, and can guarantee 512 kbps, the base layer and the main enhancement layer can be received. If the user is an Ethernet access broadband user, the base layer and all enhancement layers can be received for the best video quality.
3、 基于空间分层视频编码方法  3, based on spatial layered video coding method
为了降低基本层的比特率, 在基于空间分层编码的审查码流产生方案中, 本发明首先对于原始视频数据流进行缩小, 技术术语叫做下采样In order to reduce the bit rate of the base layer, in the scheme of generating a code stream based on spatial layered coding, The invention first reduces the original video data stream, and the technical term is called downsampling.
( Down-sampling ), 导致图像在水平和垂直方向上各自缩小 kh, kv倍。 这样 做的好处是进一步降低审查码流的比特率, 比如在 S R分层情况下, 如果 128kbps是能够提供审查需要的视频质量的比特率,如果我们在空间分层编码 中, 首先把图像降低到原来 1/4大小 (水平和垂直方向各自降低 2倍), 那么 应该在 32kbps左右就可以提供审查需要的视频质量了。 原理如图 7所示。 其 中变换 U表示求差后再进行上釆样。 变换 D表示下采样。 (Down-sampling), causing the image to shrink by k h , k v times in the horizontal and vertical directions. The advantage of this is to further reduce the bit rate of the review code stream. For example, in the case of SR layering, if 128kbps is the bit rate capable of providing the video quality required for the review, if we are in spatial layered coding, first reduce the image to Originally 1/4 size (two times horizontally and vertically), then the video quality required for review should be available at around 32 kbps. The principle is shown in Figure 7. The transformation U represents the difference and then the upper sample. Transform D represents downsampling.
在流媒体业务中, 两种最重要的业务服务模式是存储内容( storage )的播 放和现场直播(live )。  In the streaming media business, the two most important business service modes are storage and live broadcast of storage.
1、 存储内容播放情况  1, storage content playback
视频内容事先经过压縮编码以视频流媒体文件形式存储在内容数据库 中, 在视频文件内部, 基本层码流和增强层码流可以分别放置在两个不同的 媒体轨道上, 原理参阅图 4所示, 首先由视频编码设备产生基本层编码码流 和各个增强层编码码流, 然后写入到视频文件并存储到内容数据库中。  The video content is compressed and encoded in the form of a video streaming file in a content database. Within the video file, the base layer stream and the enhancement layer stream can be respectively placed on two different media tracks. For the principle, see FIG. It is shown that the base layer coded code stream and the respective enhancement layer coded code streams are first generated by the video encoding device, then written to the video file and stored in the content database.
在需要播放时, 流媒体服务器从视频文件中分别读取审查轨道和增强层 轨道的视频数据, 然后将增强层数据通过一个延迟时间可以设定的延迟环节, 在发送到网络上去, 而审查码流直接发送, 不需要经过延迟。  When playback is required, the streaming server reads the video data of the review track and the enhancement layer track from the video file, and then transmits the enhancement layer data to the network through a delay link that can be set by the delay time, and the review code Streams are sent directly without delay.
流媒体服务器( streaming server )从文件中读出数据, 按照文件中的辅助 信息进行打包和发送这些包, 辅助信息一般提供关于打包的规则, 和文件中 多个媒体轨道( media track, 比如视频是一个轨道, 音频一个轨道, 文本一个 轨道等等)之间的同步信息,从而流媒体服务器知道在任何一个特定时间段 [t, t+At]内应该如何从哪些轨道上读出哪些和多少数据进行打包和发送。 这种情 况下, 视频编码器负责进行分层编码, 然后生成视频文件。 流媒体服务器负 责从视频文件中读取数据, 按照辅助信息的指示来进行打包发送:  A streaming server reads data from a file, packages and sends the packets according to the auxiliary information in the file. The auxiliary information generally provides rules about packaging, and multiple media tracks (such as video is in the file). Synchronization information between a track, an audio track, a text track, etc., so that the streaming server knows which tracks and how much data should be read from which tracks in any particular time period [t, t+At] Pack and send. In this case, the video encoder is responsible for layered encoding and then generating a video file. The streaming server is responsible for reading the data from the video file and packaging it according to the instructions of the auxiliary information:
( 1 )提取审查用码流的数据进行打包发送, 发送到审查设备;  (1) extracting the data of the review code stream for packet transmission and sending to the review device;
这里, 审查用码流可能仅包括基本层码流、 或者包括基本层码流和部分 增强审查效果和准确性的增强层码流, 根据审查设备的指示增加或减少增强 层码流的具体数量。 Here, the review code stream may include only the base layer code stream, or an enhancement layer code stream including the base layer code stream and a part of the enhanced review effect and accuracy, increasing or decreasing the enhancement according to the indication of the review device. The specific number of layer streams.
( 2 )提取非审查用的增强层码流进行打包发送, 直接发送到用户终端; ( 3 )保持各个媒体轨道之间的时间同步。  (2) extracting the non-examination enhancement layer code stream for packet transmission and directly transmitting to the user terminal; (3) maintaining time synchronization between the respective media tracks.
( 4 )要求对于审查用码流打包发送在时间上领先增强层码流一个提前 量, 比如 5秒或者 10秒。 这是因为审查码流比增强层码流要多经过一个审查 过程进行审查, 尤其是人工审查需要一定的提前量, 以便一旦审查到有害内 容, 有足够多时间来采取措施。 这种提前量可以通过把增强层码流延迟相应 的时间来实现, 因为所谓提前和落后都是相对的。  (4) It is required to send an advance amount of the enhancement layer code stream in time for the packet stream for review, such as 5 seconds or 10 seconds. This is because the review code stream is reviewed more often than the enhancement layer code stream, especially if the manual review requires a certain amount of advancement so that there is enough time to take action once the harmful content is reviewed. This amount of advance can be achieved by delaying the enhancement layer code stream by a corresponding amount of time because both the advance and the backward are relative.
2、 直播情况  2. Live broadcast situation
视频编码器直接将编码后的视频流按照审查用码流(基本码流或基本码 流和部分增强码流)和非审査用码流(全部或剩余部分增强码流) 区分开来 打包发送给流媒体服务器, 流媒体服务器只要进行转发就可以了。 编码器在 输出非审查视频编码码流时通过一个延迟环节延迟设定的时间, 再发送到网 络上去, 而审查用码流直接发送, 不需要经过延迟。  The video encoder directly packs and transmits the encoded video stream according to the code stream for review (basic code stream or elementary stream and partial enhanced code stream) and the non-review code stream (all or the remaining portion of the enhanced code stream). To the streaming server, the streaming server can just forward it. When the encoder outputs the unreviewed video coded stream, it delays the set time through a delay link and then sends it to the network, and the review is sent directly by the code stream without delay.
内容审查设备一旦收到来自流媒体服务器的审查视频流 , 就要对其进行 审查, 审查可以有两种基本方式:  Once the content review device receives the review video stream from the streaming server, it is subject to review. There are two basic ways to review it:
1、 人工审查: 就是通过人工对于解码并且显示在屏幕上的视频审查内容 进行审查, 凭借审查人员的经验, 按照国家的法律法规或者社会约定的一些 原则来判断被审查内容是否有害。  1. Manual review: It is to manually review the video review content decoded and displayed on the screen. Based on the experience of the reviewer, it is judged whether the content under review is harmful according to the national laws and regulations or some principles agreed by the society.
2、 自动审查: 通过机器进行自动的内容审查。 有很多种方法, 比如通过 视频镜头的识别处理, 识别特定的有害场景, 比如暴力色情等; 也可以通过 提取视频中的叠加文字比如字幕等来进行文字的识别, 来判定是否存在内容 有害的文字信息; 也可以进行人脸的识别来发现是否有特定人物出现在画面 中, 比如恐怖分子等。 这些自动审查, 在最新的技术进步推动下, 已经日趋 成熟和实用, 并且^ [艮多方法, 比如叠加文字的定位和识别不需要对于审查码 流进行解码就能够进行, 因此可以大大提高效率, 其实用性进一步提高。  2. Automated review: Automated content review through the machine. There are many ways, such as recognizing specific harmful scenes, such as violent pornography, through the recognition process of video footage. It is also possible to identify the text by superimposing superimposed characters such as subtitles in the video to determine whether there is harmful text. Information; face recognition can also be performed to find out if a specific person appears in the picture, such as a terrorist. These automated reviews, with the latest technological advances, have become increasingly sophisticated and practical, and [multiple methods, such as overlay text positioning and recognition, do not require decoding of the code stream, so they can greatly improve efficiency. Its practicality is further improved.
当然有的时候, 也可以进行人工审查和机器审查同时进行, 再把其结果 进行融合。 比如一种识别结果不可靠的时候, 可能需要两者同时进行来融合。 比如, 如果审查码流质量较差, 显示图像模糊, 人眼无法清晰辨别的时候, 可以借助机器; 还有一种情况, 比如出现了一个人工审查人员认为可以的人 脸, 但是无法判定是否为需要控制的对象, 借助自动审查来识别, 如果决策 支持知识库中有该人的脸, 就可以准确识别其姓名属性等。 Of course, sometimes you can perform manual review and machine review at the same time, and then put the results. Convergence. For example, when a recognition result is unreliable, it may be necessary to combine the two at the same time. For example, if the quality of the code stream is poor, the display image is blurred, and the human eye cannot clearly distinguish it, the machine can be used. In another case, for example, a face that the human reviewer thinks is possible, but it is impossible to determine whether it is necessary. The object of control is identified by automatic review. If there is a face of the person in the decision support knowledge base, the name attribute and the like can be accurately identified.
对于两种审查的结果要进行融合, 可以采用多种方式来进行信息的融合。 比如加权平均等方法。  For the results of the two reviews to be fused, there are many ways to integrate the information. Such as weighted average methods.
在进行了审查后, 如果发现内容没有问题, 就让审查码流通过, 转发到 用户终端, 用户终端结合收到的多个增强层进行解码重构, 得到可以观看的 内容视频。  After the review, if the content is found to be okay, the review code stream is passed and forwarded to the user terminal, and the user terminal performs decoding and reconstruction in combination with the received enhancement layers to obtain a content video that can be viewed.
如果审查发现有问题, 那么需要对于有害内容卡断, 从而用户终端的节 目收视就会中断, 为了弥补 "黑屏" 的问题。 内容审查设备可以采发送替代 视频内容给用户终端, 比如公益广告等, 来自内容审查设备中的替换片源库。 同时对于有害的内容还要进行录制, 以备后期追查, 法律取证等。  If the review finds a problem, it is necessary to break the harmful content, so that the user terminal's program viewing will be interrupted, in order to make up for the "black screen" problem. The content review device can send alternative video content to the user terminal, such as a public service advertisement, from a replacement source library in the content review device. At the same time, harmful content should be recorded for later tracing, legal forensics, etc.
本发明实施例提供的技术方案还可以用于对其它视频内容的识别, 例如: 体育比赛精彩镜头, 比如足球的射门镜头, 篮球的远投命中, 扣篮等精彩动 作, 目的是对于识别出来的视频片断进行存储和录制; 从新闻节目中识别特 定人物相关的镜头, 进行存档; 对于交通系统中使用的电子眼(即安装在各 主要路口的摄像机) 自动记录的录像进行识别, 寻找违章行为, 并且识别违 章车辆的号码; 识别电视节目中的特定故事, 比如哈里波特电影, 一旦识别 到可以通知 IPTV用户进行观看等。  The technical solution provided by the embodiment of the present invention can also be used for recognizing other video content, for example, a sports game highlight shot, such as a soccer shot lens, a basketball long-range hit, a dunk, and the like, for the purpose of identifying the video. Fragments are stored and recorded; images related to specific people are identified from news programs for archiving; videos automatically recorded by electronic eyes (ie cameras installed at major intersections) used in the transportation system are identified, and violations are identified and identified The number of the illegal vehicle; identifies a specific story in the TV program, such as a Harry Potter movie, once it is recognized that the IPTV user can be notified to watch.
显然, 本领域的技术人员可以对本发明进行各种改动和变型而不脱离本 发明的精神和范围。 这样, 倘若本发明的这些修改和变型属于本发明权利要 求及其等同技术的范围之内, 则本发明也意图包含这些改动和变型在内。  It is apparent that those skilled in the art can make various modifications and variations to the invention without departing from the spirit and scope of the invention. Thus, it is intended that the present invention cover the modifications and variations of the inventions

Claims

权 利 要 求 Rights request
1、 一种视频码流审查方法, 内容源将原始视频码流进行分层编码为基本 层码流和至少一层增强层码流, 在传输所述视频码流时, 所述方法包括如下 步骤: A method for reviewing a video stream, the content source layer-coding the original video stream into a base layer stream and at least one layer of enhancement layer code. When the video stream is transmitted, the method includes the following steps. :
A、将所述基本层码流转发给审查设备, 将所述增强层码流进行延迟处理 后发送给用户接收端;  A, forwarding the basic layer code stream to the review device, delaying the code stream of the enhancement layer, and transmitting the code stream to the user receiving end;
B、 审查设备审查所述基本层码流中是否包含有害内容, 如果否再将该基 本层码流转发给用户接收端, 否则切断向用户接收端转发该基本层码流。  B. The review device checks whether the basic layer code stream contains harmful content, and if the basic layer code stream is forwarded to the user receiving end, the basic layer code stream is forwarded to the user receiving end.
2、 如权利要求 1所述的方法, 其特征在于, 直播模式时,  2. The method according to claim 1, wherein, in the live mode,
所述步骤 A中, 内容源直接发送或转发分层编码的相应码流;  In the step A, the content source directly sends or forwards the corresponding code stream of the layered code;
所述步驟 B中, 审查设备根据需要的视频质量, 通知内容源相应提高 /降 低所述基本层码流的编码质量, 或者在转发该基本层码流的同时, 增加 /减少 转发用于增强审查效果和准确性的部分增强层码流。  In the step B, the review device notifies the content source to increase/decrease the coding quality of the base layer code stream according to the required video quality, or increases/decreases the forwarding for enhanced review while forwarding the base layer code stream. Partial enhancement layer code stream for effect and accuracy.
3、 如权利要求 1所述的方法, 其特征在于, 存储内容播放模式时, 所述步骤 A中, 所述内容源先将产生的所述基本层码流和增强层码流分 别存储到对应的基本层码流轨道和增强层轨道; 流媒体服务器从所述基本层 码流轨道中读出所述基本层码流并转发给审查设备, '读出所述增强层码流并 进行延迟处理后发送给用户终端;  The method according to claim 1, wherein, when the content playing mode is stored, in the step A, the content source first stores the generated base layer code stream and the enhancement layer code stream respectively. Base layer stream track and enhancement layer track; the streaming server reads the base layer code stream from the base layer code stream track and forwards it to the review device, 'reads the enhancement layer code stream and performs delay processing After being sent to the user terminal;
所述步骤 B中, 审查设备根据需要的视频质量, 通知流媒体服务器在转 发该基本层码流的同时, 增加 /减少转发用于增强审查效果和准确性的增强层 码流。  In the step B, the reviewing device notifies the streaming media server to increase/decrease the enhanced layer code stream for enhancing the review effect and accuracy while forwarding the base layer code stream according to the required video quality.
4、 如权利要求 1所述的方法, 其特征在于, 步骤 B中, 审查设备的审查 方法包括:  4. The method according to claim 1, wherein in step B, the review method of the review device comprises:
解码所述基本层码流图像并输入自动识别装置, 自动识别装置将预存的 有害内容数据库中的有害内容与所述基本层码流图像中包含的相关内容进行 比较分析来进行有害内容的自动识别; 和 /或 将所述基本层码流图像显示给监控者来进行有害内容的人工识别。 Decoding the base layer code stream image and inputting the automatic identification device, and the automatic identification device compares and analyzes the harmful content in the pre-stored harmful content database with the related content included in the base layer code stream image to perform automatic identification of the harmful content. ; and / or The base layer code stream image is displayed to the monitor for manual identification of the harmful content.
5、 如权利要求 4所述的方法, 其特征在于, 当人工识别和自动识别同时 进行时, 如果二者的识别结果出现不一致, 则优先执行自动识别装置或监控 者的判决结果。  The method according to claim 4, wherein when the manual recognition and the automatic recognition are simultaneously performed, if the recognition results of the two are inconsistent, the judgment result of the automatic identification device or the monitor is preferentially executed.
6、 如权利要求 4所述的方法, 其特征在于, 当人工识别和自动识别同时 进行时, 由自动识别装置和监控者分别根据预先设定的规则, 为识别出的有 害内容给出相应的有害程度分值, 然后对两个分值进行加权处理后得到最终 执行的判决结果, 当仅收到一方为认定的有害内容给出的分值时, 默认另一 方为该内容给出的分值为零。  6. The method according to claim 4, wherein when the manual identification and the automatic recognition are simultaneously performed, the automatic identification device and the monitor respectively give corresponding contents for the identified harmful content according to a preset rule. The degree of harmfulness is scored, and then the two scores are weighted to obtain the final executed judgment result. When only the score given by one party for the identified harmful content is received, the other party gives the score given by the content. Zero.
7、 如权利要求 6所述的方法, 其特征在于, 所述的加权处理方法为:  7. The method according to claim 6, wherein the weighting processing method is:
S ( WMXSM+WHXSH ) I ( WM+WH ) S ( W M XSM+W H XSH ) I ( WM+W H )
其中, WM和 WH表示自动识别装置和监控者的权值, WM和 WH之间的 相对大小表示了对识别结果的信任度, SM和 SH分别自动识别装置和监控者给 出的分数, 如果 Si大于一个给定值, 则判决结果为有害, 否则判决结果为无 害, WM 、 WH和给定值分别根据经验值设定。 Wherein, W M and W H represent the weights of the automatic identification device and the monitor, and the relative size between W M and W H represents the degree of trust in the recognition result, and the S M and the S H respectively automatically identify the device and the monitor to give The score, if Si is greater than a given value, the judgment result is harmful, otherwise the judgment result is harmless, W M , W H and the given value are respectively set according to the empirical value.
8、 如权利要求 1所述的方法, 其特征在于, 所述的有害内容至少包括如 下之一: 有害图像、 有害叠加文字或符号、 特定人脸图像。  8. The method according to claim 1, wherein the harmful content comprises at least one of the following: a harmful image, a harmful superimposed text or symbol, a specific facial image.
9、 如权利要求 1所述的方法, 其特征在于, 所述步骤 B中, 审查设备切 断转发所述基本层码流的同时, 启动转发备用无害视频码流。  The method according to claim 1, wherein in the step B, the reviewing device cuts and forwards the base layer code stream, and starts forwarding the standby harmless video code stream.
10、 如权利要求 2或 3所述的方法, 其特征在于, 所述方法还同时包括: 审查设备录制并保存指定时段的基本层码流和用于增强审查效果和准确性的 增强层码流。  The method according to claim 2 or 3, wherein the method further comprises: reviewing the device to record and save the base layer code stream of the specified time period and the enhancement layer code stream for enhancing the review effect and accuracy .
11、 如权利要求 1 所述的方法, 其特征在于, 所述方法还同时包括: 将 有害内容的识别情况记录在日志中并生成日志报告。  11. The method according to claim 1, wherein the method further comprises: recording the identification of the harmful content in a log and generating a log report.
12、 如权利要求 1所述的方法, 其特征在于, 步骤 B之后还包括如下步 骤: 用户接收端接收到所有增强层码流和基本层码流时, 解码重构所述原始 视频码流。 The method according to claim 1, wherein the step B further comprises the following steps: decoding, when the user receiving end receives all the enhancement layer code streams and the base layer code stream, decoding and reconstructing the original video code stream.
13、 如权利要求 1 所述的方法, 其特征在于, 当基于空间分层视频编码 方法时, 内容源先根据设定比例对原始视频码流进行缩小处理后, 再进行分 层编码。 The method according to claim 1, wherein when the spatial layered video coding method is used, the content source first performs reduction processing on the original video code stream according to the set ratio, and then performs layer coding.
14、 一种视频码流内容审查的系统, 包括: 视频输入设备、 视频编码设 备和流媒体服务器, 其特征在于, 所述视频码流审查系统还包括审查设备; 所述视频编码设备将视频输入设备采集的原始视频码流的分层编码为基 本层码流和至少一个增强层码流, 并将所述基本层码流通过通信网络转发给 审查设备, 同时将所有增强层码流进行延迟处理后通过通信网络发送给用户 接收端; 审查设备审查所述基本层码流中是否包含有害内容, 如果否再通过 通信网络将该基本层码流转发给用户接收端, 否则切断向用户接收端转发该 基本层码流。  A video stream content review system, comprising: a video input device, a video encoding device, and a streaming media server, wherein the video stream review system further includes a review device; the video encoding device inputs a video The layered code of the original video code stream collected by the device is coded into a base layer code stream and at least one enhancement layer code stream, and the base layer code stream is forwarded to the reviewing device through the communication network, and all enhancement layer code streams are deferred. And then sent to the user receiving end through the communication network; the reviewing device checks whether the basic layer code stream contains harmful content, and if not, forwards the basic layer code stream to the user receiving end through the communication network, otherwise cuts off and forwards to the user receiving end. The base layer code stream.
15、 如权利要求 14所述的系统, 其特征在于, 所述视频编码设备包括: 编码器, 编码所述基本层码流和增强层码流, 将所述基本层码流直接转 发给审查设备;  The system according to claim 14, wherein the video encoding device comprises: an encoder that encodes the base layer code stream and the enhancement layer code stream, and forwards the base layer code stream directly to the review device ;
第一延迟模块, 将所述增强层码流进行延迟处理后发送给用户接收端。 The first delay module delays the enhancement layer code stream and sends the stream to the user receiving end.
16、 如权利要求 15所述的系统, 其特征在于, 所述审查设备还包括: 通 信模块, 通信连接视频编码设备, 审查设备根据需要的视频质量, 通知视频 编码设备相应提高 /降低所述基本层码流的编码质量, 或者在转发该基本层码 流的同时, 增加 /减少转发用于增强审查效果和准确性的增强层码流, 并同时 将用于增强审查效果和准确性的增强层码流进行延迟处理后转发给用户接收 端。 The system according to claim 15, wherein the reviewing device further comprises: a communication module, the communication connection video encoding device, the reviewing device notifying the video encoding device to increase/decrease the basic according to the required video quality The coding quality of the layer code stream, or the forwarding of the base layer code stream, while increasing/decreasing the enhancement layer code stream for enhancing the review effect and accuracy, and at the same time enhancing layer for enhancing the review effect and accuracy. The code stream is delayed and forwarded to the user receiver.
17、 如权利要求 14所述的系统, 其特征在于,  17. The system of claim 14 wherein:
所述视频码流审查系统还包括: 视频内容数据库, 保存所述基本层码流 和增强层码流, 其中, 所述基本层码流保存在设定的基本层码流轨道中, 所 述增强层码流保存在设定的增强层码流轨道中;  The video stream review system further includes: a video content database, where the base layer code stream and the enhancement layer code stream are saved, wherein the base layer code stream is saved in a set base layer code stream track, and the enhancement The layer code stream is saved in the set enhancement layer code stream track;
所述流媒体服务器包括: 码流读取模块和第二延迟模块, 码流读取模块 从所述基本层码流轨道中读出所述基本层码流并转发给审查设备, 从所述增 强层码流轨道中读取增强层码流并通过所述第二延迟模块进行延迟处理后发 送给用户接收端。 The streaming media server includes: a code stream reading module and a second delay module, wherein the code stream reading module reads the base layer code stream from the base layer code stream track and forwards the code stream to the review device, The enhancement layer code stream is read in the strong layer code stream track and subjected to delay processing by the second delay module, and then sent to the user receiving end.
18、 如权利要求 17所述的系统, 其特征在于, 所述审查设备还包括: 通 信模块, 通信连接视频编码设备流媒体服务器, 审查设备根据需要的视频质 量, 通知流媒体服务器在转发该基本层码流的同时, 增加 /减少转发用于增强 审查效果和准确性的增强层码流, 并同时将用于增强审查效果和准确性的增 强层码流进行延迟处理后转发给用户接收端。  The system according to claim 17, wherein the reviewing device further comprises: a communication module, the communication connection video encoding device streaming media server, the reviewing device notifying the streaming media server to forward the basic according to the required video quality At the same time as the layer code stream, the enhancement layer code stream for enhancing the review effect and accuracy is added/reduced, and the enhancement layer code stream for enhancing the review effect and accuracy is delayed and forwarded to the user receiving end.
19、如权利要求 14-18任意之一所述的系统, 其特征在于, 所述审查设备 包括:  The system of any of claims 14-18, wherein the reviewing device comprises:
第三延迟模块, 对接收到的所有码流进行延迟处理后转发给用户接收端; 审查模块, 审查接收到的所有码流, 在审查出该码流中包含有害内容时 输出相应的控制信号;  The third delay module delays processing all the received code streams and forwards them to the user receiving end; the review module reviews all the received code streams, and outputs corresponding control signals when it is detected that the code stream contains harmful content;
开关模块, 连接在所述第三延迟模块之后;  a switch module connected behind the third delay module;
控制模块, 连接在所述审查模块和开关模块之间, 根据所述控制信号触 发断开所述开关模块。  And a control module, connected between the review module and the switch module, triggering disconnection of the switch module according to the control signal.
20、 如权利要求 19所述的系统, 其特征在于, 所述审查模块中具体包括 人工识别子模块和 /或自动识别子模块, 其中:  The system according to claim 19, wherein the review module specifically includes a manual identification submodule and/or an automatic identification submodule, wherein:
所述人工识别子模块中具体包括: 解码单元、 增强处理单元、 显示单元 和指令接收单元, 所述解码单元解码接收到的所有码流图像, 所述图像通过 增强处理单元处理后在显示单元上显示给监控者, 监控者人工识别出该图像 中包含有害内容时, 通过所述指令接收单元触发所述主控单元输出所述控制 信号;  Specifically, the manual identification sub-module includes: a decoding unit, an enhancement processing unit, a display unit, and an instruction receiving unit, where the decoding unit decodes all received code stream images, and the image is processed by the enhancement processing unit on the display unit. Displayed to the monitor, when the monitor manually recognizes that the image contains harmful content, the instruction receiving unit triggers the main control unit to output the control signal;
所述自动识别子模块中具体包括: 自动识别单元和有害内容数据库, 自 动识别单元根据所述有害内容数据库中的有害内容与接收到的所有码流中包 含的相关内容进行对比分析来进行有害内容的自动识别, 并在识别出有害内 容时自动触发所述主控单元输出所述控制信号。  The automatic identification sub-module specifically includes: an automatic identification unit and a harmful content database, and the automatic identification unit performs comparative analysis according to the harmful content in the harmful content database and the related content included in all the received code streams to perform harmful content. Automatic identification, and automatically triggering the main control unit to output the control signal when identifying harmful content.
21、 如权利要求 20所述的系统, 其特征在于, 当所述审查模块中同时包 括人工识别子模块和自动识别子模块时, 该审查模块中还包括: 21. The system of claim 20, wherein when the review module is simultaneously packaged Including the manual identification sub-module and the automatic identification sub-module, the review module further includes:
审查方式切换模子块, 根据控制模块的控制选择启动所述人工识别子模 块和 /或自动识别子模块; 以及  The mode of review switches the module block, and the manual identification sub-module and/or the automatic identification sub-module are activated according to the control of the control module;
判决子模块, 同时接收所述指令接收单元和自动识别单元的输出信号, 并根据设定的判决原则判定是否触发所述主控单元输出所述控制信号。  The decision sub-module receives the output signals of the instruction receiving unit and the automatic identification unit at the same time, and determines whether to trigger the main control unit to output the control signal according to the set decision principle.
22、 如权利要求 20所述的系统, 其特征在于, 所述审查模块中还包括判 决原则存储子模块, 连接在所述判决子模块和控制模块之间, 存储通过控制 模块输入的判决原则。  The system according to claim 20, wherein the review module further comprises a decision principle storage submodule connected between the decision submodule and the control module to store a decision principle input through the control module.
23、 如权利要求 14-18任意之一所述的系统, 其特征在于, 所述审查设备 还包括如下之一或任意组合:  The system according to any one of claims 14-18, wherein the review device further comprises one or any combination of the following:
内容录制模块, 连接所述第三延迟模块, 用于录制指定时段该第三延迟 模块输出的所有码流;  a content recording module, connected to the third delay module, configured to record all code streams output by the third delay module in a specified time period;
日志记录模块, 分别连接其它模块或子模块, 用于生成并输出所述审查 设备的运行状态曰志;  The logging module is respectively connected to other modules or sub-modules for generating and outputting the running status of the reviewing device;
决策支持知识库, 连接控制模块, 用于存储特定或临时的有害内容文字、 人脸图像以及相关的法律法规文件;  Decision support knowledge base, connection control module, for storing specific or temporary harmful content words, face images and related legal and regulatory documents;
替换片源库, 连接在所述开关模块, 存储无害备用视频内容。  The chip source library is replaced, connected to the switch module, and the harmless spare video content is stored.
24、 一种视频编码设备, 包括编码器, 其特征在于, 还包括: 所述视频 编码设备还包括连接所述编码器的第一延迟模块, 所述编码器基于将原始视 频流分层编码为基本层码流和至少一层增强层码流, 然后:  A video encoding device, comprising: an encoder, further comprising: the video encoding device further comprising a first delay module connected to the encoder, the encoder layering the original video stream into The base layer stream and at least one layer of enhancement layer code, then:
直接输出所述基本层码流, 通过所述第一延迟模块经进行延迟处理后输 出所述增强层码流; 或者  Directly outputting the base layer code stream, and performing delay processing by the first delay module to output the enhancement layer code stream; or
直接输出所述基本层码流和部分增强层码流, 通过所述第一延迟模块进 行延迟处理后输出剩余的增强层码流。  The base layer code stream and the partial enhancement layer code stream are directly output, and the remaining enhancement layer code stream is output after the delay processing by the first delay module.
25、 一种流媒体服务器, 包括视频码流读取模块, 其特征在于, 所述流 媒体服务器还包括连接所述视频码流读取模块的第二延迟模块, 所述视频码 流读取模块从外部数据库中读取基于分层编码的基本层码流和至少一层增强 层码流, 然后: A streaming media server, comprising a video stream reading module, wherein the streaming media server further comprises a second delay module connected to the video stream reading module, the video stream reading module Reading a layered code based base layer code stream and at least one layer of enhancement from an external database Layer code stream, then:
直接输出所述基本层码流, 通过所述第二延迟模块经进行延迟处理后输 . 出所述增强层码流; 或者  Directly outputting the base layer code stream, and performing delay processing by the second delay module to output the enhancement layer code stream; or
直接输出所述基本层码流和部分增强层码流, 通过所述第二延迟模块经 进行延迟处理后输出剩余的增强层码流。  The base layer code stream and the partial enhancement layer code stream are directly output, and the remaining enhancement layer code stream is output after the delay processing by the second delay module.
26、 一种审查设备, 被审查的原始视频码流由内容源进行分层编码为基 本层码流和至少一层增强层码流, 所述审查设备包括: 通信模块, 通信连接 其他网络设备; 其特征在于,  26, a review device, the original video code stream to be examined is hierarchically encoded by the content source into a base layer code stream and at least one layer of enhancement layer code stream, the review device includes: a communication module, and a communication connection with other network devices; It is characterized in that
第三延迟模块, 接收所述基本层码流、 或者基本层码流和部分增强层码 流, 并对接收到的所有码流进行延迟处理后转发给用户接收端;  The third delay module receives the base layer code stream, or the base layer code stream and the part of the enhancement layer code stream, and delays processing all the received code streams and forwards them to the user receiving end;
审查模块, 审查接所有延迟转发的码流, 在审查出所述码流中包含有害 内容时输出相应的控制信号;  a review module, which examines all delayed forwarded code streams, and outputs a corresponding control signal when it is detected that the code stream contains harmful content;
开关模块, 连接在所述第三延迟模块之后;  a switch module connected behind the third delay module;
控制模块, 连接在所述审查模块和开关模块之间, 根据所述控制信号触 发断开所述开关模块。  And a control module, connected between the review module and the switch module, triggering disconnection of the switch module according to the control signal.
27、 如权利要求 26所述的审查设备, 其特征在于, 所述审查模块中具体 包括人工识别子模块和 /或自动识别子模块, 其中:  The review device according to claim 26, wherein the review module specifically includes a manual identification submodule and/or an automatic identification submodule, wherein:
所述人工识别子模块中具体包括: 解码单元、 增强处理单元、 显示单元 和指令接收单元, 所述解码单元解码接收到的所有码流图像, 所述图像通过 增强处理单元处理后在显示单元上显示给监控者, 监控者人工识别出该图像 中包含有害内容时, 通过所述指令接收单元触发所述主控单元输出所述控制 信号;  Specifically, the manual identification sub-module includes: a decoding unit, an enhancement processing unit, a display unit, and an instruction receiving unit, where the decoding unit decodes all received code stream images, and the image is processed by the enhancement processing unit on the display unit. Displayed to the monitor, when the monitor manually recognizes that the image contains harmful content, the instruction receiving unit triggers the main control unit to output the control signal;
所述自动识别子模块中具体包括: 自动识别单元和有害内容数据库, 自 动识别单元根据所述有害内容数据库中的有害内容与接收到的所有码流中包 含的相关内容进行对比分析来进行有害内容的自动识别 , 并在识别出有害内 容时自动触发所述主控单元输出所述控制信号。  The automatic identification sub-module specifically includes: an automatic identification unit and a harmful content database, and the automatic identification unit performs comparative analysis according to the harmful content in the harmful content database and the related content included in all the received code streams to perform harmful content. Automatic identification, and automatically triggering the main control unit to output the control signal when identifying harmful content.
28、 如权利要求 27所述的审查设备, 其特征在于, 当所述审查模块中同 时包括人工识别子模块和自动识别子模块时, 该审查模块中还包括: 审查方式切换子模块, 根据控制模块的控制选择启动所述人工识别子模 块和 /或自动识别子模块; 28. The review apparatus according to claim 27, wherein when said review module is the same When the manual identification sub-module and the automatic identification sub-module are included, the review module further includes: a review mode switching sub-module, and the manual identification sub-module and/or the automatic identification sub-module are activated according to the control of the control module;
判决子模块, 同时接收所述指令接收单元和自动识别单元的输出信号, 并根据设定的判决原则判定是否触发所述主控单元输出所述控制信号。  The decision sub-module receives the output signals of the instruction receiving unit and the automatic identification unit at the same time, and determines whether to trigger the main control unit to output the control signal according to the set decision principle.
29、 如权利要求 28所述的审查设备, 其特征在于, 所述审查模块中还包 括判决原则存储子模块, 连接在所述判决模块和控制模块之间, 存储通过控 制模块输入的判决原则。  The review apparatus according to claim 28, wherein the review module further comprises a decision principle storage submodule connected between the decision module and the control module to store a decision principle input through the control module.
30、 如权利要求 26-29任意之一所述的审查设备, 其特征在于, 所述审查 设备还包括如下之一或任意组合:  The review apparatus according to any one of claims 26 to 29, wherein the review apparatus further comprises one or any combination of the following:
内容录制模块, 连接所述第三延迟模块, 用于录制指定时段该第三延迟 模块输出的所有码流;  a content recording module, connected to the third delay module, configured to record all code streams output by the third delay module in a specified time period;
日志记录模块, 分别连接其它模块或子模块, 用于生成并输出所述审查 设备的运行状态日志;  a logging module, which is respectively connected to other modules or submodules for generating and outputting an operation status log of the reviewing device;
决策支持知识库, 连接控制模块, 用于存储特定或临时的有害内容文字、 人脸图像以及相关的法律法规文件;  Decision support knowledge base, connection control module, for storing specific or temporary harmful content words, face images and related legal and regulatory documents;
替换片源库, 连接在所述开关模块, 存储无害备用视频内容。  The chip source library is replaced, connected to the switch module, and the harmless spare video content is stored.
PCT/CN2007/001548 2006-05-16 2007-05-14 A method, a system and a apparatus for censoring video code stream WO2007131445A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2006100813110A CN1968250B (en) 2006-05-16 2006-05-16 Video code stream checking method, system and device
CN200610081311.0 2006-05-16

Publications (1)

Publication Number Publication Date
WO2007131445A1 true WO2007131445A1 (en) 2007-11-22

Family

ID=38076795

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2007/001548 WO2007131445A1 (en) 2006-05-16 2007-05-14 A method, a system and a apparatus for censoring video code stream

Country Status (2)

Country Link
CN (1) CN1968250B (en)
WO (1) WO2007131445A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101557510A (en) * 2008-04-09 2009-10-14 华为技术有限公司 Method, system and device for processing video coding
CN102523441B (en) * 2011-12-22 2014-04-02 北京工业大学 H.264 compressed domain bit plane encryption method based on bit sensitivity
CN107659538A (en) * 2016-07-25 2018-02-02 北京优朋普乐科技有限公司 A kind of method and apparatus of Video processing
CN108184167A (en) * 2017-12-20 2018-06-19 东莞市纽格力信息技术有限公司 A kind of management-control method and system based on TV set-top box

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09275548A (en) * 1996-04-05 1997-10-21 Nippon Telegr & Teleph Corp <Ntt> Digital image separation and delivery method and system
US5706507A (en) * 1995-07-05 1998-01-06 International Business Machines Corporation System and method for controlling access to data located on a content server
JP2001268505A (en) * 2000-03-17 2001-09-28 Sony Tektronix Corp Image edition system
US20030195977A1 (en) * 2002-04-11 2003-10-16 Tianming Liu Streaming methods and systems
CN1835581A (en) * 2006-03-08 2006-09-20 北京北大方正电子有限公司 Method of inspecting video material in low digital codes stream

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1442602A1 (en) * 2001-10-26 2004-08-04 Koninklijke Philips Electronics N.V. Spatial scalable compression scheme using adaptive content filtering

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5706507A (en) * 1995-07-05 1998-01-06 International Business Machines Corporation System and method for controlling access to data located on a content server
JPH09275548A (en) * 1996-04-05 1997-10-21 Nippon Telegr & Teleph Corp <Ntt> Digital image separation and delivery method and system
JP2001268505A (en) * 2000-03-17 2001-09-28 Sony Tektronix Corp Image edition system
US20030195977A1 (en) * 2002-04-11 2003-10-16 Tianming Liu Streaming methods and systems
CN1835581A (en) * 2006-03-08 2006-09-20 北京北大方正电子有限公司 Method of inspecting video material in low digital codes stream

Also Published As

Publication number Publication date
CN1968250A (en) 2007-05-23
CN1968250B (en) 2010-04-21

Similar Documents

Publication Publication Date Title
WO2007128234A1 (en) Method and node for filtering the video stream
US10582201B2 (en) Most-interested region in an image
TWI458340B (en) Signaling data for multiplexing video components
CN103843301B (en) The switching between expression during the network crossfire of decoded multi-medium data
JP4943513B2 (en) Video data loss recovery system using low bit rate stream of IPTV
JP4874343B2 (en) Aggregation of backward-compatible pictures in scalable video coding
KR101396628B1 (en) Streaming encoded video data
CN110324580B (en) Monitoring video playing method and device based on video network
JP2003533953A (en) Video coding method
EP2174500A2 (en) Video indexing method, and video indexing device
WO2007128185A1 (en) A system and method of media stream censorship and a node apparatus for generating censorship code stream
KR101145782B1 (en) Method for lightweight video contents encryption and decryption to provide mobile contents service
Kunkelmann Applying encryption to video communication
WO2007131445A1 (en) A method, a system and a apparatus for censoring video code stream
US8565083B2 (en) Thinning of packet-switched video data
US11893090B2 (en) Synchronization of digital rights management data
CN108924088A (en) A kind of 4K web camera transmission realizing method
Nightingale et al. Priority-based methods for reducing the impact of packet loss on HEVC encoded video streams
US20240080426A1 (en) Methods and systems for managing content items
Guo et al. Adaptive transmission of split-screen video over wireless networks
Otsuki et al. Development of System for Simultaneously Present Multiple Videos That Enables Search by Absolute Time
Otsuki et al. A trial implementation of an MMT-receiving application to enable HTTP access by designating absolute time
Nicholson et al. Technologies for multimedia and video surveillance convergence
CN114189686A (en) Video encoding method, apparatus, device, and computer-readable storage medium
Deshpande Adaptive HTTP Streaming with Temporal Scalability in High Efficiency Video Coding (HEVC)

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07721121

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07721121

Country of ref document: EP

Kind code of ref document: A1