WO2012006828A1 - Method and device for presenting web pages - Google Patents

Method and device for presenting web pages Download PDF

Info

Publication number
WO2012006828A1
WO2012006828A1 PCT/CN2010/077881 CN2010077881W WO2012006828A1 WO 2012006828 A1 WO2012006828 A1 WO 2012006828A1 CN 2010077881 W CN2010077881 W CN 2010077881W WO 2012006828 A1 WO2012006828 A1 WO 2012006828A1
Authority
WO
WIPO (PCT)
Prior art keywords
interest
webpage
link
association rule
degree
Prior art date
Application number
PCT/CN2010/077881
Other languages
French (fr)
Chinese (zh)
Inventor
阚光远
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012006828A1 publication Critical patent/WO2012006828A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Definitions

  • the present invention relates to the field of mobile communications, and in particular, to a web page presentation method and apparatus.
  • BACKGROUND With the advent of the 3G era, the number of mobile Internet users has increased, and the requirements for mobile browsers have become higher and higher. However, because mobile browsers are limited by screen size and hardware configuration, mobile screens can only display web pages. Part of the information. And now the mobile browser's scrolling of web pages is in the order of the web links. If the user wants to see the content of interest, he or she needs to constantly drag the scroll bar to find the web link that the user is interested in.
  • a primary object of the present invention is to provide a method and apparatus for rendering a web page to at least solve the problem that the content search time is large when the web page is large.
  • a webpage presentation method including: calculating interest levels according to interest association rules for each link in a webpage; and displaying a new webpage by indicating a link whose interest degree is higher than a specified value; presenting a new webpage
  • the interest association rule is determined by performing data mining on the user's historical access record.
  • Determining the interest association rule by performing data mining on the user's historical access includes: reading the historical webpage data in the browser cache after the browser is opened; performing data mining on the historical webpage data to obtain the interest association rule.
  • the interest association rule is stored in the designated storage area, and the interest association rule of the specified storage area is updated every specified duration or the set number of times of opening the webpage; and the interest degree is calculated according to the interest association rule for each link in the webpage:
  • the specified storage area reads the user's interest association rule; and the interest degree is calculated according to the interest association rule for each link in the webpage.
  • the above-mentioned links with a higher degree of interest than the specified value are obtained by the new webpage including: The link is sorted; the webpage link whose interest degree is higher than the specified value is extracted from the sorted result of the link, and the extracted webpage link is marked with a specified color to obtain a new webpage.
  • the above web page rendering method is applied to a mobile terminal.
  • a webpage presentation apparatus including: an interest degree calculation module, configured to calculate an interest degree according to an interest association rule for each link in a webpage; and a webpage labeling module, configured to indicate a high degree of interest A new webpage is obtained by the link of the specified value; a rendering module is configured to present the new webpage; wherein the interest association rule is determined by performing data mining on the historical access record of the user.
  • the device further includes: a rule obtaining module, configured to read historical webpage data in the browser cache after the browser is opened; perform data mining on the historical webpage data to obtain an interest association rule.
  • the foregoing interest association rule is stored in a designated storage area of the device, and the device further includes: an update module, configured to update an interest association rule of the specified storage area every specified duration or a set number of times of opening the webpage; the interest degree calculation module
  • the method includes: an obtaining unit, configured to read a user's interest association rule from the specified storage area; and a calculating unit, configured to calculate an interest degree according to the interest association rule for each link in the webpage.
  • the webpage labeling module includes: a sorting unit, configured to sort each link according to the interest degree; and a new webpage obtaining unit, configured to extract, from the sorted result of the link, a webpage link whose interest degree is higher than the specified value, in a specified color Mark the extracted web page link to get a new web page.
  • the above device is a mobile terminal.
  • the interest degree of each link is determined according to the historical access record of the user, and the link with high interest is marked, so that the user can quickly browse the content of interest, thereby solving the problem that the speed of browsing the large webpage is slow, and further Achieve the effect of improving the user experience.
  • FIG. 1 is a flowchart of a webpage presentation method according to Embodiment 1 of the present invention
  • FIG. 2 is a flowchart of a webpage presentation method according to Embodiment 2 of the present invention
  • 3 is a block diagram showing a structure of a webpage presenting apparatus according to a third embodiment of the present invention
  • FIG. 1 is a flowchart of a webpage presentation method according to Embodiment 1 of the present invention
  • FIG. 2 is a flowchart of a webpage presentation method according to Embodiment 2 of the present invention
  • 3 is a block diagram showing a structure of a webpage presenting apparatus according to a third embodiment of the present invention
  • FIG. 4 is a block diagram showing a specific structure of a webpage presenting apparatus according to a third embodiment of the present invention.
  • BEST MODE FOR CARRYING OUT THE INVENTION the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict.
  • the display interface of a mobile terminal (such as a mobile phone) is limited, and the content in the webpage is more and more rich, which makes it impossible to display the entire webpage once on the display interface of the device, and needs to be displayed in multiple pages.
  • an embodiment of the present invention provides a webpage presentation method and apparatus. The following embodiments are described by taking a webpage presentation on a mobile terminal as an example.
  • FIG. 1 is a flowchart of a webpage presentation method according to an embodiment of the present invention.
  • the method includes the following steps: Step S102: Calculate an interest degree according to an interest association rule for each link in a webpage, where Performing data mining on the user's historical access record to determine the interest association rule; the above interest association rule may be determined each time the user opens the browser, and may specifically read the historical webpage data in the browser cache after opening the browser. Data mining of historical web page data to obtain interest association rules. It can also be read to the specified storage area when the user opens the browser.
  • the interest association rule in the specified storage area can be updated according to a certain time, and the number of times the user opens the webpage can be counted.
  • Step S104 indicating that the link with the interest level higher than the specified value obtains a new webpage; for example, sorting each link according to the degree of interest; extracting a webpage link whose interest degree is higher than a specified value from the sorted result of the link, using the specified color Mark the extracted web page link to get a new web page.
  • Step 4 gathers S 106 to present the new web page.
  • the webpage is presented, the original content of the webpage is directly displayed.
  • the screen of the device is relatively small and the webpage is relatively large, the user can browse the content of interest for a long time.
  • Embodiment 2 This embodiment provides a webpage presentation method, which is described by taking an implementation on a mobile phone as an example. Referring to FIG.
  • Step S202 The user opens a browser to save historical webpage data in the cache of the mobile browser; for example: reading historical webpage data in the browser cache, and saving historical webpage data to Corresponding storage area; Step S204, performing data preprocessing on the saved historical webpage data; the data preprocessing process of the embodiment can complete data analysis, data extraction, data processing, and data transformation. For example, the data in the Cache (cache) represented by the WWW data model is processed, and the processing of the word thousand extraction, the term segmentation, and the like is mainly completed; in step S206, the data obtained by the data pre-processing is data-extracted, and the data is obtained.
  • the user's interest association rule; the mining algorithm used in the data mining process of this embodiment is as follows:
  • the data mining process includes the following stages:
  • Freshness reflects the length of time that interest terms exist. The recent freshness of interest terms in the recently visited page is relatively high. In the prediction process, the more recently the interest terms in the recently visited pages play a greater role in forecasting. Freshness can be equal to the time the item was saved, or it can have a linear relationship with the save time.
  • K ( Y k ) represents the set of all interest terms that appear in the Y k page, and t is one of the terms.
  • the historical webpage data in the cache (Cache) is usually represented by the WWW data model.
  • the historical webpage data of the WWW data model may be converted into a data format and converted into a required data format.
  • the extraction and segmentation of the word thousands can be referred to the IEEE (Institute of Electrical and Electronics Engineers) data application (application of data mining in Web pre-fetching).
  • the link word of the link point is obtained, and the link word is segmented, and the link term set Q ( lk, .. stnng ) of the link point l k , , in the page is obtained.
  • Q ( l k , L stnng ) represents a link to a link l k in Y k , which is obtained by segmenting the link word 1000
  • the collection of the above data is obtained by the following data processing: page collection, collection of words of the page, collection of link points of the page, and collection of linked terms of the link points in the page.
  • the four sets are obtained for the following calculations.
  • Interest association rules [ Node ( ti ), support, Node ( tj ) ], the possibility of moving from one entry to another, and then calculating the possibility of moving from one page to one of the links.
  • the specific process of generating the interest association rule may include the following methods: traversing the page set C, traversing the link set L ( Y k ) in the page for the page Y k , and determining the source page of the link point one by one (the page where the link point is located) ) ⁇ " belongs to page set C, if it belongs, it traverses the set of words of page Y k and ⁇ ", combines the terms in Y k with ⁇ ", and calculates the transfer from one entry to another in the combination of terms
  • the transfer support rate of a term, the transfer support rate is equal to the sum of the weights of the two terms.
  • the weight of the entry is accumulated in the support rate; if the source page of the link point If Yj does not belong to page set C, it traverses the link term set of page Y k and the link point, and combines Y k with the term in the link term set of the link point to calculate the transfer from one entry to the entry combination.
  • the transfer support rate of another term, the transfer support rate is equal to the weight of the entry in the page Y k , and when the link term appears in the link term set of the plurality of link points, the transfer support rate is accumulated in the page Y k Entry the weight of.
  • the pseudo code that generates the affinity association rule is as follows: For each page in the saved page collection C
  • the target page of ⁇ set 1] ⁇ r is Yj; if Yj e C then ⁇ for each entry in the set of terms K ( Y k ) in page Y k ( , weightp )
  • the method for calculating the link interest degree may be: searching the interest association rule database for the interest association rule of the term and the link term in the current access page, and calculating the interest degree, which is equal to the weight of the term in the current access page multiplied by The degree of support in the found interest association rule, after completing the calculation of the interest degree, sorts all the obtained links according to the degree of interest.
  • step S208 the result of the data mining and the webpage accessed by the current user are used to mark the link with high interest level, and a new webpage is obtained.
  • Step S210 browsing the new webpage according to the marked link.
  • the interest degree of each link may be sorted, and the webpage link with high interest degree is extracted. Use prominent colors for labeling; and scroll through and focus on the already marked links as the page scrolls through.
  • the historical webpage data saved in the browser cache is obtained. The data implied the user's hobbies and access habits, and the interest association rules are used to mine the interest association rules reflecting the user's interests and habits. According to the interest association rule and the webpage currently browsed by the user, the link with high user interest in the current webpage is marked. And when the webpage is scrolled, the user can choose to browse the webpage according to the already marked webpage link.
  • the next webpage link to be browsed by the user is the webpage link marked on the screen.
  • the next link of the webpage to be browsed by the user is not in the current screen of the mobile phone.
  • the browser will first page through the page, and then scroll to the webpage link indicated after the page turning.
  • the interest degree of each link is determined according to the history record accessed by the user, and the link with high interest is marked, so that the user can quickly browse the content of interest, and the mobile browser is improved in browsing.
  • the speed of the web page which in turn improves the user experience of using the browser.
  • Embodiment 3 Referring to FIG. 3, the embodiment provides a webpage presentation apparatus, where the apparatus includes: an interest degree calculation module 32, configured to calculate an interest degree according to an interest association rule for each link in a webpage, where The history access record performs data mining to determine the interest association rule; the webpage labeling module 34 is connected to the interest degree calculation module 32, and is configured to mark the link with the interest degree higher than the specified value to obtain a new webpage; the presentation module 36 is connected to the webpage labeling module 34. , used to render the above new web page.
  • the interest degree calculation module 32 reference may be made to the algorithm in Embodiment 2, It will not be detailed in detail.
  • the above-mentioned interest association rule may be determined each time the user opens the browser.
  • the device further includes: a rule obtaining module, configured to read historical webpage data in the browser cache after the browser is opened; Data is used for data mining to obtain interest association rules.
  • the interest association rule may be stored in a specified storage area for use in subsequent rendering of the webpage. Therefore, when the user opens the browser, the interest association rule can be read in the specified storage area, and the interest association rule in the specified storage area can be updated according to a certain time, and the number of times the user opens the webpage can be counted, when the webpage is opened. Update when the number of times reaches the set number of times.
  • the device further includes: an update module, configured to update the interest association rule of the specified storage area every specified duration or a set number of times of opening the webpage; correspondingly, the interest degree calculation module 32 includes: an obtaining unit, Reading the user's interest association rule from the specified storage area; the calculating unit is configured to calculate the interest degree according to the interest association rule for each link in the webpage.
  • 4 is a block diagram of a specific structure of a webpage presentation apparatus provided by the embodiment.
  • the apparatus includes: an interest degree calculation module 32, a webpage labeling module 34, and a presentation module 36.
  • the webpage labeling module 34 includes: a sorting unit 342.
  • the plurality of links are sorted according to the degree of interest; the new webpage obtaining unit 344 is configured to extract a webpage link whose interest degree is higher than a specified value from the sorted result of the link, and mark the extracted webpage link with a specified color to obtain a new webpage.
  • the device provided in this embodiment may be a mobile terminal or other device.
  • the interest degree of each link is determined according to the history record accessed by the user, and the link with high interest is marked, so that the user can quickly browse to the content of interest, and the browser is improved. The speed at which a large web page is viewed, which in turn improves the user experience of using the browser.
  • the present invention achieves the following technical effects:
  • the above embodiment processes a webpage by a method based on data mining, and obtains a new webpage with a link identifier, and presents the new webpage. It can speed up the browsing speed of the browser for large web pages and improve the user's body-risk.
  • modules or steps of the present invention may be Implemented by a general-purpose computing device, which may be centralized on a single computing device or distributed over a network of computing devices, optionally, they may be implemented by program code executable by the computing device, such that They may be stored in a storage device by a computing device, and in some cases, the steps shown or described may be performed in an order different than that herein, or separately fabricated into individual integrated circuit modules. Alternatively, multiple modules or steps of them can be implemented as a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software.
  • the above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present invention are intended to be included within the scope of the present invention.

Abstract

A method and a device for presenting web pages are disclosed. The method includes: interest degrees are calculated for each of links of web pages according to interest associated rules (S102), the link whose interest degree is higher than a designated value is indicated to obtain a new web page (S104); the new web page is presented (S106); wherein the data mining is performed to history access records of users to determine the interest associated rules. According to the disclosed solution, it can resolve the problem of slowly displaying the web pages and improve the user experience satisfaction.

Description

网页呈现方法和装置 技术领域 本发明涉及移动通讯领域, 尤其涉及一种网页呈现方法和装置。 背景技术 随着 3G时代的到来, 手机互联网用户的增加, 用户对手机浏览器的要 求也越来越高, 但是由于手机浏览器受到屏幕大小和硬件配置等限制, 手机 屏幕只能显示网页上的一部分信息。 并且现在的手机浏览器对于网页的滚动 浏览,都是按照网页链接的先后顺序进行的。如果用户想看到感兴趣的内容, 需要不断地拖动滚动条, 才能找到用户感兴趣的网页链接。 特别是在浏览一 个比较大的网页时, 需要的时间将会比较长, 而且找到用户感兴趣的内容将 会更费时间, 影响了用户的使用, 降低了用户体验。 发明内容 本发明的主要目的在于提供一种网页呈现方法和装置, 以至少解决上述 网页较大时内容查找耗时较大的问题。 才艮据本发明的一个方面, 提供了一种网页呈现方法, 包括: 对网页中的 各条链接按照兴趣关联规则计算兴趣度; 标示兴趣度高于指定值的链接得到 新网页; 呈现新网页; 其中, 通过对用户的历史访问记录进行数据挖掘确定 兴趣关联规则。 通过对用户的历史访问进行数据挖掘确定所述兴趣关联规则包括: 浏览 器打开后, 读取该浏览器緩存中的历史网页数据; 对该历史网页数据进行数 据挖掘, 得到上述兴趣关联规则。 上述兴趣关联规则存储在指定存储区, 并每隔指定时长或打开网页的设 定次数对指定存储区的兴趣关联规则进行更新; 对网页中的各条链接按照兴 趣关联规则计算兴趣度包括: 从该指定存储区读取用户的兴趣关联规则; 对 网页中的各条链接按照该兴趣关联规则计算兴趣度。 上述标示兴趣度高于指定值的链接得到新网页包括: 按照兴趣度对各条 链接进行排序; 从链接的排序结果中提取出兴趣度高于指定值的网页链接 , 用指定颜色标示提取出的网页链接得到新网页。 上述网页呈现方法应用于移动终端。 根据本发明的另一方面, 提供了一 种网页呈现装置, 包括: 兴趣度计算模块, 用于对网页中的各条链接按照兴 趣关联规则计算兴趣度; 网页标示模块, 用于标示兴趣度高于指定值的链接 得到新网页; 呈现模块, 用于呈现所述新网页; 其中, 通过对用户的历史访 问记录进行数据挖掘确定所述兴趣关联规则。 该装置还包括: 规则获取模块, 用于浏览器打开后, 读取该浏览器緩存 中的历史网页数据; 对历史网页数据进行数据挖掘, 得到兴趣关联规则。 上述兴趣关联规则存储在该装置的指定存储区, 该装置还包括: 更新模 块, 用于每隔指定时长或打开网页的设定次数对指定存储区的兴趣关联规则 进行更新; 上述兴趣度计算模块包括: 获取单元, 用于从该指定存储区读取 用户的兴趣关联规则; 计算单元, 用于对网页中的各条链接按照兴趣关联规 则计算兴趣度。 网页标示模块包括: 排序单元, 用于按照该兴趣度对各条链接进行排序; 新网页获取单元, 用于从链接的排序结果中提取出兴趣度高于该指定值的网 页链接, 用指定颜色标示提取出的网页链接得到新网页。 上述装置为移动终端。 通过本发明, 釆用根据用户的历史访问记录确定各个链接的兴趣度, 对 兴趣度高的链接进行标示, 使用户能够快速浏览到感兴趣的内容, 解决了浏 览大网页速度慢的问题, 进而达到了提升用户体验的效果。 附图说明 此处所说明的附图用来提供对本发明的进一步理解, 构成本申请的一部 分, 本发明的示意性实施例及其说明用于解释本发明, 并不构成对本发明的 不当限定。 在附图中: 图 1是才艮据本发明实施例 1的网页呈现方法的流程图; 图 2是才艮据本发明实施例 2的网页呈现方法的流程图; 图 3是 居本发明实施例 3的网页呈现装置的结构框图; 以及 图 4是 居本发明实施例 3的网页呈现装置的具体结构框图。 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在 不冲突的情况下, 本申请中的实施例及实施例中的特征可以相互组合。 移动终端 (例如手机) 等设备的显示界面有限, 而网页中的内容越来越 丰富, 导致不能在设备的显示界面上一次显示整个网页, 需要分多页显示。 基于此, 本发明实施例提供了一种网页呈现方法和装置, 以下实施例以在移 动终端上实现网页呈现为例进行说明。 实施例 1 图 1示出了才艮据本发明实施例的网页呈现方法流程图, 该方法包括以下 步骤: 步骤 S 102 , 对网页中的各条链接按照兴趣关联规则计算兴趣度, 其中, 通过对用户的历史访问记录进行数据挖掘确定兴趣关联规则; 上述兴趣关联规则可以在每次用户打开浏览器时进行确定, 具体可以釆 用浏览器打开后, 读取该浏览器緩存中的历史网页数据; 对历史网页数据进 行数据挖掘, 得到兴趣关联规则。 也可以在用户打开浏览器时到指定存储区 读取, 该指定存储区中兴趣关联规则可以按照一定的时间进行更新, 也可以 统计用户打开网页的次数, 当打开网页的次数达到设定次数时进行更新。 步骤 S 104, 标示兴趣度高于指定值的链接得到新网页; 例如, 按照兴趣度对各条链接进行排序; 从链接的排序结果中提取出兴 趣度高于指定值的网页链接,用指定颜色标示提取出的网页链接得到新网页。 步 4聚 S 106, 呈现该新网页。 相关技术中在呈现网页时, 直接按照网页原有的内容进行显示, 当设备 的屏幕比较小而网页又比较大时, 用户能够浏览到感兴趣的内容将会耗时较 长。 本实施例在呈现网页时, 根据用户访问的历史记录确定各个链接的兴趣 度, 对兴趣度高的链接进行标示, 使用户能够快速浏览到感兴趣的内容, 提 高了用户浏览网页的速度, 进而提高了用户体验的满意度。 实施例 2 本实施例提供了一种网页呈现方法, 该方法以在手机上实现为例进行说 明。 参见图 2, 该方法包括以下步 4聚: 步骤 S202, 用户打开浏览器, 保存手机浏览器緩存中的历史网页数据; 例如: 读取浏览器緩存中的历史网页数据, 把历史网页数据保存到相应 存储区; 步骤 S204 , 对保存的历史网页数据进行数据预处理; 本实施例的数据预处理过程可以完成数据分析及数据抽取、 数据处理、 数据变换。 例如, 对 WWW数据模型表示的 Cache (緩存) 中的数据进行处 理, 主要完成词千抽取、 词条切分等类似的处理; 步骤 S206, 对数据预处理后得到的数据进行数据挖掘, 得到该用户的兴 趣关联规则; 本实施例数据挖掘过程中使用的挖掘算法如下所述, 该数据挖掘过程包 括以下阶段: The present invention relates to the field of mobile communications, and in particular, to a web page presentation method and apparatus. BACKGROUND With the advent of the 3G era, the number of mobile Internet users has increased, and the requirements for mobile browsers have become higher and higher. However, because mobile browsers are limited by screen size and hardware configuration, mobile screens can only display web pages. Part of the information. And now the mobile browser's scrolling of web pages is in the order of the web links. If the user wants to see the content of interest, he or she needs to constantly drag the scroll bar to find the web link that the user is interested in. Especially when browsing a relatively large webpage, it takes a long time, and it will take more time to find the content that the user is interested in, which affects the user's use and reduces the user experience. SUMMARY OF THE INVENTION A primary object of the present invention is to provide a method and apparatus for rendering a web page to at least solve the problem that the content search time is large when the web page is large. According to an aspect of the present invention, a webpage presentation method is provided, including: calculating interest levels according to interest association rules for each link in a webpage; and displaying a new webpage by indicating a link whose interest degree is higher than a specified value; presenting a new webpage Wherein, the interest association rule is determined by performing data mining on the user's historical access record. Determining the interest association rule by performing data mining on the user's historical access includes: reading the historical webpage data in the browser cache after the browser is opened; performing data mining on the historical webpage data to obtain the interest association rule. The interest association rule is stored in the designated storage area, and the interest association rule of the specified storage area is updated every specified duration or the set number of times of opening the webpage; and the interest degree is calculated according to the interest association rule for each link in the webpage: The specified storage area reads the user's interest association rule; and the interest degree is calculated according to the interest association rule for each link in the webpage. The above-mentioned links with a higher degree of interest than the specified value are obtained by the new webpage including: The link is sorted; the webpage link whose interest degree is higher than the specified value is extracted from the sorted result of the link, and the extracted webpage link is marked with a specified color to obtain a new webpage. The above web page rendering method is applied to a mobile terminal. According to another aspect of the present invention, a webpage presentation apparatus is provided, including: an interest degree calculation module, configured to calculate an interest degree according to an interest association rule for each link in a webpage; and a webpage labeling module, configured to indicate a high degree of interest A new webpage is obtained by the link of the specified value; a rendering module is configured to present the new webpage; wherein the interest association rule is determined by performing data mining on the historical access record of the user. The device further includes: a rule obtaining module, configured to read historical webpage data in the browser cache after the browser is opened; perform data mining on the historical webpage data to obtain an interest association rule. The foregoing interest association rule is stored in a designated storage area of the device, and the device further includes: an update module, configured to update an interest association rule of the specified storage area every specified duration or a set number of times of opening the webpage; the interest degree calculation module The method includes: an obtaining unit, configured to read a user's interest association rule from the specified storage area; and a calculating unit, configured to calculate an interest degree according to the interest association rule for each link in the webpage. The webpage labeling module includes: a sorting unit, configured to sort each link according to the interest degree; and a new webpage obtaining unit, configured to extract, from the sorted result of the link, a webpage link whose interest degree is higher than the specified value, in a specified color Mark the extracted web page link to get a new web page. The above device is a mobile terminal. Through the invention, the interest degree of each link is determined according to the historical access record of the user, and the link with high interest is marked, so that the user can quickly browse the content of interest, thereby solving the problem that the speed of browsing the large webpage is slow, and further Achieve the effect of improving the user experience. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are set to illustrate,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, In the drawings: FIG. 1 is a flowchart of a webpage presentation method according to Embodiment 1 of the present invention; FIG. 2 is a flowchart of a webpage presentation method according to Embodiment 2 of the present invention; 3 is a block diagram showing a structure of a webpage presenting apparatus according to a third embodiment of the present invention; and FIG. 4 is a block diagram showing a specific structure of a webpage presenting apparatus according to a third embodiment of the present invention. BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict. The display interface of a mobile terminal (such as a mobile phone) is limited, and the content in the webpage is more and more rich, which makes it impossible to display the entire webpage once on the display interface of the device, and needs to be displayed in multiple pages. Based on this, an embodiment of the present invention provides a webpage presentation method and apparatus. The following embodiments are described by taking a webpage presentation on a mobile terminal as an example. Embodiment 1 FIG. 1 is a flowchart of a webpage presentation method according to an embodiment of the present invention. The method includes the following steps: Step S102: Calculate an interest degree according to an interest association rule for each link in a webpage, where Performing data mining on the user's historical access record to determine the interest association rule; the above interest association rule may be determined each time the user opens the browser, and may specifically read the historical webpage data in the browser cache after opening the browser. Data mining of historical web page data to obtain interest association rules. It can also be read to the specified storage area when the user opens the browser. The interest association rule in the specified storage area can be updated according to a certain time, and the number of times the user opens the webpage can be counted. When the number of times the webpage is opened reaches the set number of times Update. Step S104, indicating that the link with the interest level higher than the specified value obtains a new webpage; for example, sorting each link according to the degree of interest; extracting a webpage link whose interest degree is higher than a specified value from the sorted result of the link, using the specified color Mark the extracted web page link to get a new web page. Step 4 gathers S 106 to present the new web page. In the related art, when the webpage is presented, the original content of the webpage is directly displayed. When the screen of the device is relatively small and the webpage is relatively large, the user can browse the content of interest for a long time. In the embodiment, when the webpage is presented, the interest degree of each link is determined according to the history record accessed by the user, and the link with high interest is marked, so that the user can quickly browse to the content of interest, and It increases the speed at which users browse the web, which in turn increases the satisfaction of the user experience. Embodiment 2 This embodiment provides a webpage presentation method, which is described by taking an implementation on a mobile phone as an example. Referring to FIG. 2, the method includes the following steps: Step S202: The user opens a browser to save historical webpage data in the cache of the mobile browser; for example: reading historical webpage data in the browser cache, and saving historical webpage data to Corresponding storage area; Step S204, performing data preprocessing on the saved historical webpage data; the data preprocessing process of the embodiment can complete data analysis, data extraction, data processing, and data transformation. For example, the data in the Cache (cache) represented by the WWW data model is processed, and the processing of the word thousand extraction, the term segmentation, and the like is mainly completed; in step S206, the data obtained by the data pre-processing is data-extracted, and the data is obtained. The user's interest association rule; the mining algorithm used in the data mining process of this embodiment is as follows: The data mining process includes the following stages:
( 1 ) 将兴趣词条定义为节点, 节点以二元组 (t, weight ) 表示, 简记 为 Node ( t ), 其中, weight为词条 t的权重; weight=新鲜度 X频度 ( f;)。 新鲜度反映兴趣词条存在时间的长短, 最近访问页面中的兴趣词条的新 鲜度相对较高, 在预测过程中, 越是最近访问的页面中的兴趣词条对预测起 的作用越大。 新鲜度可以等于保存该词条的时间, 也可以与保存时间具有一 定的线性关系。 fl为词条在页面中出现的频度, 例如, 某个词条在一个页面中出现了 8 次, 该页面中总的词条数为 100 (包括重复), 则 ¾=8/100。 其中, 兴趣词条可以是娱乐、 体育、 新闻、 天气、 咨询和财经等。 (1) The interest term is defined as a node, and the node is represented by a two-tuple (t, weight), abbreviated as Node ( t ), where weight is the weight of the term t; weight = freshness X frequency (f ;). Freshness reflects the length of time that interest terms exist. The recent freshness of interest terms in the recently visited page is relatively high. In the prediction process, the more recently the interest terms in the recently visited pages play a greater role in forecasting. Freshness can be equal to the time the item was saved, or it can have a linear relationship with the save time. Fl is the frequency at which the term appears on the page. For example, if an entry appears 8 times in a page, the total number of entries in the page is 100 (including repetitions), then 3⁄4=8/100. Among them, interest terms can be entertainment, sports, news, weather, consulting and finance.
( 2 )定义节点之间的联系为兴趣关联规则,用三元组 [ Node( ¾ ), support, Node ( tj ) ]表示, 简 ΐ己为 Rule [ Node ( t; ), Node ( tj ) ], 其中, support称为 关联支持度, 表示由节点 Node ) 转到节点 Node ( t} ) 的可能性; (2) Define the relationship between the nodes as the interest association rule, using the triplet [ Node( 3⁄4 ), support, Node ( tj ) ] indicates that simply [Node ( t ; ), Node ( tj ) ], where support is called association support, indicating the possibility of going to node Node ( t } ) by node Node ) ;
( 3 ) 数据预处理, 对页面集合 C 中的各页面抽取词千, 并进行词千切 分, 对应地得到页面 Yk的词条集合 K ( Yk ) ={ ( ti' , weight ) I ti' T (汉 语词汇;), i ( 自然数) }; (3) Data preprocessing, extracting thousands of words for each page in page set C, and performing word segmentation, correspondingly obtaining a set of words of page Y k K ( Y k ) = { ( ti ' , weight ) I ti 'T (Chinese vocabulary;), i (natural number) };
K ( Yk ) 表示在 Yk页面中出现的所有的兴趣词条的集合, t 为其中一 个词条。 緩存 ( Cache ) 中的历史网页数据通常釆用 WWW数据模型表示, 根据 具体实现还可能对 WWW数据模型的历史网页数据进行数据格式转换,转换 为所需要的数据格式。 词千的抽取和切分可以参考 IEEE (美国电气和电子工程师协会 )的数据 ¾;掘在网页预耳又中的应用 ( application of data mining in Web pre-fetching )。 K ( Y k ) represents the set of all interest terms that appear in the Y k page, and t is one of the terms. The historical webpage data in the cache (Cache) is usually represented by the WWW data model. According to the specific implementation, the historical webpage data of the WWW data model may be converted into a data format and converted into a required data format. The extraction and segmentation of the word thousands can be referred to the IEEE (Institute of Electrical and Electronics Engineers) data application (application of data mining in Web pre-fetching).
( 4 )从页面集合 C中的各页面 Yk中提取该页面的链接点, 得到页面的 链接点集合 L ( Yk ) ={lk, i I lk, i为页面 Yk中的链接点 }; 链接集合表示 Yk页面中所有的可以链接进入的链接点的集合,通过点击(4) extracting the link points of the page from each page Yk in the page set C, and obtaining the link point set L (Y k ) of the page = {l k , i I l k , i is the link in the page Y k Point}; a collection of links represents a collection of all link points that can be linked into the Yk page, by clicking
Yk页面里的链接就可以进入下一个页面。 The link in the Y k page will take you to the next page.
( 5 ) 提取页面的链接点的同时, 获取链接点的链接词千, 对链接词千 进行切分, 得到页面中链接点 lk, ,的链接词条集合 Q ( lk, .. stnng )
Figure imgf000007_0001
I tj" 在 lk, i- string中, j N} ; Q ( lk, L stnng ) 表示对 Yk中的某个链接 lk, ,对其链接词千进行切分后得到 的词条的集合。 通过以上的数据处理得到了四种集合, 分别为: 页面集合、 页面的词条 集合、 页面的链接点集合以及页面中链接点的链接词条集合。 得到四种集合 是为了下面计算兴趣关联规则 [ Node ( ti ), support, Node ( tj ) ], 即从一个 词条转移到另一个词条的可能性, 进而在计算出从一个页面转移到其中某个 链接的可能性。
(5) At the same time as extracting the link point of the page, the link word of the link point is obtained, and the link word is segmented, and the link term set Q ( lk, .. stnng ) of the link point l k , , in the page is obtained.
Figure imgf000007_0001
I tj" in l k , i- string, j N} ; Q ( l k , L stnng ) represents a link to a link l k in Y k , which is obtained by segmenting the link word 1000 The collection of the above data is obtained by the following data processing: page collection, collection of words of the page, collection of link points of the page, and collection of linked terms of the link points in the page. The four sets are obtained for the following calculations. Interest association rules [ Node ( ti ), support, Node ( tj ) ], the possibility of moving from one entry to another, and then calculating the possibility of moving from one page to one of the links.
( 6 ) 生成兴趣关联规则, 兴趣关联规则的集合构成兴趣关联知识库; 生成兴趣关联规则的具体过程可以包括下述方法: 遍历页面集合 C, 对于页面 Yk遍历该页面中的链接集合 L ( Yk ), 逐一 判断其中的链接点的源页面 (链接点所在的页面) Υ」是否属于页面集合 C, 如果属于, 则遍历页面 Yk和 Υ」的词条集合, 将 Yk与 Υ」中的词条进行组合, 计算词条组合中从一个词条转移到另一个词条的转移支持率, 该转移支持率 等于两个词条权重之和, 当词条在多个页面中重复出现时, 则在支持率中累 加词条的权重; 如果链接点的源页面 Yj不属于页面集合 C , 则遍历页面 Yk和链接点的 链接词条集合,将 Yk与链接点的链接词条集合中的词条进行组合, 计算词条 组合中从一个词条转移到另一个词条的转移支持率, 该转移支持率等于页面 Yk中词条的权重, 当链接词条在多个链接点的链接词条集合中出现时, 则转 移支持率累加页面 Yk中词条的权重。 生成兴趣关联规则的伪代码如下: for保存的页面集合 C 中的每个页面(6) generating an interest association rule, and the set of interest association rules constitute an interest association knowledge base; The specific process of generating the interest association rule may include the following methods: traversing the page set C, traversing the link set L ( Y k ) in the page for the page Y k , and determining the source page of the link point one by one (the page where the link point is located) ) Υ" belongs to page set C, if it belongs, it traverses the set of words of page Y k and Υ", combines the terms in Y k with Υ", and calculates the transfer from one entry to another in the combination of terms The transfer support rate of a term, the transfer support rate is equal to the sum of the weights of the two terms. When the entry is repeated in multiple pages, the weight of the entry is accumulated in the support rate; if the source page of the link point If Yj does not belong to page set C, it traverses the link term set of page Y k and the link point, and combines Y k with the term in the link term set of the link point to calculate the transfer from one entry to the entry combination. The transfer support rate of another term, the transfer support rate is equal to the weight of the entry in the page Y k , and when the link term appears in the link term set of the plurality of link points, the transfer support rate is accumulated in the page Y k Entry the weight of. The pseudo code that generates the affinity association rule is as follows: For each page in the saved page collection C
Yk { for链接集合 L ( Yk ) 中的每个链接 lk, r Y k { for each link in the link set L ( Y k ) l k , r
{设1]^ r的目标页面为 Yj ; if Yj e C then { for 页面 Yk中的词条集 K ( Yk ) 中的每个词条 ( , weightp ) The target page of {set 1]^ r is Yj; if Yj e C then { for each entry in the set of terms K ( Y k ) in page Y k ( , weightp )
{ for 页面 Yj中的词条集 K ( Yj ) 中的每个词条 ( tq', weightq ) { { for each entry in the set of terms K ( Yj ) in page Yj ( t q ', weight q ) {
Rule [Node ( tp' ), Node ( tq' ) ]的支持度 +=g ( weightp, weightq ) ; ( tp', weightp ) ≡K ( Yk ), ( tq', weightq ) ≡K ( Yj ) Rule [Node (t p ' ), Node ( t q ' ) ] Support +=g (weightp, weightq ) ; ( t p ', weightp ) ≡K ( Y k ), ( t q ', weightq ) ≡ K ( Yj )
} } } }
} else  } else
{ for 页面 Yk中的词条集合 K (Yk) 中的每个词条 ( , weightp ) { for each entry in the set of terms K (Y k ) in page Y k ( , weightp )
{ for Q (1 k, r- string ) 中的每个词条 tq' { for Q (1 k, r- string ) for each entry t q '
{ {
Rule [ Node ( tp' ), Node ( tq' ) ] 的支持度 +=weightp; ( tp', weightp ) e Yi, tq'eQ ( lk, r.string ) Rule [ Node ( t p ' ), Node ( t q ' ) ] Support +=weight p ; ( t p ', weightp ) e Yi, t q 'eQ ( l k , r . string )
} }
}  }
}  }
} } 其中, g ( weightp, weightq ) 为函数, 令其为 ( weightp+weightq ), 表示 緩存中的页面的链接点及链接点所指向的页面对兴趣关联知识库中的兴趣关 联规则的影响。 使用上面的关联规则挖掘算法计算 Rule [Node (¾), Node (tj) ]的支持度, 反映了当前浏览器用户访问网页兴趣和习惯, 作为下一步 计算链接兴趣度的依据。 计算链接兴趣度的方法可以为: 在兴趣关联规则数据库中查找当前访问 页面中的词条与链接词条的兴趣关联规则, 计算兴趣度, 该兴趣度等于当前 访问页面中词条的权重乘以该查找到的兴趣关联规则中的支持度, 完成兴趣 度的计算后, 对得到的全部链接按照兴趣度进行排序。 步骤 S208 , 通过数据挖掘给出的结果和当前用户访问的网页, 对计算出 兴趣度高的链接进行标示, 得到新网页。 步骤 S210, 按照标示的链接浏览新网页。 本实施例根据用户当前访问的网页和上述兴趣关联规则进行计算得到当 前网页链接集合中各个链接的兴趣度后,可以对各个链接的兴趣度进行排序, 并提取出兴趣度较高的网页链接, 使用突出的颜色进行标示; 并在网页滚动 浏览的时候按照已经标示的链接来滚动浏览和聚焦。 本实施例通过获取浏览器緩存中保存的历史网页数据, 这些数据中隐含 着用户的兴趣爱好和访问习惯, 使用兴趣关联规则挖掘, 挖掘出反映用户兴 趣和习惯的兴趣关联规则。 根据兴趣关联规则和用户当前所浏览的网页, 对 当前网页中用户兴趣度高的链接进行标示。 并且在网页滚动浏览的时候, 用 户可以选择按照已经标示网页链接来浏览网页。 如果用户下一个要浏览的网 页链接 (已经被标示出来的) 的位置在手机当前屏幕内, 则用户下一个要浏 览的网页链接就是屏幕内标示出的网页链接。 用户下一个要浏览的网页链接 位置不在手机当前屏幕内, 在手机的下一个刷新页面, 则浏览器会进行先翻 页, 然后滚动浏览到翻页后标示出的网页链接。 本实施例在呈现网页时, 根据用户访问的历史记录确定各个链接的兴趣 度, 对兴趣度高的链接进行标示, 使用户能够快速浏览到感兴趣的内容, 提 高了手机浏览器在浏览一个大网页时的速度, 进而提升使用浏览器的用户体 验。 实施例 3 参见图 3 , 本实施例提供了一种网页呈现装置, 该装置包括: 兴趣度计算模块 32 , 用于对网页中的各条链接按照兴趣关联规则计算兴 趣度, 其中, 通过对用户的历史访问记录进行数据挖掘确定兴趣关联规则; 网页标示模块 34 , 连接至兴趣度计算模块 32 , 用于标示兴趣度高于指 定值的链接得到新网页; 呈现模块 36 , 连接至网页标示模块 34 , 用于呈现上述新网页。 其中, 兴趣度计算模块 32 的具体实现可以参考实施例 2 中的算法, 这 里不再详述。 上述兴趣关联规则可以在每次用户打开浏览器时进行确定, 基于此, 该 装置还包括: 规则获取模块, 用于浏览器打开后, 读取浏览器緩存中的历史 网页数据; 对该历史网页数据进行数据挖掘, 得到兴趣关联规则。 得到上述兴趣关联规则后, 可以将该兴趣关联规则存储到指定存储区, 用于后续呈现网页时使用。 因此在用户打开浏览器时, 可以到该指定存储区 读取该兴趣关联规则, 该指定存储区中兴趣关联规则可以按照一定的时间进 行更新, 也可以统计用户打开网页的次数, 当打开网页的次数达到设定次数 时进行更新。 基于此, 该装置还包括: 更新模块, 用于每隔指定时长或打开 网页的设定次数对该指定存储区的兴趣关联规则进行更新; 相应地, 兴趣度 计算模块 32包括: 获取单元, 用于从指定存储区读取用户的兴趣关联规则; 计算单元, 用于对网页中的各条链接按照兴趣关联规则计算兴趣度。 参见图 4 , 为本实施例提供的网页呈现装置的具体结构框图, 该装置包 括: 兴趣度计算模块 32、 网页标示模块 34和呈现模块 36 , 其中, 网页标示 模块 34包括: 排序单元 342 , 用于按照兴趣度对各条链接进行排序; 新网页获取单元 344 , 用于从链接的排序结果中提取出兴趣度高于指定 值的网页链接, 用指定颜色标示提取出的网页链接得到新网页。 本实施例提供的装置可以是移动终端, 也可以是其它设备。 本实施例的装置在呈现网页时, 才艮据用户访问的历史记录确定各个链接 的兴趣度, 对兴趣度高的链接进行标示, 使用户能够快速浏览到感兴趣的内 容, 提高了浏览器在浏览一个大网页时的速度, 进而提升使用浏览器的用户 体验。 从以上的描述中可以看出, 本发明实现了如下技术效果: 以上实施例通过基于数据挖掘的方法对网页进行处理, 得到带有链接标 示的新网页, 呈现该新网页。 可以加快浏览器对于大网页的浏览速度, 并提 高用户体 -险。 显然, 本领域的技术人员应该明白, 上述的本发明的各模块或各步骤可 以用通用的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布 在多个计算装置所组成的网络上, 可选地, 它们可以用计算装置可执行的程 序代码来实现, 从而, 可以将它们存储在存储装置中由计算装置来执行, 并 且在某些情况下, 可以以不同于此处的顺序执行所示出或描述的步骤, 或者 将它们分别制作成各个集成电路模块, 或者将它们中的多个模块或步骤制作 成单个集成电路模块来实现。 这样, 本发明不限制于任何特定的硬件和软件 结合。 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本 领域的技术人员来说, 本发明可以有各种更改和变化。 凡在本发明的 ^"神和 原则之内, 所作的任何修改、 等同替换、 改进等, 均应包含在本发明的保护 范围之内。 } } where g (weightp, weightq ) is a function, let it be ( weight p + weight q ), which means that the link point of the page in the cache and the page pointed to by the link point are related to the interest association rule in the interest-related knowledge base. influences. The above association rule mining algorithm is used to calculate the support degree of Rule [Node (3⁄4), Node (tj)], which reflects the current browser user's access to webpage interests and habits, as the basis for calculating the link interest degree in the next step. The method for calculating the link interest degree may be: searching the interest association rule database for the interest association rule of the term and the link term in the current access page, and calculating the interest degree, which is equal to the weight of the term in the current access page multiplied by The degree of support in the found interest association rule, after completing the calculation of the interest degree, sorts all the obtained links according to the degree of interest. In step S208, the result of the data mining and the webpage accessed by the current user are used to mark the link with high interest level, and a new webpage is obtained. Step S210, browsing the new webpage according to the marked link. In this embodiment, after calculating the interest degree of each link in the current webpage link set according to the webpage currently accessed by the user and the above-mentioned interest association rule, the interest degree of each link may be sorted, and the webpage link with high interest degree is extracted. Use prominent colors for labeling; and scroll through and focus on the already marked links as the page scrolls through. In this embodiment, the historical webpage data saved in the browser cache is obtained. The data implied the user's hobbies and access habits, and the interest association rules are used to mine the interest association rules reflecting the user's interests and habits. According to the interest association rule and the webpage currently browsed by the user, the link with high user interest in the current webpage is marked. And when the webpage is scrolled, the user can choose to browse the webpage according to the already marked webpage link. If the location of the next webpage link (which has already been marked) of the user to browse is in the current screen of the mobile phone, the next webpage link to be browsed by the user is the webpage link marked on the screen. The next link of the webpage to be browsed by the user is not in the current screen of the mobile phone. On the next refreshing page of the mobile phone, the browser will first page through the page, and then scroll to the webpage link indicated after the page turning. In the embodiment, when the webpage is presented, the interest degree of each link is determined according to the history record accessed by the user, and the link with high interest is marked, so that the user can quickly browse the content of interest, and the mobile browser is improved in browsing. The speed of the web page, which in turn improves the user experience of using the browser. Embodiment 3 Referring to FIG. 3, the embodiment provides a webpage presentation apparatus, where the apparatus includes: an interest degree calculation module 32, configured to calculate an interest degree according to an interest association rule for each link in a webpage, where The history access record performs data mining to determine the interest association rule; the webpage labeling module 34 is connected to the interest degree calculation module 32, and is configured to mark the link with the interest degree higher than the specified value to obtain a new webpage; the presentation module 36 is connected to the webpage labeling module 34. , used to render the above new web page. For a specific implementation of the interest degree calculation module 32, reference may be made to the algorithm in Embodiment 2, It will not be detailed in detail. The above-mentioned interest association rule may be determined each time the user opens the browser. Based on this, the device further includes: a rule obtaining module, configured to read historical webpage data in the browser cache after the browser is opened; Data is used for data mining to obtain interest association rules. After the foregoing interest association rule is obtained, the interest association rule may be stored in a specified storage area for use in subsequent rendering of the webpage. Therefore, when the user opens the browser, the interest association rule can be read in the specified storage area, and the interest association rule in the specified storage area can be updated according to a certain time, and the number of times the user opens the webpage can be counted, when the webpage is opened. Update when the number of times reaches the set number of times. Based on this, the device further includes: an update module, configured to update the interest association rule of the specified storage area every specified duration or a set number of times of opening the webpage; correspondingly, the interest degree calculation module 32 includes: an obtaining unit, Reading the user's interest association rule from the specified storage area; the calculating unit is configured to calculate the interest degree according to the interest association rule for each link in the webpage. 4 is a block diagram of a specific structure of a webpage presentation apparatus provided by the embodiment. The apparatus includes: an interest degree calculation module 32, a webpage labeling module 34, and a presentation module 36. The webpage labeling module 34 includes: a sorting unit 342. The plurality of links are sorted according to the degree of interest; the new webpage obtaining unit 344 is configured to extract a webpage link whose interest degree is higher than a specified value from the sorted result of the link, and mark the extracted webpage link with a specified color to obtain a new webpage. The device provided in this embodiment may be a mobile terminal or other device. When the device of the embodiment presents the webpage, the interest degree of each link is determined according to the history record accessed by the user, and the link with high interest is marked, so that the user can quickly browse to the content of interest, and the browser is improved. The speed at which a large web page is viewed, which in turn improves the user experience of using the browser. As can be seen from the above description, the present invention achieves the following technical effects: The above embodiment processes a webpage by a method based on data mining, and obtains a new webpage with a link identifier, and presents the new webpage. It can speed up the browsing speed of the browser for large web pages and improve the user's body-risk. Obviously, those skilled in the art should understand that the above modules or steps of the present invention may be Implemented by a general-purpose computing device, which may be centralized on a single computing device or distributed over a network of computing devices, optionally, they may be implemented by program code executable by the computing device, such that They may be stored in a storage device by a computing device, and in some cases, the steps shown or described may be performed in an order different than that herein, or separately fabricated into individual integrated circuit modules. Alternatively, multiple modules or steps of them can be implemented as a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software. The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present invention are intended to be included within the scope of the present invention.

Claims

权 利 要 求 书 Claim
1. 一种网页呈现方法, 包括: A method for rendering a web page, comprising:
对网页中的各条链接按照兴趣关联规则计算兴趣度;  Calculate the interest level according to the interest association rule for each link in the webpage;
标示兴趣度高于指定值的链接得到新网页;  A link indicating that the degree of interest is higher than the specified value results in a new web page;
呈现所述新网页;  Presenting the new web page;
其中, 通过对用户的历史访问记录进行数据挖掘确定所述兴趣关联 规则。  The interest association rule is determined by performing data mining on the historical access record of the user.
2. 根据权利要求 1所述的方法, 其中, 通过对用户的历史访问进行数据挖 掘确定所述兴趣关联规则包括: 2. The method according to claim 1, wherein determining the interest association rule by performing data mining on a historical access of the user comprises:
浏览器打开后, 读取所述浏览器緩存中的历史网页数据; 对所述历史网页数据进行数据挖掘, 得到所述兴趣关联规则。  After the browser is opened, the historical webpage data in the browser cache is read; data mining is performed on the historical webpage data to obtain the interest association rule.
3. 根据权利要求 1所述的方法, 其中, 所述兴趣关联规则存储在指定存储 区, 并每隔指定时长或打开网页的设定次数对所述指定存储区的兴趣关 联规则进行更新; The method according to claim 1, wherein the interest association rule is stored in a designated storage area, and an interest association rule of the specified storage area is updated every specified duration or a set number of times of opening a webpage;
对网页中的各条链接按照兴趣关联规则计算兴趣度包括: 从所述指定存储区读取用户的兴趣关联规则;  Calculating the degree of interest according to the interest association rule for each link in the webpage includes: reading the user's interest association rule from the specified storage area;
对网页中的各条链接按照所述兴趣关联规则计算兴趣度。  The interest degree is calculated for each link in the web page according to the interest association rule.
4. 根据权利要求 1所述的方法, 其中, 标示兴趣度高于指定值的链接得到 新网页包括: 4. The method according to claim 1, wherein the link indicating that the degree of interest is higher than the specified value is obtained.
按照所述兴趣度对所述各条链接进行排序;  Sorting the links according to the degree of interest;
从链接的排序结果中提取出兴趣度高于所述指定值的网页链接, 用 指定颜色标示提取出的网页链接得到新网页。  A webpage link whose interest degree is higher than the specified value is extracted from the sorted result of the link, and the extracted webpage link is marked with a specified color to obtain a new webpage.
5. 才艮据权利要求 1至 4任一项所述的方法, 其中, 所述网页呈现方法应用 于移动终端。 The method according to any one of claims 1 to 4, wherein the web page presentation method is applied to a mobile terminal.
6. —种网页呈现装置, 包括: 6. A web page rendering device, comprising:
兴趣度计算模块, 用于对网页中的各条链接按照兴趣关联规则计算 兴趣度;  a degree of interest calculation module, configured to calculate a degree of interest according to an interest association rule for each link in the webpage;
网页标示模块, 用于标示兴趣度高于指定值的链接得到新网页; 呈现模块, 用于呈现所述新网页;  a webpage labeling module, configured to mark a link whose interest is higher than a specified value to obtain a new webpage; and a rendering module, configured to present the new webpage;
其中, 通过对用户的历史访问记录进行数据挖掘确定所述兴趣关联 规则。  The interest association rule is determined by performing data mining on the historical access record of the user.
7. 根据权利要求 6所述的装置, 其中, 所述装置还包括: The device according to claim 6, wherein the device further comprises:
规则获取模块, 用于浏览器打开后, 读取所述浏览器緩存中的历史 网页数据; 对所述历史网页数据进行数据挖掘, 得到所述兴趣关联规则。  a rule obtaining module, configured to read historical webpage data in the browser cache after the browser is opened, and perform data mining on the historical webpage data to obtain the interest association rule.
8. 根据权利要求 6所述的装置, 其中, 所述兴趣关联规则存储在所述装置 的指定存储区, 所述装置还包括: The device according to claim 6, wherein the interest association rule is stored in a designated storage area of the device, and the device further includes:
更新模块, 用于每隔指定时长或打开网页的设定次数对所述指定存 储区的兴趣关联规则进行更新;  An update module, configured to update an interest association rule of the specified storage area every specified duration or a set number of times of opening a webpage;
所述兴趣度计算模块包括: 获取单元, 用于从所述指定存储区读取 用户的兴趣关联规则; 计算单元, 用于对网页中的各条链接按照所述兴 趣关联规则计算兴趣度。  The interest degree calculation module includes: an obtaining unit, configured to read a user's interest association rule from the specified storage area; and a calculating unit, configured to calculate an interest degree according to the interest association rule for each link in the webpage.
9. 根据权利要求 6所述的装置, 其中, 所述网页标示模块包括: 9. The device according to claim 6, wherein the webpage labeling module comprises:
排序单元, 用于按照所述兴趣度对所述各条链接进行排序; 新网页获取单元, 用于从链接的排序结果中提取出兴趣度高于所述 指定值的网页链接, 用指定颜色标示提取出的网页链接得到新网页。  a sorting unit, configured to sort the links according to the interest degree; a new webpage obtaining unit, configured to extract, from the sorted result of the link, a webpage link whose interest degree is higher than the specified value, and is marked with a specified color The extracted web page link gets a new web page.
10. 居权利要求 6至 9任一项所述的装置, 其中, 所述装置为移动终端。 10. The device of any one of claims 6 to 9, wherein the device is a mobile terminal.
PCT/CN2010/077881 2010-07-14 2010-10-19 Method and device for presenting web pages WO2012006828A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010230248.9 2010-07-14
CN2010102302489A CN101894157A (en) 2010-07-14 2010-07-14 Webpage display method and device

Publications (1)

Publication Number Publication Date
WO2012006828A1 true WO2012006828A1 (en) 2012-01-19

Family

ID=43103349

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/077881 WO2012006828A1 (en) 2010-07-14 2010-10-19 Method and device for presenting web pages

Country Status (2)

Country Link
CN (1) CN101894157A (en)
WO (1) WO2012006828A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10346496B2 (en) 2014-06-06 2019-07-09 Tencent Technology (Shenzhen) Company Limited Information category obtaining method and apparatus

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013154542A1 (en) * 2012-04-10 2013-10-17 Empire Technology Development Llc Distilling popular information of a web page
CN103425670B (en) * 2012-05-16 2018-11-13 百度在线网络技术(北京)有限公司 A kind of method, apparatus and equipment providing a user content recommendation information
CN103678306A (en) * 2012-08-31 2014-03-26 腾讯科技(深圳)有限公司 Method and device for displaying access link
CN103885968B (en) * 2012-12-20 2019-04-12 北京百度网讯科技有限公司 It is a kind of for providing the method and apparatus of recommendation information
CN104636374A (en) * 2013-11-11 2015-05-20 腾讯科技(深圳)有限公司 Browser webpage displaying method and browser
CN104331260A (en) * 2014-03-05 2015-02-04 广州三星通信技术研究有限公司 Method and equipment for displaying web pages on screen
CN104991935B (en) * 2015-07-06 2019-03-12 无锡天脉聚源传媒科技有限公司 A kind for the treatment of method and apparatus of website attention rate
CN107798095A (en) * 2017-10-25 2018-03-13 星潮闪耀移动网络科技(中国)有限公司 Update the methods, devices and systems of column purpose output order
CN111767101A (en) * 2020-03-05 2020-10-13 北京沃东天骏信息技术有限公司 Page generation method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6055542A (en) * 1997-10-29 2000-04-25 International Business Machines Corporation System and method for displaying the contents of a web page based on a user's interests
CN101071424A (en) * 2006-06-23 2007-11-14 腾讯科技(深圳)有限公司 Personalized information push system and method
CN101739402A (en) * 2008-11-07 2010-06-16 华为技术有限公司 Method and device for interest analysis
CN101770520A (en) * 2010-03-05 2010-07-07 南京邮电大学 User interest modeling method based on user browsing behavior

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101071426A (en) * 2006-05-10 2007-11-14 北京锐科天智科技有限责任公司 Personalized webpage generating method and device
CN101661477A (en) * 2008-08-26 2010-03-03 华为技术有限公司 Search method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6055542A (en) * 1997-10-29 2000-04-25 International Business Machines Corporation System and method for displaying the contents of a web page based on a user's interests
CN101071424A (en) * 2006-06-23 2007-11-14 腾讯科技(深圳)有限公司 Personalized information push system and method
CN101739402A (en) * 2008-11-07 2010-06-16 华为技术有限公司 Method and device for interest analysis
CN101770520A (en) * 2010-03-05 2010-07-07 南京邮电大学 User interest modeling method based on user browsing behavior

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10346496B2 (en) 2014-06-06 2019-07-09 Tencent Technology (Shenzhen) Company Limited Information category obtaining method and apparatus

Also Published As

Publication number Publication date
CN101894157A (en) 2010-11-24

Similar Documents

Publication Publication Date Title
WO2012006828A1 (en) Method and device for presenting web pages
CN103886017B (en) A kind of for providing the method and apparatus of related sub links in Search Results
CN107609152B (en) Method and apparatus for expanding query expressions
CN102708174B (en) Method and device for displaying rich media information in browser
WO2011109957A1 (en) Method and apparatus for improving web page access speed
CN107766399B (en) Method and system for matching images to content items and machine-readable medium
EP2557511B1 (en) Information processing device, information processing method, information processing programme, and recording medium
CN101930475A (en) Web page display method and browser
CN106339398A (en) Pre-reading method and device for webpage and intelligent terminal device
WO2017059800A1 (en) Web crawler scheduling method and web crawler system applying same
KR20140012664A (en) Method for rearranging web page
US20180285331A1 (en) Method, server, browser, and system for recommending text information
US20150161278A1 (en) Method and apparatus for identifying webpage type
WO2014190265A1 (en) Community detection in weighted graphs
KR100983003B1 (en) Target advertisement system and method of mobile communication terminal
CN107562939A (en) Vertical field news recommends method, apparatus and readable storage medium
US20130305131A1 (en) Method, system and computer storage medium for pre-reading network data
CN107273393B (en) Image searching method and device for mobile equipment and data processing system
WO2014183544A1 (en) Method and device for generating a personalized navigation webpage
CN104090923A (en) Method and device for displaying rich media information in browser
CN107315753B (en) Paging method and device across multiple databases
JP5435731B2 (en) Concierge device, concierge service providing method, and concierge program
CN108647312A (en) A kind of user preference analysis method and its device
JP5827874B2 (en) Keyword acquiring apparatus, content providing system, keyword acquiring method, program, and content providing method
CN105653724A (en) Page exposure monitoring method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10854620

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10854620

Country of ref document: EP

Kind code of ref document: A1