US8126891B2 - Future data event prediction using a generative model - Google Patents
Future data event prediction using a generative model Download PDFInfo
- Publication number
- US8126891B2 US8126891B2 US12/255,556 US25555608A US8126891B2 US 8126891 B2 US8126891 B2 US 8126891B2 US 25555608 A US25555608 A US 25555608A US 8126891 B2 US8126891 B2 US 8126891B2
- Authority
- US
- United States
- Prior art keywords
- user
- search
- events
- search engine
- episode
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related, expires
Links
- 238000000034 method Methods 0.000 claims abstract description 48
- 230000009471 action Effects 0.000 claims abstract description 14
- 230000004044 response Effects 0.000 claims abstract description 8
- 239000000203 mixture Substances 0.000 claims description 32
- 238000012549 training Methods 0.000 claims description 24
- 230000008859 change Effects 0.000 claims description 20
- 238000012360 testing method Methods 0.000 claims description 18
- 238000004422 calculation algorithm Methods 0.000 claims description 15
- 238000012544 monitoring process Methods 0.000 claims description 7
- 238000001914 filtration Methods 0.000 claims description 3
- 230000008569 process Effects 0.000 description 8
- 230000003993 interaction Effects 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 6
- 238000013459 approach Methods 0.000 description 5
- 230000007704 transition Effects 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 230000006399 behavior Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000005315 distribution function Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000013515 script Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Abstract
Description
where Γ is determined as follows:
may be used as the frequency threshold to obtain significant N-node episodes.
if it is less than
and to 1 otherwise.
for all j. By regarding θj g as the prior probability corresponding to the jth mixture component Λαj, the posterior probability for the lth mixture component, with respect to the ith sequence XiεDY, can be written using Bayes' Rule:
then,
the joint probability of the data DY, and the most likely state sequence qα*, is given by
Claims (20)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/255,556 US8126891B2 (en) | 2008-10-21 | 2008-10-21 | Future data event prediction using a generative model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/255,556 US8126891B2 (en) | 2008-10-21 | 2008-10-21 | Future data event prediction using a generative model |
Publications (2)
Publication Number | Publication Date |
---|---|
US20100100517A1 US20100100517A1 (en) | 2010-04-22 |
US8126891B2 true US8126891B2 (en) | 2012-02-28 |
Family
ID=42109458
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/255,556 Expired - Fee Related US8126891B2 (en) | 2008-10-21 | 2008-10-21 | Future data event prediction using a generative model |
Country Status (1)
Country | Link |
---|---|
US (1) | US8126891B2 (en) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120102064A1 (en) * | 2010-09-24 | 2012-04-26 | Marcel Becker | Systems and methods for customized electronic communications |
US9843594B1 (en) | 2014-10-28 | 2017-12-12 | Symantec Corporation | Systems and methods for detecting anomalous messages in automobile networks |
US9906545B1 (en) | 2016-11-22 | 2018-02-27 | Symantec Corporation | Systems and methods for identifying message payload bit fields in electronic communications |
US9967274B2 (en) | 2015-11-25 | 2018-05-08 | Symantec Corporation | Systems and methods for identifying compromised devices within industrial control systems |
US10091077B1 (en) | 2016-06-27 | 2018-10-02 | Symantec Corporation | Systems and methods for detecting transactional message sequences that are obscured in multicast communications |
US10104100B1 (en) | 2016-03-03 | 2018-10-16 | Symantec Corporation | Systems and methods for detecting anomalies that are potentially indicative of malicious attacks |
US10146893B1 (en) | 2015-03-27 | 2018-12-04 | Symantec Corporation | Systems and methods for evaluating electronic control units within vehicle emulations |
US10193903B1 (en) | 2016-04-29 | 2019-01-29 | Symantec Corporation | Systems and methods for detecting suspicious microcontroller messages |
US10200259B1 (en) | 2016-09-21 | 2019-02-05 | Symantec Corporation | Systems and methods for detecting obscure cyclic application-layer message sequences in transport-layer message sequences |
US10326788B1 (en) | 2017-05-05 | 2019-06-18 | Symantec Corporation | Systems and methods for identifying suspicious controller area network messages |
US10432720B1 (en) | 2014-06-25 | 2019-10-01 | Symantec Corporation | Systems and methods for strong information about transmission control protocol connections |
WO2020077871A1 (en) * | 2018-10-15 | 2020-04-23 | 平安科技(深圳)有限公司 | Event prediction method and apparatus based on big data, computer device, and storage medium |
US11194821B2 (en) | 2014-08-15 | 2021-12-07 | Groupon, Inc. | Enforcing diversity in ranked relevance results returned from a universal relevance service framework |
US11216843B1 (en) | 2014-08-15 | 2022-01-04 | Groupon, Inc. | Ranked relevance results using multi-feature scoring returned from a universal relevance service framework |
US11341446B2 (en) | 2016-06-14 | 2022-05-24 | International Business Machines Corporation | Personalized behavior-driven dynamic risk management with constrained service capacity |
US11442945B1 (en) * | 2015-12-31 | 2022-09-13 | Groupon, Inc. | Dynamic freshness for relevance rankings |
US11675816B1 (en) * | 2021-01-29 | 2023-06-13 | Splunk Inc. | Grouping evens into episodes using a streaming data processor |
US11676072B1 (en) | 2021-01-29 | 2023-06-13 | Splunk Inc. | Interface for incorporating user feedback into training of clustering model |
US11843528B2 (en) | 2017-09-25 | 2023-12-12 | Splunk Inc. | Lower-tier application deployment for higher-tier system |
US11934417B2 (en) | 2017-09-23 | 2024-03-19 | Splunk Inc. | Dynamically monitoring an information technology networked entity |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11132748B2 (en) * | 2009-12-01 | 2021-09-28 | Refinitiv Us Organization Llc | Method and apparatus for risk mining |
US20120221485A1 (en) * | 2009-12-01 | 2012-08-30 | Leidner Jochen L | Methods and systems for risk mining and for generating entity risk profiles |
US20130124496A1 (en) * | 2011-11-11 | 2013-05-16 | Microsoft Corporation | Contextual promotion of alternative search results |
US20130191310A1 (en) * | 2012-01-23 | 2013-07-25 | Microsoft Corporation | Prediction model refinement for information retrieval system |
CN103631807A (en) * | 2012-08-24 | 2014-03-12 | 腾讯科技(深圳)有限公司 | Method and device for switching engines so as to conduct searching again |
WO2014149827A1 (en) * | 2013-03-15 | 2014-09-25 | REMTCS Inc. | Artificial neural network interface and methods of training the same for various use cases |
JP5486116B1 (en) * | 2013-06-11 | 2014-05-07 | ヤフー株式会社 | User information providing apparatus, user information providing method, user information providing program, and advertisement distribution system |
US10872023B2 (en) * | 2017-09-24 | 2020-12-22 | Microsoft Technology Licensing, Llc | System and method for application session monitoring and control |
US20220179838A1 (en) * | 2020-12-03 | 2022-06-09 | Servicenow, Inc. | System and Method for Modification of Storage Engine of Database Table |
US20230161779A1 (en) * | 2021-11-22 | 2023-05-25 | Yandex Europe Ag | Multi-phase training of machine learning models for search results ranking |
Citations (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030078766A1 (en) * | 1999-09-17 | 2003-04-24 | Douglas E. Appelt | Information retrieval by natural language querying |
US20030101449A1 (en) * | 2001-01-09 | 2003-05-29 | Isaac Bentolila | System and method for behavioral model clustering in television usage, targeted advertising via model clustering, and preference programming based on behavioral model clusters |
US20030101187A1 (en) * | 2001-10-19 | 2003-05-29 | Xerox Corporation | Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects |
US20040002994A1 (en) * | 2002-06-27 | 2004-01-01 | Brill Eric D. | Automated error checking system and method |
US6687696B2 (en) * | 2000-07-26 | 2004-02-03 | Recommind Inc. | System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models |
US20040024773A1 (en) * | 2002-04-29 | 2004-02-05 | Kilian Stoffel | Sequence miner |
US20040153288A1 (en) | 2001-01-23 | 2004-08-05 | Intel Corporation | Method and system for detecting semantic events |
US20040158469A1 (en) * | 2003-02-05 | 2004-08-12 | Verint Systems, Inc. | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
US6826569B2 (en) | 2001-04-10 | 2004-11-30 | 12 Limited | Method for identifying patterns |
US20060106743A1 (en) * | 2004-11-16 | 2006-05-18 | Microsoft Corporation | Building and using predictive models of current and future surprises |
US20060224579A1 (en) * | 2005-03-31 | 2006-10-05 | Microsoft Corporation | Data mining techniques for improving search engine relevance |
US20070005646A1 (en) | 2005-06-30 | 2007-01-04 | Microsoft Corporation | Analysis of topic dynamics of web search |
US20070136264A1 (en) | 2005-12-13 | 2007-06-14 | Tran Bao Q | Intelligent data retrieval system |
US7233933B2 (en) * | 2001-06-28 | 2007-06-19 | Microsoft Corporation | Methods and architecture for cross-device activity monitoring, reasoning, and visualization for providing status and forecasts of a users' presence and availability |
US20080002791A1 (en) | 2006-06-21 | 2008-01-03 | Sam Gratrix | Likelihood detector apparatus and method |
US20080071721A1 (en) * | 2006-08-18 | 2008-03-20 | Haixun Wang | System and method for learning models from scarce and skewed training data |
US7363308B2 (en) * | 2000-12-28 | 2008-04-22 | Fair Isaac Corporation | System and method for obtaining keyword descriptions of records from a large database |
US20080154821A1 (en) | 2006-12-11 | 2008-06-26 | Poulin Christian D | Collaborative Predictive Model Building |
US7899761B2 (en) * | 2005-04-25 | 2011-03-01 | GM Global Technology Operations LLC | System and method for signal prediction |
-
2008
- 2008-10-21 US US12/255,556 patent/US8126891B2/en not_active Expired - Fee Related
Patent Citations (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6910003B1 (en) * | 1999-09-17 | 2005-06-21 | Discern Communications, Inc. | System, method and article of manufacture for concept based information searching |
US20030078766A1 (en) * | 1999-09-17 | 2003-04-24 | Douglas E. Appelt | Information retrieval by natural language querying |
US6687696B2 (en) * | 2000-07-26 | 2004-02-03 | Recommind Inc. | System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models |
US7363308B2 (en) * | 2000-12-28 | 2008-04-22 | Fair Isaac Corporation | System and method for obtaining keyword descriptions of records from a large database |
US20030101449A1 (en) * | 2001-01-09 | 2003-05-29 | Isaac Bentolila | System and method for behavioral model clustering in television usage, targeted advertising via model clustering, and preference programming based on behavioral model clusters |
US20040153288A1 (en) | 2001-01-23 | 2004-08-05 | Intel Corporation | Method and system for detecting semantic events |
US6826569B2 (en) | 2001-04-10 | 2004-11-30 | 12 Limited | Method for identifying patterns |
US7233933B2 (en) * | 2001-06-28 | 2007-06-19 | Microsoft Corporation | Methods and architecture for cross-device activity monitoring, reasoning, and visualization for providing status and forecasts of a users' presence and availability |
US20030101187A1 (en) * | 2001-10-19 | 2003-05-29 | Xerox Corporation | Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects |
US20040024773A1 (en) * | 2002-04-29 | 2004-02-05 | Kilian Stoffel | Sequence miner |
US20040002994A1 (en) * | 2002-06-27 | 2004-01-01 | Brill Eric D. | Automated error checking system and method |
US20040158469A1 (en) * | 2003-02-05 | 2004-08-12 | Verint Systems, Inc. | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
US20060106743A1 (en) * | 2004-11-16 | 2006-05-18 | Microsoft Corporation | Building and using predictive models of current and future surprises |
US20060224579A1 (en) * | 2005-03-31 | 2006-10-05 | Microsoft Corporation | Data mining techniques for improving search engine relevance |
US7899761B2 (en) * | 2005-04-25 | 2011-03-01 | GM Global Technology Operations LLC | System and method for signal prediction |
US20070005646A1 (en) | 2005-06-30 | 2007-01-04 | Microsoft Corporation | Analysis of topic dynamics of web search |
US20070136264A1 (en) | 2005-12-13 | 2007-06-14 | Tran Bao Q | Intelligent data retrieval system |
US20080002791A1 (en) | 2006-06-21 | 2008-01-03 | Sam Gratrix | Likelihood detector apparatus and method |
US20080071721A1 (en) * | 2006-08-18 | 2008-03-20 | Haixun Wang | System and method for learning models from scarce and skewed training data |
US20080154821A1 (en) | 2006-12-11 | 2008-06-26 | Poulin Christian D | Collaborative Predictive Model Building |
Non-Patent Citations (8)
Title |
---|
Downey, et al., "Models of Searching and Browsing: Language, Studies and Application", (IJCAI 2007), 8 pages. |
Heath, et al., "Defection Detection: Predicting Search Engine Switching", ACM, WWW, Apr. 21-25, 2008, 2 pages. |
Juan, et al., "An Analysis of Search Engine Switching Behavior Using Click Streams", ACM, WWW 2005, May 10-14, 2005, pp. 1050-1051. |
Li, et al., "A Probabilistic Model for Retrospective News Event Detection", ACM, SIGIR'05, Aug. 15-19, 2005, pp. 106-113. |
Mannila, et al., "Discovery of Frequent Episodes in Event Sequences", Data Mining and Knowledge Discovery 1, 1997, pp. 259-289. |
Montgomery, et al., "Modeling Online Browsing and Path Analysis Using Clickstream Data", Marketing Science, vol. 23, No. 4, Fall 2004, 36 pages. |
Schatten, et al., "Efficient Indexing and Searching in Correlated Business Event Streams", Apr. 25, 2006, pp. 1-127. |
White, et al., "Investigating Behavioral Variablity in Web Search", ACM, WWW 2007, May 8-12, 2007, pp. 21-30. |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10114869B2 (en) | 2010-09-24 | 2018-10-30 | Oath Inc. | Systems and methods for customized electronic communications |
US20120102064A1 (en) * | 2010-09-24 | 2012-04-26 | Marcel Becker | Systems and methods for customized electronic communications |
US9081824B2 (en) | 2010-09-24 | 2015-07-14 | Aol Inc. | Systems and methods for customized electronic communications |
US11120028B2 (en) | 2010-09-24 | 2021-09-14 | Verizon Media Inc. | Systems and methods for customized electronic communications |
US11714817B2 (en) | 2010-09-24 | 2023-08-01 | Yahoo Ad Tech Llc | Systems and methods for customized electronic communications |
US8612477B2 (en) * | 2010-09-24 | 2013-12-17 | Aol Inc. | Systems and methods for customized electronic communications |
US10432720B1 (en) | 2014-06-25 | 2019-10-01 | Symantec Corporation | Systems and methods for strong information about transmission control protocol connections |
US11194821B2 (en) | 2014-08-15 | 2021-12-07 | Groupon, Inc. | Enforcing diversity in ranked relevance results returned from a universal relevance service framework |
US11216843B1 (en) | 2014-08-15 | 2022-01-04 | Groupon, Inc. | Ranked relevance results using multi-feature scoring returned from a universal relevance service framework |
US9843594B1 (en) | 2014-10-28 | 2017-12-12 | Symantec Corporation | Systems and methods for detecting anomalous messages in automobile networks |
US10146893B1 (en) | 2015-03-27 | 2018-12-04 | Symantec Corporation | Systems and methods for evaluating electronic control units within vehicle emulations |
US9967274B2 (en) | 2015-11-25 | 2018-05-08 | Symantec Corporation | Systems and methods for identifying compromised devices within industrial control systems |
US11442945B1 (en) * | 2015-12-31 | 2022-09-13 | Groupon, Inc. | Dynamic freshness for relevance rankings |
US10104100B1 (en) | 2016-03-03 | 2018-10-16 | Symantec Corporation | Systems and methods for detecting anomalies that are potentially indicative of malicious attacks |
US10193903B1 (en) | 2016-04-29 | 2019-01-29 | Symantec Corporation | Systems and methods for detecting suspicious microcontroller messages |
US11341446B2 (en) | 2016-06-14 | 2022-05-24 | International Business Machines Corporation | Personalized behavior-driven dynamic risk management with constrained service capacity |
US10091077B1 (en) | 2016-06-27 | 2018-10-02 | Symantec Corporation | Systems and methods for detecting transactional message sequences that are obscured in multicast communications |
US10200259B1 (en) | 2016-09-21 | 2019-02-05 | Symantec Corporation | Systems and methods for detecting obscure cyclic application-layer message sequences in transport-layer message sequences |
US9906545B1 (en) | 2016-11-22 | 2018-02-27 | Symantec Corporation | Systems and methods for identifying message payload bit fields in electronic communications |
US10326788B1 (en) | 2017-05-05 | 2019-06-18 | Symantec Corporation | Systems and methods for identifying suspicious controller area network messages |
US11934417B2 (en) | 2017-09-23 | 2024-03-19 | Splunk Inc. | Dynamically monitoring an information technology networked entity |
US11843528B2 (en) | 2017-09-25 | 2023-12-12 | Splunk Inc. | Lower-tier application deployment for higher-tier system |
WO2020077871A1 (en) * | 2018-10-15 | 2020-04-23 | 平安科技(深圳)有限公司 | Event prediction method and apparatus based on big data, computer device, and storage medium |
US11675816B1 (en) * | 2021-01-29 | 2023-06-13 | Splunk Inc. | Grouping evens into episodes using a streaming data processor |
US11676072B1 (en) | 2021-01-29 | 2023-06-13 | Splunk Inc. | Interface for incorporating user feedback into training of clustering model |
Also Published As
Publication number | Publication date |
---|---|
US20100100517A1 (en) | 2010-04-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8126891B2 (en) | Future data event prediction using a generative model | |
Laxman et al. | Stream prediction using a generative model based on frequent episodes in event sequences | |
Chandrasekaran et al. | Exploring connections between active learning and model extraction | |
US11082438B2 (en) | Malicious activity detection by cross-trace analysis and deep learning | |
US11218498B2 (en) | Context-aware feature embedding and anomaly detection of sequential log data using deep recurrent neural networks | |
US11451565B2 (en) | Malicious activity detection by cross-trace analysis and deep learning | |
Wang et al. | Real-time influence maximization on dynamic social streams | |
Gomez-Rodriguez et al. | Influence estimation and maximization in continuous-time diffusion networks | |
Wang et al. | Improving regret bounds for combinatorial semi-bandits with probabilistically triggered arms and its applications | |
Wu et al. | Dual sequential prediction models linking sequential recommendation and information dissemination | |
US8429110B2 (en) | Pattern tree-based rule learning | |
Feldman et al. | On distributing symmetric streaming computations | |
Chen et al. | Generalized PageRank on directed configuration networks | |
US11057414B1 (en) | Asynchronous hidden markov models for internet metadata analytics | |
US20090030916A1 (en) | Local computation of rank contributions | |
Kamara et al. | SoK: Cryptanalysis of encrypted search with LEAKER-a framework for LEakage AttacK Evaluation on Real-world data | |
Karlaš et al. | Data debugging with shapley importance over end-to-end machine learning pipelines | |
Xu et al. | Generative models for evolutionary clustering | |
Lee et al. | Relational self-supervised learning on graphs | |
Huang et al. | Protocol reverse-engineering methods and tools: a survey | |
Rao et al. | An optimal machine learning model based on selective reinforced Markov decision to predict web browsing patterns | |
Sallinen et al. | Real-time pagerank on dynamic graphs | |
Oweis et al. | A novel Mapreduce lift association rule mining algorithm (MRLAR) for big data | |
Guan et al. | The design and implementation of a multidimensional and hierarchical web anomaly detection system | |
Brach et al. | Spreading rumours without the network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: MICROSOFT CORPORATION,WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LAXMAN, SRIVATSAN;TANKASALI, VIKRAM;WHITE, RYEN;SIGNING DATES FROM 20081019 TO 20081020;REEL/FRAME:021908/0312 Owner name: MICROSOFT CORPORATION, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LAXMAN, SRIVATSAN;TANKASALI, VIKRAM;WHITE, RYEN;SIGNING DATES FROM 20081019 TO 20081020;REEL/FRAME:021908/0312 |
|
ZAAA | Notice of allowance and fees due |
Free format text: ORIGINAL CODE: NOA |
|
ZAAB | Notice of allowance mailed |
Free format text: ORIGINAL CODE: MN/=. |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:034564/0001 Effective date: 20141014 |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |