The fundamentals of this algorithm could be use in large databases.
The problem: "As databases evolve the problem of maintaining sequential patterns over a significantly long period of time becomes essential, since a large number of new records may be added to a database. To reflect the current state of the database where previous sequential patterns would become irrelevant and new sequential patterns might appear, there is a need for efficient algorithms to update, maintain and manage the information discovered [12]. Several efficient algorithms for maintaining association rules have been developed [12–15]. Nevertheless, the problem of maintaining sequential patterns is much more complicated than maintaining association rules, since transaction cutting and sequence permutation have to be taken into account [16]."
The proposed solution: "This method is based on the discovery of frequent
sequences by only considering frequent sequences obtained by an earlier mining
step. By proposing an iterative approach based only on such frequent sequences
we are able to handle large databases without having to maintain negative border
information, which was proved to be very memory consuming [16]. Maintaining
such a border is well adapted to incremental association mining [26,19], where association rules are only intended to discover intra-transaction patterns (itemsets). Nevertheless, in sequence mining, we also have to discover inter-transaction patterns (sequences) and the set of all frequent sequences is an unbounded superset of the set of frequent itemsets (bounded) [16]. The main consequence is that such approaches are very limited by the negative border size."
The latest development in data mining, predictive modeling, marketing analytics, artificial intelligence, analytics, intelligent agents, semiconductors, distributing computing, and network security. SAS, Fair Isaac, Microsoft Analysis Services, SPSS, Cognos, Hyperion, Business Objects, Oracle, KXEN,or R. Healthcare, Pharmaceutical,Retail, CPG, Travel, Financial, Banking, Telecommunications, or Insurance. Unleashing the Power of the Mind©™
Tuesday, June 12, 2007
Subscribe to:
Post Comments (Atom)
Business Analytics
Labels
- advanced analytics (2)
- analytics (5)
- analytics tools (2)
- big data (3)
- buisness analytics (4)
- business analytics (4)
- business plan (1)
- center of excellence (1)
- classification (1)
- companies (2)
- data mining (3)
- framework (3)
- game theory (1)
- innovation (3)
- leverage (1)
- marketing analytics (1)
- predictive modeling (4)
- prioritization (1)
- priority (1)
- projects (1)
- recession (1)
- robotic surgery (1)
- segmentation (1)
- social media (2)
- trade promotion (1)
- trends (2)
- web analytics (2)
- what if scenarios (2)
Blog Archive
-
▼
2007
(55)
-
▼
June
(13)
- Web Analytics: Future Applications in Predicting M...
- Geovisual Analytics and Crisis Management
- NIH-NSF Visualization Research Challenges Report
- BioGRID version 2.0.29 release ( maintenance update )
- What Data Mining Can and Can't Do
- Evaluation of noise reduction techniques in the sp...
- A review of symbolic analysis of experimental data
- Enhancing Data Analysis with Noise Removal
- Incremental Mining of Sequential Patterns in Large...
- Molecular Staging for Survival Prediction of Color...
- The treatment of missing values and its effect in ...
- An Assessment of Accuracy, Error, and Conflict wit...
- Phase II Studies: Which is Worse, False Positive ...
-
▼
June
(13)
About Me
- alberto
- See my resume at: https://docs.google.com/document/d/1-IonTpDtAgZyp3Pz5GqTJ5NjY0PhvCfJsYAfL1rX8KU/edit?hl=en_USid=1gr_s5GAMafHRjwGbDG_sTWpsl3zybGrvu12il5lRaEw
No comments:
Post a Comment