Good paper on the effects on missing values in the accuracy of your model. The organization of this paper could improve if the authors would have included their recommendation as part of the Summary.
Nevertheless, this is the crucial recommnedation (p.8): "We recommend that we can deal with datasets having up to 20 % of missing values. For the CD (Complete Deletion) method we have up to 60 % of instances containing missing
values and still have a reasonable performance."
For healthcare, pharma, and biotech data this paper is important because of the complexity and diversity of this data.
The latest development in data mining, predictive modeling, marketing analytics, artificial intelligence, analytics, intelligent agents, semiconductors, distributing computing, and network security. SAS, Fair Isaac, Microsoft Analysis Services, SPSS, Cognos, Hyperion, Business Objects, Oracle, KXEN,or R. Healthcare, Pharmaceutical,Retail, CPG, Travel, Financial, Banking, Telecommunications, or Insurance. Unleashing the Power of the Mind©™
Friday, June 08, 2007
Subscribe to:
Post Comments (Atom)
Business Analytics
Labels
- advanced analytics (2)
- analytics (5)
- analytics tools (2)
- big data (3)
- buisness analytics (4)
- business analytics (4)
- business plan (1)
- center of excellence (1)
- classification (1)
- companies (2)
- data mining (3)
- framework (3)
- game theory (1)
- innovation (3)
- leverage (1)
- marketing analytics (1)
- predictive modeling (4)
- prioritization (1)
- priority (1)
- projects (1)
- recession (1)
- robotic surgery (1)
- segmentation (1)
- social media (2)
- trade promotion (1)
- trends (2)
- web analytics (2)
- what if scenarios (2)
Blog Archive
-
▼
2007
(55)
-
▼
June
(13)
- Web Analytics: Future Applications in Predicting M...
- Geovisual Analytics and Crisis Management
- NIH-NSF Visualization Research Challenges Report
- BioGRID version 2.0.29 release ( maintenance update )
- What Data Mining Can and Can't Do
- Evaluation of noise reduction techniques in the sp...
- A review of symbolic analysis of experimental data
- Enhancing Data Analysis with Noise Removal
- Incremental Mining of Sequential Patterns in Large...
- Molecular Staging for Survival Prediction of Color...
- The treatment of missing values and its effect in ...
- An Assessment of Accuracy, Error, and Conflict wit...
- Phase II Studies: Which is Worse, False Positive ...
-
▼
June
(13)
About Me
- alberto
- See my resume at: https://docs.google.com/document/d/1-IonTpDtAgZyp3Pz5GqTJ5NjY0PhvCfJsYAfL1rX8KU/edit?hl=en_USid=1gr_s5GAMafHRjwGbDG_sTWpsl3zybGrvu12il5lRaEw
No comments:
Post a Comment