عنوان البحث(Papers / Research Title)
Miner for OACCR: Case of medical data analysis in knowledge discovery
الناشر \ المحرر \ الكاتب (Author / Editor / Publisher)
سماهر حسين علي الجنابي
Citation Information
سماهر,حسين,علي,الجنابي ,Miner for OACCR: Case of medical data analysis in knowledge discovery , Time 16/11/2016 10:45:27 : كلية العلوم للبنات
وصف الابستركت (Abstract)
IEEE_Sciences of Electronics, Technologies of Information and Telecommunications (SETIT2012)
الوصف الكامل (Full Abstract)
Modern scientific data consist of huge datasets which gathered by a very large number of techniques and stored in much diversified and often incompatible data repositories as data of bioinformatics, geoinformatics, astroinformatics and Scientific World Wide Web. At the other hand, lack of reference data is very often responsible for poor performance of learning where one of the key problems in supervised learning is due to the insufficient size of the training dataset. Therefore, we try to suggest a new development a theoretically and practically valid tool for analyzing small of sample data remains a critical and challenging issue for researches. This paper presents a methodology for Obtaining Accurate and Comprehensible Classification Rules (OACCR) of both small and huge datasets with the use of hybrid techniques represented by knowledge discovering. In this article the searching capability of a Genetic Programming Data Construction Method (GPDCM) has been exploited for automatically creating more visual samples from the original small dataset. Add to that, this paper attempts to developing Random Forest data mining algorithm to handle missing value problem. Then database which describes depending on their components were built by Principle Component Analysis (PCA), after that, association rule algorithm to the FP-Growth algorithm (FP-Tree) was used. At the last, TreeNet classifier determines the class under which each association rules belongs to was used. The proposed methodology provides fast, Accurate and comprehensible classification rules. Also, this methodology can be use to compression dataset in two dimensions (number of features, number of records).
تحميل الملف المرفق Download Attached File
|
|