Binary classification rule generation from decomposed data
AbstractLearning classification rules from data that do not fit in the available memory is a challenging task. The goal of this study is to develop an approach for generating binary classification rules from decomposed data that are equivalent in terms of quality to those found over the whole data. In the proposed approach, each class is divided into the same arbitrary small number of subtables. For each pair of subsets from different classes, rule sets are induced using any sequential covering algorithm. Rule sets generated from the same positive class subset and different negative class subsets are merged using an operator constructed on the basis of Cartesian product and conjunction operators. The rule sets obtained in this way are joined into one set. During the rule merging, unnecessary rules are removed. It is proven that for training data, the quality of the rule set generated using the approach is the same as that for the whole data. It is experimentally verified that for test data, the quality of classification is comparable with that obtained using a nondecomposed data approach.
|Journal series||International Journal of Intelligent Systems, ISSN 0884-8173, e-ISSN 1098-111X, (N/A 100 pkt)|
|Publication size in sheets||0.75|
|ASJC Classification||; ; ;|
|Internal identifier||ROC 19-20|
|Score||= 100.0, 24-03-2020, ArticleFromJournal|
|Publication indicators||: 2018 = 2.027; : 2018 = 7.229 (2) - 2018=5.861 (5)|
* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.