Computing Reviews, the leading online review service for computing literature.

Search

Rule induction with extension matrices
Wu X. (ed) Journal of the American Society for Information Science49 (5):435-454,1998.Type:Article

Date Reviewed: Jul 1 1998

Wu describes a heuristic, attribute-based, noise-tolerant data mining program based on the extension matrix approach. Given n negative examples and p positive examples, where each example has a attributes, one can define the negative example matrix NEM to be the matrix whose rows are the negative examples and whose columns give the attribute values. For each positive example P, one constructs an extension matrix whose elements are * (a so-called dead element) when P has the same value as the corresponding attribute of the negative example, and whose elements have the value of the attribute of the negative example otherwise. If one can find a set of n non-dead elements, one in each row, this defines a conjunctive formula that separates the given example P from the negative example. The algorithm HCV uses heuristics to find, in polynomial time ( O ( p n a³ + p² n a ) ), a disjunctive formula separating all the positive examples from the negative examples whenever at least one such formula exists. The paper contains a brief discussion of how the algorithm can be modified to cope with noisy data. It concludes with the results of experimental comparisons of HCV with other data mining algorithms, including C4.5. The author observes that HCV is much faster than C4.5 on some sets and slower on other sets.

Reviewer: J. P. E. Hodgson	Review #: CR121757 (9807-0535)

Data Mining (H.2.8 ... )

Algorithm Design And Analysis (G.4 ... )

Heuristic Methods (I.2.8 ... )

Induction (I.2.6 ... )

Would you recommend this review?

yes

Other reviews under "Data Mining":	Date

Feature selection and effective classifiers Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article	May 1 1999

Predictive data mining Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)	Feb 1 1999

Data mining solutions Westphal C., Blaxton T., John Wiley & Sons, Inc., New York, NY, 1998. Type: Book (9780471253846)	Jun 1 1999

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy