Evolutionary Decision Trees in Large-Scale Data Mining (Studies in Big Data, 59) - Softcover

Kretowski, Marek

 
9783030218539: Evolutionary Decision Trees in Large-Scale Data Mining (Studies in Big Data, 59)

Synopsis

This book presents a unified framework, based on specialized evolutionary algorithms, for the global induction of various types of classification and regression trees from data. The resulting univariate or oblique trees are significantly smaller than those produced by standard top-down methods, an aspect that is critical for the interpretation of mined patterns by domain analysts. The approach presented here is extremely flexible and can easily be adapted to specific data mining applications, e.g. cost-sensitive model trees for financial data or multi-test trees for gene expression data. The global induction can be efficiently applied to large-scale data without the need for extraordinary resources. With a simple GPU-based acceleration, datasets composed of millions of instances can be mined in minutes. In the event that the size of the datasets makes the fastest memory computing impossible, the Spark-based implementation on computer clusters, which offers impressive fault tolerance and scalability potential, can be applied.


"synopsis" may belong to another edition of this title.

From the Back Cover

This book presents a unified framework, based on specialized evolutionary algorithms, for the global induction of various types of classification and regression trees from data. The resulting univariate or oblique trees are significantly smaller than those produced by standard top-down methods, an aspect that is critical for the interpretation of mined patterns by domain analysts. The approach presented here is extremely flexible and can easily be adapted to specific data mining applications, e.g. cost-sensitive model trees for financial data or multi-test trees for gene expression data. The global induction can be efficiently applied to large-scale data without the need for extraordinary resources. With a simple GPU-based acceleration, datasets composed of millions of instances can be mined in minutes. In the event that the size of the datasets makes the fastest memory computing impossible, the Spark-based implementation on computer clusters, which offers impressive fault tolerance and scalability potential, can be applied.


"About this title" may belong to another edition of this title.

Other Popular Editions of the Same Title

9783030218508: Evolutionary Decision Trees in Large-Scale Data Mining (Studies in Big Data, 59)

Featured Edition

ISBN 10:  3030218503 ISBN 13:  9783030218508
Publisher: Springer, 2019
Hardcover