AnyBook4Less.com | Order from a Major Online Bookstore |
![]() |
Home |  Store List |  FAQ |  Contact Us |   | ||
Ultimate Book Price Comparison Engine Save Your Time And Money |
![]() |
Title: Principles of Data Mining (Adaptive Computation and Machine Learning) by David J. Hand, Heikki Mannila, Padhraic Smyth ISBN: 0-262-08290-X Publisher: MIT Press Pub. Date: 01 August, 2001 Format: Hardcover Volumes: 1 List Price(USD): $58.00 |
Average Customer Rating: 3.69 (13 reviews)
Rating: 5
Summary: A welcome addition to data mining
Comment: This is a welcome addition to the canon of books on data mining. It is an interdisciplinary book, drawing together the differing views of the statistician and the computer scientist, but with an emphasis on the principles underlying data mining. Many data mining books are written from a specialist computer scientist's viewpoint or from a similarly specialist business users one. In the former, the emphasis tends to be on algorithms and computational efficiency, while in the latter business applications of a small number of techniques are the main thrust.
Data mining requires an understanding of concepts from statistics and computer science, and the authors illustrate this with many examples. The first third of the book covers fundamentals of data analysis, which is appropriate, because some deep statistical ideas can arise in data mining problems, and those without training in statistics may not be aware of their consequences. The next third covers components of data mining algorithms, and the final part of the book draws the two strands together in a unified whole, with descriptions of typical data mining tasks and suitable algorithms.
It is a well written and easy to understand book, and will be an ideal reference for researchers and practitioners from either discipline, who may be seeking a greater understanding of the other.
Rating: 5
Summary: nice treatment of data mining and underlying methodology
Comment: This book is not an introductory text. Anyone interested in a particular topic should consult the preface of the text to find out what it is about. The negative reviewers were not fair to the authors on that score. Had they read the preface they would have found out (1) how the authors define data mining, (2) that they see it as a subject with an important mix of statistical methodology and computer science and (3) that it is intended as an advanced undergraduate or first year graduate text on the topic.
They also provide a very well organized structure for the text that is well described in the preface. It consists of three parts. Chapter 1 is an essential introduction that is informative to everyone. Chapters 2 through 4 go through basic statistical ideas that statisticians would be very familiar with and others could view as a refresher. The authors have experience teaching this course to engineering and science majors and have found that many of these students unfortunately do not have the prerequisite statistical inference ideas and need this material covered in the course.
Chapters 5 through 8 cover the components of data mining algorithms and the remaining chapters deal with the details of the tasks and algorithms.
The book features a further reading section at the end of each chapter that provides a very nice guide to the useful and most significant relevant literature. The author's have done a very good job at this. One mistake I found was a reference to Miller (1980). I think this was intended to be a reference to the seocnd edition fo Rupert Miller's text "Simultaneous Statistical Inference" which was published in 1981 by Springer-Verlag but the full citation is missing from the list of references in the back of the book.
This book deserves 5 stars because it does what it intends to do. It presents the field of data mining in a clear way covering topics on classfication and kernel methods expertly. David Hand has published a great deal on these techniques including many fine books.
Mannila and Smyth bring to the text the computer science perspective. There is much useful material on optimization methods and computational complexity.
Statistical modeling and issues of the "curse of dimensionality" and the "overfitting problem" are key issues that this text emphasizes and expertly addresses.
The only thing the text misses is details on specific algorithms. But I do not grade them down for that because it was not their intention. They emphasize methodology and issues and that is the most critical thing a practitioner needs to know first before embarking on his own attack at mining data.
The text does provide most of the current important methods. Although Vapnik's work is mentioned and his two books are referenced there is very little discussion of support vector machines and the use of Vapnik-Chervonenkis classes and dimension
in data mining. The new book by Hastie, Tibshirani and Friedman goes into much greater detail on specific algorithms include some only briefly discussed in this text (e.g. support vector machines). The support vector approach is also nicely treated in "Learning with Kernels" by Scholkopf and Smola.
I highly recommend this book for anyone interested in data mining. It is a great reference source and an eloquent text to remind you of the pitfalls of thoughtless mining or "data-dredging". It also has many nice practical examples and some interesting success stories on the application of data mining to specific problems.
Rating: 5
Summary: Great book with a great layout!
Comment: I'd been struggling with the seemingly infinite ways to approach data mining and this book cleared it all up for me. It is absolutely full of information and is a great base reference. It does not contain complete algorithms or step by step instructions (you can get those anywhere these days) but instead is a comprehensive survey of all the best known methods for data mining. I really like how the authors combined classical mining techniques with more modern ones (ex: Bayesian Networks). Other books try to stay in one camp or the other, all while denying that they use very similar sub-components.
This book is well worth it. I promise you will find more information than you could possibly retain.
![]() |
Title: The Elements of Statistical Learning by T. Hastie, R. Tibshirani, J. H. Friedman ISBN: 0387952845 Publisher: Springer Verlag Pub. Date: 09 August, 2001 List Price(USD): $82.95 |
![]() |
Title: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations by Ian H. Witten, Eibe Frank ISBN: 1558605525 Publisher: Morgan Kaufmann Pub. Date: 11 October, 1999 List Price(USD): $49.95 |
![]() |
Title: Data Mining: Concepts and Techniques by Jiawei Han, Micheline Kamber ISBN: 1558604898 Publisher: Morgan Kaufmann Pub. Date: August, 2000 List Price(USD): $54.95 |
![]() |
Title: Data Preparation for Data Mining by Dorian Pyle ISBN: 1558605290 Publisher: Morgan Kaufmann Pub. Date: 15 March, 1999 List Price(USD): $49.95 |
![]() |
Title: An Introduction to Support Vector Machines and Other Kernel-based Learning Methods by Nello Cristianini, John Shawe-Taylor ISBN: 0521780195 Publisher: Cambridge University Press Pub. Date: 23 March, 2000 List Price(USD): $53.00 |
Thank you for visiting www.AnyBook4Less.com and enjoy your savings!
Copyright� 2001-2021 Send your comments