Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Mathematics of data science: a computational approach to clustering and classification
Calvetti D., Somersalo E., SIAM, Philadelphia, PA, 2020. 189 pp.  Type: Book (978-1-611976-36-6)
Date Reviewed: Oct 7 2021

Mathematics is the foundation of data science techniques. With the democratization of data science, almost anyone has access to easy-to-use tools and platforms to get started with data science applications. However, a serious professional or researcher would sooner or later need to understand the mathematics behind the wide variety of data science methods. To this end, this book provides a lightweight mathematics background for common machine learning models and techniques.

The book starts with a refresher in linear algebra and a concise overview of basic mathematics terms and operations, such as vectors, matrices, and eigenvalues and eigenvectors. However, the real meat of the book starts with chapter 2, where the authors introduce principal component analysis (PCA). In a later chapter, the authors cover other well-known and commonly used techniques such as k-means, classification algorithms, and tree-based classifiers. The authors first provide a brief description of the technique, to establish the need for it, and then delve into the mathematics of the concept. The authors typically include brief examples to show the transformations and applicable operations; these examples are welcome additions to the book. The provided figures and tables are clean; in general, they help readers get a better understanding of the explained concept.

The book assumes a basic understanding of mathematics notations and operations, such as summation, and does a good job of raising the bar on the mathematics knowledge associated with machine learning models. However, the authors fail to include a much-desired piece: a high-level logical explanation of data science operations without using any Greek symbols.

Reviewer:  Tushar Sharma Review #: CR147369
Bookmark and Share
  Reviewer Selected
Clustering (H.3.3 ... )
General (G.0 )
Would you recommend this review?
Other reviews under "Clustering": Date
Data analysis in bi-partial perspective: clustering and beyond
Owsinski J.,  Springer International Publishing, New York, NY, 2020. 153 pp. Type: Book (978-3-030133-88-7)
Jun 24 2020
 A rapid hybrid clustering algorithm for large volumes of high dimensional data
Rathore P., Kumar D., Bezdek J., Rajasegarar S., Palaniswami M.  IEEE Transactions on Knowledge and Data Engineering 31(4): 641-654, 2019. Type: Article
Mar 10 2020
Triclustering algorithms for three-dimensional data analysis: a comprehensive survey
Henriques R., Madeira S.  ACM Computing Surveys 51(5): 1-43, 2018. Type: Article
Jan 18 2019

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright © 2000-2021 ThinkLoud, Inc.
Terms of Use
| Privacy Policy