This engaging and clearly written textbook/reference provides a must-have introduction to the rapidly emerging interdisciplinary field of data science. It focuses on the principles fundamental to becoming a good data scientist and the key skills needed to build systems for collecting, analyzing, and interpreting data.
The Data Science Design Manual is a source of practical insights that highlights what really matters in analyzing data, and provides an intuitive understanding of how these core concepts can be used. The book does not emphasize any particular programming language or suite of data-analysis tools, focusing instead on high-level discussion of important design principles.
This easy-to-read text ideally serves the needs of undergraduate and early graduate students embarking on an "Introduction to Data Science" course. It reveals how this discipline sits at the intersection of statistics, computer science, and machine learning, with a distinct heft and character of its own. Practitioners in these and related fields will find this book perfect for self-study as well.
Additional learning tools:
Contains "War Stories," offering perspectives on how data science applies in the real world
Includes "Homework Problems," providing a wide range of exercises and projects for self-study
Provides a complete set of lecture slides and online video lectures at www.data-manual.com
Provides "Take-Home Lessons," emphasizing the big-picture concepts to learn from each chapter
Recommends exciting "Kaggle Challenges" from the online platform Kaggle
Highlights "False Starts," revealing the subtle reasons why certain approaches fail
Offers examples taken from the data science television show "The Quant Shop" (www.quant-shop.com)
Dr. Steven S. Skiena is Distinguished Teaching Professor of Computer Science at Stony Brook University, with research interests in data science, natural language processing, and algorithms. He was awarded the IEEE Computer Science and Engineering Undergraduate Teaching Award "for outstanding contributions to undergraduate education ...and for influential textbooks and software." Dr. Skiena is the author of six books, including the popular Springer titles The Algorithm Design Manual and Programming Challenges: The Programming Contest Training Manual.
What is Data Science? Mathematical Preliminaries Data Munging Scores and Rankings Statistical Analysis Visualizing Data Mathematical Models Linear Algebra Linear and Logistic Regression Distance and Network Methods Machine Learning Big Data: Achieving Scale