Enjoy fast, free delivery, exclusive deals, and award-winning movies & TV shows.
Buy New
$57.27
FREE delivery Monday, June 15
Ships from: Amazon
Sold by: ROSE BOOK SELLER
$57.27
FREE delivery Monday, June 15
Or fastest delivery Saturday, June 13
Arrives before Father's Day
$$57.27 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$57.27
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Ships from
Amazon
Amazon
Ships from
Amazon
Returns
FREE 30-day refund/replacement
FREE 30-day refund/replacement
Quick refund
Usually issued within 24 hours. See exceptions
FREE return
At least one free return option available.
Convenient dropoff
At any of our 50,000 US locations.
See return policy
Gift options
Available at checkout
Available at checkout This item is a gift. Change
At checkout, you can add a custom message, a gift receipt for easy returns and have the item gift-wrapped
Payment
Secure transaction
Your transaction is secure
We work hard to protect your security and privacy. Our payment security system encrypts your information during transmission. We don’t share your credit card details with third-party sellers, and we don’t sell your information to others. Learn more
$15.65
Minimal or no signs of wear, writing, or notes. ~ BooksRun: cheaper books, every time! Minimal or no signs of wear, writing, or notes. ~ BooksRun: cheaper books, every time! See less
FREE delivery Monday, June 15. Details
Or fastest delivery Tomorrow, June 10. Order within 1 hr 36 mins. Details
Arrives before Father's Day
Only 1 left in stock - order soon.
$$57.27 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$57.27
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Access codes and supplements are not guaranteed with used items.
Ships from and sold by BooksRun.
Added to

Sorry, there was a problem.

There was an error retrieving your Wish Lists. Please try again.

Sorry, there was a problem.

List unavailable.
Kindle app logo image

Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.

Read instantly on your browser with Kindle for Web.

Using your mobile phone camera - scan the code below and download the Kindle app.

QR code to download the Kindle App

  • Practical Statistics for Data Scientists: 50 Essential Concepts

Follow the authors

Get new release updates & improved recommendations
Something went wrong. Please try your request again later.

Practical Statistics for Data Scientists: 50 Essential Concepts

4.5 out of 5 stars (453)

{"desktop_buybox_group_1":[{"displayPrice":"$57.27","priceAmount":57.27,"currencySymbol":"$","integerValue":"57","decimalSeparator":".","fractionalValue":"27","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"U0OmM9aNaIYuui27MfPey%2BDmkyzP4QmQflOaejO4Hnml2Eg%2BqLCZgA4UAFoRiIHQlemNpz2KLyJ20%2FAVRLSJphiYadkg%2FtfzSCIP%2FyrGNGtjOqYu68R58q3a89cIH%2BRCgCwcinjjyAH3wfQeJfunnxL6E%2FMwLsmaEuSVBEAI8c05OoyIIHLZ7PVRfI24vBc4","locale":"en-US","buyingOptionType":"NEW","aapiBuyingOptionIndex":0}, {"displayPrice":"$15.65","priceAmount":15.65,"currencySymbol":"$","integerValue":"15","decimalSeparator":".","fractionalValue":"65","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"U0OmM9aNaIYuui27MfPey%2BDmkyzP4QmQLfRWEebbiWEsQMEGyyydtQzo84PFMy1H%2By0Y%2B78mEBgrYDLWb5rwvrxb6yI%2FonqXdK3pP5L3DOe67JopwXVmx79GqME6ZPVANHCJ5S3U0nNlbCYQA2QEjp46rZi8tNLOXe1OuxdIdHh7L4VbSDRqvgHFeKlx9J8h","locale":"en-US","buyingOptionType":"USED","aapiBuyingOptionIndex":1}]}

Purchase options and add-ons

Statistical methods are a key part of data science, yet very few data scientists have any formal statistics training. Courses and books on basic statistics rarely cover the topic from a data science perspective. This practical guide explains how to apply various statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you’re familiar with the R programming language, and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

With this book, you’ll learn:

  • Why exploratory data analysis is a key preliminary step in data science
  • How random sampling can reduce bias and yield a higher quality dataset, even with big data
  • How the principles of experimental design yield definitive answers to questions
  • How to use regression to estimate outcomes and detect anomalies
  • Key classification techniques for predicting which categories a record belongs to
  • Statistical machine learning methods that "learn" from data
  • Unsupervised learning methods for extracting meaning from unlabeled data

There is a newer edition of this item:

Frequently bought together

This item: Practical Statistics for Data Scientists: 50 Essential Concepts
$57.27
Get it as soon as Monday, Jun 15
Sold by ROSE BOOK SELLER and ships from Amazon Fulfillment.
+
$23.18
Get it as soon as Sunday, Jun 14
In Stock
Ships from and sold by Amazon.com.
+
$13.98
Get it as soon as Sunday, Jun 14
In Stock
Ships from and sold by Amazon.com.
Total price: $00
To see our price, add these items to your cart.
Details
Added to Cart
Some of these items ship sooner than the others.
Choose items to buy together.

Customers also bought or read

Loading...

From the Publisher


About this Book

Data science is a fusion of multiple disciplines, including statistics, computer science, information technology and domain specific fields. As a result, a several different terms could be used to reference a given concept. Key terms and their synonyms will be highlighted throughout the book in a sidebar within the text.

This book is aimed at the data scientist with some familiarity with the R programming language, and with some prior (perhaps spotty or ephemeral) exposure to statistics. Both of us came to the world of data science from the world of statistics, and have some appreciation of the contribution that statistics can make to the art of data science. At the same time, we are well aware of the limitations of traditional statistics instruction: statistics as a disciple is a century and a half old, and most statistics textbooks and courses are laden with the momentum and inertia worthy of an ocean liner.

Two goals underlie this book:

  • To lay out, in digestible, navigable and easily referenced form, key concepts from statistics that are relevant to data science.
  • To explain which concepts are important and useful from a data science perspective, which are less so, and why.

Editorial Reviews

Book Description

50 Essential Concepts

About the Author

Peter Bruce founded and grew the Institute for Statistics Education at Statistics.com, which now offers about 100 courses in statistics, roughly a third of which are aimed at the data scientist. In recruiting top authors as instructors and forging a marketing strategy to reach professional data scientists, Peter has developed both a broad view of the target market, and his own expertise to reach it.

Andrew Bruce has over 30 years of experience in statistics and data science in academia, government and business. He has a Ph.D. in statistics from the University of Washington and published numerous papers in refereed journals. He has developed statistical-based solutions to a wide range of problems faced by a variety of industries, from established financial firms to internet startups, and offers a deep understanding the practice of data science.

Product details

  • Publisher ‏ : ‎ O'Reilly Media
  • Publication date ‏ : ‎ June 27, 2017
  • Edition ‏ : ‎ 1st
  • Language ‏ : ‎ English
  • Print length ‏ : ‎ 315 pages
  • ISBN-10 ‏ : ‎ 1491952962
  • ISBN-13 ‏ : ‎ 978-1491952962
  • Item Weight ‏ : ‎ 1.12 pounds
  • Dimensions ‏ : ‎ 6.75 x 0.5 x 9 inches
  • Best Sellers Rank: #271,202 in Books (See Top 100 in Books)
  • Customer Reviews:
    4.5 out of 5 stars (453)

About the authors

Follow authors to get new release updates, plus improved recommendations.
Sponsored

Customer reviews

4.5 out of 5 stars
453 global ratings
Sponsored

Customers say

Customers find the book provides a good foundation for exploratory data analysis and serves as an excellent introduction to relevant topics for students and data scientists. Moreover, the writing style is well-received, with customers describing it as nice and easy to read. However, the detailed explanations receive mixed feedback - while some find the statistical concepts easy to understand and remember, others say the code isn't explained in enough detail.
AI Generated from the text of customer reviews

Select to learn more

49 customers mention informative, 43 positive, 6 negative
Customers find the book informative, providing a good foundation for exploratory data analysis and serving as an excellent introduction to relevant topics for students and data scientists.
It is great one ! It is practical ! I like itRead more
...Pros: * Decent review of core concepts * Good coverage of importance of distinguishing between sample and population statistics *...Read more
It is a great help for a data science programmer. The code is in R, and I hope there will be a Python version.Read more
very useful book! If you wanna be a data scientist, this is ur startRead more
29 customers mention content, 27 positive, 2 negative
Customers find the book great for learning statistics, with one customer particularly appreciating the R and Python programs included.
100 pages in and I’ve learned so much! Great book for anyone working in the data/business analyst world.Read more
Good book. But, good for people who has basic knowledge of statisticsRead more
Excellent book. I use it daily in my data science activities!!!Read more
I love this book as a reference. Clear, efficient but detailed explanations. It is not designed as a textbook but as a reference....Read more
11 customers mention writing style, 9 positive, 2 negative
Customers appreciate the writing style of the book.
Well written, and serves as a great refresher for those in research, mathematics, AI, machine learning, data analysis, etc..Read more
Excellently written book on those looking for modern statistical data applications. The book provides open source codes so it is accessible to all.Read more
Very well written and approachable. Highly recommend if interested in using statistics in a practical work setting.Read more
...Each section explains the concept clearly and concisely with the mathematical formula, then a programming example....Read more
9 customers mention readability, 9 positive, 0 negative
Customers find the book easy to read and clear.
I love this book as a reference. Clear, efficient but detailed explanations. It is not designed as a textbook but as a reference....Read more
Easy to read book to bring your stats up to speed with current best practices, without the weight of all the theory or massive code dumps....Read more
...I found this book a very engaging read: it sets itself apart from other books on statistics in clearly telling which concepts are not-so-relevant...Read more
This might be a good read if you have taken a stats 101 course a long time ago and need a refresher, but don’t expect to be an expert by the end....Read more
28 customers mention detailed explanations, 14 positive, 14 negative
Customers have mixed opinions about the book's explanations, with some finding them detailed and easy to understand, while others note that the code lacks sufficient detail and the math explanations are unclear or incomplete.
I love this book as a reference. Clear, efficient but detailed explanations. It is not designed as a textbook but as a reference....Read more
...Overall I like it, it covers a lot of topics, but not in great detail....Read more
...It is best used to get a survey and overview of many of the facets of the domain of data science....Read more
...The problem is that the author does a poor job explaining many of the topics clearly....Read more
7 customers mention code, 2 positive, 5 negative
Customers have mixed opinions about the code in the book, with one customer noting that it is incomplete and another mentioning that it assumes readers already know how to program in R.
...Cons * Assumes that you know R. Lots of code, no explanations of the code. * Inconsistent level of detail and depth....Read more
...Although this is an introductory book, it assumes you can already program in R. If you can't, either accept that you won't be able to follow the...Read more
It is a great help for a data science programmer. The code is in R, and I hope there will be a Python version.Read more
...Good luck when you get to Chapter 5. Some of the code is incomplete or contains a typo. (See image)...Read more
You will learn more from doing a Google search.
1 out of 5 stars
You will learn more from doing a Google search.
I am still trudging through this book (at page 200), and I will try write a more comprehensive review when I finish. However, with that said... The concepts discussed in this book are surface level at best. You end up learning more from Google as you try to grasp a better understanding of what concept is being talked about. An intuitive understanding will not be learned as math examples are replaced by steps and R scripts. Also, one big caveat to the R in this book: **** The R scripts in this book are not complete compared to the R scripts that you get from the GitHub page! *** - You will end up debugging a lot of the scripts to make the examples work. - Not all of the data you get from the Github page matches what you see in the graphs of the book. It ranges from small errors in percentage points, to entire directions being the opposite. - Good luck when you get to Chapter 5. Some of the code is incomplete or contains a typo. (See image) The only positive I can comment are the resources cited, though I'd avoid purchasing anything by the Bruces at this point. Save yourself the $18.
Thank you for your feedback
Sorry, there was an error
Sorry we couldn't load the review

Top reviews from the United States

  • 5 out of 5 stars
    A good start for those who are iffy with stats but don't want to dive too deep yet.
    Reviewed in the United States on June 20, 2017
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    There's always that one person who is unsatisfied, but it sure as hell isn't me, because I knew what this book was going to be like the moment I saw how many pages it was going to have & how the early release version looked. I still preordered a hard copy (for sharing) & a digital copy (for carrying), because I knew this was kind of what type of book I was looking for & then some.

    The concepts are not astronomically explained, but with just enough depth that I can also individually explain to people what they are. What really stands out for me so far is after each or so concept, there is a section labeled as further reading (well, in the digital copy) that is usually at the end of the book altogether & I found myself realizing I have a lot of those books so the authors really know where to look & guide those who wanted more depth.

    Yeah yeah yeah, the codes are missing (as of mid-June 2017) but if you really understood / know which packages to use, you wouldn't need the code. The first half of the book are two three liners of code concepts anyways; it's the explanations that matter the most. The second half of the book is the good part, which separates a white hat statistician from a grey hat data scientist, which is exactly what I wanted in a <300 page book.

    Thanks for keeping me waiting since November though, thought it would never come! The O`Reilly books always keep me in awe at how they always know what topic I want to have a brief book (probably data collecting on me :P) & simultaneously leave me in suspense because I never notice I am preordering the books! Sigh. My only request is to be able to preorder the Kindle editions rather than the physical editions; my data science book cubby is starting to overwhelm my statistics cubby (NOT FOR LONG MASTERS PROGRAM ~).

    27 people found this helpful
    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 5 out of 5 stars
    Excellent, straightforward review of key concepts
    Reviewed in the United States on September 7, 2018
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    This book gets right to the point, and explains relevant statistical concepts in straightforward, easy to follow language. It is exactly what I was looking for. I was glad that the authors started at the literal beginning, but the pace and efficiency of writing allow the reader to come up to speed quickly. Well done.

    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 4 out of 5 stars
    A modern and very readable book that nicely explains high-level concepts.
    Reviewed in the United States on November 13, 2018
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    First of all, this book is not for you if you want a deep and thorough explanation of statistical concepts. It serves a completely different purpose: to familiarize a reader with high-level concepts; to enable them to continue their statistics education elsewhere.

    I found this book a very engaging read: it sets itself apart from other books on statistics in clearly telling which concepts are not-so-relevant for the modern computerized explorative analysis toolset. Many concepts that are presented in classic books on the subjects are rooted in 20s and 30s where computing power wasn't available and researches resorted to various pre-calculated distributions and formulas to do their work. A modern data-scientist's approach would eschew some of the old ways and instead rely on randomization, resampling and computing power.

    This book not only tells what something is, but also why it is that way and if a concept is still relevant today.

    I can recommend this book if your statistics knowledge is spotty or ephemeral, it serves its purpose well and doesn't bog down the reader with (sometimes) unnecessary mathematical concepts to demonstrate an idea.

    Why the four stars:

    1. Lack of examples in programming languages.

    2. Complete lack of exercises (at least 1-2 exercises are necessary).

    3. All scarce examples that are available are in R. No Python. :(

    26 people found this helpful
    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 5 out of 5 stars
    Interesting info, no practical applications without datasets
    Reviewed in the United States on June 11, 2017
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    Information seems plainly written and relevant. No link to datasets makes the "practical" code portion of the book unusable. Will happily update my review when the datasets are released.

    EDIT:

    Ok the datasets are up. There is a short R script to run to download the data, it will require some small modifications to get it working correctly.

    You need to create a folder named "data".

    and I changed the second line in the script from:

    PSDS_PATH <- file.path('~', 'statistics-for-data-scientists')

    to this:

    PSDS_PATH <- file.path('.')

    This will download the data into a folder named "data" in whatever directory you run the script. The script runs with no real feedback and some of the data sets are large, so just be patient. Once these were downloaded the examples in the book run great.

    15 people found this helpful
    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 5 out of 5 stars
    I love this book as a reference
    Reviewed in the United States on January 5, 2018
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    I love this book as a reference. Clear, efficient but detailed explanations. It is not designed as a textbook but as a reference. When I wonder "what is that test used for again?" or "what was that formula?" this is the first thing I reach for. Sure, Google has become universal for that too, but I like having a single hard copy reference that I can get to know and that becomes a trustworthy old friend. This book is taking on that role for me.

    19 people found this helpful
    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 5 out of 5 stars
    Excellent introductory textbook for data scientists (and students)
    Reviewed in the United States on July 14, 2017
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    Excellent introductory text for a comprehensive overview of statistics! The github repository augments the content very well and provides added value for the statistical topics covered in the book. Both of the Bruce brothers are statistical gurus and this fact is evident in the writing, which is both informative and witty. Peter is the president of Statistics.com and is well-versed in providing statistical instruction to students of all ages and levels. He is also a proponent of resampling and one of the developers of the excellent Resampling Stats software package for Excel.

    It is true that the textbook does not provide in-depth coverage for all topics, but I don't think that was the intent of the authors. However, the text DOES provide an excellent introduction to topics relevant to students and data scientists. After reading the text and working through the examples, you will be equipped to further your knowledge in whichever topic you require for you data analysis task.

    Highly recommended!

    21 people found this helpful
    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 3 out of 5 stars
    Good Topics, Incomplete Explanations
    Reviewed in the United States on July 1, 2019
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    I think this book covers a great and useful set of topics for a data scientist to know. The problem is that the author does a poor job explaining many of the topics clearly. So, I often feel the need to read about every explained concept online or watch a youtube clip to understand it better. The definitions, explanations, and examples are sometimes good but often rushed. You need this book to know what you need to learn, but then you end up learning those things elsewhere, and not from this book.

    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 5 out of 5 stars
    Excellent Pocket reference for Aspiring Data Scientists
    Reviewed in the United States on March 11, 2019
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    I bought this book for $13 an it has been a great read. Numerous major concepts required for a data scientist interview have been covered in this book. If you ask me, it's worth every cent spent on it. I gifted a second one to my friend who is in a Data Science program.

    Sending feedback...
    Thank you for your feedback.
    Sending feedback...
    Thanks, we'll investigate in the next few days.

Top reviews from other countries

    Translated by Amazon
    See original
  • 2 out of 5 stars
    Nicht das was das was man bei dem Titel erwartet
    Reviewed in Germany on August 18, 2019
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    Eine oberflächliche kurze Darstellung diverser statistischer Methoden ohne auf die Details/Formeln groß einzugehen sofern diese denn gegeben sind. Jedes Thema enthält zwar Referenzen auf weiterführende Bücher/Quellen, allerdings ist dieses Buch somit alles andere als Praktisch. Der R-Code ist auch nur obligatorisch und nicht mal sauber formatiert.

    Kurzum: Das Buch ist nicht mehr als ein Glossar, der die Methoden anreist und fast gar nicht gegeneinander Vergleicht. Mit Google wird man wesentlich besser informiert.

    Vor allem richtet sich dieses Buch an Data Scientists und Leute die schon mal mit R. gearbeitet haben. Wer das bereits hat, der braucht dieses Buch nicht!

    Schreibstil: Trocken, repetitiv und viele Vorwärts- und -Rückwärtsverweise.

    Von daher keine Empfehlung!

    Sending feedback...
    Thanks, we'll investigate in the next few days.
    Translated from German by Amazon
    See original
  • 4 out of 5 stars
    A good book to start the journey of data science
    Reviewed in the United Kingdom on March 10, 2020
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    The book very well covers the basics with special focus on data science. It also demonstrate the concepts using the R codes.

    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 1 out of 5 stars
    Mala impresión
    Reviewed in Mexico on March 3, 2020
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    Parece fotocopia de otro libro. No parece un libro de importación.

    Sending feedback...
    Thanks, we'll investigate in the next few days.
    Translated from Spanish by Amazon
    See original
  • 5 out of 5 stars
    Statistical Handbook for Students and Practitioners
    Reviewed in Canada on March 23, 2018
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    Practical Statistics for Data Scientists presents all of the statistical analysis techniques that students and pracitioners of data analytics projects data science would benefit from reading. From school to workplace this book will earn it's place on your bookshelf.

    Sending feedback...
    Thanks, we'll investigate in the next few days.
  • 5 out of 5 stars
    Gives readers a very good perspective of traditional statistics and how data science differs ...
    Reviewed in India on September 26, 2017
    Brief content visible, double tap to read full content.
    Full content visible, double tap to read brief content.

    Well organised and really lucid text! Gives readers a very good perspective of traditional statistics and how data science differs from that. Very learnable. It would have been very useful if they added some problem sets at the end of the chapters.

    Sending feedback...
    Thanks, we'll investigate in the next few days.