Embedded Projects: Think Stats: Exploratory Data Analysis by Allen B. Downey; O'Reilly Media

Wednesday, November 26, 2014

Think Stats: Exploratory Data Analysis by Allen B. Downey; O'Reilly Media

I recently finished reading Think Stats: Exploratory Data Analysis by Allen B. Downey, which is an introduction to using probability and statistics to perform analysis on data sets. This book uses Python to explore and perform statistical analysis on several example data sets.

I have a decent statistics background (several undergraduate and graduate level statistics courses), and this book definitely took a different approach than I have seen before. The focus is on an exploratory and computational approach to analyzing a data set. This approach is very valuable, and provides a much more easily applied skill set than a traditional statistics introduction.

This book is not a thorough reference (though it often provides links to Wikipedia or other external sources for more information), and it won't replace my other statistics textbooks. However, it is a good introduction to the field (including many more advanced topics) and is easy to follow. I would be very interested in seeing a class that used this book as the text and followed the approach presented here. The book flows logically, but the topics were presented in a very different order than I was originally exposed to them.

To get the most out of this book, I would definitely recommend working through the examples. An even better approach would be to work through the topics on a data set you have at hand that is of interest to you.

Most of the examples in the book use the author's "thinkstats2.py" module. You can get the thinkstats2.py module (along with other sample code) at the book's GitHub page, and all of the examples can be viewed in IPython Notebooks. The examples are fairly straightforward, but I have not used to module enough to know whether I would consider it a candidate for a general purpose tool beyond working through the book. The author assumes you are familiar with Python, and having the module available is a useful tool to allow the reader to focus on the data and analysis.

The non-core Python packages used in this book are: pandas, NumPy, SciPy, StatsModels, and matplotlib. Pandas, in particular, is used quite heavily in thinkstats2.py. The author recommends the Anaconda distribution, which gives you all of these packages and many more. I've been using Anaconda as my primary distribution for the past 6 months or so and am very happy with it.

The book is also available under a CC BY-NC 3.0 license at Green Tea Press.

Disclaimer: I received a free Ebook copy of this work under the O'Reilly Blogger Review Program.

8 comments:

NandhiniMay 18, 2016 at 2:33 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
AnonymousMarch 5, 2022 at 5:49 AM
Tithian Athletics | Tithian Athletics
› en-US › tithian-b titanium canteen › en-US › titanium exhaust tubing tithian-b Tithian Athletics - Tithian Athletics - titanium white Tithian Athletics - can titanium rings be resized Tithian Athletics - Tithian Athletics - Tithian babyliss pro nano titanium straightener Athletics - Tithian Athletics - Tithian Athletics
ReplyDelete
Replies
hithoughJuly 14, 2022 at 5:56 AM
l989c2votzv951 horse dildos,wolf dildo,male masturbator,realistic dildo,male masturbators,dildos,realistic dildo,dildos,realistic dildo y565d9qmrme592
ReplyDelete
Replies
AnonymousAugust 4, 2022 at 7:49 PM
o834u2vnjqh272 custom sex doll,realistic sex dolls,dildos,wholesale sex toys,cheap sex toys,vibrators,Male Masturbators,sex chair,couples sexy toys v975b4dszjl034
ReplyDelete
Replies
AnonymousAugust 21, 2022 at 3:24 AM
d031x3eojuv441 Panty Vibrators,dildo,horse dildo,love dolls,sex toys,dog dildo,sex chair,Male Masturbators,dildo l434e9zufvb388
ReplyDelete
Replies
thesunchronicleJune 24, 2024 at 11:16 PM
Its a wonderful post and very informative, thanks for all this information. You are included prodigious content regarding this topic in an effective way.
Joe Lemus
ReplyDelete
Replies
unclepharmacy.netDecember 3, 2025 at 6:45 AM
Krijg snelle, discrete en betaalbare toegang tot betrouwbare ED-medicatie wanneer je kiest voor Cenforce 100mg kopen in Virginia. Een vertrouwde online apotheek biedt originele kwaliteit, veilige betalingen en snelle levering. Deze handige optie zorgt voor privacy, gemak en een probleemloze ervaring, zonder dat je een fysieke winkel hoeft te bezoeken.
ReplyDelete
Replies
unclepharmacy.netJanuary 16, 2026 at 8:07 AM
Alaska men can buy Kamagra in Alaska through Uncle Pharmacy for sildenafil-driven results matching Viagra—strong erections in 30-60 minutes via better blood flow, lasting 4-6 hours in 50mg/100mg tablets or jellies. Skip local hassles and high prices; Uncle Pharmacy ships authentic stock discreetly to Houston, Dallas, Austin, or statewide (7-14 days US delivery), with secure payments and privacy assured. No prescription needed—trusted for potency without counterfeits. Buy Kamagra in Alaska
ReplyDelete
Replies

Add comment