Comments on: learn data science
http://ask.metafilter.com/217293/learn-data-science/
Comments on Ask MetaFilter post learn data scienceFri, 08 Jun 2012 06:18:01 -0800Fri, 08 Jun 2012 06:36:33 -0800en-ushttp://blogs.law.harvard.edu/tech/rss60Question: learn data science
http://ask.metafilter.com/217293/learn-data-science
I'd like to learn about data science. Things like predictive modelling, regression and classification and so on. What would be good books or online courses to start with?post:ask.metafilter.com,2012:site.217293Fri, 08 Jun 2012 06:18:01 -0800gwynpdatasciencestatisticsBy: kandinski
http://ask.metafilter.com/217293/learn-data-science#3136934
I am enjoying a lot <a href="http://www.coursera.org/course/ml">Coursera's Machine Learning course</a>.comment:ask.metafilter.com,2012:site.217293-3136934Fri, 08 Jun 2012 06:36:33 -0800kandinskiBy: russm
http://ask.metafilter.com/217293/learn-data-science#3136957
I came here to mention the Stanford ML course at coursera, so consider this a second.comment:ask.metafilter.com,2012:site.217293-3136957Fri, 08 Jun 2012 06:59:42 -0800russmBy: bessel functions seem unnecessarily complicated
http://ask.metafilter.com/217293/learn-data-science#3136995
Here's <a href="http://www-stat.stanford.edu/~tibs/ElemStatLearn/">a free textbook</a> that I have not yet read.comment:ask.metafilter.com,2012:site.217293-3136995Fri, 08 Jun 2012 07:54:32 -0800bessel functions seem unnecessarily complicatedBy: Tooty McTootsalot
http://ask.metafilter.com/217293/learn-data-science#3137003
There are some references in a recent closely related question of mine:<br>
<br>
<a href="http://ask.metafilter.com/214823/Im-a-data-scientist-What-does-that-mean">http://ask.metafilter.com/214823/Im-a-data-scientist-What-does-that-mean</a>comment:ask.metafilter.com,2012:site.217293-3137003Fri, 08 Jun 2012 08:03:20 -0800Tooty McTootsalotBy: Tooty McTootsalot
http://ask.metafilter.com/217293/learn-data-science#3137091
Some of the books I've seen most recommended for data scientists are:<br>
<br>
<a href="http://www.amazon.com/exec/obidos/ASIN/0123748569/metafilter-20/ref=nosim/">Data Mining: Practical Machine Learning Tools and Techniques</a><br>
<a href="http://www.amazon.com/exec/obidos/ASIN/0596802358/metafilter-20/ref=nosim/">Data Analysis with Open Source Tools</a><br>
<a href="http://www.amazon.com/exec/obidos/ASIN/1617290181/metafilter-20/ref=nosim/">Machine Learning in Action</a><br>
<a href="http://www.amazon.com/exec/obidos/ASIN/0961392142/metafilter-20/ref=nosim/">The Visual Display of Quantitative Information</a><br>
<a href="http://www.amazon.com/exec/obidos/ASIN/0198568320/metafilter-20/ref=nosim/">Data Analysis: A Bayesian Tutorial</a><br>
<a href="http://www.amazon.com/exec/obidos/ASIN/0387310738/metafilter-20/ref=nosim/">Pattern Recognition and Machine Learning</a><br>
<br>
I recently went through the exercises here: <a href="http://www.rdatamining.com/home">http://www.rdatamining.com/home</a>, which were useful, but the site (and text) is currently incomplete, so I can't recommend it too much yet. The author is working on a book describing Data Mining with R.<br>
<br>
Also, these aren't so much technical, but if you're interested in data science and haven't seen them, you should read them over:<br>
<br>
<a href="http://shop.oreilly.com/product/0636920025559.do">Planning for Big Data</a><br>
<a href="http://research.microsoft.com/en-us/collaboration/fourthparadigm/">The Fourth Paradigm: Data-Intensive Scientific Discovery</a>comment:ask.metafilter.com,2012:site.217293-3137091Fri, 08 Jun 2012 09:13:36 -0800Tooty McTootsalotBy: mgogol
http://ask.metafilter.com/217293/learn-data-science#3137406
<a href="http://www.amazon.com/exec/obidos/ASIN/1449303714/metafilter-20/ref=nosim/">Machine Learning for Hackers?<br>
</a>comment:ask.metafilter.com,2012:site.217293-3137406Fri, 08 Jun 2012 13:35:42 -0800mgogolBy: benthegirl
http://ask.metafilter.com/217293/learn-data-science#3137575
To learn the basics of regression and ANOVA "Applied Linear Statistical Models" by Neter & Kutner is a good book that is easy to read. This book will tell you about how to do linear regression and how linear regression works. <br>
<br>
For generalized linear models "Categorical Data Analysis" by Agresti is my favorite book. This talks about data such as binary data or count data. You need to know something about linear regression before this book. <br>
<br>
For data driven modeling "Statistical Learning: Data Mining, Inference, and Prediction." by Hastie & Tibshirani is nice, however, not as easy as the other two books. This book talks about regression and judging the fit of a model and data driven stuff. One good thing about it is that it is FREE. <br>
<br>
Regression and Data driven modeling are useful for different things. If you are interested in answering scientific questions, try to learn the basics of regression and the theory behind it. If you are interested in predictive modeling, data driven models are good.comment:ask.metafilter.com,2012:site.217293-3137575Fri, 08 Jun 2012 16:24:52 -0800benthegirlBy: pmb
http://ask.metafilter.com/217293/learn-data-science#3137811
What is your background? Math? CS? Stats? Physics? English?comment:ask.metafilter.com,2012:site.217293-3137811Fri, 08 Jun 2012 20:53:57 -0800pmbBy: town of cats
http://ask.metafilter.com/217293/learn-data-science#3137845
Another free textbook: <a href="http://infolab.stanford.edu/~ullman/mmds.html">Mining of Massive Datasets. This text is used for Stanford's "big data" course, CS 246.</a>comment:ask.metafilter.com,2012:site.217293-3137845Fri, 08 Jun 2012 22:07:05 -0800town of cats