Cathy O’Neil earned a Ph.D. in math from Harvard, was postdoc at the MIT math department, and a professor at Barnard College where she published a number of research papers in arithmetic algebraic geometry. She then chucked it and switched over to the private sector. She worked as a quant for the hedge fund D.E. Shaw in the middle of the credit crisis, and then for RiskMetrics, a risk software company that assesses risk for the holdings of hedge funds and banks. She is currently a data scientist on the New York start-up scene, writes a blog at mathbabe.org, and is involved with Occupy Wall Street.
Rachel Schutt is a Senior Research Scientist at Johnson Research Labs, and most recently was a Senior Statistician at Google Research in the New York office. She is also an adjunct assistant professor in the Department of Statistics at Columbia University where she taught Introduction to Data Science. She earned a PhD from Columbia University in statistics, and masters degrees in mathematics and operations research from the Courant Institute and Stanford University, respectively. Her statistical research interests include modeling and analyzing social networks, epidemiology, hierarchical modeling and Bayesian statistics. Her education-related research interests include curriculum design.
Rachel enjoys designing and creating complex, thought-provoking situations for other people. She won the Howard Levene Outstanding Teaching Award at Columbia and also taught probability and statistics at Cooper Union, and remedial math as a high school teacher in San Jose, CA. She was a mathematics curriculum expert for the Princeton Review, and won a game design award for best family game at the Come Out and Play Festival in New York.
Now that answering complex and compelling questions with data can make the difference in an election or a business model, data science is an attractive discipline. But how can you learn this wide-ranging, interdisciplinary field? With this book, you’ll get material from Columbia University’s "Introduction to Data Science" class in an easy-to-follow format.
Each chapter-long lecture features a guest data scientist from a prominent company such as Google, Microsoft, or eBay teaching new algorithms, methods, or models by sharing case studies and actual code they use. You’ll learn what’s involved in the lives of data scientists and be able to use the techniques they present.
Guest lectures focus on topics such as:
Machine learning and data mining algorithms
Statistical models and methods
Prediction vs. description
Exploratory data analysis
Communication and visualization
Data processing
Big data
Programming
Ethics
Asking good questions
If you’re familiar with linear algebra, probability and statistics, and have some programming experience, this book will get you started with data science.
Doing Data Science is collaboration between course instructor Rachel Schutt (also employed by Google) and data science consultant Cathy O’Neil (former quantitative analyst for D.E. Shaw) who attended and blogged about the course.
發表於2024-11-22
Doing Data Science 2024 pdf epub mobi 電子書 下載
這本書蠻不錯的,就是看的時候碰到一些小錯誤,記錄如下,如果本書的編者看到瞭,也方便勘誤。 P43 第11行 “事”改為“是” P45 第9行 “歌”改為“個” P52 圖3-6說明文字第2行 “直”改為“緻” P96 正文第6行 “Emprical”改為“Empirical” P103 倒數第4行 “...
評分這本書蠻不錯的,就是看的時候碰到一些小錯誤,記錄如下,如果本書的編者看到瞭,也方便勘誤。 P43 第11行 “事”改為“是” P45 第9行 “歌”改為“個” P52 圖3-6說明文字第2行 “直”改為“緻” P96 正文第6行 “Emprical”改為“Empirical” P103 倒數第4行 “...
評分我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看過瞭 我看...
評分這本書蠻不錯的,就是看的時候碰到一些小錯誤,記錄如下,如果本書的編者看到瞭,也方便勘誤。 P43 第11行 “事”改為“是” P45 第9行 “歌”改為“個” P52 圖3-6說明文字第2行 “直”改為“緻” P96 正文第6行 “Emprical”改為“Empirical” P103 倒數第4行 “...
評分這本書蠻不錯的,就是看的時候碰到一些小錯誤,記錄如下,如果本書的編者看到瞭,也方便勘誤。 P43 第11行 “事”改為“是” P45 第9行 “歌”改為“個” P52 圖3-6說明文字第2行 “直”改為“緻” P96 正文第6行 “Emprical”改為“Empirical” P103 倒數第4行 “...
圖書標籤: 數據挖掘 數據分析 數據科學 datascience 機器學習 計算機 統計 O'Reilly
很多很business的東西,看瞭很多隻講算法和扣定的覺得蠻新鮮
評分看這種書主要不是看算法吧,主要看看一個“流程性”的東西,拿到數據,怎麼explore,怎麼telling story,試model之類的。 裏麵有些和不同公司訪談性的東西還比較有趣。
評分很多地方都講到瞭,語言也很簡練,易理解
評分很好的入門書
評分使用R來學習數據科學,有算法,有實例,不錯
Doing Data Science 2024 pdf epub mobi 電子書 下載