Jure Leskovec is Assistant Professor of Computer Science at Stanford University. His research focuses on mining large social and information networks. Problems he investigates are motivated by large scale data, the Web and on-line media. This research has won several awards including a Microsoft Research Faculty Fellowship, the Alfred P. Sloan Fellowship, Okawa Foundation Fellowship, and numerous best paper awards. His research has also been featured in popular press outlets such as the New York Times, the Wall Street Journal, the Washington Post, MIT Technology Review, NBC, BBC, CBC and Wired. Leskovec has also authored the Stanford Network Analysis Platform (SNAP, http://snap.stanford.edu), a general purpose network analysis and graph mining library that easily scales to massive networks with hundreds of millions of nodes and billions of edges. You can follow him on Twitter at @jure.
Written by leading authorities in database and Web technologies, this book is essential reading for students and practitioners alike. The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction.
發表於2025-01-03
Mining of Massive Datasets 2025 pdf epub mobi 電子書 下載
看到好多人說這本書是大綱,是目錄,沒啥內容,講的淺。 那就對瞭。 本書是Stanford CS246課程MMDS使用的講義,還有配套的Slides和HW,所以觀看本書請配套課程進行學習,同時coursera上也有配套的課程。 See more detail: http://www.mmds.org/
評分 評分並非傳統的”數據挖掘”教材,更像是,“數據挖掘”在互聯網的應用場景,所遇到的問題(數據量大)和解決方案; 不過老實說,這本書挺不好懂的。 大概 get 瞭幾個不錯的思想: 思想-1:務必充分利用數據的”稀疏性”,如數據充分稀疏時,可以利用 HASH 將數據“聚閤”成“有效...
評分並非傳統的”數據挖掘”教材,更像是,“數據挖掘”在互聯網的應用場景,所遇到的問題(數據量大)和解決方案; 不過老實說,這本書挺不好懂的。 大概 get 瞭幾個不錯的思想: 思想-1:務必充分利用數據的”稀疏性”,如數據充分稀疏時,可以利用 HASH 將數據“聚閤”成“有效...
評分麻煩支那豬以後翻譯外文書籍,先找個稍微懂行的把書看一遍行嗎! 鑒於中文翻譯縮水不準的情況,本掉韆辛萬苦找來英文原版,一看到目錄,本屌就硬瞭,尼瑪作者太牛逼瞭! 最新補充一句,話說如果這本書的名字叫做類似《數據挖掘基礎》的話,本屌絕壁不噴它。本來就是基礎的基...
圖書標籤: 數據挖掘 計算機 機器學習 Data Coursera CS 數據分析 軟件工程
勉強一刷吧。到時配閤斯坦福的課再過一遍~
評分行文很流暢,看到下麵很多人說翻譯的問題,由此推薦原版。配閤網課還是挺淺顯的,例子舉得也挺多,自學也可以。步驟寫的也很細,有條件完全可以照著碼,不晦澀,小白很喜歡。
評分內容不錯,但作為技術嚮的書有些浮於錶麵。
評分行文很流暢,看到下麵很多人說翻譯的問題,由此推薦原版。配閤網課還是挺淺顯的,例子舉得也挺多,自學也可以。步驟寫的也很細,有條件完全可以照著碼,不晦澀,小白很喜歡。
評分bug非常之多, 還找不到地方提交, 讀起來極度痛苦, 前看後忘, 也許裏麵的算法本質上就是這樣, bottom line至少近15年最新的論文成果被這麼串講一下, 本科生也能看懂
Mining of Massive Datasets 2025 pdf epub mobi 電子書 下載