Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.
Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and genomics data processing.
Learn fundamental components such as MapReduce, HDFS, and YARN
Explore MapReduce in depth, including steps for developing applications with it
Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
Learn two data formats: Avro for data serialization and Parquet for nested data
Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
Learn the HBase distributed database and the ZooKeeper distributed configuration service
發表於2024-05-20
Hadoop: The Definitive Guide 2024 pdf epub mobi 電子書 下載
買瞭第一版,時間太緊,沒來得及看,後來齣瞭個號稱修訂升級的第二版,毫不猶豫又買瞭,後來聽說第二版比第一版翻譯得好,心中竊喜,再後來看瞭第二版,我震驚瞭,我TM就是一傻子,放著好好的英文版不看,趕什麼時髦買中文版呢。在這個神奇的國度,牛奶裏放的是三聚氰胺,火腿...
評分 評分看瞭幾章中文版的,各種錯誤,太低級,實在是看不下去瞭。 建議還是看原版吧。 譯者們的臉皮可真厚,英文譯不明白也就罷瞭,中文都組織的不通順,好意思嗎!! 什麼叫 “但是,......,但是”啊,“但是體”啊。
評分專門登錄來評論的,翻譯也太爛瞭吧,真的真的建議強烈英語閱讀能力好的人去讀原版書,不要花冤枉錢在這上麵,除瞭文字錯誤外,裏邊的圖居然也有錯,就比如260頁的圖最後兩個年份應該是1901結果這裏竟然是1900,我是真滴服瞭,一本神書被翻譯成這樣,作者得氣死。zsbd zsbd zsbd...
評分-- china-pub 贈書活動 -- http://www.douban.com/group/topic/20965935/ 一直比較忙,整本書還沒讀完,隻是粗略翻瞭個大概,其中有兩三章細讀瞭一遍。先做個大體評價吧,有時間全部細讀後再評論。 從書的內容上來講,大緻上與網上該書的內容介紹一緻。簡單點概括:這本書對...
圖書標籤: Hadoop 大數據 BigData 計算機 分布式 hadoop 機器學習 O'Reilly
真尼瑪長。介紹瞭生態圈裏的大部分工具,用來總結迴顧比較適閤,沒有實踐過的讀者看前兩部分mr和yarn核心,掃一遍後麵所有工具是做什麼用的就可以瞭。
評分經典
評分很棒
評分入門hadoop的好書
評分Have read the first part of it for overview. Superb. Definitely come back for details before the third year career.
Hadoop: The Definitive Guide 2024 pdf epub mobi 電子書 下載