Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoop clusters. This revised edition covers recent changes to Hadoop, including new features such as Hive, Sqoop, and Avro. It also provides illuminating case studies that illustrate how Hadoop is used to solve specific problems. Looking to get the most out of your data? This is your book. Use the Hadoop Distributed File System (HDFS) for storing large datasets, then run distributed computations over those datasets with MapReduce Become familiar with Hadoop’s data and I/O building blocks for compression, data integrity, serialization, and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud Use Pig, a high-level query language for large-scale data processing Analyze datasets with Hive, Hadoop’s data warehousing system Take advantage of HBase, Hadoop’s database for structured and semi-structured data Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems "Now you have the opportunity to learn about Hadoop from a master -- not only of the technology, but also of common sense and plain talk."
--Doug Cutting, Cloudera
發表於2024-12-27
Hadoop 2024 pdf epub mobi 電子書 下載
首先,翻譯太差,很多句子就是瞎翻,根本不通順,很多時候你要停下來斷句,慢慢去理解。 然後,這本書是很多人去翻譯的,很多人連代碼都不懂,曾經一段代碼看到我濛圈,去看瞭一下源代碼,好傢夥,四行有五個錯誤。另外,從代碼瞎縮進也可以看齣這是群沒寫過代碼的人翻的,而且...
評分很多地方翻譯的不行,需要對照英文看纔能明白。。。不過對於快速學習,仍然是不錯的選擇。建議譯者看看每部分內容的重要性,不重要的瞎翻翻就算瞭,重要的部分還是好好花點功夫,不要本末倒置瞭。比如第三章的數據流部分,這麼經典的地方居然被翻譯爛的一塌糊塗。不知道譯者會...
評分-- china-pub 贈書活動 -- http://www.douban.com/group/topic/20965935/ 一直比較忙,整本書還沒讀完,隻是粗略翻瞭個大概,其中有兩三章細讀瞭一遍。先做個大體評價吧,有時間全部細讀後再評論。 從書的內容上來講,大緻上與網上該書的內容介紹一緻。簡單點概括:這本書對...
評分買瞭第一版,時間太緊,沒來得及看,後來齣瞭個號稱修訂升級的第二版,毫不猶豫又買瞭,後來聽說第二版比第一版翻譯得好,心中竊喜,再後來看瞭第二版,我震驚瞭,我TM就是一傻子,放著好好的英文版不看,趕什麼時髦買中文版呢。在這個神奇的國度,牛奶裏放的是三聚氰胺,火腿...
評分很好的Hadoop教程,比Apache和Yahoo !網頁版guide詳細很多,很多想不明白的Hadoop實現細節都可以在這本書裏找到。
圖書標籤: hadoop 計算機科學 計算機 程序設計 分布式計算 軟件開發 架構設計 互聯網
入門書
評分入門書
評分入門書
評分入門書
評分入門書
Hadoop 2024 pdf epub mobi 電子書 下載