Ready to unleash the power of your massive dataset? With the latest edition of this comprehensive resource, you'll learn how to use Apache Hadoop to build and maintain reliable, scalable, distributed systems. It's ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. This third edition covers recent changes to Hadoop, including new material on the new MapReduce API, as well as version 2 of the MapReduce runtime (YARN) and its more flexible execution model. You'll also find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. * Store large datasets with the Hadoop Distributed File System (HDFS), then run distributed computations with MapReduce * Use Hadoop's data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence * Discover common pitfalls and advanced features for writing real-world MapReduce programs * Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud * Use Pig, a high-level query language for large-scale data processing * Analyze datasets with Hive, Hadoop's data warehousing system * Load data from relational databases into HDFS, using Sqoop * Take advantage of HBase, the database for structured and semi-structured data * Use ZooKeeper, the toolkit for building distributed systems
發表於2024-06-26
Hadoop 2024 pdf epub mobi 電子書 下載
詳見:http://www.cnblogs.com/aprilrain/archive/2013/03/07/2947664.html
評分Cobub Razor APP數據統計分析工具官網上有篇文章是講Hadoop Yarn調度器的選擇和使用的,我覺得寫的挺好的,推薦http://www.cobub.com/the-selection-and-use-of-hadoop-yarn-scheduler/
評分專門登錄來評論的,翻譯也太爛瞭吧,真的真的建議強烈英語閱讀能力好的人去讀原版書,不要花冤枉錢在這上麵,除瞭文字錯誤外,裏邊的圖居然也有錯,就比如260頁的圖最後兩個年份應該是1901結果這裏竟然是1900,我是真滴服瞭,一本神書被翻譯成這樣,作者得氣死。zsbd zsbd zsbd...
評分Cobub Razor APP數據統計分析工具官網上有篇文章是講Hadoop Yarn調度器的選擇和使用的,我覺得寫的挺好的,推薦http://www.cobub.com/the-selection-and-use-of-hadoop-yarn-scheduler/
圖書標籤: Hadoop 分布式 並行計算 數據挖掘 大數據 計算機 O'Reilly 編程
感覺還行,講的比較細
評分Hadoop權威指南英文版,非常給力
評分看完瞭第一篇關於hadoop,Hdfs,yarn的部分(我看的是第四版),算是對於hadoop有個初步認識吧。這本書寫的很有邏輯,值得推薦。但是hadoop暫時還不是我的側重點,所以先就此彆過,以後有機會再迴頭細讀吧。
評分入門最佳,此書最好結閤實際操作去理解體會,map/reduce原理及應用部分應該是最權威的,重點看瞭,後麵部分看其他書籍瞭,平常工作可以當作工具書,有空再多溫習下最佳~
評分終於讀完瞭。。。這本是本係列的第三版瞭,好評如潮,無需我再費口舌瞭。
Hadoop 2024 pdf epub mobi 電子書 下載