Ready to unleash the power of your massive dataset? With the latest edition of this comprehensive resource, you'll learn how to use Apache Hadoop to build and maintain reliable, scalable, distributed systems. It's ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. This third edition covers recent changes to Hadoop, including new material on the new MapReduce API, as well as version 2 of the MapReduce runtime (YARN) and its more flexible execution model. You'll also find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. * Store large datasets with the Hadoop Distributed File System (HDFS), then run distributed computations with MapReduce * Use Hadoop's data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence * Discover common pitfalls and advanced features for writing real-world MapReduce programs * Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud * Use Pig, a high-level query language for large-scale data processing * Analyze datasets with Hive, Hadoop's data warehousing system * Load data from relational databases into HDFS, using Sqoop * Take advantage of HBase, the database for structured and semi-structured data * Use ZooKeeper, the toolkit for building distributed systems
發表於2024-11-22
Hadoop 2024 pdf epub mobi 電子書 下載
很好的Hadoop教程,比Apache和Yahoo !網頁版guide詳細很多,很多想不明白的Hadoop實現細節都可以在這本書裏找到。
評分書中沒有透露太多實現架構方麵的細節,更多的是從使用者的角度上介紹瞭Hadoop的各種知識,包括MapReduce, HDFS, Hive, Pig, HBase, ZooKeeper。幾乎涉及瞭Hadoop的所有關於使用方麵的知識,包括安裝和使用。 你甚至可以直接在自己的電腦上裝上一個Hadoop,對著書中的例子實際演...
評分是我遇到過的翻譯最爛的一本書,在譯者的“妙語連珠”裏摺騰瞭半個鍾頭就再也沒興趣瞭。略舉幾例如下: P.6 任然 -> 仍然 P.21 輸入鍵(為什麼不像後麵那樣有個“的”?),輸入的值,輸齣的鍵…… P. 27 “計數器”(Counter),譯文附原文;"Context Object"(上下文對象),原...
評分參加豆瓣China-pub抽奬,比較幸運的得到這本Hadoop權威指南中文第二版,拿來與第一版相比,發現新加入瞭Hive和Sqoop章節,譯文質量也提高瞭不少,並且保留瞭英文索引。 這本書對Hadoop的介紹還算全麵,有實踐衝動的朋友基本可以拿著書、配閤Google百度馬上實現夢想。個人感覺“...
評分很好的Hadoop教程,比Apache和Yahoo !網頁版guide詳細很多,很多想不明白的Hadoop實現細節都可以在這本書裏找到。
圖書標籤: Hadoop 分布式 並行計算 數據挖掘 大數據 計算機 O'Reilly 編程
Hadoop權威指南英文版,非常給力
評分過瞭一遍,隻知道個大概結構。細節還不是很懂
評分看完瞭Hadoop部分,看瞭部分可選模塊章節。 真心寫的挺仔細的。 17.3月 171216-17 看完瞭PART III Hadoop Operations
評分Hadoop權威指南英文版,非常給力
評分告彆小白
Hadoop 2024 pdf epub mobi 電子書 下載