Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoop clusters. This revised edition covers recent changes to Hadoop, including new features such as Hive, Sqoop, and Avro. It also provides illuminating case studies that illustrate how Hadoop is used to solve specific problems. Looking to get the most out of your data? This is your book. Use the Hadoop Distributed File System (HDFS) for storing large datasets, then run distributed computations over those datasets with MapReduce Become familiar with Hadoop’s data and I/O building blocks for compression, data integrity, serialization, and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud Use Pig, a high-level query language for large-scale data processing Analyze datasets with Hive, Hadoop’s data warehousing system Take advantage of HBase, Hadoop’s database for structured and semi-structured data Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems "Now you have the opportunity to learn about Hadoop from a master -- not only of the technology, but also of common sense and plain talk."
--Doug Cutting, Cloudera
發表於2025-04-10
Hadoop 2025 pdf epub mobi 電子書 下載
其實也不算全部讀完瞭,讀它主要是為瞭技術選型,考慮升級持久層架構、提高係統可擴展性,仔細研讀瞭前幾章,對Hadoop、MapReduce、HDFS的模型、機製、使用場景有瞭一定瞭解。後麵幾章及其生態圈內的其他項目抱著瞭解的心態簡單瀏覽瞭一下。整體感覺還行,至少從我看過的章節來...
評分你的履曆添瞭一筆<hadoop權威指南>譯者,但是你不配 這是我見過的最不用心的翻譯, 字裏行間行文不通順, 請彆勉強自己,map reduce shuffle機製都沒翻譯的好 雖然原作者寫作功底也實在是一般 第 1 2 5 6 7 這幾章 翻譯的實在是太爛瞭 請不要呐Google翻譯糊弄人阿 誤人子弟 ...
評分 評分書中沒有透露太多實現架構方麵的細節,更多的是從使用者的角度上介紹瞭Hadoop的各種知識,包括MapReduce, HDFS, Hive, Pig, HBase, ZooKeeper。幾乎涉及瞭Hadoop的所有關於使用方麵的知識,包括安裝和使用。 你甚至可以直接在自己的電腦上裝上一個Hadoop,對著書中的例子實際演...
評分很多地方翻譯的不行,需要對照英文看纔能明白。。。不過對於快速學習,仍然是不錯的選擇。建議譯者看看每部分內容的重要性,不重要的瞎翻翻就算瞭,重要的部分還是好好花點功夫,不要本末倒置瞭。比如第三章的數據流部分,這麼經典的地方居然被翻譯爛的一塌糊塗。不知道譯者會...
圖書標籤: hadoop 計算機科學 計算機 程序設計 分布式計算 軟件開發 架構設計 互聯網
入門書
評分入門書
評分入門書
評分入門書
評分入門書
Hadoop 2025 pdf epub mobi 電子書 下載