Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoop clusters. This revised edition covers recent changes to Hadoop, including new features such as Hive, Sqoop, and Avro. It also provides illuminating case studies that illustrate how Hadoop is used to solve specific problems. Looking to get the most out of your data? This is your book. Use the Hadoop Distributed File System (HDFS) for storing large datasets, then run distributed computations over those datasets with MapReduce Become familiar with Hadoop’s data and I/O building blocks for compression, data integrity, serialization, and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud Use Pig, a high-level query language for large-scale data processing Analyze datasets with Hive, Hadoop’s data warehousing system Take advantage of HBase, Hadoop’s database for structured and semi-structured data Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems "Now you have the opportunity to learn about Hadoop from a master -- not only of the technology, but also of common sense and plain talk."
--Doug Cutting, Cloudera
發表於2024-10-06
Hadoop 2024 pdf epub mobi 電子書 下載
很多地方翻譯的不行,需要對照英文看纔能明白。。。不過對於快速學習,仍然是不錯的選擇。建議譯者看看每部分內容的重要性,不重要的瞎翻翻就算瞭,重要的部分還是好好花點功夫,不要本末倒置瞭。比如第三章的數據流部分,這麼經典的地方居然被翻譯爛的一塌糊塗。不知道譯者會...
評分 評分 評分參加豆瓣China-pub抽奬,比較幸運的得到這本Hadoop權威指南中文第二版,拿來與第一版相比,發現新加入瞭Hive和Sqoop章節,譯文質量也提高瞭不少,並且保留瞭英文索引。 這本書對Hadoop的介紹還算全麵,有實踐衝動的朋友基本可以拿著書、配閤Google百度馬上實現夢想。個人感覺“...
評分買瞭第一版,時間太緊,沒來得及看,後來齣瞭個號稱修訂升級的第二版,毫不猶豫又買瞭,後來聽說第二版比第一版翻譯得好,心中竊喜,再後來看瞭第二版,我震驚瞭,我TM就是一傻子,放著好好的英文版不看,趕什麼時髦買中文版呢。在這個神奇的國度,牛奶裏放的是三聚氰胺,火腿...
圖書標籤: hadoop 計算機科學 計算機 程序設計 分布式計算 軟件開發 架構設計 互聯網
入門書
評分入門書
評分入門書
評分入門書
評分入門書
Hadoop 2024 pdf epub mobi 電子書 下載