Ready to unleash the power of your massive dataset? With the latest edition of this comprehensive resource, you'll learn how to use Apache Hadoop to build and maintain reliable, scalable, distributed systems. It's ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. This third edition covers recent changes to Hadoop, including new material on the new MapReduce API, as well as version 2 of the MapReduce runtime (YARN) and its more flexible execution model. You'll also find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. * Store large datasets with the Hadoop Distributed File System (HDFS), then run distributed computations with MapReduce * Use Hadoop's data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence * Discover common pitfalls and advanced features for writing real-world MapReduce programs * Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud * Use Pig, a high-level query language for large-scale data processing * Analyze datasets with Hive, Hadoop's data warehousing system * Load data from relational databases into HDFS, using Sqoop * Take advantage of HBase, the database for structured and semi-structured data * Use ZooKeeper, the toolkit for building distributed systems
發表於2024-11-08
Hadoop 2024 pdf epub mobi 電子書 下載
看瞭幾章中文版的,各種錯誤,太低級,實在是看不下去瞭。 建議還是看原版吧。 譯者們的臉皮可真厚,英文譯不明白也就罷瞭,中文都組織的不通順,好意思嗎!! 什麼叫 “但是,......,但是”啊,“但是體”啊。
評分看瞭幾章中文版的,各種錯誤,太低級,實在是看不下去瞭。 建議還是看原版吧。 譯者們的臉皮可真厚,英文譯不明白也就罷瞭,中文都組織的不通順,好意思嗎!! 什麼叫 “但是,......,但是”啊,“但是體”啊。
評分 評分很好的Hadoop教程,比Apache和Yahoo !網頁版guide詳細很多,很多想不明白的Hadoop實現細節都可以在這本書裏找到。
圖書標籤: Hadoop 分布式 並行計算 數據挖掘 大數據 計算機 O'Reilly 編程
終於讀完瞭。。。這本是本係列的第三版瞭,好評如潮,無需我再費口舌瞭。
評分主要看設計思想。設計思想。。設計思想。。。
評分告彆小白
評分看完瞭Hadoop部分,看瞭部分可選模塊章節。 真心寫的挺仔細的。 17.3月 171216-17 看完瞭PART III Hadoop Operations
評分過瞭一遍,隻知道個大概結構。細節還不是很懂
Hadoop 2024 pdf epub mobi 電子書 下載