Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.
发表于2024-05-14
Hadoop: The Definitive Guide 2024 pdf epub mobi 电子书
-- china-pub 赠书活动 -- http://www.douban.com/group/topic/20965935/ 一直比较忙,整本书还没读完,只是粗略翻了个大概,其中有两三章细读了一遍。先做个大体评价吧,有时间全部细读后再评论。 从书的内容上来讲,大致上与网上该书的内容介绍一致。简单点概括:这本书对...
评分是我遇到过的翻译最烂的一本书,在译者的“妙语连珠”里折腾了半个钟头就再也没兴趣了。略举几例如下: P.6 任然 -> 仍然 P.21 输入键(为什么不像后面那样有个“的”?),输入的值,输出的键…… P. 27 “计数器”(Counter),译文附原文;"Context Object"(上下文对象),原...
评分专门登录来评论的,翻译也太烂了吧,真的真的建议强烈英语阅读能力好的人去读原版书,不要花冤枉钱在这上面,除了文字错误外,里边的图居然也有错,就比如260页的图最后两个年份应该是1901结果这里竟然是1900,我是真滴服了,一本神书被翻译成这样,作者得气死。zsbd zsbd zsbd...
评分你的履历添了一笔<hadoop权威指南>译者,但是你不配 这是我见过的最不用心的翻译, 字里行间行文不通顺, 请别勉强自己,map reduce shuffle机制都没翻译的好 虽然原作者写作功底也实在是一般 第 1 2 5 6 7 这几章 翻译的实在是太烂了 请不要呐Google翻译糊弄人阿 误人子弟 ...
图书标签: Hadoop 大数据 BigData 计算机 分布式 hadoop 机器学习 O'Reilly
Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and genomics data processing.
Learn fundamental components such as MapReduce, HDFS, and YARN
Explore MapReduce in depth, including steps for developing applications with it
Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
Learn two data formats: Avro for data serialization and Parquet for nested data
Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
Learn the HBase distributed database and the ZooKeeper distributed configuration service
前半段原理英文第四版,后半段相关项目和案例学习中文第三版就直接划水划过去了。Definitive Guide一贯作风,料多废话也多,Hadoop也是复杂又难用,Spark要是革了你的命也是理所应当。
评分读了前3部分,该看源码去了。
评分很全,主要是前两部分,尤其mapreduce部分,后面的那些cluster和各种相关项目的其实可以只做浏览,讲得也不是很细,用的时候看apache的说明文档就好
评分还好我用的时候不需要写 Java(
评分当年入门时看了第一版,工作中真正要用到时看了第二版,在这块领域做了一年后回过来看了第三版。每遍各有收获。
Hadoop: The Definitive Guide 2024 pdf epub mobi 电子书