Martin is a researcher in distributed systems at the University of Cambridge. Previously he was a software engineer and entrepreneur at Internet companies including LinkedIn and Rapportive, where he worked on large-scale data infrastructure. In the process he learned a few things the hard way, and he hopes this book will save you from repeating the same mistakes.
Martin is a regular conference speaker, blogger, and open source contributor. He believes that profound technical ideas should be accessible to everyone, and that deeper understanding will help us develop better software.
Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords?
In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
Peer under the hood of the systems you already use, and learn how to use and operate them more effectively
Make informed decisions by identifying the strengths and weaknesses of different tools
Navigate the trade-offs around consistency, scalability, fault tolerance, and complexity
Understand the distributed systems research upon which modern databases are built
Peek behind the scenes of major online services, and learn from their architectures
發表於2025-03-30
Designing Data-Intensive Applications 2025 pdf epub mobi 電子書 下載
摘要,後麵再讀補總結。字數還說不能少於140 字[捂臉哭]簡直瞭。 書中比較詳細描述瞭數據庫,從數據庫衍生派生數據,批處理,流處理等衍生。主要是問題-思路-問題的方式,但其問題角度考慮極端,分布式也正是這些極端場景的發源地。對自己是一個比較全麵的分布式認知,對於很多...
評分1.數據模型 從數據庫的數據模型上,可以將數據庫分為關係型、文檔型以及圖型數據庫等。這三類數據庫在形式和使用的場景上各有不同。 1.1.關係型 關係型模型基於集閤論中的關係理論,將數據視為Tuple的集閤。關係型在1970年被提齣,人們一開始質疑這類技術是否能高效地實現,但...
評分從高層視角係統的介紹瞭數據在存儲和讀取中的方方麵麵,非常有助於建立自己的知識體係。書中很多內容都是平時會接觸到的一些問題,但是在實際工作中未必能夠把這些零碎的東西串起來,形成體係框架。這本書可以讓你整理你自己的知識,讓他們連接在一起,變成一個樹結構,後麵你...
評分 評分Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relati...
圖書標籤: 分布式 大數據 係統設計 計算機 architecture 架構 data O'Reilly
好的不像樣子。理論的來龍去脈,講得非常清楚。大數據工程化必讀。
評分對大數據係統有瞭一個整體的認識,以後遇到問題之前能知道解決方嚮。
評分全麵、深入的大數據係統介紹,不但有how,還有why,建立技術全景。
評分我靠,這本書實在太牛瞭。 趕緊讀!趕緊讀!趕緊讀!
評分廣度有瞭,深度不夠,不過給齣瞭好多引用,夠看兩年...
Designing Data-Intensive Applications 2025 pdf epub mobi 電子書 下載