随笔分类 -  论文解析

论文解析 -- Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
摘要:作为distributed tracing的基石的paper 大部分内容,https://logz.io/blog/distributed-tracing-dapper-jaeger/ 这个blog已经包含了 Dapper针对的场景,google作为搜索引擎,应对大量的网络请求,一个请求会通过rpc 阅读全文

posted @ 2022-08-15 18:10 fxjwind 阅读(230) 评论(0) 推荐(0) 编辑

论文解析 -- Big Metadata: When Metadata is Big Data
摘要:要解决的问题就是对于云原生数据库,越来越大的meta应该怎么管理 传统的数据库,都是将catalog存在系统表里面 大数据系统,比如Colossus将meta存在Big table里面;Hadoop生态有Hive metastore Delta lake用事务log的方式来记录meta 并且对于AP 阅读全文

posted @ 2022-05-18 16:17 fxjwind 阅读(293) 评论(0) 推荐(0) 编辑

论文解析 -- Db2 Event Store: A Purpose-Built IoT Database Engine (PVLDB 2020)
摘要:INTRODUCTION The Needs of IoT systems 正对IoT系统的以下4点需求,高吞吐,有效存储,实时查询和高可用,当前的系统无法比较好的解决。 With the rapid proliferation(rapid increase in numbers) of conne 阅读全文

posted @ 2021-04-16 15:13 fxjwind 阅读(288) 评论(0) 推荐(0) 编辑

论文解析 -- A Cloud-native Architecture for Replicated Data Services (HotCloud '20)
摘要:Introduction 传统的服务是正对本地服务器和本地磁盘设计的,所以在迁移到cloud上的时候会存储不匹配的问题 尤其对于分布式数据系统,传统的方式是他们都需要自己管理local的磁盘 在云上的方案是,云存储服务被封装成块存储或文件存储,让系统可以按照原来的方式访问。 Infrastructu 阅读全文

posted @ 2021-04-07 14:46 fxjwind 阅读(295) 评论(0) 推荐(0) 编辑

论文解析 -- How Good Are Query Optimizers, Really? (TUM PVLDB 2015)
摘要:INTRODUCTION The problem of finding a good join order is one of the most studied problems in the database field. Figure 1 illustrates the classical, c 阅读全文

posted @ 2021-04-02 16:04 fxjwind 阅读(595) 评论(0) 推荐(0) 编辑

论文解析 -- Building An Elastic Query Engine on Disaggregated Storage (NSDI 2020)
摘要:Introduction 引入Shared-nothing架构 Shared-nothing architectures have been the foundation of traditional query execution engines and data warehousing syst 阅读全文

posted @ 2021-03-25 13:50 fxjwind 阅读(506) 评论(0) 推荐(1) 编辑

论文解析 -- Efficiently Approximating Selectivity Functions using Low Overhead Regression Models (2020)
摘要:这篇文章,主体在说,如何降低训练的代价 从两个方面,降低training set大小,降低单个训练样本的label的收集成本 ABSTRACT (现状)Today's query optimizers use fast selectivity estimation techniques but ar 阅读全文

posted @ 2021-03-03 17:50 fxjwind 阅读(160) 评论(0) 推荐(0) 编辑

论文解析 -- Selectivity Estimation for Range Predicates using Lightweight Models (2019)
摘要:摘要 (点题)Query optimizers depend on selectivity estimates of query predicates to produce a good execution plan. (当前的方案)When a query contains multiple pr 阅读全文

posted @ 2021-01-19 15:45 fxjwind 阅读(747) 评论(0) 推荐(0) 编辑

论文解析 -- LeanStore: In-Memory Data Management Beyond Main Memory
摘要:INTRODUCTION Managing large data sets has always been the raison d’ˆetre (a French expression commonly used in English, meaning "reason for being" or 阅读全文

posted @ 2021-01-05 14:55 fxjwind 阅读(752) 评论(1) 推荐(1) 编辑

论文解析 -- Anti-Caching: A New Approach to Database Management System Architecture
摘要:INTRODUCTION (Disk-oriented数据库的问题,维护buffer pool)DBMSs invariably(always) maintain a buffer pool of blocks in main memory for faster access. When an ex 阅读全文

posted @ 2020-12-25 16:08 fxjwind 阅读(390) 评论(0) 推荐(0) 编辑

论文解析 -- What’s Really New with NewSQL?
摘要:ABSTRACT (点题)A new class of database management systems (DBMSs) called NewSQL tout(吹捧) their ability to scale modern on-line transaction processing (O 阅读全文

posted @ 2020-12-23 18:06 fxjwind 阅读(405) 评论(0) 推荐(0) 编辑

论文解析 -- An Empirical Evaluation of In-Memory Multi-Version Concurrency Control
摘要:ABSTRACT (点题)Multi-version concurrency control (MVCC) is currently the most popular transaction management scheme in modern database management system 阅读全文

posted @ 2020-12-16 17:38 fxjwind 阅读(605) 评论(0) 推荐(0) 编辑

论文解析 -- TiDB: A Raftbased HTAP Database
摘要:摘要 Hybrid Transactional and Analytical Processing (HTAP) databases require processing transactional and analytical queries in isolation to remove the 阅读全文

posted @ 2020-12-01 15:50 fxjwind 阅读(740) 评论(0) 推荐(0) 编辑

论文解析 -- The Snowflake Elastic Data Warehouse
摘要:摘要 We live in the golden age of distributed computing. (背景,云平台和Saas)Public cloud platforms now offer virtually unlimited compute and storage resources 阅读全文

posted @ 2019-12-09 14:15 fxjwind 阅读(2596) 评论(0) 推荐(1) 编辑