摘要: Comparison of the Open Source Query Engines: Trino and StarRocks https://www.starrocks.io/blog/comparison-of-the-open-source-query-engines-trino-and-s
摘要: Trino 查询引擎资源瓶颈 降本 40%+!数数科技打造云原生查询引擎,全面优化数据分析体验 - 数数科技 https://www.thinkingdata.cn/thinking/blog/features/5966.html 技术开源|TA大数据分析系统的高可用架构从设计到实现 - 数数科技
https://github.com/StarRocks/starrocks/blob/main/be/src/exprs/agg/window_funnel.h enum FunnelMode : int { DEDUPLICATION = 1, FIXED = 2, DEDUPLICATION_
使用 HyperLogLog 实现近似去重 | StarRocks https://docs.starrocks.io/zh/docs/using_starrocks/Using_HLL/
bitmap_to_array not supported size > 1000000 bitmap_to_string not supported size > 1000000
StarRocks 最佳实践 - 📚 StarRocks 最佳实践 - StarRocks中文社区论坛 https://forum.mirrorship.cn/t/topic/11572
Retail eCommerce Funnel Analysis Demo with 1 million members and 87 million record dataset using StarRocks · StarRocks/starrocks · Discussion #26075
__del__ __exist__ https://docs.python.org/3/reference/datamodel.html
window_funnel | StarRocks https://docs.starrocks.io/zh/docs/sql-reference/sql-functions/aggregate-functions/window_funnel/
海量数据处理利器 Roaring BitMap 原理介绍 https://mp.weixin.qq.com/s/aNHXKHNViWchWKcyRxjOIg
MySQL :: MySQL 8.0 Reference Manual :: 15.2.13 SELECT Statement https://dev.mysql.com/doc/refman/8.0/en/select.html
大表查询结果慢,没有谓词下推​ 多张大表关联时,旧 planner有时没有自动谓词下推,比如: 查询常见问题 | StarRocks https://docs.starrocks.io/zh/docs/faq/Sql_faq/#%E5%A4%A7%E8%A1%A8%E6%9F%A5%E8%AF%A2
异步物化视图 | StarRocks https://docs.starrocks.io/zh/docs/using_starrocks/Materialized_view/ 同步物化视图 | StarRocks https://docs.starrocks.io/zh/docs/using_sta
现代分布式数据库中,常见的数据分布方式有如下几种:Round-Robin、Range、List 和 Hash。如下图所示: 数据分布 | StarRocks https://docs.starrocks.io/zh/docs/table_design/Data_distribution/ StarR
摘要: class T: def __init__(self): self.lst = [] # biz for i in range(6): self.lst.append(i) def f(self, i): print(i) if i in self.lst: self.lst.remove(i) d 阅读全文
排行榜系统设计-01-6 大排名算法介绍 | Echo Blog https://houbb.github.io/2021/08/02/rank-system-01-overview
用户分群 | ThinkingData https://docs.thinkingdata.cn/ta-manual/v4.1/user_guide/user/cohort/cohort.html 用户标签 | ThinkingData https://docs.thinkingdata.cn/ta
B站基于ClickHouse的海量用户行为分析应用实践 - 哔哩哔哩 https://www.bilibili.com/read/cv21106590/
RoaringBitMap在B站大数据中的应用 https://mp.weixin.qq.com/s/hZsZoaMfEo3G51OLv2keyQ
摘要: Standard Library Types - Pydantic https://docs.pydantic.dev/2.7/api/standard_library_types/#typeddict from pydantic import BaseModelfrom typing import 阅读全文
Clickhouse在大数据分析平台-留存分析上的应用_大数据_腾讯云大数据_InfoQ写作社区 https://xie.infoq.cn/article/c7af40e5ba5f5f5beaccde990 ClickHouse实战留存、路径、漏斗、session-腾讯云开发者社区-腾讯云
摘要: UDFs User Defined Functions | ClickHouse Docs https://clickhouse.com/docs/en/sql-reference/functions/udf def gen_if_sql(range_list, field='cnt', group 阅读全文
https://clickhouse.com/docs/zh/sql-reference/aggregate-functions/parametric-functions
摘要: 聚合函数组合器 | ClickHouse Docs https://clickhouse.com/docs/zh/sql-reference/aggregate-functions/combinators 示例 考虑一下 people 表具有以下数据的表结构: ┌─name───┬─age─┬─wa 阅读全文
摘要: https://clickhouse.com/docs/en/sql-reference/aggregate-functions/reference/varpop quantile | ClickHouse Docs https://clickhouse.com/docs/zh/sql-refere 阅读全文
摘要: 重构MQ处理架构:MVEL表达式和责任链设计模式应用实践 https://mp.weixin.qq.com/s/_UZhfi1BiGNHQAHWhGus8Q 3.责任链设计模式 【3.1 定义】 责任链模式(Chain of Responsibility)又名职责链模式,是一种行为设计模式,它允许你 阅读全文
摘要: l=[{'a':45},{"b":34}]c=0for i in l: i['a']=c*100 c+=1 * 翻译 搜索 复制 阅读全文
摘要: class Test: def __init__(self, totals, uniques, pre): self.totals = totals self.uniques = uniques self.cumulative_sum = totals if pre: self.cumulative 阅读全文
摘要: 分布式任务调度内的 MySQL 分页查询优化 https://mp.weixin.qq.com/s/VhSzxYIRv83T3D3JD4cORg 三、优化方案 3.1 优化方案确定 当前SQL执行计划以主键进行顺序遍历,是一个范围扫描,有点像在一片很大的居民区按照序号挨家挨户寻找一些特定的人一样,比 阅读全文
一文搞懂七种基本的GC垃圾回收算法 https://mp.weixin.qq.com/s/RSWYSAzpvPJD056vZDQPig
摘要: Functions for Searching in Strings | ClickHouse Docs https://clickhouse.com/docs/en/sql-reference/functions/string-search-functions Hyperscan is gener 阅读全文
正则匹配Functions for Searching in Strings | ClickHouse Docs https://clickhouse.com/docs/en/sql-reference/functions/string-search-functionsIf you only wan
Clickhouse 时间日期函数实战(详细)_clickhouse 时间函数-CSDN博客 https://blog.csdn.net/u010180815/article/details/105250864 时间日期函数 | ClickHouse Docs https://clickhouse.
漏斗分析 | ThinkingData https://docs.thinkingdata.cn/ta-manual/v4.3/user_guide/analysis/funnel.html 关联属性 例如,如果您希望统计用户的【商品曝光 > 点击 > 购买】漏斗,可以指定商品ID作为关联属性,保证
ClickHouse进阶|如何自研一款企业级高性能网关组件? https://mp.weixin.qq.com/s/UFTtlD2KQH9e_Y91sEF5xA ClickHouse 查询优化详细介绍 https://mp.weixin.qq.com/s/38RMVbw25P3iuE4IIuxdog
摘要: MySQL LEFT JOIN Keyword https://www.w3schools.com/mysql/mysql_join_left.asp MySQL LEFT JOIN Keyword The LEFT JOIN keyword returns all records from the 阅读全文
StarRocks | A High-Performance Analytical Database https://www.starrocks.io/ Extremely fast query performance in all scenarios Whether you're working
