随笔分类 - 大数据
摘要:Docker is an open platform for developing, shipping, and running applications. Docker enables you to separate your applications from your infrastructu
阅读全文
摘要:how to network your containers. Launch a container on the default network Docker includes support for networking containers through the use of network
阅读全文
摘要:Make sure you have published the friendlyhello image you created by pushing it to a registry. We’ll be using that shared image here. Be sure your imag
阅读全文
摘要:Make sure you have published the friendlyhello image you created by pushing it to a registry. We’ll use that shared image here. Be sure your image wor
阅读全文
摘要:Make sure you have published the friendlyhello image you created by pushing it to a registry. We’ll be using that shared image here. Be sure your imag
阅读全文
摘要:Docker学习笔记 — Swarm搭建Docker集群 Swarm在schedule节点运行容器的时候,会根据指定的策略来计算最适合运行容器的节点,目前支持的策略有:spread, binpack, random. Random顾名思义,就是随机选择一个Node来运行容器,一般用作调试用, spr
阅读全文
摘要:Make sure you have published the friendlyhello image you created by pushing it to a registry. We’ll use that shared image here. Be sure your image wor
阅读全文
摘要:Document An image is a lightweight, stand-alone, executable package that includes everything needed to run a piece of software, including the code, a
阅读全文
摘要:Big Data Era: 1.More and more data becoming available on Hadoop2.Limitations in existing Business Intelligence (BI) Tools Limited support for Hadoo
阅读全文
摘要:Kylin is an open source Distributed Analytics Engine from eBay Inc。that provides SQL interface and multi-dimensional analysis (OLAP) on Hadoop support
阅读全文
摘要:CDH:Cloudera‘s Distribution,including Apache Hadoop. Hadoop众多分支中的一种,可直接用于成产环境 CM:Cloudera Manager
阅读全文
摘要:核心思想:预计算。 对多维分析可能用到的度量进行预计算,将计算好的结果保存成Cube,并存在HBase中,供查询时直接访问 将高复杂度的聚合运算、多表连接……操作转换成对预计算结果的查询。决定了Kylin拥有很好的快速查询、高并发能力 理论基础:空间换时间 Cuboid:Kylin中将维度任意组合成
阅读全文
摘要:RDBMS: 关系数据库管理系统(Relational Database Management System),是将数据组织为相关的行和列的系统,而管理关系数据库的计算机软件就是关系数据库管理系统, 常用的数据库软件有Oracle、SQL Server等。 机分析处理OLAP: 是一种软件技术,它使
阅读全文
摘要:Hadoop核心 1.HDFS:高容错性、高伸缩性……,允许用户将Hadoop部署在廉价的硬件上,构建分布式系统 2.MapReduce:分布式计算框架,允许用户在不了解分布式系统底层细节的情况下,开发并行、分步的应用程序。充分利用大规模的计算资源,解决传统高性能单机无法解决的大数据用户 应用:分析
阅读全文