摘要: Motivation: finding "similar" sets in high-dimensional space Defination: Distance Measures: Aim to find "near neighbors" in high-dimensional space We 阅读全文
posted @ 2019-11-11 17:18 FrancisForeverhappy 阅读(311) 评论(0) 推荐(0) 编辑
摘要: 1. Standard Architecture to solve the problem of big data computation Cluster of commodity Linux nodes Commodity network (ethernet) to connect them 2. 阅读全文
posted @ 2019-11-05 01:45 FrancisForeverhappy 阅读(168) 评论(0) 推荐(0) 编辑
摘要: 1. Characteristics of Big Data: 4V Volume: From terabytes to exabyte to zetabytes of existing data to process Velocity: Batch data, real-time data, st 阅读全文
posted @ 2019-11-03 23:55 FrancisForeverhappy 阅读(85) 评论(0) 推荐(0) 编辑