上一页 1 ··· 43 44 45 46 47 48 49 50 51 ··· 53 下一页

大数据索引技术 - B+ tree vs LSM tree

摘要: MySQL索引背后的数据结构及算法原理, http://www.codinglabs.org/html/theory-of-mysql-index.html HBase Architecture, http://duanple.blog.163.com/blog/static/70971767201191661620641/ 数据库如何抵抗随机IO:问题、方法与现实, http://wangy... 阅读全文
posted @ 2012-06-09 17:47 fxjwind 阅读(9826) 评论(5) 推荐(8) 编辑

Data replication 同步技术

摘要: http://horicky.blogspot.com/2009/11/nosql-patterns.html, NOSQL Patterns There are 2 ways how the master propagate updates to the slave; State transfer and Operation transfer. In State tran... 阅读全文
posted @ 2012-06-08 15:11 fxjwind 阅读(739) 评论(0) 推荐(0) 编辑

Why Vector Clock are Easy or Hard?

摘要: 通过实际例子来阐述vector clock其实是容易理解的, easy 同样通过实际例子来描述在使用vector clock时会遇到哪些难以解决的问题, hard Why Vector Clocks are Easy http://basho.com/blog/technical/2010/01/29/why-vector-clocks-are-easy/ Vector Clocks... 阅读全文
posted @ 2012-06-06 11:31 fxjwind 阅读(935) 评论(0) 推荐(0) 编辑

FP基础

摘要: 由Mapreduce想到Function programming, 于是看到programming paradigms, 下面的链接进行了比较, 有兴趣可以认真研究一下... http://en.wikipedia.org/wiki/Comparison_of_programming_paradigms Imperative programming – defines computati... 阅读全文
posted @ 2012-05-11 15:40 fxjwind 阅读(837) 评论(0) 推荐(0) 编辑

Data Mining with R

摘要: Ubuntu下安装R sudo apt-get install r-base r-base-devCRAN (Comprehensive R Archive Network), R的插件管理和releaseInstall add-on packages available for R at CRAN比如安装一个RMySQL> options(CRAN=’http://cran.r-projec... 阅读全文
posted @ 2012-05-04 14:17 fxjwind 阅读(620) 评论(0) 推荐(0) 编辑

SentiAnalysis

摘要: “Opinion Lexicon” Based 1. Refer to the “Mining Twitter for Airline Consumer Sentiment”(1) Loading twitter data into R with twitterR package Loading Hu and Liu’s “opinion lexicon”(2), nearly 6,800 ... 阅读全文
posted @ 2012-03-31 16:09 fxjwind 阅读(895) 评论(0) 推荐(0) 编辑

Distributed Message System

摘要: http://dongxicheng.org/search-engine/log-systems/ 包括facebook的scribe,apache的chukwa,linkedin的kafka和cloudera的flume Kafka http://www.cnblogs.com/fxjwind/archive/2013/03/22/2975573.html http://www.c... 阅读全文
posted @ 2012-03-17 17:49 fxjwind 阅读(642) 评论(0) 推荐(0) 编辑

Regression, 回归问题

摘要: Refer to http://www.r-tutor.com/elementary-statistics/simple-linear-regression simple linear regression A simple linear regression model that describes the relationship between two variables x a... 阅读全文
posted @ 2012-03-02 14:13 fxjwind 阅读(907) 评论(0) 推荐(0) 编辑

Hypothesis Testing

摘要: Refer to R Tutorial andExercise SolutionResearchers retain or reject hypothesis based on measurements of observed samples. The decision is often based on a statistical mechanism called hypothesis testing.假设检验是数理统计学中根据一定假设条件由样本推断总体的一种方法。具体作法是:根据问题的需要对所研究的总体作某种假设,记作H0;选取合适的统计量,这个统计量的选取要使得在假设H0成立时,其分布为 阅读全文
posted @ 2012-02-28 17:19 fxjwind 阅读(1047) 评论(0) 推荐(0) 编辑

CAP – Consistency, Availability, Partition Tolerance

摘要: ---------------------------------------------------------------------------------------------------------------------- http://blog.nahurst.com/visual-guide-to-nosql-systems 相当不错的ppt: http://www.slid... 阅读全文
posted @ 2012-02-24 17:41 fxjwind 阅读(1333) 评论(0) 推荐(1) 编辑
上一页 1 ··· 43 44 45 46 47 48 49 50 51 ··· 53 下一页