摘要:
Basic Solution The simplest way is to build a web crawler that runs on a single machine with single thread. So, a basic web crawler should be like thi 阅读全文
摘要:
Overview Apache ZooKeeper is an effort to develop and maintain an open-source server which enables highly reliable distributed coordination. zookeeper 阅读全文
摘要:
1.Multi-thread Two ways to create thread: extends from thread class, or implement runnable interface (prefer). Yield() and sleeping(): yield changes t 阅读全文
摘要:
Overview 讨论一些常见大数据框架的容错机制 Fault Tolerance in Hadoop MapReduce Heartbeat心跳机制:如果在一定时间内没有收到心跳,则reschedule all pending and in progress tasks to another Ta 阅读全文