2012年7月9日
摘要: Data conversion – the first step towards data processing Convert all string to integers: ranging from 0 to n.Agecontinuous.WorkclassPrivate, Self-emp-not-inc, Self-emp-inc, Federal-gov, Local-gov, State-gov, Without-pay, Never-worked.Fnlwgtcontinuous.EducationBachelors, Some-college, 11th, HS-grad, 阅读全文
posted @ 2012-07-09 20:38 Jiang, X. 阅读(458) 评论(0) 推荐(0) 编辑
摘要: 模式识别网络资源链接一、模式识别相关网址及其论坛网址1、中国模式识别与机器学习论坛http://bbs.pr-ml.cn (推荐)2、振动论坛——人工智能与模式识别http://www.chinavib.com/forum/forum-108-1.html3、研学论坛——人工智能与模式识别http://219.232.49.40/index.php4、中国图像网——模式识别http://www.china-image.cn/mssb/index.aspx5、中国人工智能网 >> 人工智能、模式识别、图像处理 http://www.chinaai.org/6、模式识别国家重点实验室h 阅读全文
posted @ 2012-07-09 16:54 Jiang, X. 阅读(241) 评论(0) 推荐(0) 编辑
摘要: DescriptionIn computer science and data mining, Apriori[1] is a classic algorithm for learning association rules. Apriori is designed to operate on databases containing transactions (for example, collections of items bought by customers, or details of a website frequentation). Other algorithms are d 阅读全文
posted @ 2012-07-09 15:55 Jiang, X. 阅读(430) 评论(0) 推荐(0) 编辑
摘要: Definitions:•Set of items: I={I1,I2,…,Im}•Transactions: D={t1,t2, …, tn}, tj∈I•Itemset: {Ii1,Ii2, …, Iik} ∈I•Support of an itemset: Percentage of transactions which contain that itemset.•Large (Frequent) itemset: Itemset whose number of occurrences is above a threshold.•Association Rule (AR): implic 阅读全文
posted @ 2012-07-09 15:51 Jiang, X. 阅读(199) 评论(0) 推荐(0) 编辑
摘要: A set has closure under an operation if performance of that operation on members of the set always produces a member of the same set. For example, the real numbers are closed under subtraction, but the natural numbers are not: 3 and 8 are both natural numbers, but the result of 3 − 8 is not a natura 阅读全文
posted @ 2012-07-09 13:11 Jiang, X. 阅读(312) 评论(0) 推荐(0) 编辑
摘要: Distribution-based methodsDistance-based methodsDensity-based methodsClustering-based methods 阅读全文
posted @ 2012-07-09 09:27 Jiang, X. 阅读(134) 评论(0) 推荐(0) 编辑
摘要: Outlier mining - A data mining task aiming to find a specific number of objects that are considerably dissimilar, exceptional and inconsistent with respect to the majority records in the input databases.Subspace - A combination of features of attributes of a database.Outlying subspaces -An outlying 阅读全文
posted @ 2012-07-09 09:16 Jiang, X. 阅读(156) 评论(0) 推荐(0) 编辑