m-estimate of probability

In practice, we estimate conditional probabilites P(A|B) = n/N, where n is the number of times A and B in common, N is the number of times B in the trainning data.

what about n are very little, even equal to 0. Or n are very large, even equal to N. What's more, sometimes the values of probablities should be smoothing.

To avoid this, we fix the following numbers p and m beforehand:

A nonzero prior estimate p for P(A|B);

A number m that says how confident we are of our prior estimate p, as measured in number of samples

so, the P(A|B) was estimated by (n + m*p)/(N+m);

Just think of this as adding a bunch of samples to start the whole process

If we don't have any knowledge of p, assume the attribute is uniformly distributed over all possible values. Then p = 1/m.

posted @ 2013-12-28 11:38 joythink89 阅读(426) 评论(0) 编辑收藏举报

会员力量，点亮园子希望

刷新页面返回顶部

joythink89

酱油灭灭

m-estimate of probability