Python中实现count(distinct )

假设一个表有6个字段c1,c2,c3,c4,c5,c6,有如下的sql语句:

select c1,count(distinct(c6)) from tbl where c3>1 group by c3;

Python中asq模块可以实现类似的查询,以下用一个示例说明。

 

#!/usr/bin/env python2.7  
#-*- encoding: utf-8 -*-  
  
import os  
from datetime import *  
import time  
from asq.initiators import query  
  
def test1():  
        src_list = [[6,6,6,3,7,9],[3,1,1,45,6,34],[2,9,5,435,56,34],[8,6,8,23,4,32],[2,6,8,435,2,8],[1,3,6,432,5,22],[1,3,5,6,2,435],[2,4,6,34,2,12]]  
  
        res_list = query(src_list).where(lambda p: p[2]>1).group_by(lambda p:p[2]).select(lambda p:[p[0][0],p.distinct(lambda p: p[5]).count()]).to_list()  
  
        print res_list  
  
if __name__ == '__main__':  
  
     test1()  

 

 

 

 

运行结果:

 

[[6, 3], [2, 2], [8, 2]]
[Finished in 0.1s]

 

asq函数的逻辑顺序依次是: where-->group by --> select --> distinct -->返回结果列表。

posted @ 2017-02-21 19:23  最最么么哒  阅读(13345)  评论(0编辑  收藏  举报