社会科学问题研究的计算实践——10、表达：方式与挑战（计算实践：孔多塞模型下的表决综合）

学习资源来自，一个哲学学生的计算机作业 (karenlyu21.github.io)

1、背景问题

在一个共同体里，人们观点相异极为常见。在异见的基础上决议或得出“集体意见”，表决是一种重要方式。

1.1、孔多塞原则与孔多塞悖论

表决有几种经典设计。一个代表是孔多塞原则：

m（奇数）个投票人，n≥2个候选项；
每人给出对候选项的个人偏好全排序，作为自己的投票；
两两检查候选项，比较哪一候选项受到更多人偏好（少数服从多数的原则）；
给出候选项的全排序，作为集体结果。

这一过程可以用有向图来表示，下图是依据孔多塞原则表决的一个例子：

在集体全排序中，边a→b意味着至少(m+1)/2个个体偏好中有a→b。如果投票人为奇数，任意两个节点之间有且仅有一条有向边。

这一表决原则的问题在于，可能会出现孔多塞悖论。假设三个人甲、乙、丙要对三个候选项A、B、C投票，

甲：A≻B≻C，
乙：B≻C≻A，
丙：C≻A≻B。

最终的结果：A≻B,B≻C,C≻A，无法形成全排序，也说不清楚谁最被偏爱。这一悖论表明，即使个体理性、制度合理，也不一定能导致群体理性。

1.2、孔多塞悖论的解决

以下两种方法可以用来解决孔多塞悖论，保证找出胜者：

议程设置：按一定的议程顺序，每一轮分别对两个候选人投票，胜者进入下一轮。但是，如果存在孔多塞悖论，不同的议程设置会导致不同的胜者，但议程——这一无关变量——本不应该影响表决结果。
波达计数法：把每个人的全排序转换成给候选人的分数，排名越高的候选人分数越高。依此计算每一候选人的积分。但是，如果有的投票人使坏、采取策略性投票，仍然有可能出现有违公平的表决结果。例如，第二受欢迎的候选人B的投票者，为了把本来最受欢迎的A搞下去，可能把A排在最末；如果他们秉公投票，A在他们身上获得的分数并不应该如此低。一般性地讲，依照积分制比较A和B（A≻B⟺∑(a_i−b_i)），可能与“少数服从多数”的原则（A≻B⟺∑sig(a_i−b_i)）矛盾。

完美的表决方式似乎很难找到。阿罗在1950年提出了阿罗不可能定理：不存在同时满足以下三条公理的表决聚合规则，

非独裁：结果序不能等于某个参与人的个人序
一致性认同：如果大家都认为A>B，结果中就应该是A>B
独立于无关项：只要每个个体序中A和B的关系没变，结果中就不应该变

也就是说，任何一种表决规则至少违反以上三条公理之一。那么，是否存在对投票人的个人序做某种合理要求，能够克服阿罗不可能定理指出的困难呢？课堂上引入了一个新的标准——属性序——用以判断投票者是否足够理性，不理性的投票者的意见不予考虑。只考虑理性投票者似乎确实合理，这一方法也成功解决了孔多塞悖论。

在现实生活中，事物的性质往往能够按顺序排列，例如，候选项是建筑预算时，我们可以按照数额从大到小排列。对于这类事物，理性的投票行为应当符合单峰性质。例如，如果一个人对高、中、低三种建筑预算的排序是高预算低预算中预算高预算≻低预算≻中预算，这就违背了单峰性质，直观上也是不理性的选择：如果他最偏好高预算，得不到高预算时的次佳选项，应当是更接近首选的候选项，即中预算。

单峰偏好的定义如下：

假设候选项有一个属性序，A₁≻A₂≻…≻A_k≻…≻A_n。
给出的表决排序A_k≻…（即A_k排第一）满足单峰性质（单峰序），则对于所有i<j<k和所有i>j>k，A_i排在A_j的后面。

单峰性质可能呈现为以下三种形态。

删去不理性的个体后，我们可以用中位项定理得出孔多塞全排序结果：给定满足单峰性质的m个排序R₁,R₂,…,R_m（m是奇数）。若将所有人的全排序里的第一名R_i(1)按属性序R₀排列，得到S，则其中间项（A_k）是孔多塞原则下的“胜者”。

这样操作的凭依就是排序的单峰性。设S为…,A_t,…,A_k,…,A_s,…（中间项为A_k）。

对于投票人i，ta最偏好的候选项是A_k或者任意一个属性序低于A_k的候选项A_t。由于个人全排序符合单峰性，ta一定觉得A_k比任何一个属性序高于A_k的候选项A_s好。由于A_k是中间项，投票人总数是奇数，这样想的人超过总数的一半。因此，超过一半的人认为A_k≻A_s。
同理，超过一半人偏好A_k或者偏好一个属性序高于A_k的候选项A_s，他们一定认为A_k比任何一个属性序低于A_k的候选项A_t好。

根据“少数服从多数”原则，中位项定理找出的A_k一定是孔多塞胜者。

这样操作第一次，设孔多塞胜者为A_k1，它就是第一名。删去每个人全排序中的A_k1，单峰性质不变。用同样的方法，找出剩下的候选项中的孔多塞胜者A_k2，它就是第二名。迭代操作，最终获得整个集体排序。

2、计算实践：孔多塞模型下的表决综合

2.1、作业描述与算法思路

本次作业就是模拟表决过程，给出表决结果。如果投票人的个人排序中不存在孔多塞悖论，就按照孔多塞原则确定群体排序；如果存在，就删去非理性投票者，按照中位项定理一个个确定胜者。

设有m（奇数）个人参与对n个项目（p₁,p₂,…,p_n）的投票，每人提交一个对项目偏好的排序R₁,R₂,…,R_m。你要做的事情是基于大家表达的偏好，尝试给出一个那些项目的群体排序R，满足：若p_i在p_j前面，必须是m个人中的多数认为p_i应该在p_j前面。我们知道，因为孔多塞悖论现象，基于个体偏好全序和多数原则综合的评判有可能形成不了一个体现群体意见的全序。你的任务是：

如果R₁,R₂,…,R_m之中没有隐含孔多塞悖论，则按照孔多塞原则给出体现群体意见的全序R。
如果有悖论：
- 假设（p₁,p₂,…,p_n）是人们关心的n个项目的一种属性序，判断R₁,R₂,…,R_m之中有哪些不满足单峰偏好性质，认为它们不是一个“理性投票”，将它们剔除。
- 剩下的k个若不是奇数，则以（p₁,p₂,…,p_n）为附加个体序，凑成k+1（奇数）。
- 按中位项定理的操作方法给出全序R。

这一算法有三个重点：

孔多塞悖论的判定标准：关键在于结果序中是否存在有向环。如果存在有向环，这一表决就有孔多塞悖论。根据第九讲里有向图的性质，如果一个有穷有向图中不存在有向环，则该图中既存在出度为0的节点，也存在入度为0的节点。尝试不断删除入度为0的节点。当且仅当找不到入度为0的节点、但还没有删除完所有节点，该图就存在有向环。
孔多塞排序：
- 李老师提供的算法：和上述操作类似，在孔多塞有向图结果中不断删除入度为0的节点（入度为0意味着比所有其他候选项都更受偏好），如果一直可以删除到最后，则删除的顺序就是集体排序。
- 我的算法：冒泡排序。一开始设定第一个候选项是第一名，排序列表里只有一个元素。依次考虑剩下的候选项。对于未加入排名的候选项i，从队末开始，比较i和序列中的元素，直到找到比i更受偏爱的元素j，把i插入到j后面。
检查个体偏好序是否符合单峰性质：找出投票人最为偏好的候选项A_k。对于属性序低于A_k所有的候选项，偏好序应当关于属性序单减；对于属性序高于A_k所有的候选项，偏好序应当关于属性序单增。

2.2、编程实现与要点说明

首先，打开用户指定的数据文件，读取所有投票者的投票结果，存储在一个numpy 2d-array votes里。

# 打开文件读取排序/投票结果
def votes_reader(filename):
    f = open(filename, 'r')
    votes_lines = f.readlines()
    votes = []
    for line in votes_lines:
        line = line.strip()
        votes_individual = line.split()
        votes_indv_new = []
        for vote in votes_individual:
            vote = int(vote)
            votes_indv_new.append(vote)
        votes.append(votes_indv_new)
    votes = np.array(votes)
    return votes


cnt = 0
while True:
    cnt += 1
    if cnt > 3:
        print('使用默认文件b2。')
        votes = votes_reader('./input/b2.txt')
    try:
        filename = input('输入文件名（如b2.txt)：')
        pathname = os.path.join('./input', filename)
        votes = votes_reader(pathname)
    except:
        print('输入错误。', end='')
        continue
    break

print('投票结果：')
print(votes)

一个数据文件b4.txt的例子如下：

9 8 7 6 5 4 3 2 1 0
2 3 1 4 5 6 0 7 8 9
7 6 5 8 4 3 2 1 0 9
3 4 2 5 6 1 7 0 8 9
6 7 5 4 3 2 1 8 0 9
2 1 3 0 4 5 6 7 8 9
0 1 2 3 4 5 6 7 8 9
3 4 5 2 1 6 7 0 8 9
3 0 9 7 6 8 1 5 4 2
9 7 5 8 1 3 2 0 4 6
3 6 1 7 2 9 8 0 4 5
2 9 0 6 8 3 1 5 7 4
8 7 1 2 3 4 9 6 5 0

Initiating...
输入文件名（如b2.txt)：b4.txt
投票结果：
[[9 8 7 6 5 4 3 2 1 0]
 [2 3 1 4 5 6 0 7 8 9]
 [7 6 5 8 4 3 2 1 0 9]
 [3 4 2 5 6 1 7 0 8 9]
 [6 7 5 4 3 2 1 8 0 9]
 [2 1 3 0 4 5 6 7 8 9]
 [0 1 2 3 4 5 6 7 8 9]
 [3 4 5 2 1 6 7 0 8 9]
 [3 0 9 7 6 8 1 5 4 2]
 [9 7 5 8 1 3 2 0 4 6]
 [3 6 1 7 2 9 8 0 4 5]
 [2 9 0 6 8 3 1 5 7 4]
 [8 7 1 2 3 4 9 6 5 0]]

定义一个函数compare，用以比较两个候选项中哪个候选项得到了多数人的支持。advantage大于0，i比j得到了更多人支持。

# compare votes on two candidates
def compare(i, j, votes):
    n = len(votes)  # number of voters
    advantage = 0  # accumulative advantage of i over j according to all voters
    i_indices = np.where(votes == i)
    j_indices = np.where(votes == j)
    for k in range(n):
        i_index = i_indices[1][k]
        j_index = j_indices[1][k]
        if i_index < j_index:  # i ranks higher than j, according to voter k
            advantage += 1
        else:
            advantage -= 1
    return advantage

定义函数condorcet，根据孔多塞定理，得到集体序order。

# Condorcet ranking (a different algorithm from the one taught in class)
def condorcet(votes):
    n = len(votes)  # number of voters
    assert n % 2 == 1  # make sure there are an odd number of voters
    m = len(votes[0])  # number of candidates
    order = [0]

我这里使用的是冒泡排序法。对于尚未加入集体序的元素i，把它同该序列中的元素一一比较大小，从最后一名开始比。直到找到比i更受欢迎的元素j，把i插在j后面。

    # bubble sorting
    for i in range(1, m):
        for j_index in range(len(order)):
            flag = False
            j = order[len(order) - j_index - 1]
            advantage = compare(i, j, votes)
            if advantage < 0:
                order.insert(j_index + 1, i)
                flag = True
                break

如果没有找到比i更受欢迎的元素j，这意味着i是目前为止得票最多的候选项，把它放在第一位。

        if flag == False:
            order.insert(0, i)  # All existing candidates are less favored than the new one
    return order

定义函数condorcet_paradox_check，检查投票结果是否隐含孔多塞悖论。这一函数的内部结构略微有些复杂，我定义了两个局部函数：preference_matrix_generator、most_preferred_deleter。在函数condorcet_paradox_check内部的主程序中，我先调用preference_matrix_generator，生成孔多塞有向图，在对这一有向图迭代调用most_preferred_deleter，不断删除入度为0的节点，来判断孔多塞悖论是否存在。

# the main act: check if there is Condorcet paradox
def condorcet_paradox_check(votes):
    m = len(votes[0])
    n = len(votes)

函数preference_matrix_generator根据投票结果votes生成孔多塞排序有向图preference_matrix，这一过程主要依靠迭代调用compare函数完成。preference_matrix[i][j] = 1表示i→j，preference_matrix[i][j] = -1表示j→i。

    # generate a matrix which denotes which of two candidates is more preferred
    def preference_matrix_generator(votes):
        preference_matrix = np.zeros((m, m))
        assert n % 2 == 1
        for i in range(m - 1):
            for j in range(i + 1, m):
                advantage = compare(i, j, votes)
                if advantage > 0:  # i is more preferred than j
                    preference_matrix[i][j] = 1
                    preference_matrix[j][i] = -1
                else:
                    preference_matrix[i][j] = -1
                    preference_matrix[j][i] = 1
        return preference_matrix

    preference_matrix = preference_matrix_generator(votes)
    print('孔多塞排序有向图（初始）：')
    print(preference_matrix)

孔多塞排序有向图（初始）：
[[ 0. -1. -1. -1. -1. -1. -1. -1.  1.  1.]
 [ 1.  0. -1. -1.  1.  1. -1.  1.  1.  1.]
 [ 1.  1.  0. -1.  1.  1.  1. -1.  1.  1.]
 [ 1.  1.  1.  0.  1.  1.  1.  1.  1.  1.]
 [ 1. -1. -1. -1.  0.  1.  1. -1. -1.  1.]
 [ 1. -1. -1. -1. -1.  0. -1. -1.  1.  1.]
 [ 1.  1. -1. -1. -1.  1.  0.  1.  1.  1.]
 [ 1. -1.  1. -1.  1.  1. -1.  0.  1.  1.]
 [-1. -1. -1. -1.  1. -1. -1. -1.  0.  1.]
 [-1. -1. -1. -1. -1. -1. -1. -1. -1.  0.]]

接下来的任务是判断这一有向图是否存在有向环。函数most_preferred_deleter用以检查有向图中是否存在入度为0的点，如果存在，就删除它。返回该有向图是否存在入度为0的点的bool值deleter_bool，和删除入度为0的点后的有向图preference_matrix。

    # delete the nodes whose entry degree is 0
    def most_preferred_deleter(preference_matrix):
        deleter_bool = False  # whether there is indeed a node to be deleted
        for i in range(len(preference_matrix)):
            preference_individual = preference_matrix[i]
            flag = True
            for item in preference_individual:
                if item < 0:
                    flag = False
                    break
            if flag == True:
                deleter_bool = True
                preference_matrix = np.delete(preference_matrix, i, 0)
                preference_matrix = np.delete(preference_matrix, i, 1)
                break  # There's no possibility that a given matrix has two nodes whose entry degrees are 0 at the same time
        return preference_matrix, deleter_bool

循环调用most_preferred_deleter函数，直到无节点可删。此时，如果所有节点都删除掉了，有向图中不存在环，孔多塞悖论不存在；反之则存在孔多塞悖论。

    while True:
        preference_matrix, deleter_bool = most_preferred_deleter(preference_matrix)
        # print(preference_matrix)
        if deleter_bool == False:
            break  # break when there is no node to be deleted
    if len(preference_matrix) == 0:
        paradox_bool = False
        print('不存在孔多塞悖论。')
    else:
        paradox_bool = True
        print('存在孔多塞悖论。')
    return paradox_bool

如果孔多塞悖论存在，我们需要检查个体序是否符合单峰性质，删除非理性的投票者。我的程序中完成这一任务的是函数irrational_voters_deleter。我定义了两个局部函数monotonic_check和single_peak_check，后者需要调用前者。在irrational_voters_deleter的主程序中，我一一检查每个节点，调用single_peak_check，检查它们是否具有单峰性，删除不具有单峰性的节点。

# 删除违反单峰性质的voters
def irrational_voters_deleter(votes):

先定义一个函数monotonic_check，检查字典的值value是否随着键key单调变化。输入：

order_dict变量：值是某一候选项的属性位次，键是它的个体偏好位次；
right_or_left（字符串）：order_dict里的候选项在单峰的左侧还是右侧。

单峰左侧的候选项，偏好位次随着属性位次单减，返回mononotic_bool=True，否则返回mononotic_bool=False；单峰右侧的候选项，偏好位次随着属性位次单增，返回mononotic_bool=True，否则返回mononotic_bool=False。

    # check whether values vary monotonically with keys
    def monotonic_check(order_dict, right_or_left):
        # order_dict -- candidate : order
        monotonic_bool = True

首先把字典的键和值分别按照从小到大排序，得到keys和values。

        keys = list(order_dict.keys())
        keys.sort()
        values = list(order_dict.values())
        values.sort()

考虑处于keys第i位的key。如果单调递增，处于values第i位value应当恰好等于order_dict[key]；如果单调递减，处于values从队末开始数第i位（从队首开始数第values_len - i - 1）的value应当恰好等于order_dict[key]。

        for i in range(len(keys)):
            key = keys[i]
            values_len = len(values)
            if right_or_left == 'right':  # value = f(key) monotonic rising
                value = values[i]
            elif right_or_left == 'left':  # value = f(key) monotonic falling
                value = values[values_len - i - 1]
            else:
                raise Exception(
                    'The argument “right_or_left” for function *monotonic_check* need a string variable which is either "right" or "left".')
            if order_dict[key] != value:  # actual value != the correspondent value with regard to the monotonic order
                monotonic_bool = False
                break
        return monotonic_bool

在检查单峰性的函数single_peak_check中，我们只需要确定最受偏好的候选项most_preferred，把它左侧的候选项存入字典leftwing中，右侧的候选项存入rightwing中，再分别调用monotonic_check即可。返回rightwing_bool and leftwing_bool：左右候选项同时通过单调测试，这一投票者的偏好序就具有单峰性。

    # check whether an indivisual's votes satisfy the single-peak attribute
    def single_peak_check(votes_individual):
        most_preferred = votes_individual[0]
        rightwing = {}  # votes on those candidates whose attributive order is higher than the most preferred one
        leftwing = {}  # votes on those candidates whose attributive order is lower than the most preferred one
        for i in range(most_preferred + 1, m):
            i_index = np.where(votes_individual == i)
            rightwing[i] = i_index[0][0]
        for i in range(0, most_preferred):
            i_index = np.where(votes_individual == i)
            leftwing[i] = i_index[0][0]
        # The voter's behavior satisfies single-peak attribute, when both the left wing and the right wing are monotonic
        rightwing_bool = monotonic_check(rightwing, 'right')
        leftwing_bool = monotonic_check(leftwing, 'left')
        return rightwing_bool and leftwing_bool

一一检查有向图中的节点，输出不理性的投票者。

    m = len(votes[0])
    irrational_voters = []
    for i in range(len(votes)):
        votes_individual = votes[i]
        single_peak_bool = single_peak_check(votes_individual)
        if single_peak_bool == False:
            irrational_voters.append(i)
    print('违反单峰性质的voters序号（初始矩阵行号）：', end='')
    print(irrational_voters)

违反单峰性质的voters序号（初始矩阵行号）：[8, 9, 10, 11, 12]

删除非理性投票者，生成新的有向图矩阵。

    # generate a new matrix which excludes the irrational voters
    n = len(votes)
    cnt = 0
    for i in range(n):
        if i in irrational_voters:
            continue
        if cnt == 0:
            votes_new = votes[i]
            votes_new = np.reshape(votes_new, (-1, m))  # 1d to 2d (in order to concatenate into a 2-d array)
        else:
            votes_appended = votes[i]
            votes_appended = np.reshape(votes_appended, (-1, m))
            votes_new = np.concatenate((votes_new, votes_appended), axis=0)
        cnt += 1

检查投票者是否为奇数个。如果不是，补齐一个一个投票者，其偏好序为属性序。

    # make sure there is an odd number of voters
    if len(votes_new) % 2 == 0:
        print('删除非理性投票者后，总人数不为奇数，补齐一个投票者，其偏好序为属性序。')
        votes_default = np.array([[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]])
        votes_new = np.concatenate((votes_new, votes_default), axis=0)
    print('删除非理性投票者后的矩阵为：')
    print(votes_new)

    return votes_new

删除非理性投票者后，总人数不为奇数，补齐一个投票者，其偏好序为属性序。
删除非理性投票者后的矩阵为：
[[9 8 7 6 5 4 3 2 1 0]
 [2 3 1 4 5 6 0 7 8 9]
 [7 6 5 8 4 3 2 1 0 9]
 [3 4 2 5 6 1 7 0 8 9]
 [6 7 5 4 3 2 1 8 0 9]
 [2 1 3 0 4 5 6 7 8 9]
 [0 1 2 3 4 5 6 7 8 9]
 [3 4 5 2 1 6 7 0 8 9]
 [0 1 2 3 4 5 6 7 8 9]]

对于理性投票者集合，我们采用中位项定理排序。先定义函数median_identifier，给定一组投票结果votes，集合每一投票者最偏好的候选项most_preferred_choices，找出其中的中位项i，这就是当前votes中的孔多塞胜者。

# 根据中位项定理排序
def median_identifier(votes):
    candidates_num = len(votes[0])
    n = len(votes)
    most_preferred_choices = []  # collect the most preferred choice of all voters
    for votes_individual in votes:
        most_preferred_choices.append(votes_individual[0])

为了找出中位项，我调用了numpy的unique方法，得到一个字典occurences，其中记录了每一候选项成为最受偏好者的次数（被多少人选为了最佳）。

    most_preferred_choices_array = np.array(most_preferred_choices)
    # identify the choice of the median voter
    unique, counts = np.unique(most_preferred_choices_array, return_counts=True)
    occurences = dict(zip(unique, counts))

occurences的键按照属性序从小到大排列，加总每一候选项的出现次数，总和存储在变量accumulation里。当accumulation >= (n+1) // 2时，终止循环，此时的candidate即为中间项。

    accumulation = 0
    for candidate in occurences.keys():  # add up occurences, starting from the first one in the attributive order
        accumulation += occurences[candidate]
        if accumulation >= (n + 1) // 2:
            break  # when we break the loop here, i denotes the choice of the median voter
    print(candidate, end=', ')
    return candidate

以下为主程序。

首先调用condorcet_paradox_check，检查投票结果是否存在孔多塞悖论。

paradox_bool = condorcet_paradox_check(votes)

如果不存在孔多塞悖论，就调用condorcet，按照孔多塞原则排序。

if paradox_bool == False:
    condorcet_order = condorcet(votes)
    print('根据孔多塞定理，全排序如下：', end='')
    print(condorcet_order)

否则就调用irrational_voters_deleter，删除非理性投票者，得到votes_new。

else:
    votes_new = irrational_voters_deleter(votes)
    rational_voters_num = len(votes_new)

接着调用median_identifier，按照中位项定理确定当前的孔多塞胜者median_choice，然后在投票结果votes_new中删去这一候选项。迭代操作，直到所有候选项都被排到集体序中。

    print('根据中位项定理，全排序如下：', end='')
    while True:
        median_choice = median_identifier(votes_new)
        if len(votes_new[0]) == 1:
            break
        # exclude the already chosen candidates, so as to repeat the process of identifying the choice of the median voter
        m = len(votes_new[0]) - 1
        votes_new = votes_new[votes_new != median_choice]
        votes_new = np.reshape(votes_new, (rational_voters_num, m))

根据中位项定理，全排序如下：3, 4, 2, 5, 1, 6, 7, 0, 8, 9,

上面的数据例子存在孔多塞悖论。以下是一个不存在孔多塞悖论的数据例子：

Initiating...
输入文件名（如b2)：b2.txt
投票结果：
[[4 3 2 1 0]
 [3 2 1 0 4]
 [1 0 2 3 4]
 [4 1 0 3 2]
 [1 3 2 0 4]
 [4 2 3 1 0]
 [4 2 0 3 1]]
孔多塞排序有向图（初始）：
[[ 0. -1. -1. -1. -1.]
 [ 1.  0. -1. -1. -1.]
 [ 1.  1.  0. -1. -1.]
 [ 1.  1.  1.  0. -1.]
 [ 1.  1.  1.  1.  0.]]
不存在孔多塞悖论。
根据孔多塞定理，全排序如下：[4, 3, 2, 1, 0]

3、完整代码

孔多塞模型下的表决综合

import numpy as np
import os


print('Initiating...')


# 打开文件读取排序/投票结果
def votes_reader(filename):
    f = open(filename, 'r')
    votes_lines = f.readlines()
    votes = []
    for line in votes_lines:
        line = line.strip()
        votes_individual = line.split()
        votes_indv_new = []
        for vote in votes_individual:
            vote = int(vote)
            votes_indv_new.append(vote)
        votes.append(votes_indv_new)
    votes = np.array(votes)
    return votes


cnt = 0
while True:
    cnt += 1
    if cnt > 3:
        print('使用默认文件b2。')
        votes = votes_reader('./input/b2.txt')
    try:
        filename = input('输入文件名（如b2.txt)：')
        pathname = os.path.join('./input', filename)
        votes = votes_reader(pathname)
    except:
        print('输入错误。', end='')
        continue
    break

print('投票结果：')
print(votes)


# compare votes on two candidates
def compare(i, j, votes):
    n = len(votes)  # number of voters
    advantage = 0  # accumulative advantage of i over j according to all voters
    i_indices = np.where(votes == i)
    j_indices = np.where(votes == j)
    for k in range(n):
        i_index = i_indices[1][k]
        j_index = j_indices[1][k]
        if i_index < j_index:  # i ranks higher than j, according to voter k
            advantage += 1
        else:
            advantage -= 1
    return advantage


# Condorcet ranking (a different algorithm from the one taught in class)
def condorcet(votes):
    n = len(votes)  # number of voters
    assert n % 2 == 1  # make sure there are an odd number of voters
    m = len(votes[0])  # number of candidates
    order = [0]
    # bubble sorting
    for i in range(1, m):
        for j_index in range(len(order)):
            flag = False
            j = order[len(order) - j_index - 1]
            advantage = compare(i, j, votes)
            if advantage < 0:
                order.insert(j_index + 1, i)
                flag = True
                break  # compare the new candidate to existing candidates in the order, from the least favored to the most,
                # until we find a more favored existing candidate, so that we can insert the new candidate after it
        if flag == False:
            order.insert(0, i)  # All existing candidates are less favored than the new one
    return order


# 检查投票结果是否隐含孔多塞悖论
# the main act: check if there is Condorcet paradox
def condorcet_paradox_check(votes):
    m = len(votes[0])
    n = len(votes)

    # generate a matrix which denotes which of two candidates is more preferred
    def preference_matrix_generator(votes):
        preference_matrix = np.zeros((m, m))
        assert n % 2 == 1
        for i in range(m - 1):
            for j in range(i + 1, m):
                advantage = compare(i, j, votes)
                if advantage > 0:  # i is more preferred than j
                    preference_matrix[i][j] = 1
                    preference_matrix[j][i] = -1
                else:
                    preference_matrix[i][j] = -1
                    preference_matrix[j][i] = 1
        return preference_matrix

    preference_matrix = preference_matrix_generator(votes)
    print('孔多塞排序有向图（初始）：')
    print(preference_matrix)

    # delete the nodes whose entry degree is 0
    def most_preferred_deleter(preference_matrix):
        deleter_bool = False  # whether there is indeed a node to be deleted
        for i in range(len(preference_matrix)):
            preference_individual = preference_matrix[i]
            flag = True
            for item in preference_individual:
                if item < 0:
                    flag = False
                    break
            if flag == True:
                deleter_bool = True
                preference_matrix = np.delete(preference_matrix, i, 0)
                preference_matrix = np.delete(preference_matrix, i, 1)
                break  # There's no possibility that a given matrix has two nodes whose entry degrees are 0 at the same time
        return preference_matrix, deleter_bool

    while True:
        preference_matrix, deleter_bool = most_preferred_deleter(preference_matrix)
        # print(preference_matrix)
        if deleter_bool == False:
            break  # break when there is no node to be deleted
    if len(preference_matrix) == 0:
        paradox_bool = False
        print('不存在孔多塞悖论。')
    else:
        paradox_bool = True
        print('存在孔多塞悖论。')
    return paradox_bool


# 删除违反单峰性质的voters
def irrational_voters_deleter(votes):
    # check whether values vary monotonically with keys
    def monotonic_check(order_dict, right_or_left):
        # order_dict -- candidate : order
        monotonic_bool = True
        keys = list(order_dict.keys())
        keys.sort()
        values = list(order_dict.values())
        values.sort()
        for i in range(len(keys)):
            key = keys[i]
            values_len = len(values)
            if right_or_left == 'right':  # value = f(key) monotonic rising
                value = values[i]
            elif right_or_left == 'left':  # value = f(key) monotonic falling
                value = values[values_len - i - 1]
            else:
                raise Exception(
                    'The argument “right_or_left” for function *monotonic_check* need a string variable which is either "right" or "left".')
            if order_dict[key] != value:  # actual value != the correspondent value with regard to the monotonic order
                monotonic_bool = False
                break
        return monotonic_bool

    # check whether an indivisual's votes satisfy the single-peak attribute
    def single_peak_check(votes_individual):
        most_preferred = votes_individual[0]
        rightwing = {}  # votes on those candidates whose attributive order is higher than the most preferred one
        leftwing = {}  # votes on those candidates whose attributive order is lower than the most preferred one
        for i in range(most_preferred + 1, m):
            i_index = np.where(votes_individual == i)
            rightwing[i] = i_index[0][0]
        for i in range(0, most_preferred):
            i_index = np.where(votes_individual == i)
            leftwing[i] = i_index[0][0]
        # The voter's behavior satisfies single-peak attribute, when both the left wing and the right wing are monotonic
        rightwing_bool = monotonic_check(rightwing, 'right')
        leftwing_bool = monotonic_check(leftwing, 'left')
        return rightwing_bool and leftwing_bool

    m = len(votes[0])
    irrational_voters = []
    for i in range(len(votes)):
        votes_individual = votes[i]
        single_peak_bool = single_peak_check(votes_individual)
        if single_peak_bool == False:
            irrational_voters.append(i)
    print('违反单峰性质的voters序号（初始矩阵行号）：', end='')
    print(irrational_voters)

    # generate a new matrix which excludes the irrational voters
    n = len(votes)
    cnt = 0
    for i in range(n):
        if i in irrational_voters:
            continue
        if cnt == 0:
            votes_new = votes[i]
            votes_new = np.reshape(votes_new, (-1, m))  # 1d to 2d (in order to concatenate into a 2-d array)
        else:
            votes_appended = votes[i]
            votes_appended = np.reshape(votes_appended, (-1, m))
            votes_new = np.concatenate((votes_new, votes_appended), axis=0)
        cnt += 1

    # make sure there is an odd number of voters
    if len(votes_new) % 2 == 0:
        print('删除非理性投票者后，总人数不为奇数，补齐一个投票者，其偏好序为属性序。')
        votes_default = np.array([[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]])
        votes_new = np.concatenate((votes_new, votes_default), axis=0)
    print('删除非理性投票者后的矩阵为：')
    print(votes_new)

    return votes_new


# 根据中位项定理排序
def median_identifier(votes):
    candidates_num = len(votes[0])
    n = len(votes)
    most_preferred_choices = []  # collect the most preferred choice of all voters
    for votes_individual in votes:
        most_preferred_choices.append(votes_individual[0])
    most_preferred_choices_array = np.array(most_preferred_choices)
    # identify the choice of the median voter
    unique, counts = np.unique(most_preferred_choices_array, return_counts=True)
    occurences = dict(zip(unique, counts))
    accumulation = 0
    for candidate in occurences.keys():  # add up occurences, starting from the first one in the attributive order
        accumulation += occurences[candidate]
        if accumulation >= (n + 1) // 2:
            break  # when we break the loop here, i denotes the choice of the median voter
    print(candidate, end=', ')
    return candidate


paradox_bool = condorcet_paradox_check(votes)
if paradox_bool == False:
    condorcet_order = condorcet(votes)
    print('根据孔多塞定理，全排序如下：', end='')
    print(condorcet_order)
else:
    votes_new = irrational_voters_deleter(votes)
    rational_voters_num = len(votes_new)
    print('根据中位项定理，全排序如下：', end='')
    while True:
        median_choice = median_identifier(votes_new)
        if len(votes_new[0]) == 1:
            break
        # exclude the already chosen candidates, so as to repeat the process of identifying the choice of the median voter
        m = len(votes_new[0]) - 1
        votes_new = votes_new[votes_new != median_choice]
        votes_new = np.reshape(votes_new, (rational_voters_num, m))

posted @ 2022-02-16 09:53 我在吃大西瓜呢阅读(594) 评论(0) 编辑收藏举报

会员力量，点亮园子希望

刷新页面返回顶部

我在吃大西瓜呢