怎么卡飞哈希

#include<bits/stdc++.h>
using namespace std;
int main(){
    printf("100000 20\n");
    for(int i = 1;i <= 100000;i++){
        putchar(rand() % 26 + 'a');
    }
    return 0;
}

top

自然溢出

自然溢出，即 hash 数组使用 unsigned long long，也就是对于 $2^{64}$ 取模。不光其值域不难出现哈希冲突，而且代码长度与常数都会大大减小，得到了不少的同学的青睐。

将 $N = 2^{64}, n = 10^{5}$ 带入上面的函数后，我们发现出现哈希冲突的可能性 $P$ 无限接近与 $0$ ，所以使用生日攻击成功的可能性极小。

底数为偶数

可以构造全部为 $a$ 的子串和第一个为 $b$ 其余均为 $a$ 的两个长度相等且长苏大于 $64$ 的两个不一样的字符串。

因为底数的 $64$ 次方以上模 $2^{64}$ 都是 $0$ ，所以即是两个字符串不同，他们的哈希值也都会一样。

底数为奇数

设一些串 $s$ ， $s_{i}$ 表示第 $i$ 个串， $s_{i}$ 的哈希值为 $h a s h (s_{i})$ 。

定义 $f (s)$ 为字符串 $s$ 内全部的 $a$ 都变为 $b$ ，所有的 $b$ 都变成 $a$ 。

定义 $s_{i} + s_{j}$ 的意思为将 $s_{j}$ 添加到 $s_{i}$ 的末尾形成的新的字符串。

构造方法为： $s_{1} = a$ ， $s_{i} = s_{i - 1} + f (s_{i - 1})$ ，所以 $| s_{i} | = 2^{i - 1}$ 。

所以：

h a s h (s_{i}) = h a s h (s_{i - 1}) \cdot b a s e^{| s_{i - 1} |} + h a s h (f (s_{i - 1})) = h a s h (s_{i - 1}) \cdot b a s e^{2^{i - 2}} + h a s h (f (s_{i - 1}))

h a s h (f (s_{i - 1})) = h a s h (f (s_{i - 2})) \cdot b a s e^{2^{i - 2}} + h a s h (s_{i - 1})

h a s h (s_{i}) - h a s h (f (s_{i - 1})) = (h a s h (s_{i - 1}) - h a s h (f (s_{i - 2}))) \cdot b a s e^{2^{i - 2}} - (h a s h (s_{i - 1}) - h a s h (f (s_{i - 2})))

h a s h (s_{i}) - h a s h (f (s_{i - 1})) = (h a s h (s_{i - 1}) - h a s h (f (s_{i - 2}))) \cdot (b a s e^{2^{i - 2}} - 1)

因为希望产生哈希冲突，即 $2^{64} ∣ h a s h (s_{i}) - h a s h (f (s_{i}))$ 。

设 $g_{i}$ 表示 $h a s h (s_{i}) - h a s h (f (s_{i}))$ ，那么 $g$ 满足一下性质：

g_{i} = g_{i - 1} \cdot (b a s e^{i - 2} - 1)

因为每一个 $b a s e^{2^{i - 1}} - 1$ 都是偶数，所以是的 $g$ 到达第 $64$ 项就可以 hack 了。

因为 $b a s e^{2^{i - 1}} - 1 = (b a s e^{2^{i - 2}} - 1) \cdot (b a s e^{2^{i - 2}} + 1)$ 且为一个偶数乘一个偶数, 而左边的可以继续递归下去, 所以到第 $12$ 位其实就可以 hack 了。

#include <iostream>
#include <cstring>
using namespace std;
char s[10000];
int main(){
	cout<<(1<<12)+65<<' '<<(1<<11)<<'\n';
	int now=1;
	s[1]='a';
	for (int i=1;i<=12;i++){
		for (int j=1;j<=now;j++) s[now+j]=s[j]=='a'?'b':'a';
		now<<=1;
	}
	for (int i=1;i<=now;i++) printf("%c",s[i]);
	for (int i=1;i<=65;i++) putchar('a');
	return 0;
}