poj3693
Time Limit: 1000MS | Memory Limit: 65536K | |
Total Submissions: 9067 | Accepted: 2772 |
Description
The repetition number of a string is defined as the maximum number R such that the string can be partitioned into R same consecutive substrings. For example, the repetition number of "ababab" is 3 and "ababa" is 1.
Given a string containing lowercase letters, you are to find a substring of it with maximum repetition number.
Input
The input consists of multiple test cases. Each test case contains exactly one line, which
gives a non-empty string consisting of lowercase letters. The length of the string will not be greater than 100,000.
The last test case is followed by a line containing a '#'.
Output
For each test case, print a line containing the test case number( beginning with 1) followed by the substring of maximum repetition number. If there are multiple substrings of maximum repetition number, print the lexicographically smallest one.
Sample Input
ccabababc daabbccaa #
Sample Output
Case 1: ababab Case 2: aa
Source
在后缀数组神文中也这题的题解。
比较容易理解的部分就是枚举长度为L,然后看长度为L的字符串最多连续出现几次。
既然长度为L的串重复出现,那么str[0],str[l],str[2*l]……中肯定有两个连续的出现在字符串中。
那么就枚举连续的两个,然后从这两个字符前后匹配,看最多能匹配多远。
即以str[i*l],str[i*l+l]前后匹配,这里是通过查询suffix(i*l),suffix(i*l+l)的最长公共前缀
通过rank值能找到i*l,与i*l+l的排名,我们要查询的是这段区间的height的最小值,通过RMQ预处理
达到查询为0(1)的复杂度,
设LCP长度为M, 则答案显然为M / L + 1, 但这不一定是最好的, 因为答案的首尾不一定再我们枚举的位置上. 我的解决方法是, 我们考虑M % L的值的意义, 我们可以认为是后面多了M % L个字符, 但是我们更可以想成前面少了(L - M % L)个字符! 所以我们求后缀j * L - (L - M % L)与后缀(j + 1) * L - (L - M % L)的最长公共前缀。
即把之前的区间前缀L-M%L即可。
然后把可能取到最大值的长度L保存,由于 题目要求字典序最小,通过sa数组进行枚举,取到的第一组,肯定是字典序最小的。
AC代码:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 | <code-pre class = "code-pre" id= "pre-hJNRJY" ><code-line class = "line-numbers-rows" ></code-line>#include<cstdio> <code-line class = "line-numbers-rows" ></code-line>#include<cstring> <code-line class = "line-numbers-rows" ></code-line>#include<cmath> <code-line class = "line-numbers-rows" ></code-line>#include<iostream> <code-line class = "line-numbers-rows" ></code-line> using namespace std; <code-line class = "line-numbers-rows" ></code-line>#define N 100010 <code-line class = "line-numbers-rows" ></code-line> int wa[N],wb[N],wv[N],ww[N]; <code-line class = "line-numbers-rows" ></code-line> int sa[N],lcp[N],rank[N],f[N][21]; <code-line class = "line-numbers-rows" ></code-line> char str[N]; <code-line class = "line-numbers-rows" ></code-line> inline bool cmp( int *r, int a, int b, int len){ <code-line class = "line-numbers-rows" ></code-line> return r[a]==r[b]&&r[a+len]==r[b+len]; <code-line class = "line-numbers-rows" ></code-line>} <code-line class = "line-numbers-rows" ></code-line> void construct_sa( int n, int m){ <code-line class = "line-numbers-rows" ></code-line> int i,j,p,*x=wa,*y=wb,*t;n++; <code-line class = "line-numbers-rows" ></code-line> for (i=0;i<m;i++) ww[i]=0; <code-line class = "line-numbers-rows" ></code-line> for (i=0;i<n;i++) ww[x[i]=str[i]]++; <code-line class = "line-numbers-rows" ></code-line> for (i=1;i<m;i++) ww[i]+=ww[i-1]; <code-line class = "line-numbers-rows" ></code-line> for (i=n-1;i>=0;i--) sa[--ww[x[i]]]=i; <code-line class = "line-numbers-rows" ></code-line> for (j=p=1;p<n;j<<=1,m=p){ <code-line class = "line-numbers-rows" ></code-line> for (p=0,i=n-j;i<n;i++) y[p++]=i; <code-line class = "line-numbers-rows" ></code-line> for (i=0;i<n;i++) if (sa[i]>=j) y[p++]=sa[i]-j; <code-line class = "line-numbers-rows" ></code-line> for (i=0;i<m;i++) ww[i]=0; <code-line class = "line-numbers-rows" ></code-line> for (i=0;i<n;i++) ww[wv[i]=x[y[i]]]++; <code-line class = "line-numbers-rows" ></code-line> for (i=1;i<m;i++) ww[i]+=ww[i-1]; <code-line class = "line-numbers-rows" ></code-line> for (i=n-1;i>=0;i--) sa[--ww[wv[i]]]=y[i]; <code-line class = "line-numbers-rows" ></code-line> for (t=x,x=y,y=t,x[sa[0]]=0,p=i=1;i<n;i++) <code-line class = "line-numbers-rows" ></code-line> x[sa[i]]=cmp(y,sa[i-1],sa[i],j)?p-1:p++; <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line>} <code-line class = "line-numbers-rows" ></code-line> void construct_lcp( int n){ <code-line class = "line-numbers-rows" ></code-line> for ( int i=0;i<=n;i++) rank[sa[i]]=i; <code-line class = "line-numbers-rows" ></code-line> int h=0; <code-line class = "line-numbers-rows" ></code-line> lcp[0]=0; <code-line class = "line-numbers-rows" ></code-line> for ( int i=0;i<n;i++){ <code-line class = "line-numbers-rows" ></code-line> int j=sa[rank[i]-1]; <code-line class = "line-numbers-rows" ></code-line> if (h>0) h--; <code-line class = "line-numbers-rows" ></code-line> for (;j+h<n&&i+h<n;h++) if (str[i+h]!=str[j+h]) break ; <code-line class = "line-numbers-rows" ></code-line> lcp[rank[i]-1]=h; <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line>} <code-line class = "line-numbers-rows" ></code-line> void st( int n){ <code-line class = "line-numbers-rows" ></code-line> for ( int i=1;i<=n;i++) f[i][0]=lcp[i-1]; <code-line class = "line-numbers-rows" ></code-line> for ( int i=1;(1<<i)<=n;i++){ <code-line class = "line-numbers-rows" ></code-line> for ( int j=1;j+(1<<i)-1<=n;j++){ <code-line class = "line-numbers-rows" ></code-line> f[j][i]=min(f[j][i-1],f[j+(1<<(i-1))][i-1]); <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line>} <code-line class = "line-numbers-rows" ></code-line> int query( int i, int j){ <code-line class = "line-numbers-rows" ></code-line> i=rank[i];j=rank[j]; <code-line class = "line-numbers-rows" ></code-line> if (i>j) swap(i,j);i++; <code-line class = "line-numbers-rows" ></code-line> int k= log (j-i+1)/ log (2); <code-line class = "line-numbers-rows" ></code-line> return min(f[i][k],f[j-(1<<k)+1][k]); <code-line class = "line-numbers-rows" ></code-line>} <code-line class = "line-numbers-rows" ></code-line> int tmp[N],kkk; <code-line class = "line-numbers-rows" ></code-line> bool vis[N]; <code-line class = "line-numbers-rows" ></code-line> int slove( int n){ <code-line class = "line-numbers-rows" ></code-line> int ans=0,sum; <code-line class = "line-numbers-rows" ></code-line> kkk=0; <code-line class = "line-numbers-rows" ></code-line> memset (vis,0, sizeof (vis)); <code-line class = "line-numbers-rows" ></code-line> for ( int len=1;len<=n;len++){ <code-line class = "line-numbers-rows" ></code-line> for ( int i=0;i+len<=n;i+=len){ <code-line class = "line-numbers-rows" ></code-line> int t=query(i,i+len); <code-line class = "line-numbers-rows" ></code-line> sum=t/len+1; <code-line class = "line-numbers-rows" ></code-line> int pos=i-(len-t%len); <code-line class = "line-numbers-rows" ></code-line> if (pos>=0&&t%len!=0) if (query(pos,pos+len)>=(len-t%len)) sum++; <code-line class = "line-numbers-rows" ></code-line> if (sum>ans) ans=sum; <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> for ( int len=1;len<=n;len++){ <code-line class = "line-numbers-rows" ></code-line> for ( int i=0;i+len<=n;i+=len){ <code-line class = "line-numbers-rows" ></code-line> int t=query(i,i+len); <code-line class = "line-numbers-rows" ></code-line> sum=t/len+1; <code-line class = "line-numbers-rows" ></code-line> int pos=i-(len-t%len); <code-line class = "line-numbers-rows" ></code-line> if (pos>=0&&t%len!=0) if (query(pos,pos+len)>=(len-t%len)) sum++; <code-line class = "line-numbers-rows" ></code-line> if (sum==ans&&vis[len]==0){ <code-line class = "line-numbers-rows" ></code-line> vis[len]=1; <code-line class = "line-numbers-rows" ></code-line> tmp[kkk++]=len; <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> return ans; <code-line class = "line-numbers-rows" ></code-line>} <code-line class = "line-numbers-rows" ></code-line> int main(){ <code-line class = "line-numbers-rows" ></code-line> int cas=0; <code-line class = "line-numbers-rows" ></code-line> while ( scanf ( "%s" ,str)==1){ <code-line class = "line-numbers-rows" ></code-line> if (str[0]== '#' ) break ; <code-line class = "line-numbers-rows" ></code-line> int len= strlen (str); <code-line class = "line-numbers-rows" ></code-line> construct_sa(len,200); <code-line class = "line-numbers-rows" ></code-line> construct_lcp(len); <code-line class = "line-numbers-rows" ></code-line> st(len); <code-line class = "line-numbers-rows" ></code-line> int ans=slove(len); <code-line class = "line-numbers-rows" ></code-line> printf ( "Case %d: " ,++cas); <code-line class = "line-numbers-rows" ></code-line> int pos=0,leng=0,flag=0; <code-line class = "line-numbers-rows" ></code-line> for ( int i=1;i<=len;i++){ <code-line class = "line-numbers-rows" ></code-line> for ( int j=0;j<kkk;j++){ //枚举可以使重复次数到达ans的长度 <code-line class = "line-numbers-rows" ></code-line> int t=query(sa[i],sa[i]+tmp[j]); <code-line class = "line-numbers-rows" ></code-line> if (t/tmp[j]+1==ans){ //再次计算sa[i]开始的重复次数 <code-line class = "line-numbers-rows" ></code-line> pos=sa[i];leng=tmp[j];flag=1; break ; //满足即跳出 <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> if (flag) break ; <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> for ( int i=pos,j=0;j<leng*ans;j++,i++) printf ( "%c" ,str[i]); <code-line class = "line-numbers-rows" ></code-line> printf ( "\n" ); <code-line class = "line-numbers-rows" ></code-line> } <code-line class = "line-numbers-rows" ></code-line> return 0; <code-line class = "line-numbers-rows" ></code-line>} <code-line class = "line-numbers-rows" ></code-line> </code-pre> |
__EOF__

本文链接:https://www.cnblogs.com/shenben/p/5765823.html
关于博主:评论和私信会在第一时间回复。或者直接私信我。
版权声明:本博客所有文章除特别声明外,均采用 BY-NC-SA 许可协议。转载请注明出处!
声援博主:如果您觉得文章对您有帮助,可以点击文章右下角【推荐】一下。您的鼓励是博主的最大动力!
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· 如何编写易于单元测试的代码
· 10年+ .NET Coder 心语,封装的思维:从隐藏、稳定开始理解其本质意义
· .NET Core 中如何实现缓存的预热?
· 从 HTTP 原因短语缺失研究 HTTP/2 和 HTTP/3 的设计差异
· AI与.NET技术实操系列:向量存储与相似性搜索在 .NET 中的实现
· 10年+ .NET Coder 心语 ── 封装的思维:从隐藏、稳定开始理解其本质意义
· 地球OL攻略 —— 某应届生求职总结
· 周边上新:园子的第一款马克杯温暖上架
· Open-Sora 2.0 重磅开源!
· 提示词工程——AI应用必不可少的技术