HDU 1686 Oulipo(KMP)

                  Oulipo



Problem Description
The French author Georges Perec (1936–1982) once wrote a book, La disparition, without the letter 'e'. He was a member of the Oulipo group. A quote from the book:

Tout avait Pair normal, mais tout s’affirmait faux. Tout avait Fair normal, d’abord, puis surgissait l’inhumain, l’affolant. Il aurait voulu savoir où s’articulait l’association qui l’unissait au roman : stir son tapis, assaillant à tout instant son imagination, l’intuition d’un tabou, la vision d’un mal obscur, d’un quoi vacant, d’un non-dit : la vision, l’avision d’un oubli commandant tout, où s’abolissait la raison : tout avait l’air normal mais…

Perec would probably have scored high (or rather, low) in the following contest. People are asked to write a perhaps even meaningful text on some subject with as few occurrences of a given “word” as possible. Our task is to provide the jury with a program that counts these occurrences, in order to obtain a ranking of the competitors. These competitors often write very long texts with nonsense meaning; a sequence of 500,000 consecutive 'T's is not unusual. And they never use spaces.

So we want to quickly find out how often a word, i.e., a given string, occurs in a text. More formally: given the alphabet {'A', 'B', 'C', …, 'Z'} and two finite strings over that alphabet, a word W and a text T, count the number of occurrences of W in T. All the consecutive characters of W must exactly match consecutive characters of T. Occurrences may overlap.

 

 

Input
The first line of the input file contains a single number: the number of test cases to follow. Each test case has the following format:

One line with the word W, a string over {'A', 'B', 'C', …, 'Z'}, with 1 ≤ |W| ≤ 10,000 (here |W| denotes the length of the string W).
One line with the text T, a string over {'A', 'B', 'C', …, 'Z'}, with |W| ≤ |T| ≤ 1,000,000.
 

 

Output
For every test case in the input file, the output should contain a single number, on a single line: the number of occurrences of the word W in the text T.

 

 

Sample Input
3
BAPC
BAPC
AZA
AZAZAZA
VERDI
AVERDXIVYERDIAN
 
 

 

Sample Output
1
3
0
 
KMP第一题。
 
 1 #include<cstdio>
 2 #include<cstring>
 3 using namespace std;
 4 
 5 int next[10005];
 6 char p[10005];
 7 char s[1000005];
 8 
 9 void getnext(char *p)
10 {
11     int len=strlen(p);
12     next[0]=-1;
13     int k=-1,j=0;
14     while(j<len)
15     {
16         if(k==-1||p[j]==p[k])
17         {
18             ++k;
19             ++j;
20             next[j]=k;
21         }
22         else
23         k=next[k];
24     }
25 }
26 
27 int kmp(char *p,char *s)
28 {
29     int cnt=0,i=0,j=0;
30     int slen=strlen(s);
31     int plen=strlen(p);
32     while(i<slen)
33     {
34         if(j==-1||s[i]==p[j])
35         {
36             i++;
37             j++;
38         }
39         else
40         j=next[j];
41         if(j==plen)
42         {
43             cnt++;
44             j=next[j];
45 
46         }
47     }
48     return cnt;
49 }
50 
51 int main()
52 {
53     //freopen("in.txt","r",stdin);
54     int t;
55     scanf("%d",&t);
56     while(t--)
57     {
58         scanf("%s",p);
59         getnext(p);
60         scanf("%s",s);
61         int ans=kmp(p,s);
62         printf("%d\n",ans);
63     }
64     return 0;
65 }

 

posted on 2015-08-03 21:31    阅读(134)  评论(0编辑  收藏  举报

导航