Find substring with K distinct characters

Given a string and number K, find the substrings of size K with K distinct characters. If no, output empty list. Remember to emit the duplicate substrings, i.e. if the substring repeated twice, only output once.

  • 字符串中等题。Sliding window algorithm + Hash。
  • 使用移动窗口算法,两个指针标记window起点left/终点right,还有一个计数器count记录hit count,初始值为K。另外还有一个hash数组来记录当前window中所有char。
  • 注意hit的条件是hash[i] == 0,代表当前window中没有重复char。如果hit,则-- count代表找到一个满足条件char。
  • ++ hash[right]来标记已经在当前window中出现过,并扩展right。
  • 如果window size为K,那么就可以判断如果count == 0,代表已经找到K个不重复的char,可以放入结果集。这里注意下需要用STL算法find()去重。
  • 接着要把window往右移,同时对right char做过的操作进行恢复。
  • 注意如果hash[left] == 1,代表以前满足过hash[right] == 0,所以需要-- count来恢复。而对于hash[left] > 1,因为重复char只会hit一次,只会对count + 1,所以不需要-- count,只要等到hash[left] == 1的时候再count - 1就行。同时因为left要移出window了,所以-- hash[left]来恢复,并右移left扩展到下一个window。
  • find - C++ Reference
    • http://www.cplusplus.com/reference/algorithm/find/?kw=find
 1 //
 2 //  main.cpp
 3 //  LeetCode
 4 //
 5 //  Created by Hao on 2017/3/16.
 6 //  Copyright © 2017年 Hao. All rights reserved.
 7 //
 8 
 9 #include <iostream>
10 #include <vector>
11 #include <unordered_map>
12 using namespace std;
13 
14 class Solution {
15 public:
16     vector<string> subStringKDist(string S, int K) {
17         vector<string> vResult;
18         
19         // corner case
20         if (S.empty()) return vResult;
21         
22         unordered_map<char, int>    hash;
23         
24         // window start/end pointer, hit count
25         int left = 0, right = 0, count = K;
26         
27         while (right < S.size()) {
28             if (hash[S.at(right)] == 0) // hit the condition 1 dup char
29                 -- count;
30             
31             ++ hash[S.at(right)]; // increase hash value to mark that the char exists in the current window
32             
33             ++ right; // move window end pointer rightward
34             
35             // window size reaches K
36             if (right - left == K) {
37                 if (0 == count) { // find K distinct chars
38                     if (find(vResult.begin(), vResult.end(), S.substr(left, K)) == vResult.end()) // using STL find() to avoid dup
39                         vResult.push_back(S.substr(left, K));
40                 }
41 
42                 // be careful for the restore condition. Count is only increased when hash[i] == 0, so only hash[i] == 1 means that count was increased.
43                 if (hash[S.at(left)] == 1)
44                     ++ count;
45                 
46                 -- hash[S.at(left)]; // decrease to restore hash value
47                 
48                 ++ left; // move window start pointer rightward
49             }
50         }
51         
52         return vResult;
53     }
54 };
55 
56 int main(int argc, char* argv[])
57 {
58     Solution    testSolution;
59     
60     vector<string>  sInputs = {"awaglknagawunagwkwagl", "abccdef", "", "aaaaaaa"};
61     vector<int>     iInputs = {4, 2, 1, 2};
62     vector<string>  result;
63     
64     /*
65      {wagl aglk glkn lkna knag gawu awun wuna unag nagw agwk kwag }
66      {ab bc cd de ef }
67      {}
68      {}
69      */
70     for (auto i = 0; i < sInputs.size(); ++ i) {
71         result = testSolution.subStringKDist(sInputs[i], iInputs[i]);
72         
73         cout << "{";
74         for (auto it : result)
75             cout << it << " ";
76         cout << "}" << endl;
77     }
78 
79     return 0;
80 }
View Code

 

posted on 2018-02-12 15:26  浩然119  阅读(754)  评论(0编辑  收藏  举报