哈希查找字符串

哈希查找的优越性：
当查找的字串长度小于机器字长时，可以把字串当做 long 进行比较。由于移位操作相当快（一个时钟周期），所以执行操作花费的时间比较少。
这种查找办法通用性不好。（ big-endian or little-endian 问题）。但不可否认，它的查找速度确实快。

// "CSDN"　的查找算法
// 思路是：把 "csdn" 按整数比较。每比较一次，对目标串进行移位操作。
void dwordsearch(const char pstr, const char* str )
{
    const int* ptarget = (int*)str;
    const int hash= *(int*)pstr;
    // 出现的次数
    size_t index = 0;
    size_t index2 = 0;
    vector< size_t > posCollect;

    //如果不是整数的倍数，保存尾巴和最后一个 int。
    int len = strlen( str ) - strlen( str ) % sizeof( int );
    const char* end  =  str + len ;
    if ( *end )
    {
       cout <<   "有尾巴"  <<  endl;
    }
    // 判断开头相等不
    //
    if ( hash == ptarget[index] )
    {
        cout << "找到了" << endl;
    }
        // 判断结尾相等不
// 以后再加吧。
    for( index = 0; index < len / 4 - 1; index++  )
    {
           for( index2 = 8;index2 <= 32; index2 += 8 )
           {
               // 向左移？向右移？视 big-endding 和 small-endding 不同而不同
               if ( hash ==  ( ( ptarget[index] >> index2 ) | ( ptarget[index+1] << ( 32- index2 ) ) ) )
               {
                   break;
               }
           }
           if(  index2 <= 32  )
           {
               posCollect.push_back( index * 4 + index2 / 8   );
           }
    }
    cout <<  "找到的次数："    <<  posCollect.size()  << endl;

    for(  vector<size_t>::const_iterator iter = posCollect.begin(); iter!= posCollect.end(); iter++  )
    {
        cout << *iter    << endl;
    }
    system( "pause" );

}

posted @ 2009-09-16 07:31 范晨鹏阅读(904) 评论(0) 收藏举报

刷新页面返回顶部

范晨鹏的博客

哈希查找字符串

公告