随笔 - 54  文章 - 0  评论 - 184  阅读 - 58万

trie

 Trie(retrieval) is an order tree data structure that is used to store “string”.

 Unlike normal tree, trie does not store data in its nodes,node position indicates what data the node associates with.

 below is what a trie looks like holding string: ab, ac,b,cd,cde,d. 

 

 

A simple definition of a trie node:

struct node
{
 Bool isWord;//is current node the last node of a word?
 node*child[26];
};

As you can see in the graph above, each edge from a node denotes a ‘letter’ (the relative position where the edge comes from indicates which letter it represents),
When traversing from root to some of its descendant with a dfs (depth first searching), we got a ‘string’.

Note that every node has unique parent, so there is always a unique path from child to the root. 

Initially, trie has a root node only. Then we insert string into it:

复制代码
void insert(char*pWord)
{
 if(!pWord || *pWord == 0)
     return ;
 char* pw = pWord;
 char c = *pw++;
 node *p = root;
 while( c )
 {
   if(p->child[c] == null)
   {
      node*q = new node();
      p->child[c] = q;
   }

   P = p->child[c];
   c = *pw++;
 }

 p->isWord = true;
}
复制代码

 

The ‘isWord’ in the node struct indicates whether path from here to the root represent a string.

This variable is of help when we need to insert strings like these: “abc”, “abcde”, whose path in the trie completely overlaps each other. 

When searching for a string, we traverse each node like this:

 

复制代码
bool SearchWord(const char*pWord)
{
  if(!pWord || *pWord == 0)
     return false;
  char*pw = pWord;
  node*p = root;
  char c = *pw++;
  while( c )
  {
     if(p->[c] == null)
       return false;

      p = p->[c];
      c = *pw++;
   }

   return p->isWord;
}
复制代码

 

 Above is a simple version of implementation of a trie.

 All I want to show is the idea how a trie helps searching for a string in a dictionary in an efficient way.

 Searching complexity is linear time of the length of the string being search, irrelevant to the size of the string set the trie is holding.

 Of course, setting up the trie will still cost a little, which is always an evitable operation.

 

posted on   twoon  阅读(384)  评论(0编辑  收藏  举报
编辑推荐:
· AI与.NET技术实操系列:向量存储与相似性搜索在 .NET 中的实现
· 基于Microsoft.Extensions.AI核心库实现RAG应用
· Linux系列:如何用heaptrack跟踪.NET程序的非托管内存泄露
· 开发者必知的日志记录最佳实践
· SQL Server 2025 AI相关能力初探
阅读排行:
· winform 绘制太阳,地球,月球 运作规律
· 震惊!C++程序真的从main开始吗?99%的程序员都答错了
· 【硬核科普】Trae如何「偷看」你的代码?零基础破解AI编程运行原理
· 超详细:普通电脑也行Windows部署deepseek R1训练数据并当服务器共享给他人
· AI与.NET技术实操系列(五):向量存储与相似性搜索在 .NET 中的实现
< 2013年1月 >
30 31 1 2 3 4 5
6 7 8 9 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31 1 2
3 4 5 6 7 8 9

点击右上角即可分享
微信分享提示