Trie 树(转)

看了很多 Trie 树的介绍，这篇讲的最好,简单易懂(特别是代码部分)，直接转载：http://www.cnblogs.com/dolphin0520/archive/2011/10/11/2207886.html

Trie树也称字典树，因为其效率很高，所以在在字符串查找、前缀匹配等中应用很广泛，其高效率是以空间为代价的。

一.Trie树的原理

利用串构建一个字典树，这个字典树保存了串的公共前缀信息，因此可以降低查询操作的复杂度。

下面以英文单词构建的字典树为例，这棵Trie树中每个结点包括26个孩子结点，因为总共有26个英文字母(假设单词都是小写字母组成)。

则可声明包含Trie树的结点信息的结构体:

 #define MAX 26

 typedef struct TrieNode               //Trie结点声明

 {

     bool isStr;                      //标记该结点处是否构成单词

     struct TrieNode *next[MAX];      //儿子分支

 }Trie;

　其中next是一个指针数组，存放着指向各个孩子结点的指针。

如给出字符串"abc","ab","bd","dda"，根据该字符串序列构建一棵Trie树。则构建的树如下:

Trie树的根结点不包含任何信息，第一个字符串为"abc"，第一个字母为'a'，因此根结点中数组next下标为'a'-97的值不为NULL，其他同理，构建的Trie树如图所示，红色结点表示在该处可以构成一个单词。很显然，如果要查找单词"abc"是否存在，查找长度则为O(len)，len为要查找的字符串的长度。而若采用一般的逐个匹配查找，则查找长度为O(len*n)，n为字符串的个数。显然基于Trie树的查找效率要高很多。

但是却是以空间为代价的，比如图中每个结点所占的空间都为(26*4+1)Byte=105Byte，那么这棵Trie树所占的空间则为105*8Byte=840Byte，而普通的逐个查找所占空间只需(3+2+2+3)Byte=10Byte。

二.Trie树的操作

在Trie树中主要有3个操作，插入、查找和删除。一般情况下Trie树中很少存在删除单独某个结点的情况，因此只考虑删除整棵树。

1.插入

假设存在字符串str，Trie树的根结点为root。i=0，p=root。

1)取str[i]，判断p->next[str[i]-97]是否为空，若为空，则建立结点temp，并将p->next[str[i]-97]指向temp，然后p指向temp；

若不为空，则p=p->next[str[i]-97]；

2)i++，继续取str[i]，循环1)中的操作，直到遇到结束符'\0'，此时将当前结点p中的isStr置为true。

2.查找

假设要查找的字符串为str，Trie树的根结点为root，i=0，p=root

1)取str[i]，判断判断p->next[str[i]-97]是否为空，若为空，则返回false；若不为空，则p=p->next[str[i]-97]，继续取字符。

2)重复1)中的操作直到遇到结束符'\0',若当前结点p不为空并且isStr为true，则返回true，否则返回false。

3.删除

删除可以以递归的形式进行删除。

测试程序:

 #include <iostream>

 #include <cstdlib>

 #include <stdio.h>

 #define MAX 26

 using namespace std;

 typedef struct TrieNode

 {

     bool isWord;

     struct TrieNode *next[MAX];

 }Trie;

 void insert(Trie *root, const char *s)

 {

     if(root == NULL || (*s) == '\0')

         return;

     Trie *p = root;

     int i;

     while((*s) != '\0')

     {

         if(p->next[(*s) - 'a'] == NULL)

         {

             Trie *temp = (Trie *)malloc(sizeof(Trie));

             for(i = ; i < MAX; ++i)

             {

                 temp->next[i] = NULL;

             }

             temp->isWord = false;

             p->next[(*s) - 'a'] = temp;

             p = p->next[(*s) - 'a'];

         }

         else

         {

             p = p->next[(*s) - 'a'];

         }

         s++;

     }

     p->isWord = true;

 }

 void del(Trie *root)

 {

     int i;

     for(i = ; i < MAX; ++i)

     {

         if(root->next[i] != NULL)

         {

             del(root->next[i]);

         }

     }

     free(root);

 }

 int search(Trie *root, const char *s)

 {

     Trie *p = root;

     while(p != NULL&&*s != '\0')

     {

         p = p->next[(*s) - 'a'];

         s++;

     }

     return (p != NULL && (p->isWord == true));

 }

 int main()

 {

     int i;

     int n,m;

     char s[];

     Trie *root = (Trie *)malloc(sizeof(Trie));

     for(i = ; i < MAX; ++i)

     {

         root->next[i] = NULL;

     }

     root->isWord = false;

     scanf("%d",&n);

     getchar();

     for(i = ; i < n; ++i)

     {

         scanf("%s",s);

         insert(root, s);

     }

     while(scanf("%d",&m)!=EOF)

     {

         for(i = ; i < m; ++i)

         {

             scanf("%s",s);

             if(search(root, s))

             {

                 printf("Yes, find it!\n");

             }

             else

             {

                 printf("No, loss it!\n");

             }

         }

     }

     del(root);

     return ;

 }

Trie 树(转)的更多相关文章

基于trie树做一个ac自动机
基于trie树做一个ac自动机 #!/usr/bin/python # -*- coding: utf-8 -*- class Node: def __init__(self): self.value ...
基于trie树的具有联想功能的文本编辑器
之前的软件设计与开发实践课程中,自己构思的大作业题目.做的具有核心功能,但是还欠缺边边角角的小功能和持久化数据结构,先放出来,有机会一点点改.github:https://github.com/chu ...
hihocoder-1014 Trie树
hihocoder 1014 : Trie树 link: https://hihocoder.com/problemset/problem/1014 题意: 实现Trie树,实现对单词的快速统计. # ...
洛谷P2412 查单词 [trie树 RMQ]
题目背景滚粗了的HansBug在收拾旧英语书,然而他发现了什么奇妙的东西. 题目描述 udp2.T3如果遇到相同的字符串,输出后面的蒟蒻HansBug在一本英语书里面找到了一个单词表,包含N个单词 ...
通过trie树实现单词自动补全
/** * 实现单词补全功能 */ #include <stdio.h> #include <stdlib.h> #include <string.h> #incl ...
#1014 Trie树
本题主要是求构造一棵Trie树,即词典树用于统计单词. C#代码如下: using System; using System.Collections.Generic; using System.Lin ...
Trie树-字典查找
描述小Hi和小Ho是一对好朋友,出生在信息化社会的他们对编程产生了莫大的兴趣,他们约定好互相帮助,在编程的学习道路上一同前进. 这一天,他们遇到了一本词典,于是小Hi就向小Ho提出了那个经典的问题: ...
Trie树的创建、插入、查询的实现
原文:http://blog.chinaunix.net/xmlrpc.php?r=blog/article&uid=28977986&id=3807947 1.什么是Trie树 Tr ...
Trie树（c++实现）
转:http://www.cnblogs.com/kaituorensheng/p/3602155.html http://blog.csdn.net/insistgogo/article/detai ...
[转]双数组TRIE树原理
原文名称: An Efficient Digital Search Algorithm by Using a Double-Array Structure 作者: JUN-ICHI AOE 译文: 使 ...

随机推荐

poj1308 Is It A Tree?（并查集）详解
poj1308 http://poj.org/problem?id=1308 题目大意:输入若干组测试数据,输入 (-1 -1) 时输入结束.每组测试数据以输入(0 0)为结束标志.然后根据所给的 ...
StringToInt
public class Stringtoint { public static void stringtoint(String s){ //判断字符串是否为null和是否为空 ...
Spring配置，JDBC数据源及事务
<?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http://www.sp ...
15_动态SQL
[UserMapper.xml]和之前的作对比 <mapper namespace="com.Higgin.Mybatis.mapper.UserMapper"> &l ...
PAT_1026 程序运行时间
问题描述: 要获得一个C语言程序的运行时间,常用的方法是调用头文件time.h,其中提供了clock()函数,可以捕捉从程序开始运行到clock()被调用时所耗费的时间.这个时间单位是clock ti ...
IOS 学习笔记 2015-03-24 OC-API-不可变字符串
大部分是模仿// // main.m // OC-API-不可变字符串 // // Created by wangtouwang on 15/3/25. // Copyright (c) 2015年 ...
Web用户自定义控件
在新建项的时候,选择Web用户控件,可用来自定义自己的控件,做好后,直接拖到页面即可使用自定义控件与WEB交互,需要在自定义控件里面写属性,如: public string CityID { g ...
ceph入门学习链接
https://tobegit3hub1.gitbooks.io/ceph_from_scratch/content/introduction/component.html
jQuery实现购物车多物品数量的加减+总价+删除计算
<?php session_start(); ?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//E ...
php发送http请求
http请求有get,post. php发送http请求有三种方式[我所知道的有三种,有其他的告诉我]. file_get_contents();详情见:http://www.cnblogs.com/ ...

Trie 树(转)

Trie 树(转)的更多相关文章

随机推荐

热门专题