Trie 树及Java实现

来源于英文“retrieval”. Trie树就是字符树，其核心思想就是空间换时间。

举个简单的例子。
给你100000个长度不超过10的单词。对于每一个单词，我们要判断他出没出现过，如果出现了，第一次出现第几个位置。
这题当然可以用hash来，但是我要介绍的是trie树。在某些方面它的用途更大。比如说对于某一个单词，我要询问它的前缀是否出现过。这样hash就不好搞了，而用trie还是很简单。

现在回到例子中，如果我们用最傻的方法，对于每一个单词，我们都要去查找它前面的单词中是否有它。那么这个算法的复杂度就是O(n^2)。显然对于100000的范围难以接受。现在我们换个思路想。假设我要查询的单词是abcd，那么在他前面的单词中，以b，c，d，f之类开头的我显然不必考虑。而只要找以a开头的中是否存在abcd就可以了。同样的，在以a开头中的单词中，我们只要考虑以b作为第二个字母的……这样一个树的模型就渐渐清晰了……

假设有b，abc，abd，bcd，abcd，efg，hii这6个单词，我们构建的树就是这样的。

对于每一个节点，从根遍历到他的过程就是一个单词，如果这个节点被标记为红色，就表示这个单词存在，否则不存在。
那么，对于一个单词，我只要顺着他从根走到对应的节点，再看这个节点是否被标记为红色就可以知道它是否出现过了。把这个节点标记为红色，就相当于插入了这个单词。

我们可以看到，trie树每一层的节点数是26^i级别的。所以为了节省空间。我们用动态链表，或者用数组来模拟动态。空间的花费，不会超过单词数×单词长度。(转自一大牛)

Trie树的java代码 实现如下：

import java.util.ArrayList;

import java.util.Iterator;

import java.util.List;

/** *//**

 * A word trie which can only deal with 26 alphabeta letters.

 * @author Leeclipse

 * @since 2007-11-21

 */

public class Trie{

   private Vertex root;//一个Trie树有一个根节点

    //内部类

    protected class Vertex{//节点类

        protected int words;

        protected int prefixes;

        protected Vertex[] edges;//每个节点包含26个子节点(类型为自身)

        Vertex() {

            words = ;

            prefixes = ;

            edges = new Vertex[];

            for (int i = ; i < edges.length; i++) {

                edges[i] = null;

            }

        }

    }

    public Trie () {

        root = new Vertex();

    }

    /** *//**

     * List all words in the Trie.

     *

     * @return

     */

    public List< String> listAllWords() {

        List< String> words = new ArrayList< String>();

        Vertex[] edges = root.edges;

        for (int i = ; i < edges.length; i++) {

            if (edges[i] != null) {

                     String word = "" + (char)('a' + i);

                depthFirstSearchWords(words, edges[i], word);

            }

        }

        return words;

    }

     /** *//**

     * Depth First Search words in the Trie and add them to the List.

     *

     * @param words

     * @param vertex

     * @param wordSegment

     */

    private void depthFirstSearchWords(List words, Vertex vertex, String wordSegment) {

        Vertex[] edges = vertex.edges;

        boolean hasChildren = false;

        for (int i = ; i < edges.length; i++) {

            if (edges[i] != null) {

                hasChildren = true;

                String newWord = wordSegment + (char)('a' + i);

                depthFirstSearchWords(words, edges[i], newWord);

            }

        }

        if (!hasChildren) {

            words.add(wordSegment);

        }

    }

    public int countPrefixes(String prefix) {

        return countPrefixes(root, prefix);

    }

    private int countPrefixes(Vertex vertex, String prefixSegment) {

        if (prefixSegment.length() == ) { //reach the last character of the word

            return vertex.prefixes;

        }

        char c = prefixSegment.charAt();

        int index = c - 'a';

        if (vertex.edges[index] == null) { // the word does NOT exist

            return ;

        } else {

            return countPrefixes(vertex.edges[index], prefixSegment.substring());

        }        

    }

    public int countWords(String word) {

        return countWords(root, word);

    }    

    private int countWords(Vertex vertex, String wordSegment) {

        if (wordSegment.length() == ) { //reach the last character of the word

            return vertex.words;

        }

        char c = wordSegment.charAt();

        int index = c - 'a';

        if (vertex.edges[index] == null) { // the word does NOT exist

            return ;

        } else {

            return countWords(vertex.edges[index], wordSegment.substring());

        }        

    }

    /** *//**

     * Add a word to the Trie.

     *

     * @param word The word to be added.

     */

    public void addWord(String word) {

        addWord(root, word);

    }

    /** *//**

     * Add the word from the specified vertex.

     * @param vertex The specified vertex.

     * @param word The word to be added.

     */

    private void addWord(Vertex vertex, String word) {

       if (word.length() == ) { //if all characters of the word has been added

            vertex.words ++;

        } else {

            vertex.prefixes ++;

            char c = word.charAt();

            c = Character.toLowerCase(c);

            int index = c - 'a';

            if (vertex.edges[index] == null) { //if the edge does NOT exist

                vertex.edges[index] = new Vertex();

            }

            addWord(vertex.edges[index], word.substring()); //go the the next character

        }

    }

    public static void main(String args[])  //Just used for test

    {

    Trie trie = new Trie();

    trie.addWord("China");

    trie.addWord("China");

    trie.addWord("China");

    trie.addWord("crawl");

    trie.addWord("crime");

    trie.addWord("ban");

    trie.addWord("China");

    trie.addWord("english");

    trie.addWord("establish");

    trie.addWord("eat");

    System.out.println(trie.root.prefixes);

     System.out.println(trie.root.words);

     List< String> list = trie.listAllWords();

     Iterator listiterator = list.listIterator();

     while(listiterator.hasNext())

     {

          String s = (String)listiterator.next();

          System.out.println(s);

     }

     int count = trie.countPrefixes("ch");

     int count1=trie.countWords("china");

     System.out.println("the count of c prefixes:"+count);

     System.out.println("the count of china countWords:"+count1);

    }

}

运行:

C:\test>java   Trie

ban

china

crawl

crime

eat

english

establish

the count of c prefixes:

the count of china countWords:

Trie 树及Java实现的更多相关文章

双数组Trie树(DoubleArrayTrie)Java实现
http://www.hankcs.com/program/java/%E5%8F%8C%E6%95%B0%E7%BB%84trie%E6%A0%91doublearraytriejava%E5%AE ...
Trie树的java实现
leetcode 地址: https://leetcode.com/problems/implement-trie-prefix-tree/description/ 难度:中等描述:略解题思路: ...
leetcode网站中找到的关于trie树的JAVA版本介绍
class TrieNode { // R links to node children private TrieNode[] links; private final int R = 26; pri ...
Trie树的应用：查询IP地址的ISP
1. 问题描述给定一个IP地址,如何查询其所属的ISP,如:中国移动(ChinaMobile),中国电信(ChinaTelecom),中国铁通(ChinaTietong)?现有ISP的IP地址区段可 ...
从Trie树到双数组Trie树
Trie树原理又称单词查找树,Trie树,是一种树形结构,是一种哈希树的变种.它的优点是:利用字符串的公共前缀来减少查询时间,最大限度地减少无谓的字符串比较,能在常数时间O(len)内实现插入和查 ...
Trie树（转：http://blog.csdn.net/arhaiyun/article/details/11913501）
Trie 树, 又称字典树,单词查找树.它来源于retrieval(检索)中取中间四个字符构成(读音同try).用于存储大量的字符串以便支持快速模式匹配.主要应用在信息检索领域. Trie 有三种结构 ...
字典树（Trie）的java实现
一.定义字典树又称单词查找树,Trie树,是一种树形结构,是一种哈希树的变种.典型应用是用于统计,排序和保存大量的字符串(但不仅限于字符串),所以经常被搜索引擎系统用于文本词频统计.它的优点是:利用 ...
java实现的Trie树数据结构
近期在学习的时候,常常看到使用Trie树数据结构来解决这个问题.比方" 有一个1G大小的一个文件.里面每一行是一个词.词的大小不超过16字节,内存大小限制是1M. 返回频数最高的100个词. ...
Trie树(字典树)的介绍及Java实现
简介 Trie树,又称为前缀树或字典树,是一种有序树,用于保存关联数组,其中的键通常是字符串.与二叉查找树不同,键不是直接保存在节点中,而是由节点在树中的位置决定.一个节点的所有子孙都有相同的前缀,也 ...

随机推荐

Camel In Action 阅读笔记第一章认识Camel 1.1 Camel 介绍
1.1 Camel 介绍 Camel 是一个为了您的项目集成变得高效有趣的集成框架,Camel 项目在2007年初开始的,相对来说它还比较年轻,但它已然是一个非常成熟的开源项目,它所使用的是Apach ...
以Akka为示例，介绍Actor模型
许多开发者在创建和维护多线程应用程序时经历过各种各样的问题,他们希望能在一个更高层次的抽象上进行工作,以避免直接和线程与锁打交道.为了帮助这些开发者,Arun Manivannan编写了一系列的博客帖 ...
云计算分布式大数据Hadoop实战高手之路第七讲Hadoop图文训练课程：通过HDFS的心跳来测试replication具体的工作机制和流程
这一讲主要深入使用HDFS命令行工具操作Hadoop分布式集群,主要是通过实验的配置hdfs-site.xml文件的心跳来测试replication具体的工作和流程. 通过HDFS的心跳来测试repl ...
html5 canvas图片马赛克
<!doctype html> <html> <head> <meta charset="utf-8"> <title> ...
VMare中安装“功能增强工具”，实现CentOS5.5与win7host共享文件夹的创建
读者如要转载,请标明出处和作者名,谢谢. 地址01:http://space.itpub.net/25851087 地址02:http://www.cnblogs.com/zjrodger/ 地址03 ...
JNLP + Applet + Bouncy Castle
http://stackoverflow.com/questions/4275005/jnlp-applet-bouncy-castle ——————————————————————————————— ...
Struts ForwardAction Example
In Struts MVC model, you have to go thought the Action Controller to get a new view page. In some ca ...
oracle学习三（持续更新中）
关于ora 01219问题的解决之前学习oracle的时候练习去建立表空间,建了很多之后手动删除了,之后再使用自己创建的用户名登陆数据库就会造成数据库 ORA-01031: ORACLE initi ...
图片转换成base64_encode的链接代码示例
<?php $file = "example.jpg"; $type = getimagesize( $file ); //取得图片的大小,类型等 $file_content ...
php中定义类
<?php class Person{ //定义了一个Person类 public $name; //定义属性name public $age; //定义属性age function __con ...

Trie 树 及Java实现

Trie 树 及Java实现的更多相关文章

随机推荐

热门专题

Trie 树及Java实现

Trie 树及Java实现的更多相关文章