Lucene全文检索的【增、删、改、查】实例

　　创建索引

Lucene在进行创建索引时，根据前面一篇博客，已经讲完了大体的流程，这里再简单说下：

Directory directory = FSDirectory.open("/tmp/testindex");

IndexWriterConfig config = new IndexWriterConfig(Version.LUCENE_CURRENT, analyzer);

IndexWriter iwriter = new IndexWriter(directory, config);

Document doc = new Document();

String text = "This is the text to be indexed.";

doc.add(new Field("fieldname", text, TextField.TYPE_STORED)); iwriter.close();

　　1 创建Directory，获取索引目录

　　2 创建词法分析器，创建IndexWriter对象

　　3 创建document对象，存储数据

　　4 关闭IndexWriter，提交

/**

     * 建立索引

     *

     * @param args

     */

    public static void index() throws Exception {

        String text1 = "hello,man!";

        String text2 = "goodbye,man!";

        String text3 = "hello,woman!";

        String text4 = "goodbye,woman!";

        Date date1 = new Date();

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        directory = FSDirectory.open(new File(INDEX_DIR));

        IndexWriterConfig config = new IndexWriterConfig(

                Version.LUCENE_CURRENT, analyzer);

        indexWriter = new IndexWriter(directory, config);

        Document doc1 = new Document();

        doc1.add(new TextField("filename", "text1", Store.YES));

        doc1.add(new TextField("content", text1, Store.YES));

        indexWriter.addDocument(doc1);

        Document doc2 = new Document();

        doc2.add(new TextField("filename", "text2", Store.YES));

        doc2.add(new TextField("content", text2, Store.YES));

        indexWriter.addDocument(doc2);

        Document doc3 = new Document();

        doc3.add(new TextField("filename", "text3", Store.YES));

        doc3.add(new TextField("content", text3, Store.YES));

        indexWriter.addDocument(doc3);

        Document doc4 = new Document();

        doc4.add(new TextField("filename", "text4", Store.YES));

        doc4.add(new TextField("content", text4, Store.YES));

        indexWriter.addDocument(doc4);

        indexWriter.commit();

        indexWriter.close();

        Date date2 = new Date();

        System.out.println("创建索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

　　增量添加索引

Lucene拥有增量添加索引的功能，在不会影响之前的索引情况下，添加索引，它会在何时的时机，自动合并索引文件。

/**

     * 增加索引

     *

     * @throws Exception

     */

    public static void insert() throws Exception {

        String text5 = "hello,goodbye,man,woman";

        Date date1 = new Date();

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        directory = FSDirectory.open(new File(INDEX_DIR));

        IndexWriterConfig config = new IndexWriterConfig(

                Version.LUCENE_CURRENT, analyzer);

        indexWriter = new IndexWriter(directory, config);

        Document doc1 = new Document();

        doc1.add(new TextField("filename", "text5", Store.YES));

        doc1.add(new TextField("content", text5, Store.YES));

        indexWriter.addDocument(doc1);

        indexWriter.commit();

        indexWriter.close();

        Date date2 = new Date();

        System.out.println("增加索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

　　删除索引

Lucene也是通过IndexWriter调用它的delete方法，来删除索引。我们可以通过关键字，删除与这个关键字有关的所有内容。如果仅仅是想要删除一个文档，那么最好就顶一个唯一的ID域，通过这个ID域，来进行删除操作。

/**

     * 删除索引

     *

     * @param str 删除的关键字

     * @throws Exception

     */

    public static void delete(String str) throws Exception {

        Date date1 = new Date();

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        directory = FSDirectory.open(new File(INDEX_DIR));

        IndexWriterConfig config = new IndexWriterConfig(

                Version.LUCENE_CURRENT, analyzer);

        indexWriter = new IndexWriter(directory, config);

        indexWriter.deleteDocuments(new Term("filename",str));  

        indexWriter.close();

        Date date2 = new Date();

        System.out.println("删除索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

　　更新索引

Lucene没有真正的更新操作，通过某个fieldname，可以更新这个域对应的索引，但是实质上，它是先删除索引，再重新建立的。

/**

     * 更新索引

     *

     * @throws Exception

     */

    public static void update() throws Exception {

        String text1 = "update,hello,man!";

        Date date1 = new Date();

         analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

         directory = FSDirectory.open(new File(INDEX_DIR));

         IndexWriterConfig config = new IndexWriterConfig(

                 Version.LUCENE_CURRENT, analyzer);

         indexWriter = new IndexWriter(directory, config);

         Document doc1 = new Document();

        doc1.add(new TextField("filename", "text1", Store.YES));

        doc1.add(new TextField("content", text1, Store.YES));

        indexWriter.updateDocument(new Term("filename","text1"), doc1);

         indexWriter.close();

         Date date2 = new Date();

         System.out.println("更新索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

　　通过索引查询关键字

Lucene的查询方式有很多种，这里就不做详细介绍了。它会返回一个ScoreDoc的集合，类似ResultSet的集合，我们可以通过域名获取想要获取的内容。

/**

     * 关键字查询

     *

     * @param str

     * @throws Exception

     */

    public static void search(String str) throws Exception {

        directory = FSDirectory.open(new File(INDEX_DIR));

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        DirectoryReader ireader = DirectoryReader.open(directory);

        IndexSearcher isearcher = new IndexSearcher(ireader);

        QueryParser parser = new QueryParser(Version.LUCENE_CURRENT, "content",analyzer);

        Query query = parser.parse(str);

        ScoreDoc[] hits = isearcher.search(query, null, 1000).scoreDocs;

        for (int i = 0; i < hits.length; i++) {

            Document hitDoc = isearcher.doc(hits[i].doc);

            System.out.println(hitDoc.get("filename"));

            System.out.println(hitDoc.get("content"));

        }

        ireader.close();

        directory.close();

    }

　　全部代码

package test;

import java.io.File;

import java.util.Date;

import java.util.List;

import org.apache.lucene.analysis.Analyzer;

import org.apache.lucene.analysis.standard.StandardAnalyzer;

import org.apache.lucene.document.Document;

import org.apache.lucene.document.LongField;

import org.apache.lucene.document.TextField;

import org.apache.lucene.document.Field.Store;

import org.apache.lucene.index.DirectoryReader;

import org.apache.lucene.index.IndexWriter;

import org.apache.lucene.index.IndexWriterConfig;

import org.apache.lucene.index.Term;

import org.apache.lucene.queryparser.classic.QueryParser;

import org.apache.lucene.search.IndexSearcher;

import org.apache.lucene.search.Query;

import org.apache.lucene.search.ScoreDoc;

import org.apache.lucene.store.Directory;

import org.apache.lucene.store.FSDirectory;

import org.apache.lucene.util.Version;

public class TestLucene {

    // 保存路径

    private static String INDEX_DIR = "D:\\luceneIndex";

    private static Analyzer analyzer = null;

    private static Directory directory = null;

    private static IndexWriter indexWriter = null;

    public static void main(String[] args) {

        try {

//            index();

            search("man");

//            insert();

//            delete("text5");

//            update();

        } catch (Exception e) {

            e.printStackTrace();

        }

    }

    /**

     * 更新索引

     *

     * @throws Exception

     */

    public static void update() throws Exception {

        String text1 = "update,hello,man!";

        Date date1 = new Date();

         analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

         directory = FSDirectory.open(new File(INDEX_DIR));

         IndexWriterConfig config = new IndexWriterConfig(

                 Version.LUCENE_CURRENT, analyzer);

         indexWriter = new IndexWriter(directory, config);

         Document doc1 = new Document();

        doc1.add(new TextField("filename", "text1", Store.YES));

        doc1.add(new TextField("content", text1, Store.YES));

        indexWriter.updateDocument(new Term("filename","text1"), doc1);

         indexWriter.close();

         Date date2 = new Date();

         System.out.println("更新索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

    /**

     * 删除索引

     *

     * @param str 删除的关键字

     * @throws Exception

     */

    public static void delete(String str) throws Exception {

        Date date1 = new Date();

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        directory = FSDirectory.open(new File(INDEX_DIR));

        IndexWriterConfig config = new IndexWriterConfig(

                Version.LUCENE_CURRENT, analyzer);

        indexWriter = new IndexWriter(directory, config);

        indexWriter.deleteDocuments(new Term("filename",str));  

        indexWriter.close();

        Date date2 = new Date();

        System.out.println("删除索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

    /**

     * 增加索引

     *

     * @throws Exception

     */

    public static void insert() throws Exception {

        String text5 = "hello,goodbye,man,woman";

        Date date1 = new Date();

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        directory = FSDirectory.open(new File(INDEX_DIR));

        IndexWriterConfig config = new IndexWriterConfig(

                Version.LUCENE_CURRENT, analyzer);

        indexWriter = new IndexWriter(directory, config);

        Document doc1 = new Document();

        doc1.add(new TextField("filename", "text5", Store.YES));

        doc1.add(new TextField("content", text5, Store.YES));

        indexWriter.addDocument(doc1);

        indexWriter.commit();

        indexWriter.close();

        Date date2 = new Date();

        System.out.println("增加索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

    /**

     * 建立索引

     *

     * @param args

     */

    public static void index() throws Exception {

        String text1 = "hello,man!";

        String text2 = "goodbye,man!";

        String text3 = "hello,woman!";

        String text4 = "goodbye,woman!";

        Date date1 = new Date();

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        directory = FSDirectory.open(new File(INDEX_DIR));

        IndexWriterConfig config = new IndexWriterConfig(

                Version.LUCENE_CURRENT, analyzer);

        indexWriter = new IndexWriter(directory, config);

        Document doc1 = new Document();

        doc1.add(new TextField("filename", "text1", Store.YES));

        doc1.add(new TextField("content", text1, Store.YES));

        indexWriter.addDocument(doc1);

        Document doc2 = new Document();

        doc2.add(new TextField("filename", "text2", Store.YES));

        doc2.add(new TextField("content", text2, Store.YES));

        indexWriter.addDocument(doc2);

        Document doc3 = new Document();

        doc3.add(new TextField("filename", "text3", Store.YES));

        doc3.add(new TextField("content", text3, Store.YES));

        indexWriter.addDocument(doc3);

        Document doc4 = new Document();

        doc4.add(new TextField("filename", "text4", Store.YES));

        doc4.add(new TextField("content", text4, Store.YES));

        indexWriter.addDocument(doc4);

        indexWriter.commit();

        indexWriter.close();

        Date date2 = new Date();

        System.out.println("创建索引耗时：" + (date2.getTime() - date1.getTime()) + "ms\n");

    }

    /**

     * 关键字查询

     *

     * @param str

     * @throws Exception

     */

    public static void search(String str) throws Exception {

        directory = FSDirectory.open(new File(INDEX_DIR));

        analyzer = new StandardAnalyzer(Version.LUCENE_CURRENT);

        DirectoryReader ireader = DirectoryReader.open(directory);

        IndexSearcher isearcher = new IndexSearcher(ireader);

        QueryParser parser = new QueryParser(Version.LUCENE_CURRENT, "content",analyzer);

        Query query = parser.parse(str);

        ScoreDoc[] hits = isearcher.search(query, null, 1000).scoreDocs;

        for (int i = 0; i < hits.length; i++) {

            Document hitDoc = isearcher.doc(hits[i].doc);

            System.out.println(hitDoc.get("filename"));

            System.out.println(hitDoc.get("content"));

        }

        ireader.close();

        directory.close();

    }

}

Lucene全文检索的【增、删、改、查】实例的更多相关文章

好用的SQL TVP~~独家赠送[增-删-改-查]的例子
以前总是追求新东西,发现基础才是最重要的,今年主要的目标是精通SQL查询和SQL性能优化. 本系列主要是针对T-SQL的总结. [T-SQL基础]01.单表查询-几道sql查询题 [T-SQL基础] ...
iOS FMDB的使用(增,删,改,查,sqlite存取图片)
iOS FMDB的使用(增,删,改,查,sqlite存取图片) 在上一篇博客我对sqlite的基本使用进行了详细介绍... 但是在实际开发中原生使用的频率是很少的... 这篇博客我将会较全面的介绍FM ...
iOS sqlite3 的基本使用(增删改查)
iOS sqlite3 的基本使用(增删改查) 这篇博客不会讲述太多sql语言,目的重在实现sqlite3的一些基本操作. 例:增删改查如果想了解更多的sql语言可以利用强大的互联网. ...
django ajax增删改查
具于django ajax实现增删改查功能代码示例: 代码: urls.py from django.conf.urls import url from django.contrib impo ...
ADO.NET 增删改查
ADO.NET:(数据访问技术)就是将C#和MSSQL连接起来的一个纽带可以通过ADO.NET将内存中的临时数据写入到数据库中也可以将数据库中的数据提取到内存中供程序调用 ADO.NET所有数据访 ...
MVC EF 增删改查
using System;using System.Collections.Generic;using System.Linq;using System.Web;//using System.Data ...
python基础中的四大天王-增-删-改-查
列表-list-[] 输入内存储存容器发生改变通常直接变化,让我们看看下面列子增---默认在最后添加 #append()--括号中可以是数字,可以是字符串,可以是元祖,可以是集合,可以是字典 #l ...
php5.4以上 mysqli 实例操作mysql 增,删,改,查
<?php //php5.4以上 mysqli 实例操作mysql header("Content-type:text/html;charset=utf8"); $conn ...
简单的php数据库操作类代码(增,删,改,查)
这几天准备重新学习,梳理一下知识体系,同时按照功能模块划分做一些东西.所以.mysql的操作成为第一个要点.我写了一个简单的mysql操作类,实现数据的简单的增删改查功能. 数据库操纵基本流程为: 1 ...
MongoDB增删改查
增增加单篇文档 > db.stu.insert({sn:'001', name:'lisi'}) WriteResult({ "nInserted" : 1 }) > ...

随机推荐

MyEclipse-6.5注冊码生成器源代码
打开MyEclipse新建一个Javaproject,然后新建类,粘贴例如以下代码,就可以生成MyEclipse的注冊码 import java.io.BufferedReader; import j ...
[ACM] 九度OJ 1553 时钟
时间限制:1 秒内存限制:128 兆特殊判题:否提交:1733 解决:656 题目描写叙述: 如图,给定随意时刻,求时针和分针的夹角(劣弧所相应的角). 输入: 输入包括多组測试数据.每组測试数 ...
0 and 1
Description Andrewid the Android is a galaxy-famous detective. In his free time he likes to think ab ...
通过无线连接的方式来做 Appium 自动化
感谢TesterHome里各种大牛,提出的宝贵思路,我这里只是将他们的想法综合了一下,试出来的成果,谢谢大家分享你们的智慧. 简单说下背景: 由于公司要测试APP 产品的耗电问题,我们采取的办法很lo ...
攻略三战的完美体验3Castle Fantisia阿兰·梅希亚战争艾伦西战记它包含重做版本(这是新的艾伦·梅希亚大战)
(城堡幻想曲3,纠正大家个错误哦,不是圣魔大战3,圣魔大战是城堡幻想曲2,圣魔大战不是个系列,艾伦西亚战记==艾伦希亚战记,一个游戏日文名:タイトルキャッスルファンタジア -エレンシア戦記-リニュー ...
Delphi Socket的最好项目——FastMsg IM（还有一些IM控件），RTC，RO，Sparkle等等，FileZilla Client/Server，wireshark，NSClient
https://www.nsclient.org/nsclient/ 好好学习,天天向上
MySQL内存表的特性与使用介绍 -- 简明现代魔法
MySQL内存表的特性与使用介绍 -- 简明现代魔法 MySQL内存表的特性与使用介绍
eclipse不能进入debug
首先说明一下.我肯定是以debug模式启动的.断点程序肯定能走到. 可是一点页面,程序就跳到class文件,而不是Java文件. 解决的方法是:window---preferences---tomca ...
#pragma详解
在#Pragma是预处理指令它的作用是设定编译器的状态或者是指示编译器完成一些特定的动作.#pragma指令对每个编译器给出了一个方法,在保持与C和C ++语言完全兼容的情况下,给出主机或操作系统专有 ...
html中的table在android端显示
转载请注明出处:http://blog.csdn.net/u012338845/article/details/46773245 開始都是用Html.fromHtml(source).来显示html的 ...

Lucene全文检索的【增、删、改、查】 实例

创建索引

增量添加索引

删除索引

更新索引

通过索引查询关键字

全部代码

Lucene全文检索的【增、删、改、查】 实例的更多相关文章

随机推荐

热门专题

Lucene全文检索的【增、删、改、查】实例

　　创建索引

　　增量添加索引

　　删除索引

　　更新索引

　　通过索引查询关键字

　　全部代码

Lucene全文检索的【增、删、改、查】实例的更多相关文章