将数据库的数据导入solr索引库中

在solr与tomcat整合文章中，我用的索引库是mycore，现在就以这个为例。

首先要准备jar包：solr-dataimporthandler-4.8.1.jar、solr-dataimporthandler-extras-4.8.1.jar和mysql-connector-java-5.0.7-bin.jar这三个包到solr的tomcat的webapps\solr\WEB-INF\lib下

在这个文件夹的conf下配置两个文件，添加一个文件。先配置solrconfig.xml。

在该文件下添加一个新节点。

<requestHandler name="/dataimport" class="org.apache.solr.handler.dataimport.DataImportHandler">

        <lst name="defaults">

            <str name="config">data-config.xml</str>

        </lst>

    </requestHandler>

在solrconfig.xml的同目录下创建data-config.xml。

配置：

<dataConfig>

    <dataSource type="JdbcDataSource"

              driver="com.mysql.jdbc.Driver"

              url="jdbc:mysql://localhost:3306/courseman"

              user="root"

              password="mysql" />

    <document>

        <entity name="student"

            query="SELECT * FROM student">

            <field column="id" name="id" />

            <field column="name" name="name" />

            <field column="gender" name="gender" />

            <field column="major" name="major" />

            <field column="grade" name="grade" />

        </entity>

    </document>

</dataConfig>

schemal.xml的配置

<?xml version="1.0" ?>

<!--

 Licensed to the Apache Software Foundation (ASF) under one or more

 contributor license agreements.  See the NOTICE file distributed with

 this work for additional information regarding copyright ownership.

 The ASF licenses this file to You under the Apache License, Version 2.0

 (the "License"); you may not use this file except in compliance with

 the License.  You may obtain a copy of the License at

     http://www.apache.org/licenses/LICENSE-2.0

 Unless required by applicable law or agreed to in writing, software

 distributed under the License is distributed on an "AS IS" BASIS,

 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

 See the License for the specific language governing permissions and

 limitations under the License.

-->

<schema name="example core one" version="1.1">

    <fieldtype name="string"  class="solr.StrField" sortMissingLast="true" omitNorms="true"/>

    <fieldType name="long" class="solr.TrieLongField" precisionStep="0" positionIncrementGap="0"/>

    <fieldType name="int" class="solr.TrieIntField" precisionStep="0" positionIncrementGap="0"/>

  <!-- general -->

  <field name="id"   type="int"    indexed="true"  stored="true"  />

  <field name="gender"  type="string"    indexed="true"  stored="true"   />

  <field name="name"      type="string"    indexed="true"  stored="true"   />

  <field name="major"     type="string"    indexed="true"  stored="true"   />

  <field name="grade"     type="string"    indexed="true"  stored="true"   />

  <field name="_version_" type="long"      indexed="true"  stored="true"/>

 <!-- field to use to determine and enforce document uniqueness. -->

 <uniqueKey>id</uniqueKey>

 <!-- field for the QueryParser to use when an explicit fieldname is absent -->

 <defaultSearchField>name</defaultSearchField>

 <!-- SolrQueryParser configuration: defaultOperator="AND|OR" -->

 <solrQueryParser defaultOperator="OR"/>

</schema>

默认的文件不是这样的，稍微改动了一下。

field 的type类型是根据fieldtype 的name定义的。class是solr自定义的不能更改。

shcema.xml文件的field字段的属性介绍：

（1）name：字段名称

（2）type：字段类型（此处type不是java类型，而是下面定义的fieldType）

（3）indexed：是否索引？true--solr会对这个字段进行索引，只有经过索引的字段才能被搜索、排序等；false--不索引

（4）stored：是否存储？true--存储，当我们需要在页面显示此字段时，应设为true，否则false。

（5）required：是否必须？true--此字段为必需，如果此字段的内容为空，会报异常；false--不是必需

（6）multiValued：此字段是否可以保存多个值？

（7）omitNorms：是否对此字段进行解析？有时候我们想通过某个字段的完全匹配来查询信息，那么设置 indexed="true"、omitNorms="true"。

（8）default：设置默认值

有这样一个FieldType描述：

<fieldType name="text_general" positionIncrementGap="100">

        <analyzer type="index">

                    <tokenizer/>

                    <filter ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />

                     <filter/>

        </analyzer>

        <analyzer type="query">

              <tokenizer/>

              <filter ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />

               <filter synonyms="synonyms.txt" ignoreCase="true" expand="true"/>

                <filter/>

         </analyzer>

 </fieldType>

属性说明：

（1）name：类型名称，<field>中的type引用的就是这个name

（2）class：solr自定义的类型

（3）<analyzer type="index">定义建立索引时使用的分词器及过滤器

（4）<analyzer type="query">定义搜索时所使用的分词器及过滤器

（5）<tokenizer/>定义分词器

（6）<filter/>定义过滤器

uniqueKey属性

<uniqueKey>id</uniqueKey>
类似于数据表数据的id，solr索引库中最好定义一个用于标示document唯一性的字段，此字段主要用于删除document。

defaultSearchField属性

就是你在做query搜尋時若不指定特定欄位做檢索時, Solr就會只查這個欄位.
<defaultSearchField>default</defaultSearchField>

copyField属性

是用來複製你一個欄位裡的值到另一欄位用. 如你可以將name裡的東西copy到major裡, 這樣solr做檢索時也會檢索到name裡的東西.

<copyField source="name" dest="major"/>

现在可以将数据库的数据导入solr了。

点击Execute就可以了。

数据库的数据导进来了。

数据库的表。

数据库表结构。

将数据库的数据导入solr索引库中的更多相关文章

将Mysq数据导入solr索引库
本文的基础环境都是在centos 64bit,jdk1.7.79 将mysql 的jar 包添加到/home/hadoop/cloudsolr/solr-4.10.4/contrib/dataimpo ...
商城06——solr索引库搭建&solr搜索功能实现&图片显示问题解决
1. 课程计划 1.搜索工程的搭建 2.linux下solr服务的搭建 3.Solrj使用测试 4.把数据库中的数据导入索引库 5.搜索功能的实现 2. 搜索工程搭建要实现搜索功能,需要搭建 ...
solr 索引库的维护
一.配置中文分析器:IK-analyzer,在FieldType中指定中文分析器:1 复制IK-analyzer到你的服务器指定目录中.2 在该目录中,我们需要的东西有:IKAnalyzer的jar包 ...
第一个lucene程序，把一个信息写入到索引库中、根据关键词把对象从索引库中提取出来、lucene读写过程分析
新建一个Java Project :LuceneTest 准备lucene的jar包,要加入的jar包至少有: 1)lucene-core-3.1.0.jar (核心包) 2) lucene- ...
使用solrj操作solr索引库
(solrj)初次使用solr的开发人员总是很郁闷,不知道如何去操作solr索引库,以为只能用<五分钟solr4.5教程(搭建.运行)>中讲到的用xml文件的形式提交数据到索引库,其实没有 ...
使用solrj操作solr索引库,solr是lucene服务器
客户端开发 Solrj 客户端开发 Solrj Solr是搭建好的lucene服务器当然不可能完全满足一般的业务需求可能要针对各种的架构和业务调整这里就需要用到Solrj了 Solrj是Sol ...
如何在分布式环境中同步solr索引库和缓存信息
天气依旧很好,主要是凉快.老习惯,我在北京向各位问好. 搜索无处不在,相信各位每天都免不了与它的亲密接触,那么我想你确实有必要来了解一下它们,就上周在公司实现的一个小需求来给各位分享一下:如何在分布式 ...
项目中通过Sorlj获取索引库中的数据
在开发项目中通过使用Solr所提供的Solrj(java客户端)获取索引库中的数据,这才是真正对项目起实质性作用的功能,提升平台的检索性能及检索结果的精确性第一步,引入相关依赖的jar包第二步,根 ...
solr索引库的创建
solr索引库的创建一.找到你安装的[solrhome]目录(我的是这个) 二.进入该目录三.选择其中任意一个索引库复制一份到该目录下并更名为要创建的索引库名称四.进入[myindex]目录下, ...

随机推荐

Spark编程模型及RDD操作
转载自:http://blog.csdn.net/liuwenbo0920/article/details/45243775 1. Spark中的基本概念在Spark中,有下面的基本概念.Appli ...
Linux的一些简单命令(四)-用户和组账户管理
linux操作系统是一个多用户操作系统,它允许多用户同时登录到系统上并使用资源.系统会根据账户来区分每个用户的文件,进程,任务和工作环境,使得每个用户工作都不受干扰 1.保存用户信息的文件:/etc/ ...
Ajax实现页面动态加载，添加数据
前台代码: <%@ Page Language="C#" AutoEventWireup="true" CodeBehind="Products ...
AngularJs中，如何在父元素中调用子元素为自定义Directive中定义的函数？
最近一段时间准备使用AngularJs中的自定义Directive重构一下代码. 在这里说明一下,把自定义控件封装成Directive并不一定是要复用,而是要让代码结构更加清晰.就好像你将一个长方法拆 ...
VR应用向导，全球Top10 VR应用排行榜
2016年国际知名产商索尼.三星.HTC.Oculus.YouTube等等都推出了自己的VR设备,与此同时还有自有的VR应用平台,供各位玩家下载应用体验沉浸式VR,当然每个平台的VR应用下载量各不相同 ...
iOS UIBezierPath类介绍
使用UIBezierPath类可以创建基于矢量的路径,这个类在UIKit中.此类是Core Graphics框架关于path的一个封装.使用此类可以定义简单的形状,如椭圆或者矩形,或者有多个直线和 ...
Python入门学习
1.访问www.python.org/download下载并安装Python3版本 2.命令行输入python启动交互式命令shell,>>>代表当前行用户的输入 3.基本算数运算: ...
PHP Cookei记录用户历史浏览信息的代码
[基础]Cookie常用方法:$_COOKIE['RecordLuHuiDUDU'] 得到Cookiesetcookie('RecordLuHuiDUDU',",time()-3600*24 ...
angular.js升序降序过滤器
向指令添加过滤器过滤器可以通过一个管道字符(|)和一个过滤器添加到指令中. orderBy 过滤器根据表达式排列数组: 例如: <div ng-app="myApp" ng ...
YII2.0 数据库增删改查
/*==================== dkhBaseModel 数据库增删改查方法 start ================================*/ //新增一条数据 publ ...

将数据库的数据导入solr索引库中

将数据库的数据导入solr索引库中的更多相关文章

随机推荐

热门专题