Lucene 4.4 环境测试
2013-09-26 10:48
190 查看
package com.zsj.test;
import java.io.IOException;
import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.TokenStream;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.analysis.tokenattributes.CharTermAttribute;
import org.apache.lucene.util.Version;
/**
* Lu
* @author hadoop
*
*/
public class FirstLucene {
public static void main(String[] args) throws IOException {
/**
* 标准分析器是Lucene内置的分析器,会将语汇单元转成小写形式,
* 并去除停用词及标点符号,很明显也是不适合于中文环境
*/
Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_44);
TokenStream tokenStream = analyzer.tokenStream("",
"this is my first lucene");
CharTermAttribute charTermAttribute = tokenStream
.addAttribute(CharTermAttribute.class);
tokenStream.reset();
while (tokenStream.incrementToken()) {
System.out.println(charTermAttribute.toString());
}
tokenStream.end();
tokenStream.close();
}
}
import java.io.IOException;
import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.TokenStream;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.analysis.tokenattributes.CharTermAttribute;
import org.apache.lucene.util.Version;
/**
* Lu
* @author hadoop
*
*/
public class FirstLucene {
public static void main(String[] args) throws IOException {
/**
* 标准分析器是Lucene内置的分析器,会将语汇单元转成小写形式,
* 并去除停用词及标点符号,很明显也是不适合于中文环境
*/
Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_44);
TokenStream tokenStream = analyzer.tokenStream("",
"this is my first lucene");
CharTermAttribute charTermAttribute = tokenStream
.addAttribute(CharTermAttribute.class);
tokenStream.reset();
while (tokenStream.incrementToken()) {
System.out.println(charTermAttribute.toString());
}
tokenStream.end();
tokenStream.close();
}
}
相关文章推荐
- Oracle的unlimited tablespace系统权限
- C++空类中的默认函数
- LDA-线性判别分析(三)推广到 Multi-classes 情形
- LDA-线性判别分析(四)其他几个相关问题
- 行业不限凡事细心观察才会发现万物都是在变化之中
- notepad++中快速插入当前时间方法
- LDA-线性判别分析(二)Two-classes 情形的数学推导
- LDA-线性判别分析(一)预备知识
- java 读写Excel
- Java中的System类
- Redhat Linux网卡配置与绑定
- 入门新手可以看看,关于Java连接WebsphereMQ(也就是MQSeries)的内容
- Spring (二) OOP V.S AOP
- linux下open和fopen的区别
- LDA-线性判别分析(一)预备知识
- LDA-线性判别分析(二)Two-classes 情形的数学推导
- Air Max Shoes is Best Alternative if You Want to Purchase Suitable Shoes For You
- notepad++中快速插入当前时间方法
- Spring (二) OOP V.S AOP
- 当前路径和上级目录的表示方法(http://hi.baidu.com/asd837660/item/16f198f56bfe484a932af2c9)