您的位置：首页 > 其它

使用Lucene和IKAnalyzer实现中文简单分词

2012-08-03 15:04 726 查看

import java.io.IOException;
import java.io.StringReader;
import java.util.ArrayList;

import org.wltea.analyzer.IKSegmentation;
import org.wltea.analyzer.Lexeme;

public class Test {
public static ArrayList<String> getStringSet(String s)throws IOException{
ArrayList<String> ali = new ArrayList<String>();

StringReader reader = new StringReader(s);
IKSegmentation ik = new IKSegmentation(reader , true);
Lexeme lexeme = null;
while ((lexeme = ik.next()) != null) {
ali.add(lexeme.getLexemeText());
}
return ali;
}
public static void main(String[] args) throws IOException {
String s = "头痛是一种病";

ArrayList<String> aList = Test.getStringSet(s);
while(!aList.isEmpty()){
System.out.println(aList.get(0));
aList.remove(0);
}
}
}

// 输出：

头痛
是
一种
病

——————————————————————————————————

实现输入的语句的分词。

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

新的分享

章节导航

使用Lucene和IKAnalyzer实现 中文简单 分词

使用Lucene和IKAnalyzer实现中文简单分词