segmentation chinese word based on max boudary mix with CRF
2010-06-27 19:16
381 查看
The Sighan BakeOff result have release, and i receive the fifth place in Word Segmentation for Simplified Chinese open test (in compute) 、second place in Word Segmentation for Traditional Chinese open test (in compute、in medical、in finace、in literature). The result is satificate for me , but have some pity that i don't receive the first place. i am earger for the first.
This year, there are 19 teams participate in this competition for the world, include CMU、Queensland University of Technology,SEG-IASL,pku, Fudan and so on. Today Chinese word segmentation will focus on the cross-domain performance of Chinese word segmentation algorithms.
I think i was the only master to participate in this competition. and i was only one person. the result is get me many confidence.
I use the method "segmentation chinese word based on max boudary mix with CRF". the algorithm contain two step. first, i seg the Chinse word based on max bondary to generate the candition chunk, and to mark.The i use CRF to segment.
The detail please wait my paper pubilcing.
This year, there are 19 teams participate in this competition for the world, include CMU、Queensland University of Technology,SEG-IASL,pku, Fudan and so on. Today Chinese word segmentation will focus on the cross-domain performance of Chinese word segmentation algorithms.
I think i was the only master to participate in this competition. and i was only one person. the result is get me many confidence.
I use the method "segmentation chinese word based on max boudary mix with CRF". the algorithm contain two step. first, i seg the Chinse word based on max bondary to generate the candition chunk, and to mark.The i use CRF to segment.
The detail please wait my paper pubilcing.
相关文章推荐
- HTTPCWS 是一款基于HTTP协议的开源中文分词系统。(HTTPCWS is an Chinese Word Segmentation System Based on the HTTP proto
- 一个基于搜索的中文分词方法( A Search-based Chinese Word Segmentation Method)
- MMSEG: A Word Identification System for Mandarin Chinese Text Based on Two Variants of the Maximum M
- 笔记-2006-Subword-based Tagging by Conditional Random Fields for Chinese Word Segmentation
- A Gap-Based Framework for Chinese Word Segmentation via Very Deep Convolutional Network
- 外文翻译_A Search-based Chinese Word Segmentation Method
- Linux中修改密码出现it is based on a dictionary word解决方法
- Ground Segmentation based on Loopy Belief Propagation for Sparse 3D Point Clouds (论文速读)
- 目标跟踪之“Robust Visual Tracking with Deep Convolutional Neural Network based Object Proposals on PETS”
- Softmax on Digits Data with TensorFlow
- Swift based iBeacon App Development with CoreLocation on Apple iOS 7/8
- 笔记-2012-Unsupervized Word Segmentation the case for Mandarin Chinese
- 笔记-2002-Combining Classifiers for Chinese Word Segmentation
- Codeforces Round #468 (Div. 2, based on Technocup 2018 Final Round)E. Game with String(枚举)
- Resolve and Remove "BAD PASSWORD: It is Based on a Dictionary Word "in Linux
- Recursive drivable road detection with shadows based on two-camera systems
- 中文分词文献列表 Bibliography of Chinese Word Segmentation
- 论文导读(person re-identification)——Person Re-identification based on nonlinear ranking with difference
- 笔记-2003-Chinese Word Segmentation as LMR Tagging
- 搜索引擎之中文分词(Chinese Word Segmentation)简介