论文阅读(Xiang Bai——【PAMI2017】An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition)
2016-11-06 17:53
1171 查看
白翔的CRNN论文阅读
1. 论文题目Xiang Bai——【PAMI2017】An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
2. 论文思路和方法
1) 问题范围: 单词识别
2) CNN层:使用标准CNN提取图像特征,利用Map-to-Sequence表示成特征向量;
3) RNN层:使用双向LSTM识别特征向量,得到每列特征的概率分布;
4) Transcription层:利用CTC和前向后向算法求解最优的label序列;
3. 亮点和创新点
1) 端到端可训练(把CNN和RNN联合训练)
2) 任意长度的输入(图像宽度任意,单词长度任意)
3) 训练集无需有字符的标定
4) 带字典和不带字典的库(样本)都可以使用
5) 性能好,而且模型小(参数少)
4. 相关链接
1) 白翔的个人主页:http://mc.eistar.net/~xbai/
2) 论文的下载地址:https://arxiv.org/pdf/1507.05717v1.pdf
3) 代码的下载地址:
http://mc.eistar.net/~xbai/CRNN/crnn_code.zip
5. 论文细节
1) 论文的框架
2) 特征提取层
3) 序列标定层
4) 翻译层
5) 网络训练
6) 实验
7) 总结
8) 问题
相关文章推荐
- 论文笔记:An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application
- An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to S
- BaiXiang——【arXi2015】An End-to-End Trainable Neural Network for Image-based Sequence Recognition and
- 论文阅读(Lukas Neumann——【ICCV2017】Deep TextSpotter_An End-to-End Trainable Scene Text Localization and Recognition Framework)
- 论文阅读(Weilin Huang——【TIP2016】Text-Attentional Convolutional Neural Network for Scene Text Detection)
- 论文阅读:End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for H
- 论文阅读(Xiang Bai——【TIP2014】A Unified Framework for Multi-Oriented Text Detection and Recognition)
- 论文阅读:Sparsifying Neural Network Connections for Face Recognition
- 论文阅读:《Associative Embedding:End-to-End Learning for Joint Detection and Grouping》
- A Novel Text Structure Feature Extractor for Chinese Scene Text Detection and Recognition论文笔记
- 论文阅读(Lukas Neuman——【ICDAR2015】Efficient Scene Text Localization and Recognition with Local Character Refinement)
- An End-to-End Trainable Neural Network...笔记
- 论文笔记之:Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic
- 论文阅读:Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis
- 论文笔记之:Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic
- 论文阅读 | An Artificial Neural Network-based Stock Trading System Using Technical Analysis and Big Data
- ”Hierarchical Recurrent Neural Network for Skeleton Based Action Recognition“阅读小结
- 论文读书笔记-Using neural network to combine measures of word semantic similarity for image annotation
- 【论文阅读】Fused Text Segmentation Network for Multi-oriented Scene Text Detection
- Domain Transform for Edge-Aware Image and Video Processing - 论文阅读