Face Detection using Deep Learning: An Improved Faster RCNN Approach论文解读
2018-02-03 17:08
786 查看
Flowchart of the training procedure
First of all, we train the CNN model of Faster RCNN using the WIDER FACE dataset [30]. We
further use the same dataset to test the pre-trained model so as to generate hard negatives. Thesehard negatives are fed into the network as the second step of our training procedure. The resulting
model will be further fine-tuned on the FDDB dataset. During the final fine-tuning process, we
apply the multi-scale training process, and adopt a feature concatenation strategy to further boost
the performance of our model. For the whole training processes, we follow the similar end-to-end
training strategy as Faster RCNN.
文章在faster RCNN上做了一些改进,主要体现在三个方面:
1、Feature Concatenation
Network architecture of the proposed feature concatenation scheme
我理解的就是把得出的ROIs映射到Conv3_3,Conv4_3,conv5_3,之后连接一个1X1的卷积层以保持深度一致,然后将这些ROIs输入到roi pool层,分别得到ROI_pool3,ROI_pool4,ROI_pool5,然后将这些ROI_pool进行concatenate(具体concatenate是怎么操作的文章中没写)。特别的,比较浅的特征层ROI_poo之后有一个L2-normalized的操作。
2、Hard Negative Mining
hard negatives are the regions where the network has failed to make correct prediction.
Thus, the hard negatives are fed into the network again as a reinforcement for improving our
trained model.
把hard negatives输入到用wider face预训练好的RPN部分,保持正负样本比例为1:3
3、Multi-Scale Training
文章对每张图片取了三种尺度进行训练,短边不超过480; 600; 750. 长边不超过1250.实验结果表示多尺度的训练让模型对不同尺寸的图片更具有鲁棒性,而且提高了检测的性能。
Experiments
1、VGG16 was selected to be our backbone CNN network, which had been pre-trained on ImageNet.
2、训练数据是WIDER FACE的training +validation datasets。
3、We gave each ground-truth annotation a difficulty value
忽略掉加起来分数大于2的ground-truth
4、具体的一些参数设置见文章
5、用WIDER FACE训练完后,将confidence scores大于0.8,与ground-truth之间的IOU小于0.5的hard negatives fed into网络中.
First of all, we train the CNN model of Faster RCNN using the WIDER FACE dataset [30]. We
further use the same dataset to test the pre-trained model so as to generate hard negatives. Thesehard negatives are fed into the network as the second step of our training procedure. The resulting
model will be further fine-tuned on the FDDB dataset. During the final fine-tuning process, we
apply the multi-scale training process, and adopt a feature concatenation strategy to further boost
the performance of our model. For the whole training processes, we follow the similar end-to-end
training strategy as Faster RCNN.
文章在faster RCNN上做了一些改进,主要体现在三个方面:
1、Feature Concatenation
Network architecture of the proposed feature concatenation scheme
我理解的就是把得出的ROIs映射到Conv3_3,Conv4_3,conv5_3,之后连接一个1X1的卷积层以保持深度一致,然后将这些ROIs输入到roi pool层,分别得到ROI_pool3,ROI_pool4,ROI_pool5,然后将这些ROI_pool进行concatenate(具体concatenate是怎么操作的文章中没写)。特别的,比较浅的特征层ROI_poo之后有一个L2-normalized的操作。
2、Hard Negative Mining
hard negatives are the regions where the network has failed to make correct prediction.
Thus, the hard negatives are fed into the network again as a reinforcement for improving our
trained model.
把hard negatives输入到用wider face预训练好的RPN部分,保持正负样本比例为1:3
3、Multi-Scale Training
文章对每张图片取了三种尺度进行训练,短边不超过480; 600; 750. 长边不超过1250.实验结果表示多尺度的训练让模型对不同尺寸的图片更具有鲁棒性,而且提高了检测的性能。
Experiments
1、VGG16 was selected to be our backbone CNN network, which had been pre-trained on ImageNet.
2、训练数据是WIDER FACE的training +validation datasets。
3、We gave each ground-truth annotation a difficulty value
忽略掉加起来分数大于2的ground-truth
4、具体的一些参数设置见文章
5、用WIDER FACE训练完后,将confidence scores大于0.8,与ground-truth之间的IOU小于0.5的hard negatives fed into网络中.
相关文章推荐
- 论文解读Face Detection using Deep Learning: An Improved Faster R-CNN Approach
- paper 159:文章解读:From Facial Parts Responses to Face Detection: A Deep Learning Approach--2015ICCV
- 【论文笔记】From Facial Parts Responses to Face Detection: A Deep Learning Approach
- 论文《From Facial Parts Responses to Face Detection: A Deep Learning Approach》笔记
- 《A Lightened CNN for Deep Face Representation》论文解读 本文来自中科院,原文地址为: https://arxiv.org/abs/1511.02683
- From Facial Parts Responses to Face Detection: A Deep Learning Approach
- Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks(MTCNN)论文和代码解读
- RBM-An approach for text summarization using deep learning algorithm
- [论文解读] Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient CNN
- From Facial Parts Responses to Face Detection: A Deep Learning Approach
- 论文摘要: Multi-view Face Detection Using Deep Convolutional Neural Networks
- 论文阅读之: Hierarchical Object Detection with Deep Reinforcement Learning
- 论文阅读:《Natural Image Matting Using Deep CNN》ECCV 2016
- 论文PCANet: A Simple Deep Learning Baseline for Image Classification?的matlab源码解读(四)
- Joint Deep Learning For Pedestrian Detection(论文笔记-深度学习:行人检测)
- USING DEEP LEARNING FOR ANOMALY DETECTION IN RADIOLOGICAL IMAGES
- A Discriminative Feature Learning Approach for Deep Face Recognition 原理及在caffe实验复现
- A Discriminative Feature Learning Approach for Deep Face Recognition 的源码部分分析
- deep learning for face detection
- deep learning for face detection