车辆2D/3D--Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis
2017-07-27 09:26
441 查看
Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image
CVPR2017
https://arxiv.org/abs/1703.07570
自动驾驶 很快就可以达到实用的水平了。
本文的功能是:给一张灰度图像,使用 多任务CNN网络 Deep MANTA 可以给出6个信息: region proposal, detection, 2D box regression, part localization, part visibility and 3D template prediction,通过定义 Many-task loss functions 实现
先上图来个感性认识:
Deep MANTA 整个网络流程图如下所示:
Conv layers with the same color share the same weights
怎么从2D 信息推理出 3D 信息了?
首先我们利用了2个3D 的数据库 3D shape and template datasets
2D/3D vehicle model
数据标记问题怎么解决
Semi-automatic annotation process
Experiments
http://www.cvlibs.net/datasets/kitti/eval_object_detail.php?&result=6759889c0a252c63765d5e2e69cb8b1433cadb0a
Running time: 0.7 s
Environment: GPU @ 2.5 Ghz (Python + C/C++)
CVPR2017
https://arxiv.org/abs/1703.07570
自动驾驶 很快就可以达到实用的水平了。
本文的功能是:给一张灰度图像,使用 多任务CNN网络 Deep MANTA 可以给出6个信息: region proposal, detection, 2D box regression, part localization, part visibility and 3D template prediction,通过定义 Many-task loss functions 实现
先上图来个感性认识:
Deep MANTA 整个网络流程图如下所示:
Conv layers with the same color share the same weights
怎么从2D 信息推理出 3D 信息了?
首先我们利用了2个3D 的数据库 3D shape and template datasets
2D/3D vehicle model
数据标记问题怎么解决
Semi-automatic annotation process
Experiments
http://www.cvlibs.net/datasets/kitti/eval_object_detail.php?&result=6759889c0a252c63765d5e2e69cb8b1433cadb0a
Running time: 0.7 s
Environment: GPU @ 2.5 Ghz (Python + C/C++)
相关文章推荐
- 车辆检测“Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monoc”
- 论文阅读:Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis
- 多任务学习“Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics”
- lie groups for 2d and 3d transformations
- 论文阅读笔记:A 3D Coarse-to-Fine Framework for Automatic Pancreas Segmentation
- 论文阅读:《Associative Embedding:End-to-End Learning for Joint Detection and Grouping》
- BaiXiang——【arXi2015】An End-to-End Trainable Neural Network for Image-based Sequence Recognition and
- How to fix hung_task_timeout_secs and blocked for more than 120 seconds problem
- 3D人体姿态估计--Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose
- How to Monitor Your Network Usage in Windows 8 (And Prevent Paying For The Extra Bandwidth)
- An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to S
- 论文笔记:An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application
- Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation笔记
- 语义分割--Label Refinement Network for Coarse-to-Fine Semantic Segmentation
- Asphyre Sphinx is a cross-platform framework for developing 2D/3D video games and interactive business applications
- 论文笔记:Label Refinement Network for Coarse-to-Fine Semantic Segmentation
- Introduction to AutoCAD 2008: 2D and 3D Design
- 论文阅读(Xiang Bai——【PAMI2017】An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition)
- 姿态检测整理--06-Associative Embedding: End-to-End Learning for Joint Detection and Grouping
- Howto Install and Configure Doxygen for QtCreator on Ubuntu