您的位置：首页 > 移动开发 > Objective-C

初窥Tensorflow Object Detection API 源码

2018-02-13 13:09 776 查看

介绍

Creating accurate machine learning models capable of localizing and identifying multiple objects in a single image remains a core challenge in computer vision. The TensorFlow Object Detection API is an open source framework built on top of TensorFlow that makes it easy to construct, train and deploy object detection models. At Google we’ve certainly found this codebase to be useful for our computer vision needs, and we hope that you will as well.

Object Detection API是Google开源的一套基于Google的Tensorflow开发的目标检测API，里面涵盖了Faster RCNN,SSD,Mask RCNN,YOLO(还在开发中)等检测框架，原则上掌握了这套API就可以“无差别”使用各种检测框架。前提是要会配置检测参数——模型的config文件^.^

github

分析源码的原因

我使用了该API一段时间后，我发现一些配置的设定是相关的，

于是我开始阅读API的源码，寻找配置在框架中的具体意义（还要结合论文）

说实话，我是个python新手，但已从事软件行业多年，这种阅读源码的事对我来说问题并不是很大，最主要的难点在tensorflow的op操作和论文的理解上。不知道是tensorflow的设计原因还是python的特性，总感觉一些矩阵操作不是特别好理解，有时得配合具体输出才知道某段代码的具体作用。可能我还是个新手吧。

另外，作为state-of-the-art的论文，理解上还是要有点难度的，这是实话。

分析背景

框架：Faster RCNN

pre-trained model:faster_rcnn_resnet101_coco

数据集：The Oxford-IIIT Pet Dataset

tensorboard log下载链接: https://pan.baidu.com/s/1o9bghIi 密码: 4y2f (结合graph，便于理解）

看什么缺的以后补充

总结

分析中很大部分是粘贴了源码，因为我觉得代码更有描述性。

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

新的分享

章节导航