Skip to content

Latest commit

 

History

History
executable file
·
158 lines (120 loc) · 5.74 KB

README.md

File metadata and controls

executable file
·
158 lines (120 loc) · 5.74 KB

Object Detection by Tensorflow Object Detection API

  • Workshop Tutorial
  • August 2, 2018

...Object detection has a long history in computer vision community, usually considered as adifficult task in terms of computation and accuracy. However , in the recent years, it has becomevery common and easy to implement, thanks to the deep learning and convolutions. The difference from the classification tasks, in general, if we aim to identify the location of objects ordetect the number of instances in an image, we use object detection.

...There are two main approaches to apply the object detection which are single-stage andtwo-stage detection. Both have even several ways to be implemented. In this workshop , we are mostly going to talk about RCNN-Family(Faster-RCNN , Fast-RCNN,RCNN). In the implementation , we will use Google's Tensorflow Object Detection API which covers both singleand two-stage models at whole.

...The TensorFlow Object Detection API is an open source framework built on top of TensorFlowthat makes it easy to construct, train and deploy object detection models. This API is capable ofidentifying many types of objects like cars, pedestrians, person, kite, dog and many more. Weare going to learn how to use this API for our specific problems by preparing our own datapipeline and apply on different models (including two-stages like YOLO and SSD).

alt text


  • About the Repo
    • API : Tensorflow's Object Detection API
    • ckpt : trained model parameters saved
    • config : Configuration settings
    • eval : Evaluation result folder
    • figures : helper figures for understanding Faster-RCNN
    • frozen_model : Inference graph of trained model
    • images :
      • train/val/test data
      • converted csv files
      • converted tfrecord files
      • labelmap.pbtxt : defines labels
    • pretrained : Pretrained model parameters

1. Introduction

1.1. Datasets

--> Datasets

  • COCO
  • KITTI
  • Pascal
  • Oid
  • AVA v2.1 trained models
  • Open Images-trained models

1.2. Landmark Papers

1.3. Feature Extractors

  • VGG
  • ZFNet
  • MobileNet
  • Inception
  • ResNet
  • DenseNet

2. Prepare Input Pipeline

2.1.Download Images

  • Google Advanced Image Search

    • birds , tigers , rabbits etc..
  • Downloader Tools (Chrome Extensions):

    • Download All Images 2.0.4
    • Fatkun Batch Download Image 2.23

2.2. Annotate Objects

  • Bounding Box
  • Polygon
  • Semantic Segmentation
  • Bounding Box
  • Line

2.3. Data Format Conversion

  • xml_to_csv
  • csv_to_tfrecord
  • resizer
  • extension check
  • size check

2.4. API Introduction

3. Set Colab Environment

  • Install Requirements
  • Connect Drive
  • Clone Repository
  • Set python environment
  • train
  • ..

4. Useful Links (References)

├───data
├───images
├───src
│   ├───chainercv
│   │   └───train
│   ├───data
│   ├───keras_retinanet
│   │   ├───keras_retinanet
│   │   │   ├───backend
│   │   │   ├───bin
│   │   │   ├───callbacks
│   │   │   ├───layers
│   │   │   ├───models
│   │   │   ├───preprocessing
│   │   │   └───utils
│   │   ├───solar_panel
│   │   └───tests
│   │       ├───backend
│   │       ├───bin
│   │       ├───layers
│   │       ├───models
│   │       ├───preprocessing
│   │       └───utils
│   └───utils
└───tf_od_eval
    └───protos
        └───__pycache__