FRCNN_git

Faster R-CNN is an object detection algorithm and designed such that objects of various classes are found within an image and simultaneously bounding boxes enclosing these objects can be regressed. Unfortunately, these boxes need to be aligned with the x- and y-axis of the image.

We've adapted the original Faster R-CNN source code (see Faster R-CNN Tensorflow), such that rectangular boxes of any rotation can be predicted.

This code is thought as a starting point for work with Faster R-CNN on rotated boxes and we hope you may benefit from this implementation. Nevertheless, we can't guarantee its correctness nor provide support.

The network is developped for and tested on aerial images, where we detect buildings. So this version does currently not support more than one object class.
We do not use flipped images, as data is augmented otherwise.
We've implemented a GPU version of the new RoI pooling operation for rotated rectangles, but not CPU. RoI pooling could be optimized in terms of efficiency.
We ignore RoIs which cross image boundaries (could be handled in a better way).
All objects within one of our training images are rotated around the same angle. We assume that everything works fine even if this is not the case, but did not verify it.

Requirements for Software and Hardware

The requirements are the same as for the original Faster R-CNN Tensorflow implementation (see Faster R-CNN Tensorflow).

Installation

Clone the Faster R-CNN repository

# Clone the git repository
git clone --recursive https://github.com/runa91/FRCNN_git.git

Build Cython modules
```
cd $FRCNN_ROOT/lib
make
```
In case you'd like to train a model, download a pretrained VGG model here: Pretrained VGG

Store it as follows: $FRCNN_ROOT/data/pretrained_model/VGG_imagenet.npy
A model trained on buildings can be found at the same location: Trained Network

Store it as follows: $FRCNN_ROOT/output/faster_rcnn_end2end_sI/building_train/ VGGnet_fast_rcnn_iter_60000.ckpt

Demo

I've included a few test images, you are now able to run a demonstration:

cd $FRCN_ROOT/tools
python2.7 building_evaluation_git.py --model "$FRCNN_ROOT/output/faster_rcnn_end2end_sI/building_train/VGGnet_fast_rcnn_iter_60000.ckpt" --data "$FRCNN_ROOT/data/building_data/"

Training

Create your own data set and add a new class similar to the existing 'building' class to the folder $FRCNN_ROOT/lib/datasets/. You may have a look at $FRCNN_ROOT/changes_wrt_orig_frcnn.odt for more information.

Train a model:

cd $FRCNN_ROOT
./experiments/scripts/faster_rcnn_end2end_new.sh 0 VGG16 building

Network Structure

The following two images show the structure of the training and test network. Purple denotes network input which consists of ground truth boxes (height, width, x-position and y-position), orientation of these boxes and the image itself. All loss terms which are in the end summed up and jointly minimized are colored orange. Layers in between input and loss are either colored blue or green. Blue indicates, that a layer is not changed with respect to the original version of the code, green highlights changes.

We use the same loss terms as for original Faster R-CNN, but in addition we introduce two loss terms for the orientation. One for the orientation estimated within the RPN part of the network and one for the estimation within the R-CNN part. For both orientation estimates we implement an L2 loss function. 0.2 is found to be a good weight for each of the rotation loss terms. We predict the rotation α of a box as the angle that this box is rotated with respect to an x-, y- axis aligned box: − π/4 ≤ α < π/4

Training network:

Test network:

References

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Faster R-CNN Tensorflow

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FRCNN_git

Requirements for Software and Hardware

Installation

Demo

Training

Network Structure

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
experiments		experiments
lib		lib
old_files_not_used_anymore		old_files_not_used_anymore
output/faster_rcnn_end2end_sI/building_train		output/faster_rcnn_end2end_sI/building_train
tools		tools
various		various
README.md		README.md
README.md~
changes_wrt_orig_frcnn.odt		changes_wrt_orig_frcnn.odt

runa91/FRCNN_git

Folders and files

Latest commit

History

Repository files navigation

FRCNN_git

Requirements for Software and Hardware

Installation

Demo

Training

Network Structure

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages