Deep learning vision of birds classification implemented throught Caffe.
We modified AlexNet and VGG by combining network similar to Spatial Transformer,in order to fully utilize information of target region rather than the entire image.
To view full details of our model,please read Project_Report.pdf in this repository.