Deep Image Retrieval: Learning global representations for image search In ECCV, 2016学习笔记

contribution is twofold:

(i) we leverage a ranking framework to learn convolution and projection weights that are used to build the region features;

(ii) we employ a region proposal network to learn which regions should be pooled to form the final global descriptor.

当前最先进的是：

the state of the art is currently held by conventional methods relying on local descriptor matching and re-ranking with elaborate spatial verfication

当前使用CNN被限制在：using a pre-trained network as local feature extractor

当前的难点和解决方法有有：

1）图像被压缩还要保留大部分细节；本文可以精确的表达不同大小的长宽比的图像，解决CNN缺少的几何不变的特性；

2）深度学习的图像检索性能落后于传统方法的原因是缺少特定实例检索任务的数据集，基于深度学习的图像检索一般是使用Imagenet预训练的网络提取局部特征，这些特征被用来学习不同的语义分类，但是在类内的变化却是鲁棒的，这对实例检索不利，因为we are interested in distinguishing between particular objects – even if they belong to the same semantic category。

本文的解决手段：

1）建立在R-MAC（regional maximum activation of convolution）基础之上， It aggregates several image regions into a compact feature vector of fixed length and is thus robust to scale and translation（平移）.这种表示可以处理不同长宽比的高分辨率图像，并获得相当好的准确性。构建R-MAC表示所涉及的所有步骤都是可区分的，因此可以以端到端的方式学习权重；

2）use a three-stream Siamese network that explicitly optimizes the weights of the R-MAC representation for the image retrieval task by using a triplet ranking loss；

3）使用Landmarks dataset，并提出清理的方法；

4）池化机制使用region proposal network而不是rigid grid。

rigid grid的问题：

First, as the grid is independent of the image content,it is unlikely that any of the grid regions accurately align with the object of interest.

Second, many of the regions only cover background.

RPN的优点：

First, the region proposals typically cover the object of interest more tightly than the rigid grid.

Second, even if they do not overlap exactly with the region of interest, most of the proposals do overlap significantly with it, which means that increasing the number of proposals per image not only helps to increase the coverage but also helps in the many-to-many matching.

Representations of different images can be then compared using the dot-product（点积）。

使用 shifting and a fully connected (FC) layer代替PCA

Deep Image Retrieval: Learning global representations for image search In ECCV, 2016学习笔记的更多相关文章

Learning to Track at 100 FPS with Deep Regression Networks ECCV 2016 论文笔记
Learning to Track at 100 FPS with Deep Regression Networks ECCV 2016 论文笔记工程网页:http://davheld.git ...
论文解读（GraRep）《GraRep: Learning Graph Representations with Global Structural Information》
论文题目:<GraRep: Learning Graph Representations with Global Structural Information>发表时间: CIKM论文作 ...
Deep learning with Python 学习笔记（5）
本节讲深度学习用于文本和序列用于处理序列的两种基本的深度学习算法分别是循环神经网络(recurrent neural network)和一维卷积神经网络(1D convnet) 与其他所有神经网络一 ...
Deep High-Resolution Representation Learning for Human Pose Estimation
Deep High-Resolution Representation Learning for Human Pose Estimation 2019-08-30 22:05:59 Paper: CV ...
Deep Learning（深度学习）学习笔记整理
申明:本文非笔者原创,原文转载自:http://www.sigvc.org/bbs/thread-2187-1-3.html 4.2.初级(浅层)特征表示既然像素级的特征表示方法没有作用,那怎样的表 ...
Deep Learning（深度学习）学习笔记整理系列之（五）
Deep Learning(深度学习)学习笔记整理系列 zouxy09@qq.com http://blog.csdn.net/zouxy09 作者:Zouxy version 1.0 2013-04 ...
【转载】Deep Learning（深度学习）学习笔记整理
http://blog.csdn.net/zouxy09/article/details/8775360 一.概述 Artificial Intelligence,也就是人工智能,就像长生不老和星际漫 ...
Deep Learning（深度学习）学习笔记整理系列之（八）
Deep Learning(深度学习)学习笔记整理系列 zouxy09@qq.com http://blog.csdn.net/zouxy09 作者:Zouxy version 1.0 2013-04 ...
Deep Learning（深度学习）学习笔记整理系列之（七）
Deep Learning(深度学习)学习笔记整理系列 zouxy09@qq.com http://blog.csdn.net/zouxy09 作者:Zouxy version 1.0 2013-04 ...

随机推荐

解决方法：Could not load file or assembly 'WebGrease, Version=1.5.1.25624, Culture=neutral, PublicKeyToken=31bf3856ad364e35' or one of its dependencies.
最近使用VS2015调试ASP.NET 程序遇到了该问题: 在网上找了很多方法都不能解决,最后自己解决了,方法如下: 在project -> NuGet管理中找到已安装的所有程序:将Web Op ...
「CQOI2016」不同的最小割
「CQOI2016」不同的最小割传送门建出最小割树,把每一个点对的最小割抠出来 \(\text{unique}\) 一下就好了. 参考代码: #include <algorithm> ...
Mockito 中文文档 ( 2.0.26 beta )
Mockito 中文文档 ( 2.0.26 beta ) 由于缺乏校对,难免有谬误之处,如果发现任何语句不通顺.翻译错误,都可以在github中的项目提出issue.谢谢~ Mockito框架官方地址 ...
[Misc] ZSH 常用快捷键
安装 zsh 终端执行 brew install zsh 终端执行 vim ~/.bash_profile 命令,打开 .bash_profile 文件如果没有 vim,请自行安装在打开的文件中, ...
执行脚本，且以脚本名保存log
!/bin/bash path="/sys/devices/platform/soc/fd880000.i2c-pld/i2c-0/i2c-4/i2c-15/15-0060" f_ ...
lua叠代器
注意:叠待值遇到nil就退出叠代器,是符合for遍历框架,需要满足条件 1-叠代函数,常量,控制变量 2-叠代函数可以接受二个参数,当然也可以忽略处理(利用闭包封装参数作为控制变量和状态变量) 无状 ...
Elasticsearch 如何使用RESTful API
所有其他语言可以使用 RESTful API 通过端口 9200 和 Elasticsearch 进行通信,你可以用你最喜爱的 web 客户端访问 Elasticsearch .事实上,正如你所看到的 ...
c++将字符转换成字符串
转载:https://blog.csdn.net/dididisailor/article/details/83189135 char c; string str; stringstream stre ...
如何编写.gitignore文件
为什么要有.gitignore文件项目中经常会生成一些Git系统不需要追踪(track)的文件.典型的是在编译生成过程中产生的文件或是编程器生成的临时备份文件.当然,你不追踪(track)这些文件 ...
46 求1+2+3+...+n 静态成员函数和静态变量
题目描述求1+2+3+...+n,要求不能使用乘除法.for.while.if.else.switch.case等关键字及条件判断语句(A?B:C). 思路: 1)使用构造函数的方法,需要使用sta ...

Deep Image Retrieval: Learning global representations for image search In ECCV, 2016学习笔记

Deep Image Retrieval: Learning global representations for image search In ECCV, 2016学习笔记的更多相关文章

随机推荐

热门专题