图像拼接 SIFT资料合集

转自

http://blog.csdn.net/stellar0/article/details/8741780

分类：

最近也注意一些图像拼接方面的文章，很多很多，尤其是全景图拼接的，实际上类似佳能相机附加的软件，好多具备全景图拼接，多幅图像自动软件实现拼接，构成（合成）一幅全景图像（风景）。
Sift算法，我略知一二，无法仔细描述（刚也贴了2个最近的资料）。
当就尺度空间（scale space),我想，其在计算机视觉（Computer Vision)\图像的多分辨率分析（尤其近年来小波的多分辨率分析）是常见的概念。
人类视觉捕捉景物的时候，先粗略（rough),后细节（fine)的习惯，被研究图像视觉的采用。2点采样使用的情况，则整体图像被不断的1/2边长划分，不同的图像（矩阵）构成了不同分辨率的尺度空间（我们理解为不同层面的矩阵），尺度，Scale,这里就代表不同的空间比例。

我注意到David Lowe关于Sfit算法，2004年发表在Int. Journal of Computer Vision的经典论文中，对尺度空间（scal space)是这样定义的：
It has been shown by Koenderink (1984) and Lindeberg (1994) that under a variety of
reasonable assumptions the only possible scale-space kernel is the Gaussian function. Therefore,
the scale space of an image is defined as a function, L(x; y; delta) that is produced from the convolution of a variable-scale Gaussian, G(x; y; delta), with an input image, I(x; y):
因此，一个图像的尺度空间，L（x,y,delta) ,定义为原始图像I (x,y)与一个可变尺度的2维高斯函数G(x,y,delta) 卷积运算。

关于图象处理中的空间域卷积运算，可以参考经典的图像处理教材（比如美国冈萨雷斯的图象处理，第二版，或者其 Matlab版，都有如何在离散空间进行运算的例子和说明）

注：原文中 delta为希腊字母，这里无法表示，用delta代替。

Sift算法中，提到了尺度空间，请问什么是尺度和尺度空间呢？

在上述理解的基础上，尺度就是受delta这个参数控制的表示。
而不同的L（x,y,delta)就构成了尺度空间（ Space ,我理解，由于描述图像的时候，一般用连续函数比较好描述公式，所以，采用空间集合，空间的概念正规一些），实际上，具体计算的时候，即使连续的高斯函数，都要被离散为（一般为奇数大小）(2*k+1) *(2*k+1)矩阵，来和数字图像进行卷积运算。

1、 SIFT 算法提出及其改进

SIFT算法由D.G.Lowe 1999年提出，2004年完善总结。代表性文献

[1]David G. Lowe, "Object recognition from local scale-invariant features,"International Conference on Computer Vision, Corfu, Greece (September 1999), pp.1150-1157.

[2] David G. Lowe, "Distinctive image features from scale-invariant keypoints,"International Journal of Computer Vision, 60, 2 (2004), pp. 91-110.

具体的MATLAB代码在

http://www.cs.ubc.ca/~lowe/keypoints/可以下载。

Rob Hess 基于GSL和Opencv编写了C语言程序。具体的代码可以在

http://web.engr.oregonstate.edu/~hess/index.html 中下载，可以在VC++.net环境中运行，在调试时要注意对GSL和Opencv的正确配置。

后来Y.Ke将其描述子部分用PCA代替直方图的方式，对其进行改进。

[3] Y. Ke and R. Sukthankar. PCA-SIFT: A More Distinctive Representation for Local Image Descriptors.Computer Vision and Pattern Recognition, 2004

Yanke’s homepage:

http://www.andrew.cmu.edu/user/yke/

2、SIFT 算法主要思想

SIFT算法是一种提取局部特征的算法，在尺度空间寻找极值点，提取位置，尺度，旋转不变量。

3、 SIFT算法的主要特点：

a) SIFT特征是图像的局部特征，其对旋转、尺度缩放、亮度变化保持不变性，对视角变化、仿射变换、噪声也保持一定程度的稳定性。

b) 独特性(Distinctiveness)好，信息量丰富，适用于在海量特征数据库中进行快速、准确的匹配[23]。

c) 多量性，即使少数的几个物体也可以产生大量SIFT特征向量。

d) 高速性，经优化的SIFT匹配算法甚至可以达到实时的要求。

e) 可扩展性，可以很方便的与其他形式的特征向量进行联合。

4、SIFT算法步骤：

1）检测尺度空间极值点

2）精确定位极值点

3）为每个关键点指定方向参数

4）关键点描述子的生成

SIFT算法的介绍参见：SIFT 算法学习小记

前面有朋友问到Sift特征点的提取方法，这里简单做个介绍。

作为一种匹配能力较强的局部描述算子，SIFT算法的实现相当复杂，但从软件开发的角度来说，只要会使用其中几个比较重要的函数就行了。这里要感谢 David Lowe这个大牛，不但提供了一种强悍的特征匹配算法，还给出了C++的实现代码，后来有人用C#实现了这个算法，需要的朋友可到网上自行下载。

关键函数一：

int sift_features( IplImage* img, struct feature** feat )

这个函数就是用来提取图像中的特征向量。参数img为一个指向IplImage数据类型的指针，用来表示需要进行特征提取的图像。IplImage是 opencv库定义的图像基本类型（关于opencv是一个著名的图像处理类库，详细的介绍可以参见 http://www.opencv.org.cn）。参数feat 是一个数组指针，用来存储图像的特征向量。函数调用成功将返回特征向量的数目，否则返回-1.

关键函数二：

int _sift_features( IplImage* img, struct feature** feat, int intvls,double sigma, double contr_thr, int curv_thr, int img_dbl, int descr_width, int descr_hist_bins )

这个函数是函数一的重载，作用是一样的，实际上函数一只不过是使用默认参数调用了函数二，核心的代码都是在函数二中实现的。下面将详细介绍一下其他的几个参数。

intvls: 每个尺度空间的采样间隔数，默认值为3.

sigma: 高斯平滑的数量，默认值1.6.

contr_thr:判定特征点是否稳定，取值（0，1），默认为0.04，这个值越大，被剔除的特征点就越多。

curv_thr:判定特征点是否边缘点，默认为6.

img_dbl:在建立尺度空间前如果图像被放大了1倍则取值为1，否则为0.

descr_width:计算特征描述符时邻域子块的宽度，默认为4.

descr_hist_bins:计算特征描述符时将特征点邻域进行投影的方向数，默认为8，分别是 0，45，90，135，180，215，270，315共8个方向。

如果只是做特征提取，上面这个函数就足够了，不同图像之间的匹配都是建立在这个基础上的，如果你有足够强的编程能力即使不懂这个算法也能够编写出像样的程序来，因为下面的比较说白了都是数据结构的比较，跟算法已经没有太多关系了。如果想要深入了解算法，还是认真的看论文和代码吧。

、ubc:DAVID LOWE---SIFT算法的创始人，两篇巨经典经典的文章

2、cmu:YanKe---PCASIFT,总结的SIFT方面的文章SO全，巨经典

3、ubc:MBROWN---SIFT算法用于图像拼接的经典应用autopano-sift，包括一个SIFTLIB库

4、toronto:Jepson---Matlab SIFT tutorial, 超级超级超级经典~

5、ucla:Vedaldi---加州大学一个博士生编的SIFT，Matlab、C的都有，但没用过

6、一个小的拼接软件ptasmblr

几个关于sift的算法链接网址，以后要养成这种随时保存资料的好习惯！否则后面不知道又要花多少时间去找，最重要的是影响心情。

SIFT 关键点检测和匹配算法,这个算法在图像匹配方面具有很好的性能。

比原有的harris点匹配方式具有各高的佩准准确度，只是算法速度较慢

SIFT Keypoint Detector

This page provides access to a demo version of David Lowe's SIFT keypoint detector in the form of compiled binaries that can run under Linux or Windows. The demo software uses PGM format for image input. It can output keypoints and all information needed for matching them to a file in a simple ASCII format. A Matlab program and sample C code are provided that can read the keypoints and match them between images.

The image on the right shows an example of matching produced by the demo software. Features are extracted from each of the two images, and lines are drawn between features that have close matches. In this example, many matches are found and only a small fraction are incorrect.

This software is provided for research purposes only, and a license must be obtained from the University of British Columbia for commercial applications. See the LICENSE file provided with the software.

The program can be accessed from the following link in the form of a zip file containing the compiled binaries and demo code. To unpack, use "unzip siftDemoV4.zip" from Linux or an unzip utility in Windows. The code comes with a file README giving full details.

SIFT demo program (Version 4, July 2005)

Release historyVersion 4 (July 2005) There is now a Windows binary as well as one for Linux. The Matlab scripts have been improved and include code for matching keypoints between images. Version 3 (August 2004) This fixes a bug in the displayed orientation of arrows drawn for each keypoint when using the "-display" option. This affects only arrow display and not the keypoint features themselves (thanks to Yannik Fournier, Tom Stepleton, and Rob Sim for identifying the bug). In addition, a Matlab program is now provided for running the binary and loading the keypoints into Matlab when running under Linux (thanks to D. Alvaro and J.J. Guerrero for the Matlab program). Version 2 (September 2003) Fixes a bug in Version 1 of the sample code for doing matching (not the keypoints themselves) that incorrectly declared "unsigned char" as "char" (thanks to Yongqin Xiao and Suresh Lodha for their assistance in reporting this bug). This new version now finds more correct matches. The new version also contains more test data and raises the matching threshold. Version 1 (June 2003) Initial demo release. Related papers

The most complete and up-to-date reference for the SIFT feature detector is given in the following journal paper:

David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 60, 2 (2004), pp. 91-110. [PDF]

The SIFT approach to invariant keypoint detection was first described in the following ICCV 1999 conference paper, which also gives some more information on the applications to object recognition:

David G. Lowe, "Object recognition from local scale-invariant features," International Conference on Computer Vision, Corfu, Greece (September 1999), pp. 1150-1157. [PDF];

The following paper gives methods for performing 3D object recognition by interpolating between 2D views. It also provides a probabilistic model for verification of recognition.

David G. Lowe, "Local feature view clustering for 3D object recognition," IEEE Conference on Computer Vision and Pattern Recognition, Kau

图像拼接 SIFT资料合集的更多相关文章

Web测试到底是在测什么（资料合集）
开始今晚的主题之前先来看一张图, 这是老徐16年10月份,线上Web主题分享时整理的大纲图片略模糊看得清就好 Web测试, 进行抽离拆分,基本上就如上一些内容. 不管是测什么系统,什么功能,基本 ...
德哥的PostgreSQL私房菜 - 史上最屌PG资料合集
德哥的PostgreSQL私房菜 - 史上最屌PG资料合集
你想找的Python资料这里全都有！没有你找不到！史上最全资料合集
你想找的Python资料这里全都有!没有你找不到!史上最全资料合集 2017年11月15日 13:48:53 技术小百科阅读数:1931 GitHub 上有一个 Awesome - XXX 系列 ...
Java面试题资料合集
Java面试题资料合集 2021年最新版--Java+最常见的+200++面试题汇总+答案总结汇总.pdf BIO,NIO,AIO,Netty面试题 35道.pdf Dubbo面试题 47道.pdf ...
JavaScript推荐资料合集(前端必看)
这份合集覆盖了所有的JavaScript基本知识,从基本网络编程技巧,如变量.函数和循环语句,到高级一些的专题,如表单验证.DOM操作.客户端对象.脚本程序调试.学习前端的你不容错过! 资料名称下载 ...
学习Oracle数据库入门到精通教程资料合集
任何大型信息系统,都需要有数据库管理系统作为支撑.其中,Oracle以其卓越的性能获得了广泛的应用.本合集汇总了学习Oracle数据库从入门到精通的30份教程资料. 资料名称下载地址超详细Orac ...
Swift编程语言资料合集
在本周二凌晨召开的苹果年度开发者大会WWDC上,苹果公司推出了全新的编程语言Swift.Swift 基于C和Objective-C,是供iOS和OS X应用编程的全新语言,更加高效.现代.安全,可以提 ...
VUE相关资料合集
===官方=== https://github.com/vuejs/vue vue-components组件库 ---PC端--- https://github.com/ElemeFE/element ...
NodeJS学习资料合集
1. 官网 nodejs 2. How do I get started with Node.js,stackoverflow提问,收集非常多实用的网站 3. node-books.github收 ...

随机推荐

【6_100】Same Tree
Same Tree Total Accepted: 97481 Total Submissions: 230752 Difficulty: Easy Given two binary trees, w ...
Static Constructors
A static constructor is used to initialize any static data, or to perform a particular action that n ...
UIAlertView、 UIActionSheet
一.UIAlertView. UIActionSheet都是ios系统自带的弹出式对话框,当UIAlertView或UIActionSheet弹出来时用户无法与应用界面中的其它控件交互,UIAlert ...
如何处理webView跳转
- (void)webView:(UIWebView *)wv didFailLoadWithError:(NSError *)error { // Give iOS a chance to o ...
Magicodes.WeiChat——自定义knockoutjs template、component实现微信自定义菜单
本人一向比较喜欢折腾,玩了这么久的knockoutjs,总觉得不够劲,于是又开始准备折腾自己了. 最近在完善Magicodes.WeiChat微信开发框架时,发现之前做的自定义菜单这块太不给力了,而各 ...
eclipse导入Android项目后，项目的名称变为了主Activity的名称
不要使用File -> Import -> Android -> Existing Android Code into Workspace,而是用 File -> Import ...
Python编码/文件读取/多线程
Python编码/文件读取/多线程个人笔记~~记录才有成长编码/文件读取/多线程编码常用的一般是gbk.utf-8,而在python中字符串一般是用Unicode来操作,这样才能按照单个字 ...
ASP.NET中GridView控件删除数据的两种方法
今天在用GridView控件时,发现了一个问题,就是使用GridView控件在删除数据时的问题.接下来我们通过模板列方式和CommandField方式删除某条数据讲解下两者之间的区别. 方式一:通 ...
Entity Framework后台采用分页方式取数据与AspNetPager控件的使用
本文是一个对AspNetPager控件使用的笔记! 有关AspNetPager控件可以查看杨涛主页.这是一个开放的自定义ASP.NET控件,支持各种自定义的数据分页方式,使用很方便,而且功能也很强大, ...
atitit.查看预编译sql问号本质and原理ａｎｄ查看原生ｓｑｌ语句
atitit.查看预编译sql问号本质and原理and查看原生sql语句 1. 预编译原理. 1 2. preparedStatement 有三大优点: 1 3. How to look gene ...

图像拼接 SIFT资料合集

图像拼接 SIFT资料合集的更多相关文章

随机推荐

热门专题