SUN dataset数据集，有两个不错的网址：

http://vision.princeton.edu/projects/2010/SUN/ （普林斯顿大学）

http://groups.csail.mit.edu/vision/SUN/ （麻省理工学院）

普林斯顿大学的SUN数据集主页：

SUN Database: Scene Categorization Benchmark

Abstract

Scene categorization is a fundamental problem in computer vision. However, scene understanding research has been constrained by the limited scope of currently-used databases which do not capture the full variety of scene categories. Whereas standard databases for object categorization contain hundreds of different classes of objects, the largest available dataset of scene categories contains only 15 classes. In this paper we propose the extensive Scene UNderstanding (SUN) database that contains 899 categories and 130,519 images. We use 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition and establish new bounds of performance. We measure human scene classification performance on the SUN database and compare this with computational methods.

Paper

J. Xiao, J. Hays, K. Ehinger, A. Oliva, and A. Torralba.
SUN Database: Large-scale Scene Recognition from Abbey to Zoo.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

J. Xiao, K. A. Ehinger, J. Hays, A. Torralba, and A. Oliva.
SUN Database: Exploring a Large Collection of Scene Categories
International Journal of Computer Vision (IJCV)

Benchmark Evaluation

We use 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition and establish new bounds of performance. The results are shown in the figure on the right.

Download Figure4b in Matlab Editable Format (You can put your own curve in the figure.)

Results Visualization

We visualize the results using the combined kernel from all features for the first training and testing partition in the following webpage. For each of the 397 categories, we show the class name, the ROC curve, 5 sample traning images, 5 sample correct predictions, 5 most confident false positives (with true label), and 5 least confident false negatives (with wrong predicted label).

Recognition Results Webpage

Image Database

The database contains 397 categories SUN dataset used in the benchmark of the paper. The number of images varies across categories, but there are at least 100 images per category, and 108,754 images in total. Images are in jpg, png, or gif format. The images provided here are for research purposes only.

SUN397.tar.gz (tar.gz file, 39GB, md5sum=8ca2778205c41d23104230ba66911c7a).
Download by URLs: Dataset Image URLs

Training and Testing Partition

For the results in the paper we use a subset of the dataset that has 50 training images and 50 testing images per class, averaging over the 10 partitions in the following. To plot the curve in Figure 4(b) of the paper, we use the first n=(1, 5, 10, 20) images outof the 50 training images per class for training, and use all the same 50 testing images for testing no matter what size the training set is. (If you are using Microsoft Windows, you may need to replace / by \ in the following files.)

Download All Partitions (zip file).

Soucre Code for Benchmark Evaluation

Source Code Download

Scene Hierarchy

We have manually built an overcomplete three-level hierarchy for all 908 scene categories. The scene categories are arranged in a 3-level tree: with 908 leaf nodes (SUN categories) connected to 15 parent nodes at the second level (basic-level categories) that are in turn connected to 3 nodes at the first level (superordinate categories) with the root node at the top. The hierarchy is not a tree, but a Directed Acyclic Graph. Many categories such as "hayfield" are duplicated in the hierarchy because there might be confusion over whether such a category belongs in the natural or man-made sub-hierarchies.

Explore SUN Database

Kernel Matrices for SVM

Combined Training Kernel
Combined Testing Kernel
Best Single Training Kernel (HOG2x2)
Best Singe Testing Kernel (HOG2x2)
Other kernel matrices are available at THIS LINK.

Feature Matrices

The feature matrices are avialble at THIS LINK.

Human Classification Experiments

Human Confusion Matrix (Of the 13 good workers): good_workers_confusion.mat.
Overall confusion matrix and code for analysis: human_release.zip.
Mturk template for the experiment: template

DrawMe: A light-weight Javascript library for line drawing on a picture

DrawMe is a light-weight Javascript library to enable client-end line drawing on a picture in a web browser. It is targeted to provide a basis for self-define labeling tasks for computer vision researchers. It is different from LabelMe, which provides full support but fixed labeling interface. DrawMe is a Javascript library only and the users are required to write their own code to make use of this library for their specific need of labeling. DrawMe does not provide any server or server-end code for labeling, but gives the user greater flexibility for their specific need. It also comes with a simple example with Amazon Mechanical Turk interface that serializes Javascript DOM object into text for HTML form submission. The user can easily build their own labeling interface based on this MTurk example to make use for the Amazon Mechanical Turk for labeling, either using paid workers or the researchers themselves with MTurk sandbox.

Download DrawMe

——————————————————————————————我是分割线——————————————————————————————

麻省理工学院的SUN数据集主页：

Goals

The goal of the SUN database project is to provide researchers in computer vision, human perception, cognition and neuroscience, machine learning and data mining, computer graphics and robotics, with a comprehensive collection of annotated images covering a large variety of environmental scenes, places and the objects within. To build the core of the dataset, we counted all the entries that corresponded to names of scenes, places and environments (any concrete noun which could reasonably complete the phrase I am in a place, or Let’s go to the place), using WordNet English dictionary. Once we established a vocabulary for scenes, we collected images belonging to each scene category using online image search engines by quering for each scene category term, and annotate the objects in the images manually.

Scene Recognition Benchmark

To evaluate descriptors and classifiers for scene classification:

SUN397 Scene benchmark (397 scene categories), tar file (37GB, md5sum=58b3a6f1b8d6ec003458f940ada226bb) and project page for code, precomputed features, etc.

Object Detection Benchmark

The next collections contains only the fully annotated images from SUN. Each release contains the images from previous years.

SUN2012: 16,873 images, tar file (7.3GB).
Also available in PASCAL format: tar file (5.9GB). It also includes training and testing split that we recommend to follow.
See the instruction for using DPMv5 + PASCAL VOC Devkit + SUN2012.
Download the DPM v5 models trained using SUN2012

Citation

If you find this dataset useful, please cite this paper (and refer the data as SUN397, SUN2012, or SUN):

"SUN Database: Large-scale Scene Recognition from Abbey to Zoo". J. Xiao, J. Hays, K. Ehinger, A. Oliva, and A. Torralba. IEEE Conference on Computer Vision and Pattern Recognition, 2010.

To know more about the object annotation process (and the annotator), check this technical note:

"Notes on image annotation". A. Barriuso and A. Torralba. arXiv:1210.3448 [cs.CV] (unreferred).

Download Latest Dataset

You can download the raw SUN database using the LabelMe toolbox. If you do not have the latest version of the toolbox (or if you do not have the function SUNinstall.m), you should download the toolbox first:

LabelMe toolbox

To download the latest version of the database enter the Matlab commands:

>> yourpathimages = 'SUNDATABASE/Images';
>> yourpathannotations = 'SUNDATABASE/Annotations';
>> SUNinstall(yourpathimages, yourpathannotations);

The variables yourpathimages and yourpathannotations should point to the local paths where you want to download the images and annotations.

The first time that you call SUNinstall it will download the full set of images and annotations. Subsequent calls to SUNinstall will only download any new images added since the last download and the full set of annotations. If the download is interrupted the next call will not download again the images already downloaded.

If you want to download only one folder, you can specify a folder name:

>> folder = 'b/beach';
>> SUNinstall(yourpathimages, yourpathannotations, folder);

As new images are annotated everyday, you will get a slightly changing version if you download the database several times. If you are looking for a frozen copy of the database, use the links in the benchmark sections above.

SUN dataset图像数据集下载的更多相关文章

LabelMe图像数据集下载
Download MATLAB Toolbox for the LabelMe Image Database 利用Matlab Toolbox工具箱下载图像库一.下载Matlab Toolbox工具 ...
人工智能大数据,公开的海量数据集下载,ImageNet数据集下载,数据挖掘机器学习数据集下载
人工智能大数据,公开的海量数据集下载,ImageNet数据集下载,数据挖掘机器学习数据集下载 ImageNet挑战赛中超越人类的计算机视觉系统微软亚洲研究院视觉计算组基于深度卷积神经网络(CNN)的计 ...
医学图像数据（二）——TCIA完整数据集下载方式
1. 构建下载环境 l TCIA数据集下载文件为.jnlp格式(JNLP(Java Network Launching Protocol )是java提供的一种可以通过浏览器直接执行java应用程序 ...
MIT-Adobe FiveK Dataset 图片自动下载
MIT-Adobe FiveK Dataset 图片自动下载 MIT-Adobe FiveK是现在很多做图像增强(image enhancement)与图像修饰(image retouching)方面 ...
022. ASP.NET为DataSet中数据集添加关系及动态创建主子表和添加主子表关系
protected void Page_Load(object sender, EventArgs e) { string connectionString = "server=.;data ...
DataSet离线数据集实例
using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.W ...
scikit-learn数据集下载太慢的问题
有时候用scikit-learn在线下载数据时太慢,因为网络或者其他原因,这时候我们可以先把数据集下载到本地,然后再把这个数据集放到scikit-learn的data中,首先我们需要找到 scikit ...
MS coco数据集下载
2017年12月02日 23:12:11 阅读数:10411 登录ms-co-co数据集官网,一直不能进入,FQ之后开看到下载链接.有了下载链接下载还是很快的,在我这儿晚上下载,速度能达到7M/s,所 ...
Kaggle数据集下载
Kaggle数据集下载步骤: 安装Kaggle库: 注册Kaggle账户: 找到数据集,接受rules: 在My Account>>API中,点击Create New API Token, ...

随机推荐

Socket的双网卡收发（C#）
最近的一个项目中需要同时使用两块网卡收发UDP组播数据包,并且要求使用Socket的方式接收和发送网络数据包(我不会告诉你们我之前是直接使用SharpPcap来实现的).在C#中Socket接触的比较 ...
Python中数据的保存和读取
在科学计算的过程中,往往需要保存一些数据,也经常需要把保存的这些数据加载到程序中,在 Matlab 中我们可以用 save 和 lood 函数很方便的实现.类似的在 Python 中,我们可以用 nu ...
学习笔记--HashMap浅析
HashMap 实现了Map 接口,其底层以一个线性数组保存哈希表,所以它既有数组查询的高效,也有哈希存取的方便. HashMap提供了默认构造器,和有参构造器,在有参构造器中,提供了两个参数,可以对 ...
利用 js 实现弹出蒙板(model)功能
关于 js 实现一个简单的蒙板功能(model) 思路: 创建一个蒙板, 设置蒙板的堆叠顺序保证能将其它元素盖住 position: absolute; top: 0; left: 0; displa ...
Microsoft SQL Server 存储过程举例
-- if SP is existed, drop it. if (object_id('InvHoldToDPS', 'P') is not null) drop proc InvHoldToDPS ...
python-转换成exe文件（py2exe）
一.简介: py2exe是一个将python脚本转换成windows上的可独立执行的可执行程序(*.exe)的工具,这样,你就可以不用装python而在windows系统上运行这个可执行程序.py2e ...
Sencha Touch2 时间轴ListPanel
直接贴代码 timeline.html <!DOCTYPE html> <html> <head> <meta charset="UTF-8&quo ...
.NET设计模式（4）：建造者模式（Builder Pattern）（转）
概述在软件系统中,有时候面临着“一个复杂对象”的创建工作,其通常由各个部分的子对象用一定的算法构成:由于需求的变化,这个复杂对象的各个部分经常面临着剧烈的变化,但是将它们组合在一起的算法确相对稳定. ...
Topcoder srm 632 div2
脑洞太大,简单东西就是想复杂,活该一直DIV2; A:水,基本判断A[I]<=A[I-1],ANS++; B:不知道别人怎么做的,我的是100*N*N;没办法想的太多了,忘记是连续的数列我们枚 ...
C/C++中内存区域划分大总结
C++作为一款C语言的升级版本,具有非常强大的功能.它不但能够支持各种程序设计风格,而且还具有C语言的所有功能.我们在这里为大家介绍的是其中一个比较重要的内容,C和C++内存区域的划分. 一. 在c中 ...

SUN dataset图像数据集下载