Joint Deep Learning for Pedestrian Detection笔记

旅途中 2024-10-18 18:23:23 原文

1、结构图

Introduction

Feature extraction, deformation handling, occlusion handling, and classification are four important components in pedestrian detection. Existing methods learn or design these components either individually or sequentially. The interaction among these components is not yet well explored. This paper proposes that they should be jointly learned in order to maximize their strengths through cooperation. We formulate these four components into a joint deep learning framework and propose a new deep network architecture

Contribution Highlights

A unified deep model for jointly learning feature extraction, a part deformation model, an occlusion model and classification. With the deep model, these components interact with each other in the learning process, which allows each component to maximize its strength when cooperating with others .
We enrich the operation in deep models by incorporating the deformation layer into the convolutional neural networks (CNN). With this layer, various deformation handling approaches can be applied to our deep model.
The features are learned from pixels through interaction with deformation and occlusion handling models . Such interaction helps to learn more discriminative features.

Citation

If you use our codes or dataset, please cite the following papers:

W. Ouyang and X. Wang. Joint Deep Learning for Pedestrian Detection. In ICCV, 2013. PDF

Code (Matlab code on Wnidows OS)

Code and dataset on Google Drive:

For users who cannot download from Google Drive:

Code and dataset on Baidu

The files are on the GoogleDocs and Baidu. To Run the code, please read the following readme file:

Readme
1. Put all of the documents into the same folder and decompress them using the command "extract to here". Suppose the root folder is "root", then you should have three folders "root/CNN", "root/data", "root/model", "root/NN", "root/tmptoolbox", "root/util", and "root/dbEval". For "root/data", there should be 4 folders: "root/data/CaltechTest", "root/data/CaltechTrain", "root/data/ETH", and "root/data/INRIATrain".
2. Run the "cnnexamples.m" or "testing.m." in the folder "root/CNN" to obtain the results.

FAQ

Frequently Asked Question and Answer for the code (13 KB)

Joint Deep Learning for Pedestrian Detection笔记的更多相关文章

个性探测综述阅读笔记——Recent trends in deep learning based personality detection
目录 abstract 1. introduction 1.1 个性衡量方法 1.2 应用前景 1.3 伦理道德 2. Related works 3. Baseline methods 3.1 文本 ...
Deep learning with Python 学习笔记（11）
总结机器学习(machine learning)是人工智能的一个特殊子领域,其目标是仅靠观察训练数据来自动开发程序［即模型(model)］.将数据转换为程序的这个过程叫作学习(learning) 深 ...
Deep learning with Python 学习笔记（10）
生成式深度学习机器学习模型能够对图像.音乐和故事的统计潜在空间(latent space)进行学习,然后从这个空间中采样(sample),创造出与模型在训练数据中所见到的艺术作品具有相似特征的新作品 ...
Deep learning with Python 学习笔记（9）
神经网络模型的优化使用 Keras 回调函数使用 model.fit()或 model.fit_generator() 在一个大型数据集上启动数十轮的训练,有点类似于扔一架纸飞机,一开始给它一点推 ...
Deep learning with Python 学习笔记（8）
Keras 函数式编程利用 Keras 函数式 API,你可以构建类图(graph-like)模型.在不同的输入之间共享某一层,并且还可以像使用 Python 函数一样使用 Keras 模型.Ker ...
Deep learning with Python 学习笔记（7）
介绍一维卷积神经网络卷积神经网络能够进行卷积运算,从局部输入图块中提取特征,并能够将表示模块化,同时可以高效地利用数据.这些性质让卷积神经网络在计算机视觉领域表现优异,同样也让它对序列处理特别有效. ...
Deep learning with Python 学习笔记（6）
本节介绍循环神经网络及其优化循环神经网络(RNN,recurrent neural network)处理序列的方式是,遍历所有序列元素,并保存一个状态(state),其中包含与已查看内容相关的信息. ...
Deep learning with Python 学习笔记（5）
本节讲深度学习用于文本和序列用于处理序列的两种基本的深度学习算法分别是循环神经网络(recurrent neural network)和一维卷积神经网络(1D convnet) 与其他所有神经网络一 ...
Deep learning with Python 学习笔记（4）
本节讲卷积神经网络的可视化三种方法可视化卷积神经网络的中间输出(中间激活) 有助于理解卷积神经网络连续的层如何对输入进行变换,也有助于初步了解卷积神经网络每个过滤器的含义可视化卷积神经网络的过滤 ...

随机推荐

python内置函数 2
import__( name[, globals[, locals[, fromlist[, level]]]])被 import 语句调用的函数. 它的存在主要是为了你可以用另外一个有兼容接口的函数 ...
Intellij快捷键
post multipart data boundary问题使用curl 向jersey post文件
原以为curl 模拟post file跟post string类似,-d参数一加 ,header一加就完了,这次遇到个问题,却怎么都搞不定. curl模拟post提交与客户端定的协议是: Heade ...
jquery统计页面的pv/ip及停留时间等
我们在做网站的时候经常需要统计网站的访问信息,这里介绍一个用jquery写的一个统计方法新建一个js文件jun_record.js 代码如下: var start; var end; var tim ...
iOS开发UI篇—UIWindow简单介绍
iOS开发UI篇—UIWindow简单介绍一.简单介绍 UIWindow是一种特殊的UIView,通常在一个app中只会有一个UIWindow iOS程序启动完毕后,创建的第一个视图控件就是UIWi ...
JavaScript 用法
JavaScript 用法 HTML 中的脚本必须位于 <script> 与 </script> 标签之间. 脚本可被放置在 HTML 页面的 <body> 和 & ...
SharePoint开发——利用CSOM逐级获取O365中SharePoint网站的List内容
博客地址:http://blog.csdn.net/FoxDave 本文介绍如何利用SharePoint客户端对象模型(.NET)逐级获取Office 365网站中List的内容,仅仅是示例,没有 ...
IOS开发支付宝集成思路
一般情况下支付功能的交互流程比如我们去某个APP去支付一个产品,流程为:1.用户点击支付->2.客户端请求服务器用户支付->3.服务器接收请求生成金额订单等要给第三方支付的一切信息,并生 ...
BlackHat会议上将公布一款免费的汽车黑客工具
汽车,无可厚非是现代社会很重要的交通工具,但与此同时却也带来了诸多安全隐患,不管怎样,汽车安全都是我们不可忽视的一个重大问题. 即将免费分享该工具近日一名法国研究者将发布一款检测汽车安全漏洞的工具, ...
回顾Spring框架
Spring框架: 传统JavaEE解决企业级应用问题时的"重量级"架构体系,使它的开发效率,开发难度和实际的性能都令人失望.Spring是以一个救世主的身份降临在广大的程序员面 ...