DEEP LEARNING WITH STRUCTURE
DEEP LEARNING WITH STRUCTURE

Charlie Tang is a PhD student in the Machine Learning group at the University of Toronto, working with Geoffrey Hinton and Ruslan Salakhutdinov, whose research interests include machine learning, computer vision and cognitive science. More specifically, he has developed various higher-order extensions to generative models in deep learning for vision.
At the Deep Learning Summit in Boston next month, Charlie will present 'Deep Learning with Structure'. Supervised neural networks trained on massive datasets have recently achieved impressive performance in computer vision, speech recognition, and many other tasks. While extremely flexible, neural nets are often criticized because their internal representations are distributed codes and lack interpretability; during his presentation at the summit, Charlie will reveal how we can address some of these concerns.
We had a quick Q&A with Charlie ahead of the Deep Learning Summit, to hear more of his thoughts on developments and challenges in deep learning.
What are the key factors that have enabled recent advancements in deep learning?
The three key factors are:
- The steadfast belief and knowledge that supervised neural networks trained with enough labelled data can achieve great test set generalization.
- The availability of high performance hardware and software, in particular, Nvidia's CUDA architecture and SDK. This allowed more experimentation and the learning from large-scale data.
- The development of superior models: switching to rectified linear hidden units from the sigmoid or hyperbolic tangent units and the invention of regularization techniques, specifically "Dropout".
What are the main types of problems now being addressed in the deep learning space?
Almost all problems in statistical machine learning are currently being investigated using deep learning techniques. They include visual and speech recognition, reinforcement learning, natural language processing, medical and health applications, financial engineering and many others.
What are the practical applications of your work and what sectors are most likely to be affected?
The deep learning revolution allows models trained on big data to drastically improve accuracy. This means that many artificial intelligence recognition tasks can be now automated, which previously necessitated a human in-the-loop.
What developments can we expect to see in deep learning in the next 5 years?
Deep learning algorithms will be gradually adopted for more tasks and will "solve" more problems. For example, 5 years ago, algorithmic face recognition accuracy was still somewhat worse than human performance. However, currently, super-human performances are reported on the main face recognition dataset (LFW) and the standard image classification dataset (Imagenet). In the next 5 years, harder and harder problems such as video recognition, medical imaging or text processing will be successfully tackled by deep learning algorithms. We can also expect deep learning algorithms to be ported to commercial products, much like how the face detector was incorporated into consumer cameras in the past 10 years.
What advancements excite you most in the field?
I feel like the most exciting advance is the availability of low-energy mobile hardware that supports deep learning algorithms. This will inevitably lead to many real-time systems and mobile products which will be a part of our daily lives.
The Deep Learning Summit is taking place in Boston on 26-27 May. For more information and to register, please visit the event website here.
Join the conversation with the event hashtag #reworkDL
DEEP LEARNING WITH STRUCTURE的更多相关文章
- Can deep learning help you find the perfect girl?
Can deep learning help you find the perfect girl? One of the first things I did when I moved to Mont ...
- (转) Awesome Deep Learning
Awesome Deep Learning Table of Contents Free Online Books Courses Videos and Lectures Papers Tutori ...
- (转) The major advancements in Deep Learning in 2016
The major advancements in Deep Learning in 2016 Pablo Tue, Dec 6, 2016 in MACHINE LEARNING DEEP LEAR ...
- (转) Deep Learning Research Review Week 2: Reinforcement Learning
Deep Learning Research Review Week 2: Reinforcement Learning 转载自: https://adeshpande3.github.io/ad ...
- (转)Deep Learning Research Review Week 1: Generative Adversarial Nets
Adit Deshpande CS Undergrad at UCLA ('19) Blog About Resume Deep Learning Research Review Week 1: Ge ...
- (转) Deep Learning in a Nutshell: Core Concepts
Deep Learning in a Nutshell: Core Concepts Share: Posted on November 3, 2015by Tim Dettmers 7 Comm ...
- (转)The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)
Adit Deshpande CS Undergrad at UCLA ('19) Blog About The 9 Deep Learning Papers You Need To Know Abo ...
- Applied Deep Learning Resources
Applied Deep Learning Resources A collection of research articles, blog posts, slides and code snipp ...
- Machine and Deep Learning with Python
Machine and Deep Learning with Python Education Tutorials and courses Supervised learning superstiti ...
随机推荐
- 混合语言编程:启用CLR(公共语言运行时编译)让C#调用C++
前言 关于混合C#和C++的编程方式,本人之前写过一篇博客(参见混合语言编程:C#使用原生的Directx和OpenGL),在之前的博客中,介绍了在C#的Winform和WPF下使用原生的Direct ...
- C语言 简单的栈
//简单的栈 #include<stdio.h> #include<stdlib.h> //栈的介绍:栈先进后出,一般用于将数据逆序输出 //栈一般只有四种方法--进栈,出栈, ...
- JS 之性能优化(1)
了解JS性能优化是学习前端必备的一项技能.下面就简单的列出几点: 1.注意作用域,避免全局查找. 访问全局变量比访问局部变量慢,是因为需要遍历作用域链,查找作用域链需要额外的时间.所以在一个函数中,将 ...
- python数字图像处理(12):基本图形的绘制
图形包括线条.圆形.椭圆形.多边形等. 在skimage包中,绘制图形用的是draw模块,不要和绘制图像搞混了. 1.画线条 函数调用格式为: skimage.draw.line(r1,c1,r2,c ...
- timeSeries db之:使用Metrics监控应用程序的性能 (zz)
在编写应用程序的时候,通常会记录日志以便事后分析,在很多情况下是产生了问题之后,再去查看日志,是一种事后的静态分析.在很多时候,我们可能需要了解整个系统在当前,或者某一时刻运行的情况,比如当前系统中对 ...
- wen7安装oracle 11g出现"未找到文件 E:\development_tools\database\oracle\install_d\dbhome\owb\external\oc4j_applications\applications\WFMLRSVCApp.ear"
从oracle官网上下载了window7 64位的oracle安装包win64_11gR2_database_1of2,安装后出现了错误: 解决方法:继续下载oracle官网上的文件2:win64_1 ...
- 第七章 美化DetailView界面
本项目是<beginning iOS8 programming with swift>中的项目学习笔记==>全部笔记目录 ------------------------------ ...
- Android中的Intent详解
前言: 每个应用程序都有若干个Activity组成,每一个Activity都是一个应用程序与用户进行交互的窗口,呈现不同的交互界面.因为每一个Acticity的任务不一样,所以经常互在各个Activi ...
- [USACO2003][poj2187]Beauty Contest(凸包+旋转卡壳)
http://poj.org/problem?id=2187 题意:老题了,求平面内最远点对(让本渣默默想到了悲剧的AHOI2012……) 分析: nlogn的凸包+旋转卡壳 附:http://www ...
- MongoDB 3.0以上版本设置访问权限、设置用户
定义:创建一个数据库新用户用db.createUser()方法,如果用户存在则返回一个用户重复错误. 语法:db.createUser(user, writeConcern) user这个文档创 ...