PaperReading20200219

CanChen ggchen@mail.ustc.edu.cn

This is my first day sharing my reading paper and I will try to paraphrase core ideas in these papers with very simple words. Every paper consists three parts, namely, motivation, method, and contribution. In each part, no more than 3 sentences will be used.

NAS-Bench-201

Motivation:Network search algorithms are often quite expensive and different search spaces also make it difficult for us to compare these algorithms. In fact, we can treat network architectures as X and their corresponding accuracies as Y, and construct a standard dataset to solve this problem.
Method: Using cell-based strategy, we only need to find a cell and insert it into the macro structure. In the paper, 4 nodes and 5 operations are used, which means we need to train 15625 cells. As last，the author just trained 15625 models on cifar10，cifar100 and sampled-ImageNet，and provided us with the corresponding training logs.
Contribution: The paper is a ICLR paper and is not very novel（at least I think）. It shows us again: computing resources is very important. At least, it gives us a benchmark for NAS research and now we can use CPU to do NAS.

Peephole

Motivation: Can we get the network's performance without training?
Method: The author only considers sequential network architectures since we can treat the sequential network architectures as "a language". Then the author uses LSTM to deal with this problem like language modeling.
Contribution: The work is kind of limited since it only deals with sequential network architecures while other structures such shortcut paths are in fact dominating this field.

Latency-aware

Motivation: Current Darts algorithms do not take latency into consideration.
Method: First, the author train a regression model that can predict a network's latency based on the network's structure. Then the author inserts this model into bi-level optimization equation as part of the loss function.
Contribution: This work is an extension of Darts and can be very useful since latency is important in real scenarios.

PaperReading20200219的更多相关文章

随机推荐

ProtoBuf试用与JSON的比较
介绍 ProtoBuf 是google团队开发的用于高效存储和读取结构化数据的工具.什么是结构化数据呢,正如字面上表达的,就是带有一定结构的数据.比如电话簿上有很多记录数据,每条记录包含姓名.ID.邮 ...
XMPPFramework 框架
https://blog.csdn.net/qq_29846663/article/details/70170646 2017-04-14 11:37:02 于海明阅读数 478更多分类专栏: i ...
吴裕雄--天生自然Numpy库学习笔记：NumPy 统计函数
NumPy 提供了很多统计函数,用于从数组中查找最小元素,最大元素,百分位标准差和方差等. numpy.amin() 用于计算数组中的元素沿指定轴的最小值. numpy.amax() 用于计算数组中的 ...
Pandas的Categorical Data类型
pandas从0.15版开始提供分类数据类型,用于表示统计学里有限且唯一性数据集,例如描述个人信息的性别一般就男和女两个数据常用'm'和'f'来描述,有时也能对应编码映射为0和1.血型A.B.O和AB ...
springboot例子
@Mapperpublic interface FinancingMapper { @Insert("<script>" + "insert into fin ...
dpkg 命令
dpkg 是Debian Package的简写,是为Debian 专门开发的套件管理系统,方便软件的安装.更新及移除.所有源自Debian的Linux发行版都使用dpkg,例如Ubuntu.Knopp ...
Java入门笔记 01-Java基础语法
一.数据类型和运算符 1.注释可以提高程序的可读性.可划分为单行注释 // 多行注释 /.../ 文档注释 /**...*/ 2.标识符的命名规则: 标识符必须以字母.下划线_.美元符号$开头. 标 ...
mysql字符串相关函数（并与sql server对比）
https://blog.csdn.net/zhengxiuchen86/article/details/81220779 1.判断子串substr在字符串str中出现的位置例子:查询']'在‘OP ...
Thread的join方法
一个线程在执行的过程中,可能调用另一个线程,前者可以称为调用线程,后者成为被调用线程. Thread.Join方法的使用场景:调用线程挂起,等待被调用线程执行完毕后,继续执行. 如下案列: 当NewT ...
Python 中命令行参数解析工具 docopt 安装和应用
什么是 docopt? 1.docopt 是一种 Python 编写的命令行执行脚本的交互语言. 它是一种语言! 它是一种语言! 它是一种语言! 2.使用这种语言可以在自己的脚本中,添加一些规则限制. ...