论文笔记之：Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach

Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach

2017.11.28

Introduction：

　　人脸属性的识别在社会交互，提供了非常广泛的信息，包括：the person’s identity, demographic (age, gender, and race), hair style, clothing, etc. 基于人脸属性识别的场景也越来越多，如：（i）video Surveillance；

（ii）face retrieval；（iii）social media。尽管最近在属性识别上取得了很大的进展，但是，大部分 prior works 限制在预测单个属性（如：age），或者针对每一个属性学习一个 model，进行识别。为了解决上述的局限性，已经有很多工作在尝试 joint 的预测多个属性【见文章引用 19-23】。但是这些方法都有或多或少的不足：

　　1. The approaches in [19], [20], [22] used the same features for estimating all the attributes without considering the attribute heterogeneity.

　　2. The sumproduct network (SPN) adopted in [21] for modeling attribute correlations may not be feasible because of the exponentially growing number of attribute group combinations.

　　3. The cascade network in [23] also required learning a separate Support Vector Machine (SVM) classifier for each face attribute, and is not an end-to-end learning approach.

　　图一展示了人脸属性的相关性以及多样性。属性之间关系要么是 pos 要么是 neg。与此同时，单个属性可以是多样的（根据 data type 或者 scale，以及 semantic meaning）。这种属性相关性以及多样性应该被编码到属性预测模型中去（Such attribute correlation and heterogeneity should be considered in designing face attribute estimation models.）。

Proposed Algorithm：

　　本文提出一种 Deep Multi-Task Learning (DMTL) approach 来 Jointly 的预测单张图像中的多个属性。所提出的方法，是受到现有方法的启发，但是在一个网络中，考虑到 attribute correlation 以及 attribute heterogeneity。所提出的 DMTL 有前期的共享特征提取阶段，以及特定类型的特征学习来进行多个属性的预测。共享的特征学习自然地探索了多个 task 之间的相关性，可以更加鲁棒以及有效的进行特征的表达。

Main Contributions：

　　(i) an efficient multi-task learning (MTL) method for joint estimation of a large number of face attributes;

　　(ii) modeling both attribute correlation and attribute heterogeneity in a single network;

　　(iii) studying the generalization ability of the proposed approach under cross-database testing scenarios;

　　(iii) compiling the LFW+ database2 with face images in the wild (LFW), and heterogeneous demographic attributes (age, gender, and race) via crowdsourcing.

Proposed Approach：

　　1. Deep Multi-task Learning :

　　本文的目标是，用一个联合的预测模型，同时预测多个人脸属性。当大量 face attributes 给特征学习效率上带来挑战的同时，他们也提供了结合属性内部关系的机会（leveraging the attribute inter-correlations to obtain informative and robust feature representation）。例如，CelebA dataset 中的各个属性之间就有很强的 correlation，如下图所示：

　　那么，采用多任务的框架来学习这个东西，就变的特别直觉了。但是，外观变换的出现以及 the heterogeneity of individual attributes, 从 face image space 到 attribute space 的映射，通常是 nonlinear。所以， the joint attribute estimation model 应该可以捕获到复杂和综合的非线性变换。CNN model 是一种有效的处理 MTL 以及 nonlinear transformation learning 的方法。所以，我们选择基于 CNN 的多任务框架来完成该任务：

　　一个传统的 DMTL model 进行联合的属性预测可以 formulated by minimizing the regularization error function：

　　上述 model 就是：重构 loss + 正则化项的标准做法。但是这种方法不是最优的，因为属性之间的关系并没有考虑到，而属性的预测应该共享某些 feature。这也是被其他 paper 所支持的【34】。但是，公式 1 当中的表达方式，并没有显示的强调了 a large portion of feature sharing during MTL。我们将上述表达式改为下面的形式：

　　其中，Wc 控制了人脸属性共享的 feature，Wj 控制了共享 feature 的更新。Specifically, as shown in Fig. 2, a face image is first projected to a high-level representation through a shared deep network (Wc) consisting of a cascade of complex non-linear mappings, and then refined by shallow subnetworks ({Wj}M j=1) towards individual attribute estimation tasks。

Heterogeneous Face Attributes Estimation：

　　尽管上述 DMTL 在特征学习过程中用到了 attribute correlations，the attribute heterogeneity 仍然需要考虑。单个 face Attribute 的异质性曾经被提出过，但没有受到足够多的关注。原因是如下两个方面：

　　1. many of the public-domain face databases are labeled with a single attribute, the requirement of designing corresponding models becomes no longer urgent ;

　　2. many of the published methods choose to learn a separate model for each face attribute; model learning for individual attributes does not face the attribute heterogeneity problem.

　　我们分别对待每一个 异质的属性类别（the heterogeneous attribute categories），但是每一个类别的 attributes 都希望能够共享 feature learning 以及 classification model。为了完成这个，我们重写了目标函数：

　　其中，G 是异质属性类别的个数。

　　将大量属性进行几个 heterogeneous categories 的划分，依赖于 prior knowledge。此处，我们从 data type and scale (i.e. ordinal vs. nominal) 以及 semantic meaning (i.e. holistic vs. local) 考虑 face attribute heterogeneities，然后解释我们的特定类别的建模，来进行这些 heterogeneous attribute categories。

　　Nominal vs. ordinal attributes .

论文笔记之：Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach的更多相关文章

论文解读（SUBLIME）《Towards Unsupervised Deep Graph Structure Learning》
论文信息论文标题:Towards Unsupervised Deep Graph Structure Learning论文作者:Yixin Liu, Yu Zheng, Daokun Zhang, ...
论文笔记：Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering
Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering 2019-04-25 21: ...
论文笔记之：Optical Flow Estimation using a Spatial Pyramid Network
Optical Flow Estimation using a Spatial Pyramid Network spynet 本文将经典的 spatial-pyramid formulation ...
论文笔记之：Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement Learning ICML 2016 Best Paper 摘要:本文的贡献点主要是在 DQN ...
论文笔记之：Pedestrian Detection aided by Deep Learning Semantic Tasks
Pedestrian Detection aided by Deep Learning Semantic Tasks CVPR 2015 本文考虑将语义任务(即:行人属性和场景属性)和行人检测相结合, ...
论文笔记（5）：Fully Convolutional Multi-Class Multiple Instance Learning
这篇论文主要介绍了如何使用图片级标注对像素级分割任务进行训练.想法很简单却达到了比较好的效果.文中所提到的loss比较有启发性. 大体思路: 首先同FCN一样,这个网络只有8层(5层VGG,3层全卷积 ...
论文笔记：（TOG2019）DGCNN : Dynamic Graph CNN for Learning on Point Clouds
目录摘要一.引言二.相关工作三.我们的方法 3.1 边缘卷积Edge Convolution 3.2动态图更新 3.3 性质 3.4 与现有方法比较四.评估 4.1 分类 4.2 模型复杂度 ...
论文笔记之：Active Object Localization with Deep Reinforcement Learning
Active Object Localization with Deep Reinforcement Learning ICCV 2015 最近Deep Reinforcement Learning算 ...
论文笔记：ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware 2019-03-19 16:13:18 Pape ...

随机推荐

ModelState查看错误字段的信息
if (!ModelState.IsValid) { List<string> sb = new List<string>(); //获取所有错误的Key List<st ...
关于在搜索栏的一些小bug
问题:我们在使用input标签和button按钮写搜索框的时候,书写在两行的时候会有缝隙,其次,input标签如果用大的div括起来,里面依然会显示边框. 解决方法:1.关于input标签,我们将属性 ...
MySql 存储过程光标只循环一次
[1]MqSql 存储过程光标只循环一次针对MySql存储过程,光标只循环一次就退出的场景,可能原因分析: (1)存储过程有问题(仔细检查语法.控制变量.条件等等) (2)保证存储过程正确.调用过 ...
memcache、redis、mongoDB 如何选择？
不同的 Nosql,其实应用的场景各有不同,所以我们应该先了解不同Nosql 之间的差别,然后分析什么才是最适合我使用的 Nosql. Nosql 介绍 Nosql 的全称是 Not Only Sql ...
A stock
1. 密集成交不太妙主力抛压退为好
了解一下UTF-16
1)先啰嗦一下 UTF-16是一种编码格式.啥是编码格式?就是怎么存储,也就是存储的方式. 存储啥?存二进制数字.为啥要存二进制数字? 因为Unicode字符集里面把二进制数字和字符一一对应了,存二进 ...
BIOS 搭配 MBR/GPT 的开机流程
鸟哥私房菜书上内容: BIOS 搭配 MBR/GPT 的开机流程在计算机概论里面我们有谈到那个可爱的BIOS与CMOS两个东西, CMOS是记录各项硬件参数且嵌入在主板上面的储存器,BIOS则是一个 ...
hive 用户行为分析（活跃。启动，留存，回访，新增）的一些经典sql
很简单的sql 用户分析语句 :只要自定义简单的udf函数获取统计时间createdatms字段的使用的日历类 add方法和simpledateformat 将long类型的定义多个重载方法获 ...
springboot打包部署到tomcat
一. springboot打成war包: 1. 首先查看是否为war 2. File----->ProjectStruture,选择Artifacts,中部点击“+”号 3. 按图中标记进行选择 ...
MySQL5.7 的新特点
1.安全性 MySQL 5.7 的目标是成为发布以来最安全的 MySQL 服务器,其在 SSL/TLS 和全面安全开发方面有一些重要的改变. mysql.user表结构升级 MySQL5.7用户表my ...

论文笔记之：Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach

论文笔记之：Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach的更多相关文章

随机推荐

热门专题