CVPR 2020论文收藏(转知乎:https://zhuanlan.zhihu.com/p/112337176)
CVPR 2020 共收录 1470篇文章,根据当前的公布情况,人工智能学社整理了以下约100篇,分享给读者。
代码开源情况:详见每篇注释,当前共15篇开源。(持续更新中,可关注了解)。
算法主要领域:图像与视频处理,图像分类&检测&分割、视觉目标跟踪、视频内容分析、人体姿态估计、模型加速、网络架构搜索(NAS)、生成对抗(GAN)、光学字符识别(OCR)、人脸识别、三维重建等方向。 目录如下:
# 总目录
# 图像处理
1. Deep Image Harmonization via Domain Verification
论文:Deep Image Harmonization via Domain Verification
代码:bcmi/Image_Harmonization_Datasets
2. Learning to Shade Hand-drawn Sketches
论文:Learning to Shade Hand-drawn Sketches
3. Generalized ODIN: Detecting Out-of-distribution Image without Learning from Out-of-distribution Data
论文:Generalized ODIN: Detecting Out-of-distribution Image without Learning from Out-of-distribution Data
4. Single Image Reflection Removal through Cascaded Refinement
论文:https://arxiv.org/abs/1911.06634
5. RoutedFusion: Learning Real-time Depth Map Fusion
论文:https://arxiv.org/pdf/2001.04388.pdf
# 图像分类
1. Towards Robust Image Classification Using Sequential Attention Models
论文:Towards Robust Image Classification Using Sequential Attention Models
2. Self-training with Noisy Student improves ImageNet classification
论文:Self-training with Noisy Student improves ImageNet classification
3. Image Matching across Wide Baselines: From Paper to Practice
论文:Image Matching across Wide Baselines: From Paper to Practice
4. Improved Few-Shot Visual Classification
论文:https://arxiv.org/pdf/1912.03432.pdf
5. A General and Adaptive Robust Loss Function
论文:A General and Adaptive Robust Loss Function
6. Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks
论文:Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks
# 目标检测和分割
![](http://www.images.studyai.com/blog/20200311002818135.jpg)
1. Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector
论文:Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector
2. Bridng the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection
论文:https://arxiv.org/abs/1912.02424
3. Semi-Supervised Semantic Image Segmentation with Self-correcting Networks
论文:Semi-Supervised Semantic Image Segmentation with Self-correcting Networks
4. Deep Snake for Real-Time Instance Segmentation
论文:Deep Snake for Real-Time Instance Segmentation
5. SketchGCN: Semantic Sketch Segmentation with Graph Convolutional Networks
论文:SketchGCN: Semantic Sketch Segmentation with Graph Convolutional Networks
6. xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
论文:xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
7. CenterMask : Real-Time Anchor-Free Instance Segmentation
论文:CenterMask : Real-Time Anchor-Free Instance Segmentation
8. PolarMask: Single Shot Instance Segmentation with Polar Representation
论文:PolarMask: Single Shot Instance Segmentation with Polar Representation
9. BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
论文:BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
# 视觉目标跟踪
![](http://www.images.studyai.com/blog/20200311003231356.png)
1. ROAM: Recurrently Optimizing Tracking Model
论文:ROAM: Recurrently Optimizing Tracking Model
# 视频内容分析(理解)
![](http://www.images.studyai.com/blog/20200311003513839.jpg)
1. Hierarchical Conditional Relation Networks for Video Question Answering
论文:Hierarchical Conditional Relation Networks for Video Question Answering
2. Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
论文:Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
代码:bbrattoli/ZeroShotVideoClassification
3. Action Modifiers:Learning from Adverbs in Instructional Video
论文:Action Modifiers: Learning from Adverbs in Instructional Videos
4. Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
论文:Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
5. Blurry Video Frame Interpolation
论文:Blurry Video Frame Interpolation
6. Object Relational Graph with Teacher-Recommended Learning for Video Captioning
论文:Object Relational Graph with Teacher-Recommended Learning for Video Captioning
7. Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
论文:Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
8. Learning Representations by Predicting Bags of Visual Words
论文:Learning Representations by Predicting Bags of Visual Words
9. Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
论文:Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
# 人体关键点检测和姿态估计
![](http://www.images.studyai.com/blog/20200311003340958.png)
1. Distribution-Aware Coordinate Representation for Human Pose Estimation
论文:Distribution-Aware Coordinate Representation for Human Pose Estimation
2. VIBE: Video Inference for Human Body Pose and Shape Estimation
论文:VIBE: Video Inference for Human Body Pose and Shape Estimation
3. The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
论文:The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
4. Optimal least-squares solution to the hand-eye calibration problem
论文:Optimal least-squares solution to the hand-eye calibration problem
5. Distribution Aware Coordinate Representation for Human Pose Estimation
论文:Distribution-Aware Coordinate Representation for Human Pose Estimation
6. D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
论文:D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
7. Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
论文:Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
8. PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation
论文:https://arxiv.org/abs/1911.04231
9. 4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras
论文:4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras
# 模型轻量化和加速
1. GPU-Accelerated Mobile Multi-view Style Transfer
论文:GPU-Accelerated Mobile Multi-view Style Transfer
# 神经网络架构设计和搜索NAS
![](http://www.images.studyai.com/blog/20200311002906854.jpg)
1. GhostNet: More Features from Cheap Operations
论文:GhostNet: More Features from Cheap Operations
2. CARS: Contunuous Evolution for Efficient Neural Architecture Search
论文:https://arxiv.org/pdf/1909.04977.pdf
3. Visual Commonsense R-CNN
论文:https://arxiv.org/abs/2002.12204
4. Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral
5. AdderNet: Do We Really Need Multiplications in Deep Learning?
论文:https://arxiv.org/pdf/1912.13200
6. Filter Grafting for Deep Neural Networks
论文:https://arxiv.org/pdf/2001.05868.pdf
# 生成对抗GAN
![](http://www.images.studyai.com/blog/20200311003130441.jpg)
1. Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models
论文:Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models
2. MSG-GAN: Multi-Scale Gradient GAN for Stable Image Synthesis
论文:MSG-GAN: Multi-Scale Gradient GAN for Stable Image Synthesis
3. Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory
论文:Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory
# 三维点云&3D重建
![](http://www.images.studyai.com/blog/20200311003605270.jpg)
1. PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
论文:PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
2. PF-Net: Point Fractal Network for 3D Point Cloud Completion
论文:PF-Net: Point Fractal Network for 3D Point Cloud Completion
3. Learning multiview 3D point cloud registration
论文:Learning multiview 3D point cloud registration
4. Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image
5. In Perfect Shape: Certifiably Optimal 3D Shape Reconstruction from 2D Landmarks
论文:https://arxiv.org/pdf/1911.11924.pdf
6. RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
论文:RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
7. C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
论文:C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
8. Representations, Metrics and Statistics For Shape Analysis of Elastic Graphs
论文:Representations, Metrics and Statistics For Shape Analysis of Elastic Graphs
9. Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion
论文:Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion
# 光学字符识别OCR
1. ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
论文:ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
代码:https://github.com/Yuliang-Liu/bezier_curve_text_spotting,https://github.com/aim-uofa/adet
# 迁移学习
![](http://www.images.studyai.com/blog/20200311002940314.jpg)
1. Meta-Transfer Learning for Zero-Shot Super-Resolution
论文:Meta-Transfer Learning for Zero-Shot Super-Resolution
2. Transferring Dense Pose to Proximal Animal Classes
论文:Transferring Dense Pose to Proximal Animal Classes
# 弱监督 & 无监督学习
1. Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
论文:Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
2. Disentangling Physical Dynamics from Unknown Factors for Unsupervised Video Prediction
论文:Disentangling Physical Dynamics from Unknown Factors for Unsupervised Video Prediction
3. Rethinking the Route Towards Weakly Supervised Object Localization
论文:Rethinking the Route Towards Weakly Supervised Object Localization
4. NestedVAE: Isolating Common Factors via Weak Supervision
论文:NestedVAE: Isolating Common Factors via Weak Supervision
# 人脸识别
1. Towards Universal Representation Learning for Deep Face Recognition
论文:Towards Universal Representation Learning for Deep Face Recognition
2. Suppressing Uncertainties for Large-Scale Facial Expression Recognition
论文:Suppressing Uncertainties for Large-Scale Facial Expression Recognition
代码:kaiwang960112/Self-Cure-Network
3. Face X-ray for More General Face Forgery Detection
论文:https://arxiv.org/pdf/1912.13458.pdf
# 图神经网络GNN
1. Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction
2. Bundle Adjustment on a Graph Processor
论文:Bundle Adjustment on a Graph Processor
# 视觉 & 语言 混合任务研究
1. Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
论文:Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
2. 12-in-1: Multi-Task Vision and Language Representation Learning
论文:12-in-1: Multi-Task Vision and Language Representation Learning
3. Hierarchical Conditional Relation Networks for Video Question Answering
论文:Hierarchical Conditional Relation Networks for Video Question Answering
# 其他问题研究
1. What it Thinks is Important is Important: Robustness Transfers through Input Gradients
论文:https://arxiv.org/abs/1912.05699
2. Holistically-Attracted Wireframe Parsing
论文:Holistically-Attracted Wireframe Parsing
3. Attntive Context Normalization for Robust Permutation-Equivariant Learning
论文:Attentive Context Normalization for Robust Permutation-Equivariant Learning
5. ClusterFit: Improving Generalization of Visual Representations
论文:ClusterFit: Improving Generalization of Visual Representations
6. Learning in the Frequency Domain
论文:Learning in the Frequency Domain
7. A Characteristic Function Approach to Deep Implicit Generative Modeling
论文:A Characteristic Function Approach to Deep Implicit Generative Modeling
8. Auto-Encoding Twin-Bottleneck Hashing
论文:Auto-Encoding Twin-Bottleneck Hashing
# 论文打包下载地址
链接:https://pan.baidu.com/s/1lo3smbFWiBSNnut9JssYaQ
提取码:可在公众号内发送消息: cvpr2020
CVPR 2020论文收藏(转知乎:https://zhuanlan.zhihu.com/p/112337176)的更多相关文章
- 知乎社区核心业务 Golang 化实践 - 知乎 https://zhuanlan.zhihu.com/p/48039838
知乎社区核心业务 Golang 化实践 - 知乎 https://zhuanlan.zhihu.com/p/48039838
- HTML5之WebSocket && https://zhuanlan.zhihu.com/p/23467317
在认识websocket之前,我们必须了解的是websocket有什么用? 他能解决我们遇到的什么问题? 如果没用,那么我们就么有使用它的必要的. websocket就是建立起全双工协议的,提高了效率 ...
- 转:以下是目前已经建立的sub一览 来自:https://zhuanlan.zhihu.com/p/91935757
转:以下是目前已经建立的sub一览 来自:https://zhuanlan.zhihu.com/p/91935757 作者: Lorgar 理工科 科学(和英文r/science一样,只接受论文讨论 ...
- [转]局域网共享一键修复 18.5.8 https://zhuanlan.zhihu.com/p/24178142
@echo offcolor 2fmode con cols=50 lines=30title OKShare [制作:wnsdt]ver | findstr "6.">nu ...
- 知乎千万级高性能长连接网关 https://zhuanlan.zhihu.com/p/66807833
知乎千万级高性能长连接网关揭秘 9 天前 · 来自专栏 知乎技术专栏 实时的响应总是让人兴奋的,就如你在微信里看到对方正在输入,如你在王者峡谷里一呼百应,如你们在直播弹幕里不约而同的 666,它们的背 ...
- 来源于知乎专栏:https://zhuanlan.zhihu.com/p/29619457
1. 校验数字的表达式 1 数字:^[0-9]*$ 2 n位的数字:^\d{n}$ 3 至少n位的数字:^\d{n,}$ 4 m-n位的数字:^\d{m,n}$ 5 零和非零开头的数字:^(0|[1- ...
- HTTPS 基本流程 转载 https://zhuanlan.zhihu.com/p/27395037
协议 1.HTTP 协议(HyperText Transfer Protocol,超文本传输协议):是客户端浏览器或其他程序与Web服务器之间的应用层通信协议 . 2.HTTPS 协议(HyperTe ...
- https://zhuanlan.zhihu.com/p/32553477
科普:QUIC协议原理分析
- CVPR 2020 全部论文 分类汇总和打包下载
CVPR 2020 共收录 1470篇文章,根据当前的公布情况,人工智能学社整理了以下约100篇,分享给读者. 代码开源情况:详见每篇注释,当前共15篇开源.(持续更新中,可关注了解). 算法主要领域 ...
随机推荐
- H5解决安卓软键盘弹出遮蔽的方法
首先先判断是否为安卓,是的话才添加事件监听,获取焦点元素判断是否为input或者textarea类型,是的话,Element.scrollIntoView() 方法会让当前的元素滚动到浏览器窗口的可视 ...
- iconfont字体图标的使用方法(转)
我之前因为项目用bootstrap比较多,所以使用font awesome字体图标比较多,后来接触到了iconfont,发现想要的什么图标都有,还可以自定义图标,非常强大!之前看了一波教程,觉得繁琐, ...
- 简单认识并使用JavaScript【供后端人员作为了解】
JS(JavaScript)Web的脚本语言 脚本语言:无法独立执行,必须嵌入到其他语言当中结合使用 作用:控制页面特效展示 注:JavaScript没有访问系统的权限,并且JavaScript和Ja ...
- 关于Quartz .NET(V3.0.7)的简要说明
目录 0. 任务调度 1. Quartz .NET 1.1 基本概念 1.2 主要接口和对象 2. 使用示例 2.0 准备工作 2.1 每间隔一定时间间隔执行一次任务 2.3 某天的固定时间点执行任务 ...
- 记一次有趣的thinkphp代码执行
0x00 前言 朋友之前给了个站,拿了很久终于拿下,简单记录一下. 0x01 基础信息 漏洞点:tp 5 method 代码执行,payload如下 POST /?s=captcha _method= ...
- Scapy编写UDP扫描脚本
脚本内容如下: from scapy.all import * import optparse import threading def scan(target,port): pkt=IP(dst=t ...
- hdu3367最大伪森林(并查集)
题目链接:http://icpc.njust.edu.cn/Problem/Hdu/3367/ 题目要求一个连通图的最大伪森林,伪森林是一个最多有一个回路的图.我们只要用Kruskal最大生成树的策略 ...
- [币严BIZZAN区块链]Java生成ETH钱包助记词、私钥、地址
本文主要介绍在Java工程中如何生成ETH钱包的助记词.私钥.地址. 一.在之前创建的spring boot 项目中的 pom.xml文件中加入需要的依赖 <dependency> < ...
- CF codeforces A. New Year Garland【Educational Codeforces Round 79 (Rated for Div. 2)】
A. New Year Garland time limit per test 1 second memory limit per test 256 megabytes input standard ...
- pywin32解析office文档
from win32com.client import Dispatch xlApp = Dispatch("Excel.Application") xlApp.Visible = ...