最近在调研3D算法方面的工作，整理了几篇多视角学习的文章。还没调研完，先写个大概。

基于RGBD的语义分割的工作重点主要集中在如何将RGB信息和Depth信息融合，主要分为三类：省略。

1、(ICCV2017)《RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation》
2、（2018 Arxiv）RedNet:Residual Encoder-Decoder Network for indoor RGB-D Semantic Segmentation
3、（ICIP2019）ACNet：使用注意力网络的RGBD图像语义分割方法
4、(NIPS2020)Deep Multimodal Fusion by Channel Exchanging
5、（ECCV2020）Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation
6、（arxiv2021）GLPNet:Global-Local Propagation Network for RGB-D Semantic Segmentation
7、(ACCV 2016) FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture
8、（SCIA2017）Multimodal Neural Networks: RGB-D for Segmantic Segmentation and Object Detection
9、（3DV2019）3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation
10、（ICCV2017）3D Graph Neural Networks for RGBD Semantic Segmentation
多模态Transformer
Transformer语义分割（SETR）
TransUNet：用于医学图像分割的Transformers强大编码器
SegFormer：使用Transformer进行语义分割的简单高效设计
Swin-Unet：首个纯Transformer的医学图像分割网络
学习跨模态深度表达用于多模态MR图像分割

1、(ICCV2017)《RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation》

用于室内语义分割的RGB-D多级残差特征融合

论文地址：https://openaccess.thecvf.com/content_iccv_2017/html/Park_RDFNet_RGB-D_Multi-Level_ICCV_2017_paper.html

代码：https://github.com/SeongjinPark/RDFNet

文章介绍：https://blog.csdn.net/u012113559/article/details/81363756

2、（2018 Arxiv）RedNet:Residual Encoder-Decoder Network for indoor RGB-D Semantic Segmentation

论文地址：https://arxiv.org/abs/1806.01054

代码：https://github.com/JindongJiang/RedNet

文章介绍：https://blog.csdn.net/qq_41375318/article/details/104311597、

https://blog.csdn.net/qq_41375318/article/details/103451966

3、（ICIP2019）ACNet：使用注意力网络的RGBD图像语义分割方法

论文地址：https://arxiv.org/abs/1905.10089

代码：https://github.com/anheidelonghu/ACNet

文章介绍：https://blog.csdn.net/kevin_zhao_zl/article/details/100750591、

https://zhuanlan.zhihu.com/p/82193530

4、(NIPS2020)Deep Multimodal Fusion by Channel Exchanging

论文地址：https://arxiv.org/abs/2011.05005

代码：https://github.com/yikaiw/CEN

文章介绍：https://zhuanlan.zhihu.com/p/341959576、

https://blog.csdn.net/hongyuge/article/details/109632887

视频讲解：https://www.bilibili.com/video/BV1ya4y1W7Hf

5、（ECCV2020）Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation

论文地址：https://arxiv.org/abs/2007.09183

代码：https://github.com/charlesCXK/RGBD_Semantic_Segmentation_PyTorch

文章介绍：https://blog.csdn.net/sinat_17456165/article/details/107805136

6、（arxiv2021）GLPNet:Global-Local Propagation Network for RGB-D Semantic Segmentation

论文地址：https://arxiv.org/abs/2101.10801

代码：无

文章介绍：

7、(ACCV 2016) FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture

论文地址：https://www.semanticscholar.org/paper/FuseNet%3A-Incorporating-Depth-into-Semantic-via-CNN-Hazirbas-Ma/9360ce51ec055c05fd0384343792c58363383952

代码：https://github.com/tum-vision/fusenet

文章介绍：https://blog.csdn.net/u013841196/article/details/82939619

8、（SCIA2017）Multimodal Neural Networks: RGB-D for Segmantic Segmentation and Object Detection

论文地址：https://www.researchgate.net/publication/317803469_Multimodal_Neural_Networks_RGB-D_for_Semantic_Segmentation_and_Object_Detection

代码：

文章介绍：https://blog.csdn.net/qq_38316300/article/details/109546441

9、（3DV2019）3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

论文地址：https://arxiv.org/abs/1910.01460

代码：

文章介绍：https://blog.csdn.net/cangafuture/article/details/113822865

10、（ICCV2017）3D Graph Neural Networks for RGBD Semantic Segmentation

论文地址：https://ieeexplore.ieee.org/document/8237818

代码：https://github.com/yanx27/3DGNN_pytorch

文章介绍：https://blog.csdn.net/P_LarT/article/details/88774811、https://blog.csdn.net/P_LarT/article/details/88774811

多模态Transformer

论文地址：https://arxiv.org/abs/1906.00295

代码：https://github.com/yaohungt/Multimodal-Transformer

论文介绍：https://zhuanlan.zhihu.com/p/84678022?from_voters_page=true、https://zhuanlan.zhihu.com/p/340113856、https://blog.csdn.net/zpainter/article/details/111867693

Transformer语义分割（SETR）

论文地址:https://arxiv.org/abs/2012.15840

代码：https://github.com/fudan-zvg/SETR

文章介绍：https://zhuanlan.zhihu.com/p/341768446

TransUNet：用于医学图像分割的Transformers强大编码器

论文地址：https://arxiv.org/abs/2102.04306

代码：https://github.com/Beckschen/TransUNet

文章介绍：https://blog.csdn.net/weixin_49627776/article/details/115710379

SegFormer：使用Transformer进行语义分割的简单高效设计

论文地址：https://arxiv.org/abs/2105.15203

代码：https://github.com/NVlabs/SegFormer

文章介绍：https://zhuanlan.zhihu.com/p/379054782

Swin-Unet：首个纯Transformer的医学图像分割网络

论文地址：https://arxiv.org/abs/2105.05537

代码：https://github.com/HuCaoFighting/Swin-Unet（目前未开源）

文章介绍：https://blog.csdn.net/amusi1994/article/details/116957208

学习跨模态深度表达用于多模态MR图像分割

地址：https://zhuanlan.zhihu.com/p/349918500

几篇关于RGBD语义分割文章的总结的更多相关文章

语义分割--全卷积网络FCN详解
语义分割--全卷积网络FCN详解 1.FCN概述 CNN做图像分类甚至做目标检测的效果已经被证明并广泛应用,图像语义分割本质上也可以认为是稠密的目标识别(需要预测每个像素点的类别). 传统的基于C ...
【Semantic segmentation Overview】一文概览主要语义分割网络（转）
文章来源:https://www.tinymind.cn/articles/410 本文来自 CSDN 网站,译者蓝三金图像的语义分割是将输入图像中的每个像素分配一个语义类别,以得到像素化的密集分类 ...
多篇开源CVPR 2020 语义分割论文
多篇开源CVPR 2020 语义分割论文前言 1. DynamicRouting:针对语义分割的动态路径选择网络 Learning Dynamic Routing for Semantic Segm ...
caffe初步实践---------使用训练好的模型完成语义分割任务
caffe刚刚安装配置结束,乘热打铁! (一)环境准备前面我有两篇文章写到caffe的搭建,第一篇cpu only ,第二篇是在服务器上搭建的,其中第二篇因为硬件环境更佳我们的步骤稍显复杂.其实,第 ...
【Keras】基于SegNet和U-Net的遥感图像语义分割
上两个月参加了个比赛,做的是对遥感高清图像做语义分割,美其名曰"天空之眼".这两周数据挖掘课期末project我们组选的课题也是遥感图像的语义分割,所以刚好又把前段时间做的成果重新 ...
笔记：基于DCNN的图像语义分割综述
写在前面:一篇魏云超博士的综述论文,完整题目为<基于DCNN的图像语义分割综述>,在这里选择性摘抄和理解,以加深自己印象,同时达到对近年来图像语义分割历史学习和了解的目的,博古才能通今!感 ...
语义分割丨PSPNet源码解析「训练阶段」
引言之前一段时间在参与语义分割的项目,最近有时间了,正好把这段时间的所学总结一下. 在代码上,语义分割的框架会比目标检测简单很多,但其中也涉及了很多细节.在这篇文章中,我以PSPNet为例,解读一下 ...
语义分割(semantic segmentation) 常用神经网络介绍对比-FCN SegNet U-net DeconvNet，语义分割,简单来说就是给定一张图片,对图片中的每一个像素点进行分类；目标检测只有两类,目标和非目标，就是在一张图片中找到并用box标注出所有的目标.
from:https://blog.csdn.net/u012931582/article/details/70314859 2017年04月21日 14:54:10 阅读数:4369 前言在这里, ...
基于FCN的图像语义分割
语义图像分割的目标在于标记图片中每一个像素,并将每一个像素与其表示的类别对应起来.因为会预测图像中的每一个像素,所以一般将这样的任务称为密集预测.(相对地,实例分割模型是另一种不同的模型,该模型可以区 ...

随机推荐

小白学k8s(7)helm[v3]使用了解
helm使用什么是helm 安装helm Helm V2 & V3 架构设计配置kube config helm使用添加仓库 helm安装nginx helm的核心概念 Chart Co ...
【odoo14】【开发侧】权限配置
欢迎转载,但需标注出处,谢谢! 说明: 本文面向开发人员,普通用户可参考[odoo14][用户侧]权限配置.文章结构与用户侧一致. 目录一. odoo中的对象二. 权限控制 2.1 实现原理 2. ...
dev下拉框选择不同值显示不同控件
单列的ASPxFormLayout直接前台控制就可以了,多列的前台控制后会出现空白 <dx:LayoutItem Caption="内容类型" Height="40 ...
2018-10-14普及模拟赛」Hash 键值 (hash)
今天,带大家看一看一道思维题... Hash 键值 (hash) 题目描述 Marser沉迷hash无法自拔,然而他发现自己记不住hash键值了-- Marser使用的hash函数是一个单纯的取模运算 ...
AcWing 237. 程序自动分析
#include<bits/stdc++.h> using namespace std; const int N=1e6+5; int f[N*2],a[N],b[N],c[N],n,t, ...
资源：Postgresql数据库下载路径
postgresql下载路径: https://www.enterprisedb.com/downloads/postgres-postgresql-downloads
Centos7中安装elasticsearch
第一步:必须要有jre支持 elasticsearch是用Java实现的,跑elasticsearch必须要有jre支持,所以必须先安装jre 第二步:下载elasticsearch 进入官方下载 h ...
第13次抽考(IO流）
1.将文本文件a.txt 复制成 b.txt.要求: a. 用逐个字符复制方式: b. 用逐行读写方式: c. 用字符数组方式 2.将压缩包a.rar复制成b.rar. 注意:复制前后手工打开文件,若 ...
bugku--cookie欺骗
打开题目一看,是一串的东西,再看了一下filename发现不对劲了,明显是base64编码,拿去解码一下, 发现是这个,说明是filename,是需要解析的哪个文件名,把index.php编码一下,试 ...
OpenFlow协议分析
OpenFlow协议分析实验手册启动虚拟机mininet 和控制器 ODL 启动wireshark,在控制器的ens32 网卡抓包使用mininet创建简单拓扑,并连接控制器,指定交换机为ovs ...

几篇关于RGBD语义分割文章的总结

1、(ICCV2017)《RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation》

2、（2018 Arxiv）RedNet:Residual Encoder-Decoder Network for indoor RGB-D Semantic Segmentation

3、（ICIP2019）ACNet：使用注意力网络的RGBD图像语义分割方法

4、(NIPS2020)Deep Multimodal Fusion by Channel Exchanging

5、（ECCV2020）Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation

6、（arxiv2021）GLPNet:Global-Local Propagation Network for RGB-D Semantic Segmentation

7、(ACCV 2016) FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture

8、（SCIA2017）Multimodal Neural Networks: RGB-D for Segmantic Segmentation and Object Detection

9、（3DV2019）3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

10、（ICCV2017）3D Graph Neural Networks for RGBD Semantic Segmentation

多模态Transformer

Transformer语义分割（SETR）

TransUNet：用于医学图像分割的Transformers强大编码器

SegFormer：使用Transformer进行语义分割的简单高效设计

Swin-Unet：首个纯Transformer的医学图像分割网络

学习跨模态深度表达用于多模态MR图像分割

几篇关于RGBD语义分割文章的总结的更多相关文章

随机推荐

热门专题