没有问题就无法学习：

1. 文中的数据规模是多少？这些数据有什么特点？

2. 为什么要使用迁移学习？

3. 文章是如何验证自己模型的准确性的？

4. 摘要中的每一个结论，在文章都是如何具体论证的？

亮点：

1，首次使用如此庞大的标注好的高质量数据进行迁移学习，来建立人工智能系统；

2，基于光学相干断层成像（OCT）数据有效进行图像黄斑变性和糖尿病视网膜黄斑水肿的识别和严重性定量评估;

3，准确区分患儿胸部X光片上的细菌和病毒性肺炎（差异性分析和准确判断）

目标：

利用迁移学习识别视网膜（OCT）图像中的类别，同时也用迁移学习测试了胸部X光片的肺炎识别。

该研究也通过显示神经网络激活区域的方法向人们提供了机器诊断的可解释性。

真正启动是在去年（2017 年）初。启动到发表总共1年。

机器之心：神经网络的推理是一个「黑箱」，你们提出的新方法是如何解释计算机作出「诊断」的依据的？

张康：我们在视网膜 OCT 图像的研究中加入了「遮挡测试」——通过卷积一个遮挡核心到输入图像上，机器会通过计算预测做出正确诊断最可能的部位，并输出含有高亮色块的「遮挡」图，这些色块就是 AI「认为」的病变部位，得出直观的为临床医生信任的诊断依据。

首先，通过输入大量的数据，神经网络可以获得远超过人类医生的「经验」，计算出超越人类的准确结果，在我们的系统中，我们使用超过 20 万张医学图像，通过不同的疾病分类，最终使用近 11 万张视网膜 OCT 图像训练机器。在眼病方面，能在 30 秒内正确鉴别脉络膜新生血管、糖尿病黄斑水肿、玻璃膜疣以及正常视网膜的 OCT 图像，结果的准确率、敏感度、特异度均在 95% 以上，并能得出与人类相似甚至更高的准确率。其次，计算机对比图像像素与像素之间的差异，观察到人类关注不到的细节，从而得出更精准的判断，且不像人类一样受主观性干扰。另外，我们通过「迁移学习」这种算法，还能诊断不同系统的疾病，比如我们的系统目前还能准确鉴别肺炎和正常胸部 X 线平片，区分肺炎的病原体为细菌还是病毒，准确率可达 90% 以上。

「迁移学习」被认为是一种高效的学习技术，尤其是面临相对有限的训练数据时。相较于其他大多数学习模型的「从零开始」，「迁移学习」利用卷积神经网络（Convolutional Neural Network，CNN）学习已有的已经标记好的预训练网络系统，以医学图像学习为例，该系统会识别预系统中图像的特点，我们再继续导入含有第一层图像相似参数和结构的网络系统，最终构建出终极层级。在我们的系统中，第一层网络就是视网膜 OCT 图像，第二级网络系统使用第一级的图像寻找相应的特点，通过前向传播固定低层图像中的权重，找到已经学习的可辨别的结构，再提取更高层的权重，在其中进行反复的自我调整和反馈、传递，达到学习区分特定类型的图像的目的。我们首次使用如此庞大的标注好的高质量视网膜 OCT 数据进行迁移学习，进行常见视网膜致盲性疾病的检测及推荐治疗手段，得到与人类医生相似甚至更高的准确性。此人工智能系统还可以「举一反三」，将迁移学习用于小儿肺炎诊断。

迁移学习是深度学习的一个自然发展方向，迁移学习能让深度学习变得更加可靠，还能帮我们理解深度学习的模型。比如，我们能够知晓哪部分特征容易迁移，这些特征所对应的是某个领域比较高层、抽象的一些结构型概念。把它们的细节区分开，就能让我们对这个领域的知识表达形成一个更深的理解。这样一来，机器就可以像生物的神经系统一样终身学习，不断地对过去的知识进行总结、归纳，让一个系统越学越快，而且在学习过程中还能发现如何学习。

迁移学习在深度学习上面有极为广阔的应用前景，在图像数据资源有限的医疗领域，更高效、所需图像数量更少的迁移学习，可以说是未来 5 年内 AI 发展的热点以及深度学习成功应用的驱动力。

比如，有一些图像特征较为模糊的图像，如老年黄斑变性，某些较大的玻璃膜疣和脉络膜新生血管非常相似，我们就会偏向于采取更为严重的疾病诊断，因为我们研究的最终目的是帮助病人更可能的推荐给相应的专科医生，从而更快的获得治疗。另外，我们还可以通过我们的想法设定更为贴合实际的过滤器，并按照我们临床医生的需求不断调整；通过「遮挡实验」能够反映机器得出判断的依据。并且，我们的研究还能指导治疗方案的确定。因此我们的研究可能更能达到临床医生想要的效果，并且为临床医生所信任，也许能更快更直接的应用于临床。

学习以下神经网路，适合中国人的视频，简直不要太通俗：迁移学习 Transfer Learning

学习方法及资料：https://github.com/jindongwang/transferlearning

Cell paper数据和代码地址：https://data.mendeley.com/datasets/rscbjbr9sj/2

跑一下代码玩一下！

文章原文解读：

Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning

鉴定医疗诊断和可治疗疾病

Image-based deep learning classifies macular degeneration（[眼科] 黄斑变性） and diabetic retinopathy（[眼科] 糖尿病视网膜病变） using retinal optical coherence tomography（视网膜光学相干断层扫描） images and has potential for generalized applications in biomedical image interpretation and medical decision making.

An artificial intelligence system using transfer learning（迁移学习） techniques was developed
It effectively classified images for macular degeneration and diabetic retinopathy
It also accurately distinguished bacterial and viral pneumonia（细菌和病毒性肺炎） on chest X-rays
This has potential for generalized high-impact application in biomedical imaging

科普下迁移学习：什么是迁移学习 (Transfer Learning)？这个领域历史发展前景如何？

The implementation of clinical-decision support algorithms for medical imaging faces challenges with reliability and interpretability.

Here, we establish a diagnostic tool based on a deep-learning framework for the screening of patients with common treatable blinding retinal diseases.

Our framework utilizes transfer learning, which trains a neural network with a fraction of the data of conventional approaches（常规方法）. Applying this approach to a dataset of optical coherence tomography images, we demonstrate performance comparable to that of human experts in classifying agerelated macular degeneration and diabetic macular edema. We also provide a more transparent and interpretable diagnosis by highlighting the regions recognized by the neural network. We further demonstrate the general applicability of our AI system for diagnosis of pediatric pneumonia using chest X-ray images.

This tool may ultimately aid in expediting the diagnosis and referral of these treatable conditions, thereby facilitating earlier treatment, resulting in improved clinical outcomes.

Figure 2. Representative Optical Coherence Tomography Images and the Workflow Diagram

结果

The primary application of our transfer learning algorithm was in the diagnosis of retinal OCT images.

Spectral-domain OCT uses light to capture high-resolution in vivo optical cross sections of the retina that can be assembled into three-dimensional-volume images of living retinal tissue. OCT is critical to guiding the administration of anti-VEGF therapy by providing a clear cross-sectional representation of the retinal pathology in these conditions (Figure 2A), allowing visualization of individual retinal layers, which is impossible with clinical examination by the human eye or by color fundus photography.

主要就干了以下四件事：

Patient and Image Characteristics

数据：We initially obtained 207,130 OCT images. 108,312 images (37,206 with choroidal neovascularization, 11,349 with diabetic macular edema, 8,617 with drusen, and 51,140 normal) from 4,686 patients passed initial image quality review and were used to train the AI system.

After 100 epochs (iterations through the entire dataset), the training was stopped due to the absence of further improvement in both accuracy (Figure 3A) and cross-entropy loss (Figure 3B).

Performance of the Model

We evaluated our AI system in diagnosing the most common blinding retinal diseases.

Comparison of the Model with Human Experts

An independent test set of 1,000 images from 633 patients was used to compare the AI network’s referral decisions with the decisions made by human experts.

Occlusion Testing遮挡测试

We performed an occlusion test on 491 images to identify the areas contributing most to the neural network’s assignment of the predicted diagnosis.

Application of the AI System for Pneumonia Detection Using Chest X-Ray Images

To investigate the generalizability of our AI system in the diagnosis of common diseases, we applied the same transfer learning framework to the diagnosis of pediatric pneumonia.

案例二：In Silico Labeling: Predicting Fluorescent Labels in Unlabeled Images

transmitted-light
fluorescence images

计算机中的标注：预测荧光标签在没被标注的图像中

in-silico-labeling

Seeing More with In Silico Labeling of Microscopy Images

Multi-scale architectures ：可以用于处理多重输入，平时我们都是输入一张图片，现在我们可以输入多张图片。

In silico labeling, a machine-learning approach, reliably infers fluorescent measurements from transmitted-light（透射光） images of unlabeled fixed or live biological samples.

Microscopy is a central method in life sciences.

Many popular methods, such as antibody labeling, are used to add physical fluorescent labels to specific cellular constituents.

However, these approaches have significant drawbacks, including inconsistency; limitations in the number of simultaneous labels because of spectral overlap; and necessary perturbations of the experiment, such as fixing the cells, to generate the measurement.

Here, we show that a computational machine-learning approach, which we call ‘‘in silico labeling’’ (ISL), reliably predicts some fluorescent labels from transmitted-light images of unlabeled fixed or live biological samples. ISL predicts a range of labels, such as those for nuclei, cell type (e.g., neural), and cell state (e.g., cell death). Because prediction happens in silico, the method is consistent, is not limited by spectral overlap, and does not disturb the experiment. ISL generates biological measurements that would otherwise be problematic or impossible to acquire.

As such, it is unclear whether deep learning approaches would provide a significant and broad-based advance in image analysis and are capable of extracting useful, not readily apparent, information from unlabeled images.

we show it can accurately predict the location and texture of cell nuclei, the health of a cell, the type of cell in a mixture, and the type of subcellular structure.

We also show that the trained network exhibits transfer learning: once trained to predict a set of labels, it could learn new labels with a small number of additional data, resulting in a highly generalizable algorithm, adaptable across experiments.

Training and Testing Datasets for Supervised Machine Learning

Developing Predictive Algorithms with Machine Learning

Network Predictions of Cell Nuclei

Network Predictions of Cell Viability

Network Predictions of Cell Type and Subcellular Process Type

Adapting the Generic Learned Network to New Datasets: Transfer Learning

待续~

图像识别 | AI在医学上的应用 | 深度学习 | 迁移学习的更多相关文章

CS231n 斯坦福深度视觉识别课学习笔记(完结)
课程地址第1章 CS231n课程介绍 ---1.1 计算机视觉概述这门课的主要内容是计算机视觉.它是一门需要涉及很多其他科目知识的学科. 视觉数据占据了互联网的绝大多数,但是它们很难利用. --- ...
AI小白必读：深度学习、迁移学习、强化学习别再傻傻分不清
摘要:诸多关于人工智能的流行词汇萦绕在我们耳边,比如深度学习 (Deep Learning).强化学习 (Reinforcement Learning).迁移学习 (Transfer Learning ...
深度学习优质学习项目大放送！-AI Studio精选开源项目合集推荐
近期在AI Studio上发现了不少优质的开源深度学习项目,从深度学习入门到进阶,涵盖了CV.NLP.生成对抗网络.强化学习多个研究方向,还有最新的动态图,都以NoteBook的方式直接开源出来,并 ...
Feign Ribbon Hystrix 三者关系 | 史上最全, 深度解析
史上最全: Feign Ribbon Hystrix 三者关系 | 深度解析疯狂创客圈 Java 分布式聊天室[ 亿级流量]实战系列之 -25[ 博客园总入口 ] 前言疯狂创客圈(笔者尼恩创建的 ...
TVM 优化 ARM GPU 上的移动深度学习
TVM 优化 ARM GPU 上的移动深度学习随着深度学习的巨大成功,将深度神经网络部署到移动设备的需求正在迅速增长.与桌面平台上所做的类似,在移动设备中使用 GPU 既有利于推理速度,也有利于能源 ...
CUDA上的量化深度学习模型的自动化优化
CUDA上的量化深度学习模型的自动化优化深度学习已成功应用于各种任务.在诸如自动驾驶汽车推理之类的实时场景中,模型的推理速度至关重要.网络量化是加速深度学习模型的有效方法.在量化模型中,数据和模型参 ...
TVM在ARM GPU上优化移动深度学习
TVM在ARM GPU上优化移动深度学习随着深度学习的巨大成功,将深度神经网络部署到移动设备的需求正在迅速增长.与在台式机平台上所做的类似,在移动设备中使用GPU可以提高推理速度和能源效率.但是,大 ...
Deep Learning（深度学习）学习笔记整理系列之（五）
Deep Learning(深度学习)学习笔记整理系列 zouxy09@qq.com http://blog.csdn.net/zouxy09 作者:Zouxy version 1.0 2013-04 ...
Deep Learning（深度学习）学习笔记整理系列之（八）
Deep Learning(深度学习)学习笔记整理系列 zouxy09@qq.com http://blog.csdn.net/zouxy09 作者:Zouxy version 1.0 2013-04 ...

随机推荐

PID算法控制简单理解
1 传统的位式控制算法用户期望值Sv(设定值)经控制算法输出一个输出信号OUT,输出信号加载到执行部件上(像MOS管等)对控制对象进行控制(步进电机.加热器等),控制对象的当前值(Pv)如速度通过传 ...
Flask学习【第3篇】：蓝图、基于DBUtils实现数据库连接池、上下文管理等
小知识子类继承父类的三种方式 class Dog(Animal): #子类派生类 def __init__(self,name,breed, life_value,aggr): # Animal. ...
高通平台framework,hal,kernel打开log【转】
本文转载自:https://blog.csdn.net/u010164190/article/details/78625636 .Add framework log #define LOG_NDEBU ...
打造性感好用的 VS Code 编辑器
官网: https://code.visualstudio.com/ Blog链接:打造性感好用的VS Code编辑器主命令框 F1或Ctrl+Shift+P: 打开命令面板.在打开的输入框内,可以 ...
html 之浮动(待补充)
通过设置float 样式属性来决定区块标签的浮动问题:区块与非区块浮动的特点(左右) 区块与区块浮动的特点(左右) ie浏览器与谷歌等浏览器的效果差异
网络_TCP连接的建立与释放
三报文握手 1.概述 TCP是面向连接的协议.TCP建立连接的过程叫做握手,握手需要在客户和服务器之间交换三个TCP报文段,即我们说的"三次握手"(严格讲是一次握手过程中交换了三个 ...
Linux 时间矫正命令
Linux 时间矫正 sudo ntpdate -u ntp.api.bz 第一使用可能提示ntpdate没安装,用以下命令安装即可 sudo apt install ntpdate
ZooKeeper分布式过程协同技术详解2——了解ZooKeeper
这个服务如何实现这些协作方面的原语? ZooKeeper基础
接口Interface的四种含义
摘自<需求分析与系统设计(第3版)>第七章Q5 1. GUI——显示信息的计算机屏幕(注:其他终端) 2. API——是一套软件程序和开发工具,为应用程序提供函数调用,使程序可以访问一些级 ...
js精度误差
之前虽然有看到过 js 精度相关的文章.但也都没有“印象深刻” ,但是今天"有幸"遇到了. 做一个项目,进行页面调试的时候, 当数量增加到3时总价格变得好长好长立马在控制台验证了 ...

图像识别 | AI在医学上的应用 | 深度学习 | 迁移学习