A guide to artificial intelligence for cancer researchers
癌症研究人员人工智能指南
Abstract
摘要
| Sections |
| Introduction |
| Understandingdeeplearning |
| Al for biomedical image analysis |
| Alforlanguage |
| Emerging usesofAl |
| Conclusion |
| 章节 |
|---|
| 引言 |
| 理解深度学习 |
| 用于生物医学图像分析的AI |
| 用于语言的AI |
| AI的新兴应用 |
| 结论 |
Artificial intelligence (Al) has been com mod it i zed. It has evolved from a specialty resource to a readily accessible tool for cancer researchers. Al-based tools can boost research productivity in daily workflows, but can also extract hidden information from existing data, thereby enabling new scientific discoveries. Building a basic literacy in these tools is useful for every cancer researcher. Researchers with a traditional biological science focus can use Al-based tools through off-the-shelf software, whereas those who are more computationally inclined can develop their own Al-based software pipelines. In this article, we provide a practical guide for non-computational cancer researchers to understand how Al-based tools can benefit them. We convey general principles of Al for applications in image analysis, natural language processing and drug discovery. In addition, we give examples of how non-computational researchers can get started on the journey to productively use Al in their own work.
人工智能 (AI) 已被商品化。它已从专业资源演变为癌症研究人员随手可得的工具。基于AI的工具既能提升日常研究工作流程的效率,也能从现有数据中提取隐藏信息,从而推动新的科学发现。掌握这些工具的基础应用能力对每位癌症研究者都大有裨益。专注于传统生物科学的研究人员可通过现成软件使用基于AI的工具,而更倾向于计算方向的研究者则可自行开发基于AI的软件流程。本文为非计算背景的癌症研究人员提供实用指南,帮助他们理解基于AI的工具如何惠及自身研究。我们将阐述AI在图像分析、自然语言处理和药物发现领域应用的通用原则,并举例说明非计算背景研究人员如何在本职工作中高效运用AI技术。
Review article
综述文章
Introduction
引言
Artificial intelligence (Al) is a set of computational techniques that aim to enable machines to perform tasks that are usually reserved for humans.For decades,Alhas been mostly a theoretical construct with few implications for the real world. However, in the past 15 years, Al has reached human-level performance in a broad range of domains, including biomedical research (Fig.1a and Box1).Alhas matured into a multitude of products with real-world implications. Technically, many of the present-day applications of Al heavily rely on artificial neural networks (ANNs), computational models inspired by the biological neural networks in the brain. An ANN is composed of linked units called'neurons', which are organized into layers. ANNs typically have multiple layers, each containing numerous neurons: an input layer, which receives the input data;an output layer, which produces the final output; and hidden layers, which perform most of the computation. When an ANN contains a substantial number of these hidden layers, it is referred to as a deep ANN.Theterm'deep'in this context relates to the depth of the network, which is a measure of the number of layers. Deep neural networks are capable oflearning complex patterns due to their extended architecture, which allows for more levels of abstraction and representation of data. Deep neural networks are trained using large datasets and adjusting the parameters of the network (such as the weights of connections between neurons) based on the output error. This process is known as deep learning. During deep learning, the network learns to perform tasks from examples. Today, in cancer research,the concept of Aloverlaps with deep learning to a high degree, such that these terms are often used interchangeably.
人工智能 (AI) 是一套旨在让机器执行通常由人类完成的任务的计算技术。几十年来,AI 主要是一种理论构建,对现实世界影响甚微。然而在过去 15 年间,AI 已在包括生物医学研究在内的广泛领域达到人类水平性能 (图 1a 和框 1)。AI 已发展成众多具有现实意义的产品。从技术角度而言,当今许多 AI 应用严重依赖人工神经网络 (ANN)——这种计算模型受大脑中生物神经网络启发。ANN 由称为"神经元"的互联单元组成,这些单元分层排列。ANN 通常具有多个层:接收输入数据的输入层;产生最终输出的输出层;以及承担主要计算任务的隐藏层。当 ANN 包含大量隐藏层时,即被称为深度 ANN。此处的"深度"指网络深度,即层数的度量。深度神经网络凭借其扩展架构能够学习复杂模式,这种架构支持更多层级的抽象和数据表示。深度神经网络使用大型数据集进行训练,并根据输出误差调整网络参数(如神经元连接权重)。这一过程被称为深度学习。在深度学习过程中,网络通过示例学习执行任务。当今在癌症研究中,AI 的概念与深度学习高度重合,因此这些术语经常互换使用。
Al methods are becoming increasingly common in cancer research. Hence, we postulate that any cancer researcher nowadays needs to acquire a certain level of Al literacy. Today, it is important to be able to understand, interpret and critically evaluate the Al output. In addition, some cancer researchers will find it beneficial to acquire a deeper understanding of Al and develop their own Al-based software tools. Today, Al has been com mod it i zed, meaning it is no longer a specialized resource but a widely accessible tool that cancer researchers can readily utilize (Fig.la). In addition, some biomedical research software features Al systems that can be trained for specific tasks,such as image analysis tools with a graphical user interface that allow users to train custom Al models(Fig.1b and Supplementary Table 1).Although a complete and detailed understanding of the exact mechanisms of Al systems remains a challenging and evolving area of research, it is important to recognize the limitations and potential biases inherent in these systems based on a general understanding of how Al systems are trained and how they make their predictions.
人工智能方法在癌症研究中正变得越来越普遍。因此,我们主张当今任何癌症研究人员都需要掌握一定程度的人工智能素养。如今,能够理解、解释和批判性评估人工智能输出已变得尤为重要。此外,部分癌症研究人员会发现深入理解人工智能并开发自己的人工智能软件工具将大有裨益。当前,人工智能已实现商品化 (commoditized) ,这意味着它不再是专业资源,而是癌症研究人员可随时使用的普及工具 (图1a) 。同时,部分生物医学研究软件配备了可针对特定任务训练的人工智能系统,例如具有图形用户界面的图像分析工具,允许用户训练定制化人工智能模型 (图1b和补充表1) 。尽管要完全详细理解人工智能系统的确切机制仍是一个不断发展的挑战性研究领域,但基于对人工智能系统训练方式及其预测机制的基本理解,认识这些系统固有的局限性和潜在偏差至关重要。
This Review aims to serve as a comprehensive guide for cancer researchers who want to deepen their understanding of the applications of Al in cancer research. Our experience in engaging with professionals across disciplines such as biology, medicine and biochemistry has highlighted agrowing need forclarity and guidance on Alconcepts and tools that are pertinent to cancer research. This Review focuses on the key concepts and tools related to Al in cancer research, including applications in image analysis, natural language processing (NLP) and drug discovery. It is intended as a practical guide and will not cover the mathematical foundations or highly technical aspects of Al algorithms in depth.
本综述旨在为希望深入了解人工智能 (AI) 在癌症研究中应用的科研人员提供全面指南。我们在与生物学、医学和生物化学等跨学科专业人士的合作中发现,研究人员对癌症研究相关的AI概念与工具存在日益增长的明晰化指导需求。本文重点关注癌症研究中AI相关的核心概念与工具,包括其在图像分析、自然语言处理 (NLP) 和药物发现领域的应用。本指南侧重实践性,不会深入探讨AI算法的数学基础或高技术性内容。
Understanding deep learning Types of deep learning
理解深度学习 深度学习的类型
Deep learning algorithms can be split into supervised', unsupervised? and reinforcement 34 learning. Supervised learning is the most common type of deep learning. In supervised learning, a model is trained on a labelled dataset, which means that each training example (forexample,a photo) ispaired with an output label or ground truth label (for example,'cat'or'dog).A model makes predictions or decisions based on the input data, and it learns patterns for the correct predictions from the ground-truth labels. Beginners in deep learning can generally start by training a supervised deep neural network on a modestly sized dataset, typically containing hundreds to a few thousand well-labelled examples.A clean labelled dataset refers to data in which the labels are accurate and consistent, without mis labelled or ambiguous examples.
深度学习算法可分为监督学习、无监督学习和强化学习。监督学习是最常见的深度学习类型。在监督学习中,模型通过带标签的数据集进行训练,这意味着每个训练样本(例如一张照片)都配有输出标签或真实标签(例如"猫"或"狗")。模型根据输入数据做出预测或决策,并通过真实标签学习正确预测的模式。深度学习初学者通常可以从在中等规模数据集上训练监督式深度神经网络开始,这类数据集通常包含数百至数千个标注良好的样本。干净的标注数据集是指标签准确一致、没有错误标注或模糊样本的数据集。
By contrast, unsupervised learning is a method in which the model is trained toidentify patterns ina dataset without any explicit labels or annotations. It is commonly used in scenarios such as clustering, anomaly detection and association tasks,inwhich the model determines the inherent structure ofthe data to draw inferences°.Self-supervised learning (SSL) is a specific approach within unsupervised learning that involves training a deep neural network ona dataset in which labels are not provided in a traditional sense.Instead,inSSL,the model generates its own labels through a pseudo-task'. For example, the model might be tasked with reconstructing an image from a distorted version or predicting a missing word in a sentence. Nowadays, SSL is commonly used as thefirst stepto'pre-train'deep neural networks for subsequent 'fine-tuning' with supervised learning&,9.
相比之下,无监督学习 (Unsupervised Learning) 是一种模型在没有明确标签或标注的情况下识别数据集中模式的方法。它通常用于聚类 (clustering) 、异常检测 (anomaly detection) 和关联任务 (association tasks) 等场景,模型通过确定数据的内在结构来得出推断 [20]。自监督学习 (Self-supervised Learning, SSL) 是无监督学习中的一种特定方法,涉及在传统意义上未提供标签的数据集上训练深度神经网络。在 SSL 中,模型通过伪任务 (pseudo-task) 生成自己的标签。例如,模型可能被要求从失真版本重建图像,或预测句子中缺失的单词。如今,SSL 通常作为预训练 (pre-train) 深度神经网络的第一步,以便后续通过监督学习进行微调 (fine-tuning) [8,9]。
The third class of deep learning approaches is reinforcement learning. A reinforcement learning system involves agents, entities or systems that make decisions by interacting with their surroundings to achieve a goal. Over time, the agent learns the optimal behaviour based on feedback from the environment. This feedback is in the form of numerical rewards for actions that lead to successful outcomes and penalties for those that do not. The goal is for the agent to learn a policy that maximizes the cumulative reward overtime.Reinforcement learning can automate rule-based procedures such as computer games10 or drone navigation. Reinforcement learning is applied in cancer research in some specific areas, such as for finding optimal policies for personalized cancer screening? or designing clinical trials.
深度学习的第三类是强化学习 (Reinforcement Learning)。强化学习系统包含智能体 (Agent),即通过与周围环境交互来制定决策以实现目标的实体或系统。随着时间的推移,智能体会根据环境反馈学习最优行为模式。这种反馈以数值形式呈现:导致成功结果的动作获得奖励,而未达标的动作则受到惩罚。其目标是让智能体学会能随时间累积最大奖励的策略。强化学习可自动化基于规则的程序,例如电脑游戏[10]或无人机导航。在癌症研究领域,强化学习已应用于某些特定场景,例如寻找个性化癌症筛查的最优策略,或设计临床试验方案。
Neural network architectures
神经网络架构
Deep learning can effectively handle unstructured data such as images and text. Unstructured data refer to information that lacks a specific schema and does not follow conventional data models1². The discipline of automatically analysing images with computers is called computer vision, whereas the discipline dedicated to analysing and interpreting human language intext form is known as NLP. Throughout most ofthe 2010s,convolutional neural networks(CNNs)have been at the core of advancements in computer vision'3, whereas in the field of NLP, long short-term memory (LSTM) networks and related architectures have been widely usedl. LSTM networks, known for their ability to handle sequential data, were a substantial improvement over earlier models due to their capacity to remember information over long sequences. Despite their effectiveness, LSTM networks sometimes struggled with highly complex or extended sequences 15. This limitation has led to the exploration and adoption of alternative architectures in the 2020s, such as the transformer neural networks, also called transformers, which addresses some of these challenges more effectively in certain NLP tasks16. Interestingly, transformers can also be applied to images and even outperform CNNs at many medical image analysis tasks17,18. Transformers can capture long-range dependencies and global context in the images, whereas CNNs are by design constrained to more
深度学习能有效处理图像和文本等非结构化数据。非结构化数据指缺乏特定模式且不遵循传统数据模型的信息[1][2]。利用计算机自动分析图像的学科称为计算机视觉,而专门分析和解释文本形式人类语言的学科则被称为自然语言处理 (NLP)。在整个2010年代的大部分时间里,卷积神经网络 (CNN) 一直是计算机视觉领域进步的核心[3],而在自然语言处理领域,长短期记忆 (LSTM) 网络及相关架构被广泛使用。LSTM 网络以其处理序列数据的能力而闻名,由于能够记忆长序列信息,相较早期模型有显著改进。尽管效果显著,但 LSTM 网络有时难以处理高度复杂或超长序列[5]。这一局限性促使人们在2020年代开始探索采用替代架构,例如 Transformer 神经网络(也称 Transformer),该架构在某些自然语言处理任务中能更有效地应对这些挑战[16]。值得注意的是,Transformer 也可应用于图像处理,并在许多医学图像分析任务中表现优于卷积神经网络[17][18]。Transformer 能捕捉图像中的长程依赖关系和全局上下文信息,而卷积神经网络在设计上受限于局部感受野。
Review article
综述文章

Fig.1|Al workflows in cancer research.
图 1|癌症研究中的AI工作流程。
a,Applications of deep learning connect basic, translational and clinical research.In artificial intelligence (Al)-based image processing,three approaches prevail(asshowninb-d).b,Standard image analysis tasks such as cell counting or segmenting regions ofinterest(ROls)often require swift solutions with off-the-shelf tools.Numerous open-source tools are available for accomplishing these tasks.For example,ilastik is well-suited for subcellular analysis,whereas QuPath and 3D Slicer are commonly used for tissue-leveland whole-body imaging,respectively.Fiji,ImageJ and Cell Profiler are versatile tools that can handle a wide range of image analysis tasks across different scales.A selection of these tools is presented in Supplementary Table1.c,Another categoryof image analysis tasks involves those that require an iterative process of training and fine-tuning on extensive datasets.These tools are typically very task-specific and are designed to address one particular question.Constructing,training and validating these pipelines requires programming skills. d, The future of image analysis is trending towards foundation models,which are trained on extensive, heterogeneous datasets and can be finetuned on numerous tasks.Inzero-shot learning,the model is applied directly to the test set without any task-specific training,whereas in few-shot learning, the model is provided with a small number of labelled examples to adapt to the specific task.
a, 深度学习应用连接基础研究、转化研究和临床研究。在基于人工智能 (AI) 的图像处理领域,主要存在三种方法 (如 b-d 所示)。b, 标准图像分析任务 (如细胞计数或感兴趣区域 (ROI) 分割) 通常需要利用现成工具快速解决。现有众多开源工具可完成这些任务,例如 ilastik 适用于亚细胞分析,而 QuPath 和 3D Slicer 分别常用于组织水平和全身成像。Fiji、ImageJ 和 Cell Profiler 是多功能工具,可处理不同尺度的各种图像分析任务。部分工具选列见补充表 1。c, 另一类图像分析任务需要对大规模数据集进行迭代式训练和微调。这类工具通常具有高度任务特异性,旨在解决特定问题。构建、训练和验证这些流程需要编程技能。d, 图像分析的未来趋势是基础模型,这些模型在广泛异构数据集上训练而成,可针对众多任务进行微调。在零样本学习中,模型直接应用于测试集而无需任何任务特定训练;而在少样本学习中,模型通过少量标注示例来适应特定任务。
localized patterns19. As of 2024, transformers represent the state of the art in both image and language processing tasks, showcasing their versatility and capability in handling diverse and complex data types. In particular, transformers coupled with SSL have catalysed a movement towards so-called foundation models,which is discussed further below (Figs.1b-d and 2a,b).
localized patterns19. 截至2024年,Transformer架构已成为图像与语言处理任务的最先进技术,展现了其处理多样复杂数据类型的通用性与强大能力。特别是结合自监督学习 (SSL) 的Transformer架构催生了所谓基础模型 (foundation models) 的发展浪潮,下文将对此展开进一步探讨 (图1b-d 和图2a,b)。
Uses of deep learning in cancer research
深度学习在癌症研究中的应用
Deep learning has a wide array of applications in cancer research and could improve the productivity of researchers (Box 2). Its capability to analyse unstructured data is applicable for the analysis of results from many experimental assays, such as microscopic images, as well as text from various sources in research and clinical routines. Deep learning also effectively handles other complex data types, including genomic information. In these settings, deep learning is utilized in twomain ways.Thefirstis through user-friendly tools that require no programming skills, offering graphical interfaces for ease of use. For example,software such as Qu Path ori last ik provide intuitive inter- faces for analysing microscopy images and can be extended with deep learning models for tasks such as cell segmentation or classification. The second, more advanced method involves using programming languages such as Python to interact with deep learning architectures, such as CNNs or transformers, through scripts. This approach provides greater flexibility but necessitates a basic understanding of programming(Fig.1b,c).
深度学习在癌症研究中具有广泛的应用,并能提升研究人员的工作效率 (Box 2) 。其分析非结构化数据的能力适用于处理多种实验检测结果,例如显微图像,以及研究和临床流程中来自不同来源的文本。深度学习还能有效处理其他复杂数据类型,包括基因组信息。在这些场景中,深度学习主要通过两种方式应用:第一种是通过无需编程技能的友好型工具,提供图形界面以方便使用。例如,Qu Path 或 i last ik 等软件为分析显微镜图像提供了直观界面,并可通过集成深度学习模型来执行细胞分割或分类等任务。第二种更进阶的方法涉及使用 Python语言 等编程语言,通过脚本与卷积神经网络 (CNN) 或 Transformer 等深度学习架构交互。这种方法灵活性更高,但需要掌握基础的编程知识 (图 1b,c) 。
Deep learning algorithms are also used to analyse medical imaging data, such as detecting tumours in MRIor CT scans with precision comparable to that of experienced radiologists 20.21 This technology also assists in identifying subtle patterns in genetic data??, resulting in the understanding of the genetic origins of a cancer. In drug discovery, deep learning can help in screening potential compounds more efficiently by analysing both in silico (computational) and in vitro (experimental) data, speeding up the process of finding new cancer treatments 23. Furthermore, deep learning is used in his to pathology to analyse tissue samples, distinguishing between benign and malignant cells with remarkable accuracy24, or extracting clinically usable biomarkers directly from image data2.
深度学习算法也被用于分析医学影像数据,例如通过核磁共振成像或计算机断层扫描检测肿瘤,其精准度可与经验丰富的放射科医师相媲美 [20][21]。这项技术还有助于识别基因数据中的细微模式,从而理解癌症的遗传起源。在药物发现领域,深度学习通过分析计算机模拟数据和体外实验数据,能更高效地筛选潜在化合物,加速新型癌症疗法的研发进程 [23]。此外,深度学习应用于数字病理学领域时,可分析组织样本以极高准确度区分良恶性细胞 [24],或直接从影像数据中提取具有临床价值的生物标志物 [2]。
Al for biomedical image analysis
生物医学图像分析人工智能
The breakthroughs in deep learning in the 2010s,such asthe development of CNNs and the availability of large-scale annotated datasets, broadened the applicability of computer-based image analysis to more complex tasks(Fig.1c).Forexample,classical machine learning methods before the advent of deep learning could detect cells in microscopy images. These approaches were further applied for many downstream applications,such as prognostication in cancer based on enumeration of lymphocytes in histology slides25. These early computer vision tools inthe1990s andearly 2000s reliedon handcrafted features, such as edge detection, texture analysis and colour-based segmentation, which required domain expertise and were often limited in their genera liz ability across different datasets.Although simple cell detection has been improved and made more general iz able with deep learning methods,as compared with pre-deep learning tools, such as the commonly used tool StarDist26, deep learning is not strictly necessary to
20世纪10年代深度学习的突破,例如卷积神经网络 (CNN) 的发展和大规模标注数据集的可用性,将基于计算机的图像分析的适用性扩展到更复杂的任务 (图 1c) 。例如,在深度学习出现之前,经典机器学习方法可以检测显微镜图像中的细胞。这些方法被进一步应用于许多下游应用,例如基于组织学切片中淋巴细胞计数的癌症预后预测25。20世纪90年代和21世纪初的这些早期计算机视觉工具依赖于手工制作的特征,例如边缘检测、纹理分析和基于颜色的分割,这需要领域专业知识,并且在不同数据集间的泛化能力通常有限。虽然与深度学习前的工具 (例如常用工具 StarDist26) 相比,简单细胞检测已通过深度学习方法得到改进并变得更可泛化,但严格来说深度学习并非必需。
Review article
综述文章
Box1
Evolution of Al from theory to practical application
人工智能从理论到实际应用的演进
The field of artificial intelligence (Al) is largely regarded as emerging from a conference at Dartmouth College in 1956. Early work was focused on so-called symbolic Al such as rule-based systems for tasks such as playing chess. By the 1960s and 1970s, it became more common to incorporate machine learning algorithms and simplistic artificial neural networks, although these were computationally limited. Machine learning has replaced symbolic Al in virtually all advanced Al applications today, including in cancer research. In the 1980s, the development of the back propagation algorithm enhanced the training efficiency of artificial neural networks. Despite these advancements, Al research faced a winter period in the late 1980s and 1990s due to unfulfilled expectations and subsequent funding cuts. The focus then shifted towards machine learning techniques, such as support vector machines or decision trees, which demonstrated statistically significant performance in various applications. Today, we callthese machine learning methods'classical' machine learning. Nevertheless, artificial neural networks developed further and, finally, in the early 2010s, deep neural networks demonstrated remarkable performance in areas such as image and speech recognition. One of the first real-world applications of neural networks was in postal services. In the 1980s, artificial neural networks were already used to automate the sorting of postal mail by recognizing handwritten letters on envelopes 168. Today, deep learning is widespread, aiding in cancer diagnosis through image analysis, fraud detection in finance and navigation in autonomous vehicles. Al has transitioned from a theoretical discipline to a practical tool with a transformative effect on various industries. Its capabilities now range from basic pattern recognition to advanced neural networks that can learn unsupervised and process natural language in real-time. Whether in smart homes that adapt to us or recommendation algorithms that anticipate our preferences, the value of Al in tackling complex problems has proven to be enormous, and its influence continues to expand.
人工智能 (AI) 领域普遍被认为起源于1956年达特茅斯学院的一次会议。早期研究主要集中于所谓的符号人工智能,例如用于国际象棋等任务的基于规则的系统。到1960年代和1970年代,尽管存在计算能力限制,整合机器学习算法和简化人工神经网络的做法已更为普遍。如今在几乎所有先进AI应用领域(包括癌症研究),机器学习已完全取代符号人工智能。1980年代,反向传播算法的发展提升了人工神经网络的训练效率。尽管取得这些进展,由于预期未能实现及随之而来的资金削减,AI研究在1980年代末至1990年代遭遇寒冬期。研究重心随后转向支持向量机或决策树等机器学习技术,这些技术在各类应用中展现出统计学意义的显著性能。如今我们称这些机器学习方法为"经典"机器学习。然而人工神经网络持续发展,最终在2010年代初,深度神经网络在图像和语音识别等领域展现出卓越性能。神经网络的首个实际应用案例出现在邮政服务领域。1980年代,人工神经网络已通过识别信封上的手写字母来实现邮政自动分拣[168]。当前深度学习技术已广泛应用,通过图像分析辅助癌症诊断、金融欺诈检测以及自动驾驶车辆导航。AI已从理论学科转型为具有变革意义的实用工具,其能力范围涵盖从基础模式识别到能进行无监督学习并实时处理自然语言的先进神经网络。无论是在适配用户需求的智能家居,还是预测用户偏好的推荐算法中,AI在解决复杂问题方面的价值已被证明巨大无比,其影响力仍在持续扩展。
obtain a reasonable detection of cells in microscopy images. However, the situation is different in more complex tasks. In the case of more subtle properties such as biomarker analysis from patient samples, almost all clinically approved systems are based on deep learning22. Many of these biomarkers are based on the simple principle of image classification, which will be discussed below.
在显微镜图像中获得合理的细胞检测。然而,在更复杂的任务中情况有所不同。对于从患者样本中分析生物标志物这类更精细的特性时,几乎所有临床批准的系统都基于深度学习22。这些生物标志物大多基于简单的图像分类原理,下文将对此进行讨论。
Cellular and molecular imaging analysis
细胞与分子成像分析
Cancer researchers often deal with digital images.Inthe case of basic research, these experiments can involve visual assessments, such as visually examining cell cultures for confluency, morphology or growth as a preliminary step or assessing tumour growth in vivo through microscopic methods such as bright field or fluorescence microscopy. Many image analysis tasks in biological research are traditionally performed manually, however this is not only inefficient and error-prone but can also make experiments infeasible if thousands of output images have to be analysed. In general, by using deep learning to quantify experimental readouts, the analysis can be made more objective, reliable and quicker. For instance, in the context of cell detection in phase-contrast microscopy, deep learning can quickly and reliably detect individual cells and classify them as live or dead27. Such analyses are being widely used,for example,through commercial platforms such as the Incucyte Al Cell Health Analysis Software Module (Sartori usA G).
癌症研究人员经常处理数字图像。在基础研究领域,这些实验可能涉及视觉评估,例如通过目测检查细胞培养物的汇合度、形态或生长情况作为初步步骤,或通过明场或荧光显微镜等显微方法评估体内肿瘤生长。生物学研究中的许多图像分析任务传统上依赖人工完成,但这种方式不仅效率低下且容易出错,当需要分析数千张输出图像时甚至可能导致实验无法实施。总体而言,通过使用深度学习技术量化实验读数,可以使分析过程更具客观性、可靠性并提升效率。例如在相差显微镜下的细胞检测场景中,深度学习能够快速可靠地识别单个细胞并将其分类为存活或死亡细胞[27]。此类分析正得到广泛应用,例如通过商业化平台如Incucyte AI细胞健康分析软件模块(Sartorius AG)实现。
Open-source Al solutions for microscopy image analysis. In general,commercially available software utilizing deep learning forbasic science research is only available for common and standardized tasks, such as cell counting in phase-contrast images. The open-source community, however, has made dozens of deep learning methods available for the analysis of microscopy images. For example, QuPath28 is a common software for viewing gigapixel microscopy images, in which a single image file contains multiple gigabytes of compressed data, and it enables access to popular deep learning models for cell detection such as 'stardist'26 without requiring any programming skills. Similarly, ImageJ",ImageJ2 (ref. 30) and Fiji²1 are the standard tools for many image viewing and analysis tasks in biology, including viewing multichannel, multidimensional images, even in more uncommon file formats such as CZI or MRxS, via the integration with Bio-Formats32 (Supplementary Table 2).Onthese platforms,pre-trained deeplearning models can be run through various open-source platforms and plugins, including deep lm age J 33. Some pre-trained models for cell segmentation, nucleus segmentation or more specialized tasks such as segmentation of mitochondria in electron microscopy images are available through a repository or collection of pre-trained models, termed model'zoos' (for example, available at Biolmage.io).
用于显微镜图像分析的开源AI解决方案。总的来说,利用深度学习进行基础科学研究的商业软件仅适用于常见和标准化任务,例如相差图像中的细胞计数。然而,开源社区已经提供了数十种深度学习方法用于显微镜图像分析。例如,QuPath28是查看千兆像素显微镜图像的常用软件,其中单个图像文件包含多个千兆字节的压缩数据,并且无需任何编程技能即可访问流行的深度学习模型进行细胞检测,例如'stardist'26。类似地,ImageJ"、ImageJ2(参考文献30)和Fiji²1是生物学中许多图像查看和分析任务的标准工具,包括查看多通道、多维图像,甚至通过Bio-Formats32的集成(补充表2)查看更不常见的文件格式,如CZI或MRxS。在这些平台上,预训练的深度学习模型可以通过各种开源平台和插件运行,包括deep lm age J 33。一些用于细胞分割、细胞核分割或更专业任务(如电子显微镜图像中线粒体分割)的预训练模型可通过存储库或预训练模型集合(称为模型"动物园")获得(例如,可在Biolmage.io上获得)。
For many specialized niche applications, no off-the-shelf model or platform is available.In these cases,researchers are best advised to build their own software based on deep learning and redistribute it to other researchers under an open-source license. For example, deep learning has been successfully used in research software pipelines by several academic research groups for the purpose of evaluating tumour organoids in bright field microscopy 34,35. These research groups have made their software available to other researchers under open-source licenses, enabling wider adoption and collaboration. Furthermore, some advanced biological imaging techniques, such as reconstructing high-resolution fluorescence images from low-resolution or noisy data, cannot be performed efficiently without deep learning3. Although ImageJ can perform basic image reconstruction tasks such as tiling, deep learning-based methods can substantially improve the quality of the reconstructed images by learning to remove noise, enhance resolution and infer missing information from the available data.Often, research groups must develop advanced computational methods,such as custom deep learning architectures or novel training strategies,to enable them to solve specific image analysis needs for which nogeneral solution is available. These custom models can circumvent the limitations of traditional image reconstruction techniques by learning to exploit patterns and relationships within the data that are not easily captured by handcrafted algorithms.
对于许多专业细分领域的应用场景,市面上并没有现成的模型或平台可供使用。在这种情况下,研究人员的最佳选择是基于深度学习技术自主开发软件,并通过开源许可协议分享给其他研究者。例如,多个学术研究团队已成功将深度学习应用于研究软件流程中,用于明场显微镜下的肿瘤类器官评估 [34,35] 。这些研究团队通过开源许可将软件开放给其他研究者,促进了更广泛的采用与合作。此外,某些先进的生物成像技术(如从低分辨率或含噪数据重建高分辨率荧光图像)若没有深度学习将无法有效实现 [3] 。尽管 ImageJ 能够执行基本的图像重建任务(如图像拼接),但基于深度学习的方法通过学习去除噪声、增强分辨率以及从现有数据推断缺失信息,能显著提升重建图像的质量。研究团队通常需要开发先进的计算方法(例如定制化的深度学习架构或创新的训练策略),以满足特定图像分析需求,因为这类需求往往缺乏通用解决方案。这些定制模型通过学习数据中难以通过手工算法捕捉的模式与关联,能够规避传统图像重建技术的局限性。
His to pathology
组织病理学
Capabilities of deep learning in pathology image analysis. In basic and translational cancer research, tumour tissue from patients or animal models is analysed by histology 17 to diagnose and evaluate tumours.The his to logical morphology of a tumour represents the
深度学习在病理图像分析中的能力。在基础与转化癌症研究中,来自患者或动物模型的肿瘤组织通过组织学17进行分析以诊断和评估肿瘤。肿瘤的组织学形态代表了
Review article
综述文章
result of a plethora of molecular processes in the genome ofatumour, epigenome and environment. Because ofthis,it is essential to digitize these slides into high-resolution, gigapixel images. The challenge in analysing these images arises from their immense size combined with their detailed content. This complexity often exceedsthe capabilities of standard microscopy image analysis tools, such as ImageJ or Fiji29.31 particularly when dealing with a large volume of data. Although ImageJ and Fiji provide powerful tools for image processing and analysis, including the ability to create macros and scripts for automated workflows, they are not optimized for handling gigapixel images or large datasets. Other software, such as QuPath, is optimized for handling such gigapixel images, but there are many other ways to use deep learning in pathology image analysis, which we describe below.
肿瘤基因组、表观基因组和环境中大量分子过程作用的结果。正因如此,必须将这些玻片数字化为高分辨率的十亿像素级图像。分析这些图像的挑战在于其巨大的尺寸与精细内容相结合。这种复杂性往往超出了标准显微镜图像分析工具(如ImageJ或Fiji [29.31])的处理能力,特别是在处理海量数据时。尽管ImageJ和Fiji提供了强大的图像处理与分析工具(包括创建宏和脚本实现自动化工作流程),但它们并未针对十亿像素级图像或大型数据集进行优化。其他软件(如QuPath)虽针对此类十亿像素级图像进行了优化,但在病理图像分析中应用深度学习还有许多其他方法,我们将在下文详述。
Computational pathology. Although digital pathology involves obtaining, storing and viewing these images, the analysis is referred to as computational pathology. From atechnical point of view, before the broader application of deep learning models, image analysis pipelines were composed of successive, highly engineered steps,tailored to specific types of analysis. For example, a machine learning model would detect cells, after which a subsequent model would classify cells into distinct cell types, and a final model would predict patient prognosis from these measurements?5. However, it is difficult to build such multistep pipelines with classical machine learning methods because of the complexity of integrating the multiple processing stages. The complexities of these tasks have resulted in the growing adoption of deep learning-based end-to-end models as an alternative approach17, as they consist of fewer steps and rely on less explicitly defined expert knowledge. They use deep neural networks and can learn directly from raw data, to generate the output without the need for different intermediate steps.
计算病理学。虽然数字病理学涉及获取、存储和查看这些图像,但其分析过程被称为计算病理学。从技术角度来看,在深度学习模型广泛应用之前,图像分析流程由一系列连续的、高度工程化的步骤组成,这些步骤针对特定类型的分析进行了定制。例如,机器学习模型会先检测细胞,随后另一个模型将细胞分类为不同细胞类型,最终模型根据这些测量结果预测患者预后[5]。然而,由于需要整合多个处理阶段,使用经典机器学习方法构建这种多步骤流程十分困难。这些任务的复杂性促使人们越来越多地采用基于深度学习的端到端模型作为替代方案[17],因为这类模型步骤更少,且依赖较少明确定义的专家知识。它们使用深度神经网络,能够直接从原始数据中学习并生成输出,无需不同的中间步骤。
Tasks in computational pathology. Computational pathology can be applied to solve two types oftasks. One type oftask includes recapitulating human evaluation of images, such as counting cells or measuring tissue size. According to Echle37, basic'tasks are intended to simplify routine workflows that are currently performed solely by pathologists. These basic tasks can, in principle, be performed by humans, but are time-consuming and not scalable if done by hand38. By contrast, advanced tasks, as per the previously established definition 22.37, comprise the prediction of more high-level properties directly from image data, such as predicting genetic alterations directly from haematoxylin and eosin (H&E)-stained tissue37. Some genetic alterations, such as micro satellite instability (Msl) in colorectal cancer, are linked
计算病理学的任务。计算病理学可用于解决两类任务。一类任务包括重现人类对图像的评估,例如细胞计数或组织尺寸测量。根据Echle[37]的观点,基础任务旨在简化目前仅由病理学家执行的常规工作流程。这些基础任务原则上可由人工完成,但耗时且难以规模化操作[38]。相比之下,按照先前确立的定义[22,37],高级任务包含直接从图像数据预测更高层次的特性,例如直接从苏木精-伊红(H&E)染色组织预测基因突变[37]。某些基因突变(如结直肠癌中的微卫星不稳定性(MSI))与

a Fig.2|Development from simple,specialized,shallow models to deep, multimodal,generalist models for computer vision.a, The progress of artificial intelligence (Al) techniques in medical imaging started to be widely used with supervised learning using machine learning models with handcrafted features in the early 2o00s. These models relied on domain expertise to manually extract relevant features from images,which were then used totrainthe models on labelled datasets.Around 2012, supervised deep learning emerged, in which Al models,particularly convolutional neural networks (CNNs),were trained on large labelled datasets to automatically learn hierarchical features directly from raw image data.This approach markedly improved the performance and general iz ability of Al models in medical imaging tasks.Inthe early2020s, many research groups started to use the emerging self-supervised learning methods,which enable models to learn meaningful patterns from unlabelled data by predicting properties of the data itself, without relying on external labels.b, In the initial stages of Al application to cancer research,the approach
图 2 | 从简单、专用、浅层模型到深度多模态通用模型的计算机视觉发展历程。a, 人工智能 (AI) 技术在医学影像中的进展始于 2000 年代初采用具有手工特征特征的机器学习模型进行监督学习。这些模型依赖领域专业知识从图像中手动提取相关特征,然后利用标记数据集训练模型。约 2012 年,监督深度学习兴起,AI 模型(特别是卷积神经网络 (CNN))通过大型标记数据集训练,直接从原始图像数据中自动学习层次化特征。该方法显著提升了 AI 模型在医学影像任务中的性能与泛化能力。2020 年代初,许多研究团队开始采用新兴的自监督学习方法,使模型能够通过预测数据自身属性(无需依赖外部标签)从未标记数据中学习有意义模式。b, 在 AI 应用于癌症研究的初始阶段,该方法
Potential impact in oncology
肿瘤学领域的潜在影响
was to rely on carefully selected patterns from medical images as a basis for analysis.Overtime,the need for manual features election was eliminated,and models were able to learn essential features directly from the data.Presently, the emphasis is on integrating various data sources through multimodal models, which combine information from different data modalities,such as radiology images,pathology slides,genomic data and clinical records.The envisioned future involves foundation models,which are large-scale,self-supervised models pre-trained on diverse, unlabelled datasets across multiple modalities. These models can be fine-tuned for various downstream tasks with minimal task-specific training data.The ultimate aim of Al application in oncology could be a universal generalist model, a multi-purpose tool with the ability to analyse, interpret and interact with both patients and medical professionals. This generalist model would integrate data from multiple sources, support diagnosis and treatment recommendations, and explain its decisions in a hum z er pre table manner.
过去的方法依赖于从医学图像中精心挑选模式作为分析基础。随着时间推移,不再需要手动特征选择,模型能够直接从数据中学习关键特征。当前的重点是通过多模态模型整合各种数据源,这些模型融合了来自不同数据模态的信息,例如放射影像、病理切片、基因组数据和临床记录。未来的愿景是建立基础模型 (foundation models),这些大规模自监督模型通过跨多模态的多样化未标注数据集进行预训练,只需极少量任务特定训练数据即可微调适应各种下游任务。肿瘤学人工智能应用的终极目标可能是开发通用全能模型,这种多用途工具能够分析、解读数据并与患者及医疗专业人员互动。该全能模型将整合多源数据,支持诊断治疗建议,并以人类可理解的方式解释其决策依据。
Review article
综述文章
Box2
Productivity gains through deep learning
通过深度学习提升生产效率
Beyond direct research applications, deep learning is transforming research workflows by optimizing everyday tasks. Researchers can use artificial intelligence (Al) for administrative tasks, such as drafting standard e-mail responses or creating visual sketches. According to a 2023 report by McKinsey titled The Economic Potential of Generative Al: The Next Productivity Frontier 169, generative Al could affect between $2.6%$ and $4.5%$ of annual revenues in the health-care and pharma sector, translating to an annual additional value between $\cup\mathbb{S}\$60$ billion and $0\mathsf{S}\$110$ billion globally. Similarly, a recent study from the Boston Consulting Group concluded that Al made management consultants significantly more productive by using large language models in routine tasks that involve the generation of text170. Although these are projections not certainties, it could be extrapolated to individual researchers who can become more productive by incorporating Al methods into their daily workflows.
除了直接的研究应用,深度学习正在通过优化日常任务来改变研究工作流程。研究人员可以利用人工智能 (AI) 处理行政事务,例如起草标准电子邮件回复或创建视觉草图。根据麦肯锡2023年发布的报告《生成式AI的经济潜力:下一轮生产力前沿》[169],生成式AI可能影响医疗健康和制药行业年收入的2.6%至4.5%,相当于全球每年新增600亿至1100亿美元价值。同样,波士顿咨询集团近期研究表明,通过在涉及文本生成的常规任务中使用大语言模型,AI使管理咨询师的工作效率显著提升[170]。尽管这些是预测而非确定结论,但可以推断个体研究人员通过将AI方法融入日常工作流程也能获得效率提升。
to known morphological features, which was known even before deep learning39. However, deep learning methods can automatically analyse these morphological features and make quantitative predictions about underlying genetic alterations just by observing routine pathology slides. Deep learning methods have rediscovered known links between the genotype and the phenotype 18,40.41 and could therefore potentially be extended to detect new genotype-phenotype links for many other genetic alterations that have unknown morphological manifestations. Various studies have shown that the mutation status of individual genes42-48, defective DNA repair mechanisms such as $\mathbf{M}\mathbf{S}\mathbf{I}^{18,40,49-51}$ and homologous recombination deficiency*, or tumour mutation burden53-5 are predictable from H&E slides by deep learning. Furthermore, these models have the capability to discern morphological patterns within both epithelial and stromal tumour regions, or to predict the hormone receptor expression status in breast cancer samples without the need for the immuno his to chemistry or immuno fluorescence against the receptors 4256-58. In summary,the application of deeplearning to pathology has demonstrated that the molecular properties of tumours can be identified from basic H&E images.
已知形态学特征与基因型之间的关联在深度学习出现之前就已为人所知39。然而深度学习方法能够自动分析这些形态学特征,仅通过观察常规病理切片即可对潜在基因改变进行定量预测。深度学习方法重新发现了基因型与表型之间的已知联系18,40,41,因此有望扩展到检测许多其他具有未知形态学表现的基因改变的新基因型-表型关联。多项研究表明,单个基因的突变状态42-48、缺陷DNA修复机制如$\mathbf{MSI}^{18,40,49-51}$和同源重组缺陷52,或肿瘤突变负荷53-55,都可以通过深度学习从H&E切片中预测。此外,这些模型能够辨别上皮和间质肿瘤区域内的形态学模式,或在不需针对受体的免疫组织化学或免疫荧光的情况下预测乳腺癌样本中的激素受体表达状态42,56-58。总之,深度学习在病理学中的应用证明,肿瘤的分子特性可以从基础的H&E图像中识别。
Identifying and quantifying biomarkers from images. Computerbased image classification methods can classify or categorize images into distinct groups based on their visual content. A notable application of this is in the field of medical image analysis, particularly in his to pathology where image classification approaches are used to differentiate his to pathological images based on the presence or absence of cancerous cellss. For example, deep learning can be used to identify tumours within H&E-stained his to logical tissue section slides59. The utility of image classification extends to more complex tasks, which have been referred to as advanced computational pathology tasks 223738. An example of such an application is the categorization of his to pathological images to directly predict high-level properties from images, such as molecular alterations 40.44, tumour of origin for patients with cancer of unknown primary6° survival of patients61.62 or patient responses to immuno therapy'3. These advanced computational pathology tasks demonstrate the potential of deep learning to extract clinically relevant biomarkers directly from his to pathological images. By learning to recognize complex patterns and associations within the image data, deep learning models can identify visual features that correlate with specific molecular alterations, tumour types, patient outcomes or treatment responses.Consequently,image classification methods not only assist in diagnostic processes but also contribute to the development of predictive biomarkers,offering valuable insights for personalized treatment strategies°4. In the next sections, we discuss such his to pathology image analysis tasks in cancer research in moredetail.
识别和量化图像中的生物标志物。基于计算机的图像分类方法可以根据图像的视觉内容将其分类或归入不同的组别。一个显著的应用是在医学图像分析领域, 特别是在组织病理学中, 图像分类方法被用于根据癌细胞的存在与否来区分组织病理学图像。例如, 深度学习可用于识别H&E染色的组织学切片中的肿瘤59。图像分类的效用扩展到更复杂的任务, 这些任务被称为高级计算病理学任务223738。此类应用的一个例子是对组织病理学图像进行分类, 以直接从图像预测高级属性, 例如分子改变40.44, 原发灶不明癌症患者的肿瘤起源6°, 患者生存期61.62或患者对免疫疗法的反应'3。这些高级计算病理学任务展示了深度学习直接从组织病理学图像中提取临床相关生物标志物的潜力。通过学习识别图像数据中的复杂模式和关联, 深度学习模型可以识别与特定分子改变, 肿瘤类型, 患者结局或治疗反应相关的视觉特征。因此, 图像分类方法不仅有助于诊断过程, 还有助于预测性生物标志物的开发, 为个性化治疗策略提供有价值的见解°4。在接下来的章节中, 我们将更详细地讨论癌症研究中的此类组织病理学图像分析任务。
Commercially available and custom-built deep learning tools for computational pathology.Easy-to-use software such as QuPath28 offer some deep learning or machine learning-based capabilities for analysis (Supplementary Table 2) including automated tissue segmentation,tumour detection and classification. Through the use of scripts, QuPath also allows users to train custom models to identify specific cell types or quantify biomarkers. In addition, QuPath supports the integration of external machine learning models, enabling users to apply advanced algorithms for complex analyses without training any new models.Although QuPath provides a user-friendly interface for basic computational pathology tasks,most research groups focused on more advanced applications of Alin his to pathology are typically using self-developed pipelines written in the programming language Python. In addition,researchers often develop their own tools for data annotation or visualization ofthe results,but can also use existing tools; in the past 3 years, several multi-purpose open-source software packages for computational pathology incancer research have emerged, many of which have been re-used widely. These include Fast Pathology 65 s, TIA Toolbox 66, $\mathbf{CLAM}^{67}$ and STAMP6s, which are built on top of generalpurpose data-processing and machine learning frameworks, such as PyTorch°, MONA1°, OpenSlide7 and LibVips"2 (Supplementary Table1).Non-computational cancer researchers will find ituseful to first familiarize themselves with Python programming and then complete basic courses on Python-based image processing (available on many online platforms including commercial platforms such as YouTube,Coursera andedx)before delving into the documentation and application of the computational pathology pipeline repositories.
用于计算病理学的商用和定制深度学习工具。诸如QuPath28等易用软件提供部分基于深度学习或机器学习的功能用于分析(附表2),包括自动化组织分割、肿瘤检测与分类。通过脚本使用,QuPath还允许用户训练定制模型以识别特定细胞类型或量化生物标志物。此外,QuPath支持集成外部机器学习模型,使用户无需训练新模型即可应用先进算法进行复杂分析。虽然QuPath为基础计算病理学任务提供了用户友好界面,但大多数专注于人工智能在病理学中更高级应用的研究团队通常使用基于Python语言自行开发的流程。此外,研究人员常自主开发数据标注或结果可视化的工具,但也可使用现有工具;过去三年间,涌现出若干用于癌症研究中计算病理学的多功能开源软件包,其中许多已被广泛复用。这些包括Fast Pathology65、TIA Toolbox66、$\mathbf{CLAM}^{67}$和STAMP68,它们构建于通用数据处理和机器学习框架之上,例如PyTorch69、MONAI70、OpenSlide71和LibVips72(附表1)。非计算领域的癌症研究者会发现,在深入钻研计算病理学流程库的文档和应用之前,先熟悉Python语言编程并完成基于Python语言的图像处理基础课程(可在许多在线平台获取,包括YouTube、Coursera和edX等商业平台)会很有帮助。
Challenges in computational pathology. Computational pathology not only yields practically useful tools but it can also generate new scientific insight via explain ability methods73.Such methods help us to understand which morphological patterns are related to a prediction by the deep learning system74. However, explain able Al methods are still limited, as retrofitting a deep learning pipeline with explain ability capabilities is not trivial. This is because most deep learning models are inherently complex and opaque, with millions of parameters and multiple nonlinear transformations, making it difficult to trace the decisionmaking process from input to output. Adding explain ability to an existing deep learning pipeline requires carefully designed techniques, such as attention mechanisms, saliency maps or concept activation vectors,which can highlight the relevant regions or features in the input data that contribute to the predictions of the model. Developing and integrating these techniques into a production-ready computational pathology workflow is a non-trivial task that requires expertise in both
计算病理学面临的挑战。计算病理学不仅能产生实用工具,还能通过可解释性方法73生成新的科学见解。这类方法帮助我们理解哪些形态学模式与深度学习系统的预测相关74。然而,可解释人工智能方法仍存在局限,因为为深度学习流程添加可解释性功能并非易事。这主要是因为大多数深度学习模型本质上复杂且不透明,具有数百万个参数和多重非线性变换,使得从输入到输出的决策过程难以追溯。为现有深度学习流程添加可解释性需要精心设计的技术,例如注意力机制、显著图或概念激活向量,这些技术可以突出显示输入数据中与模型预测相关的区域或特征。将这些技术开发并整合到可用于实际生产的计算病理学工作流程中,是一项需要双重专业知识的非平凡任务。
Review article
综述文章
deep learning and interpret able Al. Moreover, the real-world clinical adoption of Al-based work flows in cancer his to pathology is hampered by the lack of digital workflows for pathology in North America and Europe7. The regulatory and ethical considerations, integration challenges within existing workflows, and issues of Al interpret ability and trust among health-care professionals further limits this.
深度学习与可解释人工智能。此外,基于人工智能的癌症组织病理学工作流在北美和欧洲的实际临床应用受到病理学数字化工作流缺失的阻碍[7]。监管与伦理考量、现有工作流程中的整合挑战,以及医疗专业人员对人工智能可解释性与信任度的问题进一步限制了其发展。
Radiology
放射学
Clinical radiology for diagnosing,treating and monitoring therapeutic response can include CT, MRl or positron emission tomography (PET) for cancer detection. Unlike his to pathology, radiology produces inherently digital images and is therefore not limited by the digitization process.
临床放射学用于诊断、治疗和监测疗效反应时,可包括CT、MRI或正电子发射断层扫描 (PET) 用于癌症检测。与组织病理学不同,放射学本身生成数字图像,因此不受数字化过程的限制。
The integration of computer vision models into radiology has markedly enhanced the analytical capabilities ofthe field, revolutionizing the way in which researchers analyse and interpret radiological images. This has paved the way for increased diagnostic accuracy, personalized treatment plans and a deeper understanding of cancer. Traditional machine learning models, which often rely on predefined, handcrafted radiomics features, have shown considerable success in identifying patterns and features in imaging data that are not immediately apparent to the human eye. At the same time, deep learning models, which process raw radiology data, offer a more flexible approach by automatically identifying relevant features without the need for manual intervention. This capability is particularly valuable in handling vast amounts of data, in which deep learning models can uncover complex patterns and relationships within the images. Attempts to inject handcrafted prior knowledge into any machine learning systems are less effective than hypothesis-free or data-driven deep learning approaches, which tend to excel as more data are used, thereby ultimately outperforming handcrafted pipelines. In other words, as the amount of available data increases, deep learning models that automatically learn relevant features from the data tend to surpass the performance of models that rely on manually designed features based on prior knowledge. In radiology, the vast amounts of data lead deep learning to typically outperform classical machine learning methods such as handcrafted radiomics7.7. However, in some scenarios, classical machine learning can offer advantages over deep learning. In situations with limited data(onlyafew dozens to 1 o 0 cases are available),the high capacity of deep learning modelsto learn from data may lead to over fitting, and thus machine learning models using simpler'features'such as tumour sizeand shapecan be more reliable. In general, it is important to choose the appropriate methods for the problem, and only a deep understanding ofthe methodology principles and the data at hand can enable such appropriate decisions.
将计算机视觉模型整合到放射学领域显著增强了该领域的分析能力,彻底改变了研究人员分析和解读放射学图像的方式。这为提高诊断准确性、制定个性化治疗方案以及更深入地了解癌症铺平了道路。传统机器学习模型通常依赖预定义的手工设计放射组学特征,在识别成像数据中肉眼难以直接观察到的模式和特征方面已取得显著成功。与此同时,处理原始放射学数据的深度学习模型通过自动识别相关特征而无需人工干预,提供了更灵活的方法。这种能力在处理海量数据时尤其宝贵,深度学习模型能够从中发掘图像内复杂的模式与关联。相较于依赖假设或无监督的数据驱动深度学习方法,试图将人工先验知识注入任何机器学习系统的效果往往较差——随着数据量增加,深度学习方法往往表现更优,最终超越人工设计的流程。换言之,随着可用数据量的增长,从数据中自动学习相关特征的深度学习模型往往超越依赖基于先验知识手动设计特征的模型性能。在放射学中,海量数据使得深度学习通常优于传统机器学习方法(例如手工放射组学)。但在某些场景下,经典机器学习仍能展现优势。当数据有限(仅几十至百例病例)时,深度学习模型强大的数据学习能力可能导致过拟合,此时使用更简单特征(如肿瘤尺寸和形状)的机器学习模型反而更可靠。总体而言,根据具体问题选择合适方法至关重要,只有深入理解方法论原理和现有数据特性才能做出恰当决策。
Applying Al in radiology: key considerations and approaches. When applying machine learning or deep learning models to medical imaging, a crucial step is preprocessing the data to reduce variability arising from diverse acquisition protocols within the same imaging modality. This involves standardizing image parameters such as slice thickness and voxel size, reducing noise and normalizing signal intensity7. Subsequently, in-depth information from radiological images can be extracted using predefined algorithms that compute standardized radiomics features79. These are handcrafted parameters, also known as engineered radiomics features, extracted from medical images using mathematical algorithms designed to quantify particular aspects of the image, such as signal intensity, texture and shape798o. These provide valuable information about voxel density or signal intensity (depending on the image modality) and also offer insights into the size and shape of the region of interest (ROl). Among the numerous algorithms for extracting handcrafted radiomics from medical images, Py radio mic s is a widely used Python package*°. To use it, a user must first install Py radio mic s and import the necessary libraries into your Python notebook. Next, set up the feature extractor to meet your specific needs,including image preprocessing and feature selection options. After loading the medical image and its associated segmentation mask, which outlines the ROl, the user can proceed to extract radiomics features from the ROl. This customizable process allows for tailoring to your research requirements. Using handcrafted radiomics features, which are usually coupled with classical machine learning models (that is, those utilized before the advent of deep learning architectures, such as logistic regression, support vector machines(SvMs)or random forest)can be trained to test various hypotheses. However, it is important to remember that applying handcrafted radiomics requires meticulous image annotation, such as delineating the R Ol corresponding to tumours in cancer studies. This delineation process should be and is typically performed manually by expert radiologists.Although some software,suchas3D Slicer? or ITK-SNAP?2, provide tools to facilitate this process,it remains a highly time-consuming task.
人工智能在放射学中的应用: 关键考量与方法
将机器学习或深度学习模型应用于医学影像时,关键步骤是通过预处理减少同一成像模态中不同采集协议带来的变异性。这包括标准化图像参数 (如层厚和体素尺寸) 、降噪和信号强度归一化7。随后,可使用预定义算法计算标准化放射组学特征79,从放射影像中提取深度信息。这些通过数学算法从医学影像中提取的手工参数 (亦称人工设计的放射组学特征) ,可量化图像的特定方面,如信号强度、纹理和形状7980。这些特征不仅提供关于体素密度或信号强度 (取决于成像模态) 的宝贵信息,还能揭示感兴趣区域 (ROI) 的大小和形态特征。在众多从医学影像提取手工放射组学的算法中, PyRadiomics 是广泛使用的 Python 语言工具包*°。使用时需先安装 PyRadiomics 并将必要库导入 Python 笔记本,接着根据具体需求配置特征提取器 (包括图像预处理和特征选择选项) 。加载医学影像及其对应的勾画 ROI 的分割掩模后,即可从 ROI 提取放射组学特征。这种可定制流程能有效适配研究需求。
手工放射组学特征通常与经典机器学习模型 (即深度学习架构出现前使用的模型,如逻辑回归、支持向量机或随机森林) 结合,可训练模型验证各种假设。但需注意,应用手工放射组学需要精细的图像标注,例如在癌症研究中勾画对应肿瘤的 ROI。该勾画过程应当且通常由专业放射科医师手动完成。尽管 3D Slicer? 或 ITK-SNAP?2 等软件提供了辅助工具,这仍是极其耗时的任务。
Deep learning applications for radiology. Alternatively, neural networks offer a powerful approach in medical image analysis. Although they can also be trained with data from pre-delineated tumours, they excel at processing whole images. Training neural networks with whole image data can obviatethe labour-intensive process of manual delineation of areas of interest. This also allows the model to learn patterns not only from the tumour but also from the rest of the anatomical structures captured in medical scans. Moreover, unlike traditional methods that rely on predefined handcrafted features, deep learning models can automatically learn relevant features directly from the image data62.83.84. This has a key role in advancing image quality while reducing acquisition timess5.86 Recently, deep learning in radiology has become more robust through large-scale training and is now capable of precisely delineating every organ in the body from whole-body scans, setting a new standard in medical imaging?7. Currently several computer-aided systems are being used to enhance radiology imaging analysis,for instance, as medical devicesfor improving the detection oflung cancer (such as aview LCS, syngo.CT Lung CAD,Samsung Auto Lung Nodule Detection and InferRead CD Lung) and breast cancer (such as breasts cape v1.0 and M ammo Screen in screening programmes and brain tumour diagnosis8. These tumour screening applications of Al-enabled radiology image analysis have also been validated in largescale prospective clinical trials89. Exceeding this, machine and deep learning models have also shown great promise in identifying subtle cancer patterns in radiology imaging, which are associated with specific mutations and the likelihood of responding to different treatments, in both pre clinical and clinical settings90-94. As pre clinical experiments often involve smaller datasets, usually just a few dozen mice compared with hundreds of patients inclinical trials,some applications consider each voxel of the image as input, which helps to mitigate the challenges posed bylimited datafor deep learning models"s. This voxel-wise approach allows for a more detailed analysis by treating each voxel as a separate data point, potentially increasing the sensitivity to subtle features relevant to the study. This method contrasts with using the full image as a source of information. Analysing whole images allows for the integration of contextual and spatial information. Again, having an
深度学习在放射学中的应用。或者,神经网络为医学影像分析提供了强大方法。虽然它们也能通过预勾画肿瘤数据进行训练,但其优势在于处理完整图像。使用完整图像数据训练神经网络可避免耗时费力的人工感兴趣区域勾画过程。这使得模型不仅能够从肿瘤中学习模式,还能从医学扫描捕获的其余解剖结构中学习。此外,与传统依赖预定义手工特征的方法不同,深度学习模型能直接从图像数据中自动学习相关特征[62][83][84]。这在提升图像质量同时缩短采集时间方面具有关键作用[85][86]。近年来,放射学中的深度学习通过大规模训练变得更为稳健,现已能够从全身扫描中精确勾画体内每个器官,为医学影像设立了新标准[87]。目前已有若干计算机辅助系统用于增强放射影像分析,例如作为医疗器械用于提升肺癌检测(如aView LCS、syngo.CT Lung CAD、Samsung Auto Lung Nodule Detection和InferRead CD Lung)和乳腺癌检测(如筛查项目中的breasts cape v1.0和Mammo Screen)以及脑肿瘤诊断[88]。这些基于人工智能的放射影像分析在肿瘤筛查中的应用已通过大规模前瞻性临床试验验证[89]。不仅如此,机器和深度学习模型在识别放射影像中与特定突变及不同治疗响应可能性相关的细微癌症模式方面也展现出巨大潜力,涵盖临床前和临床环境[90-94]。由于临床前实验通常涉及较小数据集(通常仅数十只小鼠,而临床试验有数百名患者),某些应用将图像的每个体素作为输入,这有助于缓解深度学习模型面临的数据有限挑战[95]。这种逐体素方法通过将每个体素视为独立数据点,允许进行更精细分析,可能提高对研究中相关细微特征的敏感性。该方法与使用完整图像作为信息源形成对比。分析完整图像可实现上下文和空间信息的整合。
Review article
综述文章
intuition about the available tools (not necessarily the skills to imple ment all tools in computer code) helps researchers to seek the most appropriate analysis tool for their problem at hand.
对可用工具的直觉理解 (不一定需要掌握用计算机代码实现所有工具的技能) 有助于研究人员为其当前问题寻找最合适的分析工具。
Foundation models in biomedical image analysis
生物医学影像分析中的基础模型
Foundation models, also referred to as foundational models, are a type of deep learning model that is pre-trained on a large, diverse dataset using SSL. Foundation models can be trained on any data, including images or text or both. The goal of these models is to learn general features and patterns that can be applied to a wide range oftasks,rather than being trained for a specific task from scratch.Forexample,various microscopy images, including bright field and fluorescent imaging, obtained under different conditions and with different microscopes, are used to train a foundation model. These models learn general features and patterns across the data they are trained on; for example, a model trained on microscopy images learns about intensities, shapes and structure relations.Sucha pre-training allows the model to capture a broad understanding of the data, which can then be fine-tuned for specific tasks using smaller labelled datasets. Since the early 2020s, foundation models have emerged in biomedical image analysis, enabled by the advent of SSL9°. Foundation models are typically used as a starting point to be fine-tuned using a smaller, labelled dataset, yielding more focused, specialized models. In medical image analysis, this procedure can yield better models for any task, for example, image classification tasks8.18.97. Similar foundation models are being developed for other medical imaging modalities, including cross-modality applications(Fig.1d).Cross-modality applications refer to A I systems that can process and integrate information from different data types, such as combining radiology images with clinical notes or pathology images with genomic data98, or generally, images and text97. To do so requires substantial computational power and engineering resources, as wellas access to extensive datasets.Ideally,acomputing cluster with multiple,oreven dozens of, graphics processing units (GPUs) is necessary. In 2023, various research groups and commercial entities released open-source foundation models, making them widely access ble 9799-101.
基础模型 (Foundation Model) ,也称为基础性模型,是一种通过自监督学习在大型多样化数据集上预训练的深度学习模型。基础模型可以训练任何数据,包括图像、文本或两者兼有。这些模型的目标是学习可广泛应用于各种任务的通用特征和模式,而非针对特定任务从头训练。例如,使用在不同条件下通过不同显微镜获取的各类显微图像(包括明场和荧光成像)来训练基础模型。这些模型能够从训练数据中学习通用特征和模式;例如,在显微图像上训练的模型会学习强度、形状和结构关系。这种预训练使模型能够掌握对数据的广泛理解,随后可通过较小的标注数据集针对具体任务进行微调。
自2020年代初以来,随着自监督学习的出现,基础模型开始在生物医学图像分析领域涌现。基础模型通常作为起点,通过使用较小的标注数据集进行微调,从而产生更专注的专业化模型。在医学图像分析中,这种方法可为任何任务(例如图像分类任务)生成更优的模型。类似的基模型正在被开发用于其他医学成像模态,包括跨模态应用(图1d)。跨模态应用指能够处理和整合不同数据类型信息的AI系统,例如将放射学图像与临床记录结合,或将病理图像与基因组数据整合,或通常意义上的图像与文本结合。
实现这些应用需要大量计算能力、工程资源以及获取广泛数据集的权限。理想情况下,需要配备多个甚至数十个图形处理器 (GPU) 的计算集群。2023年,多个研究团队和商业实体发布了开源基础模型,使其得以广泛使用。
In summary,the rise of foundation models marks an advancement in cancer research9, which has already led to practical improve ments in performance in relevant tasks, enabling training of more effective models with less data97101,102(Fig. 2a,b). Utilizing these models still requires Python programming skills, but future developments may com mod it ize this technology further, making it accessible to researchers with limited or no programming expertise.
总之, 基础模型 (Foundation Model) 的兴起标志着癌症研究的重大进展 [9], 这些模型已在相关任务中带来实际性能提升, 能够用更少的数据训练出更有效的模型 [97,101,102] (图 2a,b) 。使用这些模型目前仍需 Python语言 编程技能, 但未来的发展可能进一步普及这项技术, 使编程经验有限或完全不懂编程的研究人员也能使用。
Al for language Natural language processing
面向语言的自然语言处理
Natural language is unstructured data, of which computer-based analysis has been highly challenging. Since the early 2020s, the advent of large language models (LLMs) has markedly improved the capability of computer-based methodsfor NLP,and are nowthe state-of-the-art approach for processing any text103(Fig.3a). NLPis abroad field within AIthat focuses onthe interaction between computers and human language, whereas LLMs are a specific type of deep learning model used for NLP tasks. LLMs, which are based on transformer architectures, havebecome the most popularand effective method for NLPin recent years. LLMs are part of generative Al, which means they can not only rephrase,summarize or translate text but also synthesize new text. The increasing availability and standardization of LLMs have made NLP methods more accessible, enabling non-experts to solve NLP tasks with off-the-shelf models, such as OpenAl's chatGPT. Beyond storing and retrieving knowledge, LLMs can reason about text104.15s, are capable of translating and stylistically altering text,and can extract structured information from radiology reports106,pathology reports107 and medical notes10°. LLMs also have medical knowledge 109, and can use this, for example, to give medical recommendations based on imaging reportsllo. These capabilities are particularly impactful in cancer research, which relies on text on several levels - from logging initial ideas and experimental data to exchanging insights and disseminating scientific findings (Fig. 3b). The increasing availability and adoption of LLMs in cancer research settings is expected to substantially impact research methodologies!",although their clinical application is still in the early stages due to regulatory and validation challenges 2.
自然语言是非结构化数据,其基于计算机的分析一直极具挑战性。自2020年代初以来,大语言模型 (LLM) 的出现显著提升了基于计算机的自然语言处理方法的能力,并已成为处理任何文本的最先进方法[103] (图 3a)。自然语言处理是人工智能中的一个广泛领域,专注于计算机与人类语言之间的交互,而大语言模型是用于自然语言处理任务的一种特定类型的深度学习模型。基于Transformer架构的大语言模型,已成为近年来最流行和有效的自然语言处理方法。大语言模型是生成式AI (Generative AI) 的一部分,这意味着它们不仅可以重述、总结或翻译文本,还能合成新文本。大语言模型日益增长的可用性和标准化使得自然语言处理方法更易于使用,让非专家能够使用现成模型解决自然语言处理任务,例如OpenAI的ChatGPT。除了存储和检索知识外,大语言模型还能对文本进行推理[104,105],能够翻译和风格化修改文本,并能从放射学报告[106]、病理学报告[107]和医疗记录[108]中提取结构化信息。大语言模型还具备医学知识[109],并能运用这些知识,例如根据影像报告提供医疗建议[110]。这些能力在癌症研究中尤其具有影响力,因为癌症研究在多个层面依赖文本——从记录初步想法和实验数据到交流见解和传播科学发现 (图 3b)。大语言模型在癌症研究环境中日益增长的可用性和采用预计将显著影响研究方法,尽管由于监管和验证方面的挑战,其临床应用仍处于早期阶段[2]。
The advent of LLMs opens a myriad of applications in cancer research, many of which are only beginning to be explored 103. Researchers can now use these readily available modelsfor various tasks:parsing and refining their ideas, summarizing laboratory notebooks, reasoning about complex concepts, acquiring new skills, condensing docu ments and disseminating research findings more effectively 103,1o5,1,14. As NLP evolves further, we are continuously uncovering new uses,each with the potential to enhance the efficiency and scope of academic research and, ultimately, change the clinical practice of oncology!1s. Of note, LLMs signify a departure from traditional machine learning approaches, as classical machine learning requires specific datasets for problem-solving, followed by model training on these data and evaluation. LLMs such as OpenAl's popular 'generative pre-trained transformer(GPT)'models,including the models underlying chat GP T, are by design foundation models9°. Trained on extensive text corpora covering various domains using SSLli6, including on medical and scientific text, LLMs accumulate a broad knowledge base. Consequently, researchers generally donot retrain these modelsfrom scratch dueto the immense computational resources required. Instead, foundation LLMs are typically applied directly to research tasks in a zero-shot way.
大语言模型 (LLM) 的出现为癌症研究开辟了无数应用领域,其中许多才刚刚开始被探索[103]。研究人员现在可以利用这些现成模型完成多种任务:梳理和完善想法、总结实验笔记、进行复杂概念推理、获取新技能、精简文档以及更有效地传播研究成果[103,105,1,14]。随着自然语言处理 (NLP) 的进一步发展,我们不断发现新的应用场景,每个场景都有可能提升学术研究的效率与范围,并最终改变肿瘤学的临床实践[15]。值得注意的是,大语言模型标志着与传统机器学习方法的分离——经典机器学习需要针对特定问题准备专用数据集,随后基于这些数据进行模型训练和评估。而诸如OpenAI广受欢迎的生成式预训练Transformer (GPT) 模型(包括ChatGPT底层模型)本质上都是基础模型[90]。通过自监督学习 (SSL) [116] 在涵盖多个领域的海量文本语料(包括医学和科学文本)上进行训练,大语言模型积累了广泛的知识基础。因此,由于所需计算资源巨大,研究人员通常不会从头开始重新训练这些模型,而是以零样本 (Zero-shot) 方式直接将基础大语言模型应用于研究任务。
Zero-shot applications. The prevalent method is called zero-shot application,in whichan LLMisused directly on a task without previous specific training on any training examples 104. For instance, a researcher can prompt the model to structure or summarize thei
