Integrating single-cell RNA and T cell/B cell receptor sequencing with mass cytometry reveals dynamic trajectories of human peripheral immune cells from birthtoold ag
整合单细胞RNA与T细胞/B细胞受体测序及质谱流式技术揭示人类外周免疫细胞从出生到老年的动态轨迹
Check for updates
检查更新
A comprehensive understanding of the evolution of the immune landscape in humans across the entire lifespan at single-cell transcriptional and protein levels, during development, maturation and senescence is currently lacking. We recruited atotal of 220 healthy volunteers from the Shanghai Pudong Cohort (NC T 05206643), spanning 13 age groups from 0 to over 90 years, and profiled their peripheral immune cells through single-cell RNA-sequencing coupled with single T cell and B cell receptor sequencing, high-throughput mass cytometry, bulk RNA-sequencing and flow cytometry validation experiments. We revealed that T cells were the most strongly affected by age and experienced the most intensive rewiring in cell-cell interactions during specific age. Different T cell subsets displayed different aging patterns in both transcript ome s and immune repertoires; examples included GNLY+CD8+ effector memory T cells, which exhibited the highest clonal expansion among all T cell subsets and displayed distinct functional signatures in children and theelderly; and $\mathbf{CD8}^{+}$ MAIT cells,which reached their peaks of relative abundance, clonal diversity and antibacterial capability in adolescents and then gradually tapered off. Interestingly, we identified and experimentally verified a previously unrecognized'cytotoxic' B cell subset that was enriched in children. Finally, an immune age prediction model was developed based on lifecycle-wide single-cell data that can evaluate the immune status of healthy individuals and identify those with disturbed immune functions. Our work provides both valuable insights and resources for further understanding the aging of the immune system across the whole human lifespan.
目前尚缺乏对人类整个生命周期(从发育、成熟到衰老)中免疫景观在单细胞转录和蛋白质水平上演变的全面理解。我们从上海浦东队列(NCT05206643)招募了220名健康志愿者,覆盖0岁至90岁以上共13个年龄组,通过单细胞RNA测序结合单T细胞和B细胞受体测序、高通量质谱流式、批量RNA测序及流式细胞术验证实验,对其外周免疫细胞进行了系统分析。研究发现T细胞受年龄影响最为显著,并在特定年龄段经历最剧烈的细胞间互作重构。不同T细胞亚群在转录组和免疫受体库中呈现差异化衰老模式:例如GNLY+CD8+效应记忆T细胞在所有T细胞亚群中克隆扩增程度最高,且在儿童与老年群体中表现出截然不同的功能特征;而$\mathbf{CD8}^{+}$MAIT细胞在青少年时期达到相对丰度、克隆多样性和抗菌能力的峰值后逐渐衰减。值得注意的是,我们鉴定并通过实验验证了一个在儿童群体中富集的、此前未被认识的"细胞毒性"B细胞亚群。最后,基于全生命周期单细胞数据开发的免疫年龄预测模型,可评估健康个体的免疫状态并识别免疫功能紊乱者。该研究为深入理解人类全生命周期免疫系统衰老提供了重要见解和资源。
Development and aging run throughout human life, accompanied by gradual maturation and inevitable degeneration of the immune system12. After birth, the immune system gradually matures and provides better protection against foreign pathogens'. Conversely, immune aging leads to higher incidences of infection,cardiovascular diseases+ and cancers'. Moreover, it was recently reported that aged immune cells drive the aging of solid organs, such as the heart and liver, which is a probable cause of systemic senescence°. However, it remains unclear when and how the immune system changes during human aging, which typically spans nearly 90 years and encompasses development, maturation and senescence across the entire lifespan.
发育与衰老贯穿人类生命始终,伴随免疫系统的逐步成熟与不可避免的退化[12]。出生后,免疫系统逐渐成熟,能更有效抵御外来病原体侵袭;反之,免疫衰老会导致感染、心血管疾病和癌症发病率升高。最新研究发现,衰老的免疫细胞会驱动心脏、肝脏等实体器官衰老,这可能是全身性衰老的原因之一。然而,人类免疫系统在长达近90年的生命周期中,究竟何时以及如何经历发育、成熟到衰老的转变,目前仍未阐明。
Human peripheral blood mono nuclear cells(PBMCs)area diverse mixture of highly specialized immune cells that are critical components of the immune system. Previous omics studies of human PBMCs have demonstrated their involvement in immune path o logie s (for example, immuno sen esc en ce',autoimmune disorders*and infectious diseases°)
人外周血单个核细胞 (PBMCs) 是高度特化的免疫细胞组成的多样化混合物,是免疫系统的关键组成部分。先前对人类 PBMCs 的组学研究已证明其参与免疫病理过程 (例如免疫衰老、自身免疫性疾病和感染性疾病)。
and in diseases of solid organs1o.1. Significant age-related changes in P BMC composition and function in healthy individuals have been demonstrated by flow cytometry and bulk sequencing analyses12,13. Recently, single-cell RNA sequencing (scRNA-seq) and single-cell T cell receptor (TCR) and B cell receptor (BCR) sequencing (scTCR/BCR-seq),as well as single-cell mass cyto me try (cyto me try by time-of-flight, CyTOF) protein profiling have provided opportunities to comprehensively identify previously undefined celltypes at single-cell resolution 14-16 Several scRNA-seq studies on peripheral immune cells at specific ages have yielded valuable findings17-21. Although they provide valuable insights into age-related changes in immune cells at a single-cell resolution, these scRNA-seq studies involved limited age groups and did not cover the full temporal span of human aging.Moreover,none ofthescRNA-seq studies provided single-cell protein validation, and human immune-aging datasets across the entire lifespan at single-cell protein levels are currently lacking.
在健康个体中,流式细胞术和批量测序分析已证实P BMC(外周血单个核细胞)组成和功能存在显著的年龄相关变化[12,13]。近年来,单细胞RNA测序(scRNA-seq)、单细胞T细胞受体(TCR)和B细胞受体(BCR)测序(scTCR/BCR-seq),以及单细胞质谱流式(飞行时间质谱流式,CyTOF)蛋白质分析技术,为全面鉴定单细胞分辨率下未定义的细胞类型提供了新机遇[14-16]。针对特定年龄段外周免疫细胞的若干scRNA-seq研究已取得重要发现[17-21]。尽管这些研究以单细胞分辨率提供了免疫细胞年龄相关变化的宝贵见解,但涉及的年龄组别有限,未能覆盖人类衰老的完整时间跨度。此外,现有scRNA-seq研究均未提供单细胞蛋白质验证数据,目前尚缺乏涵盖全生命周期的单细胞蛋白质水平人类免疫衰老数据集。
Here, through combined scRNA-seq, scTCR/BCR-seq and CyTOF analyses, we investigated the temporal dynamics ofthe cell type compositions, transcriptional and protein profiles, cell-cell interactions (CCls) and immune repertoires of peripheral immune cells during development, maturation and senescence in humans throughout the entire lifespan, using PBMCs from healthy participants aged O to ${\ge}90$ years, spanning 13 age groups. Using this unique lifecycle-wide dataset, we developed asingle-cell immune age (siAge) prediction modelto assess an individual's immune age and describe the immune status in health anddisease.Our work provides a rich resource for understanding the evolution of peripheral immune cells during human development, maturation and aging across the whole lifespan. The data are publicly available in a web portal for easy exploration (https://pu-lab.sjtu.edu. cn/shiny/lifespan) and through the preprint version ofour data ( www.biorxiv.0rg/content/10.1101/2022.07.11.498621v2).
通过整合单细胞RNA测序(scRNA-seq)、单细胞TCR/BCR测序(scTCR/BCR-seq)和飞行时间质谱流式(CyTOF)分析技术,我们利用从0岁到${\ge}90$岁健康参与者(涵盖13个年龄组)的外周血单个核细胞(PBMCs),研究了人类整个生命周期中发育、成熟和衰老阶段外周免疫细胞的细胞类型组成、转录与蛋白质谱、细胞间相互作用(CCls)以及免疫组库的时序动态变化。基于这一独特的全生命周期数据集,我们开发了单细胞免疫年龄(siAge)预测模型,用于评估个体免疫年龄并描述健康与疾病状态下的免疫特征。本研究为理解人类全生命周期中外周免疫细胞的演变提供了丰富资源,数据已通过网页门户(https://pu-lab.sjtu.edu.cn/shiny/lifespan)和预印本平台(https://www.biorxiv.org/content/10.1101/2022.07.11.498621v2)公开。
Results
结果
A single-cell atlas of peripheral immune cells across the human lifespan
人类生命周期外周免疫细胞的单细胞图谱
A totalof 309 participants from both internal and external cohorts were included in this study. To characterize the dynamics of peripheral immune cell composition and function throughout the human lifespan, 220 healthy volunteers aged 0 years (neonates) to ${\ge}90$ years (nonagenarians) were recruited from the Shanghai Pudong Cohort (NC T 05206643) (Fig. 1a and Supplementary Table 1). The partic i pants spanned the entire human lifespan and were partitioned into 13 age groups (0-,1-,2-,6-,12-,18-,30-,40-,50-,60-,70-,80-and ${\tt>}90$ -year-olds)22-24. We further developed an siAge prediction model based on this lifecycle-wide single-cell dataset and validated the siAge model with an external cohort of 89 additional individuals, including 33 healthy individuals and 56 with a disturbed immune function (Supplementary Table 2).ForscRNA-seq analyses,after filtering out low-quality droplets, atotal of538,266 cells were obtained. According to unsupervised clustering and the expression of signature genes for each subset (Supplementary Table 3), 25 distinct PBMC subsets were identified and visualized by uniform manifold approximation and projection (UMAP) (Fig. 1b and Extended Data Fig. la; details in the Methods). The representative signature genes of the cell subsets and expression levels along with the numberof detected genes and unique molecular identifiers (UMis) for each subset are shown in heatmaps, feature plots and violin plots (Fig. 1c, Extended Data Fig. 1b-e and Supplementary Table3).
本研究共纳入来自内部和外部队列的309名参与者。为表征人类生命周期中外周免疫细胞组成和功能的动态变化,我们从上海浦东队列(NC T 05206643)招募了220名年龄从0岁(新生儿)到${\ge}90$岁(九旬老人)的健康志愿者(图1a和补充表1)。这些参与者覆盖了人类整个生命周期,并被划分为13个年龄组(0岁、1岁、2岁、6岁、12岁、18岁、30岁、40岁、50岁、60岁、70岁、80岁和${\tt>}90$岁)[22-24]。基于这一全生命周期的单细胞数据集,我们进一步开发了siAge预测模型,并使用包含33名健康个体和56名免疫功能紊乱个体的外部队列(共89人)对该模型进行了验证(补充表2)。
在单细胞RNA测序(scRNA-seq)分析中,经过滤除低质量液滴后,共获得538,266个细胞。根据无监督聚类和各亚群的特征基因表达(补充表3),我们鉴定出25个不同的外周血单个核细胞(PBMC)亚群,并通过均匀流形近似与投影(UMAP)进行可视化(图1b和扩展数据图1a;具体方法见"方法"部分)。各细胞亚群的代表性特征基因及其表达水平,以及检测到的基因数和唯一分子标识符(UMIs)数量,通过热图、特征图和小提琴图展示(图1c、扩展数据图1b-e和补充表3)。
Fig.1/Overview of single-cell transcript ome s of peripheral immune cells across each stage of the human lifespan.a,Work flow of the overall experimental design of this study. Panel a created in BioRender. Huang, T. (2024) https:/BioRender.com/a58r007.b,UMAP plot illustrating the peripheral immune cell subsets identified within PBMCs from 61 samples.c, Heatmaps showingthe expression of signature genes in each cellsubset. d, Dot plot showing the age-related variations in proportion of the 25immune cell subsets
图 1: 人类生命周期各阶段外周免疫细胞的单细胞转录组概览
a) 本研究整体实验设计的工作流程。本图使用 BioRender 制作 (Huang, T. (2024) https://BioRender.com/a58r007
b) UMAP 图展示 61 个样本 PBMC 中鉴定的外周免疫细胞亚群
c) 热图显示各细胞亚群特征基因的表达情况
d) 点图展示 25 种免疫细胞亚群比例的年龄相关性变化
Given that cell-type-specific aging analyses largely depend on accurate cell-type annotation, we took multiple approaches to ensure the reliability of our annotation. The cell subset annotation derived from scRNA-seq was first validated at the single-cell protein level using high-throughput CyTOF analyses with a panel of metal isotope-tagged protein markers(Supplementary Table4) and demonstrated good agreement between the two measures (Extended Data Fig. 2a-f).Additional validations were performed usingtwo other approaches,the automatic annotation tool Azimuth2s (Extended Data Fig. 2g) and gene set enrichment analysis(Extended Data Fig.2h), with both showing good agreement.
鉴于细胞类型特异性衰老分析在很大程度上依赖于准确的细胞类型注释,我们采用了多种方法来确保注释的可靠性。首先通过高通量CyTOF分析结合一组金属同位素标记的蛋白标志物(补充表4),在单细胞蛋白水平验证了源自scRNA-seq的细胞亚群注释,结果显示两种测量方法具有良好的一致性(扩展数据图2a-f)。我们还使用了另外两种方法进行额外验证:自动注释工具Azimuth2s(扩展数据图2g)和基因集富集分析(扩展数据图2h),两者均显示出良好的一致性。
Peripheral immune cell composition dynamics across the lifespan
外周免疫细胞组成在生命周期中的动态变化
To obtain a global view ofthe dynamics of PBMC composition, the proportion ofeach ofthe 25 cell subsets in total PB MCs was calculated for each sample from the 13 age groups(Extended Data Fig.3a).Ofthe 25 cell subsets,22 exhibited significant differences in proportions among different age groups (Fig.1d and Extended Data Fig.3a) (Kruskal-Wallis test, false discovery rate $(\mathrm{FDR})<0.05)$ .A proportion ofcertain cell subsets were positively correlated with age, such as CD 4 TEM GN LY cells and CD 8 TEM GN LY cells (Extended Data Fig. 3b) and others were negatively correlated with age,such as CD 4 Naive C CR 7 cells, CD 8 Naive LEF 1 cells, NK CD 56 high cells and B_Naive cells (Extended Data Fig. 3c). Several cell subsets exhibited enrichment in certain age groups. For example, $\mathbf{CD14^{+}}$ monocyte cells were enriched in the umbilical cord blood group, plasma cells were enriched in children (peaking in the 6-year-old group) and CD 8 MA IT S LC 4 A 10 cells were enriched in adolescents (peaking in the 18-year-old group) (Extended Data Fig. 3d). Three cell subsets did not show significant changes in proportion with respect to age (forexample, natural killer (NK) proliferating, CD16+ monocytes and plasma cyto id dendritic cells), suggesting that they are relatively stable across the human lifespan (Kruskal-Wallis test, $\mathrm{FDR}>0.05 $ .Cell composition dynamics were further analyzed at the single-cell protein level using high-throughput CyTOF and showed good agreement (Extended Data Fig. These results demonstrate differential correlations of different PBMC subsets with age at single-cell resolution.
为全面了解PBMC组成的动态变化,我们计算了13个年龄组样本中25种细胞亚群在总PBMC中的占比(扩展数据图3a)。其中22种细胞亚群的比例在不同年龄组间存在显著差异(图1d和扩展数据图3a)(Kruskal-Wallis检验,错误发现率(FDR)<0.05)。部分细胞亚群比例与年龄呈正相关,如CD4 TEM GNLY细胞和CD8 TEM GNLY细胞(扩展数据图3b);而CD4 Naive CCR7细胞、CD8 Naive LEF1细胞、NK CD56 high细胞和B_Naive细胞等则呈负相关(扩展数据图3c)。若干细胞亚群在特定年龄组富集,例如CD14+单核细胞在脐带血组富集,浆细胞在儿童组富集(6岁组达峰值),CD8 MAIT SLC4A10细胞在青少年组富集(18岁组达峰值)(扩展数据图3d)。NK增殖细胞、CD16+单核细胞和浆细胞样树突状细胞等3种亚群的比例未随年龄发生显著变化(Kruskal-Wallis检验,FDR>0.05),提示这些细胞在人类生命周期中保持相对稳定。通过高通量CyTOF技术在单细胞蛋白层面的进一步分析显示一致结果(扩展数据图3e-g),这些发现以单细胞分辨率揭示了不同PBMC亚群与年龄的差异化关联规律。
Peripheral immune cell transcriptional dynamics across the lifespan
外周免疫细胞转录动态的终身变化
To systematically investigate the transcript ome dynamics of PBMC subsets over the entire lifespan, we ranked the 25 PBMC subsets bythe number of differential ly expressed genes (DEGs) (Fig. 2a). The top ten cell subsets with the highest DEG numbers all belonged to lymphoid lineage cell subsets (that is, T cell, NK cell and B cell types), indicating that these cell subsets were the most strongly affected by age.Among the top ten cell subsets, eight were T cells, and marked differences in the numbers of DEGs were observed among different Tcell subsets (Fig. 2a). No myeloid lineage (that is, monocytes, dendritic cells (DCs) and mega kary o cyte s)had ${\tt z l00}$ DEGs.
为系统研究外周血单个核细胞 (PBMC) 亚群在整个生命周期中的转录组动态变化,我们根据差异表达基因 (DEGs) 数量对25个PBMC亚群进行了排序 (图 2a)。DEG数量最高的前十种细胞亚群均属于淋巴系细胞亚群 (即T细胞、NK细胞和B细胞类型),表明这些细胞亚群受年龄影响最为显著。在前十种细胞亚群中,有八种是T细胞,且不同T细胞亚群间的DEG数量存在显著差异 (图 2a)。髓系细胞 (即单核细胞、树突状细胞 (DCs) 和巨核细胞) 未出现 ${\tt z l00}$ 个DEGs。
To explore the functional dynamics ofPBMC subsets across the lifespan,the DEGsinthe 25 subsets were subjected to functional enrichment analyses separately.DEGs were enriched in biological processes including pathways related to positive regulation of cell activation (24 cell subsets), lymphocyte-mediated immunity (23 cell subsets), regulation of leukocyte cell-cell adhesion (22 cell subsets) and type Il interferon production (22 cell subsets) (Fig. 2b).We then focused on age-related genes that were monotonically correlated with age across 13 age groups.Kruskal-Wallis one-way analysis was used to identify cell subsets with significant differences across age groups. FDR correction was applied to account for multiple testing and the values are shown above the dot plot. Horizontal dashed line marks the threshold for FDR significance $\langle\mathrm{FDR}<0.05\rangle$ .mDCs, myeloid dendritic cells; pDCs, plasma cyto id dendritic cells; UMAP, uniform manifold approximation and projection; y, years.
为探究PBMC亚群在生命周期中的功能动态变化,我们对25个亚群的差异表达基因(DEG)分别进行了功能富集分析。这些DEG显著富集于以下生物学过程:细胞活化正调控(24个细胞亚群)、淋巴细胞介导的免疫反应(23个细胞亚群)、白细胞间黏附调控(22个细胞亚群)以及II型干扰素产生(22个细胞亚群)(图2b)。随后我们聚焦于在13个年龄组中与年龄呈单调相关性的基因。采用Kruskal-Wallis单因素方差分析鉴定不同年龄组间存在显著差异的细胞亚群,并通过FDR校正进行多重假设检验,校正后p值显示于点图上方。水平虚线表示FDR显著性阈值$\langle\mathrm{FDR}<0.05\rangle$。mDCs指髓系树突状细胞;pDCs为浆细胞样树突状细胞;UMAP表示均匀流形近似与投影;y代表年份。
Fig.2|Global transcript ome dynamics of peripheral immune cells throughout the human lifespan.a, Bar plot showing the number of DEGs derived from scRNA-seq analyses of all cell subsets.b, Heatmap showing the functional enrichment analyses of DEGsin all cell subsets. $P$ values were determined by hyper geometric test.c,Bar plots showingthe frequencies of up regulated and down regulated M-DEGs derived from scRNA-seq analyses observed in all cell subsets.All M-DEGs with a frequency of ≥5 are displayed. d, Line plots showing the expression trajectories of up regulated(left panel) and down regulated(right panel)age-related genes identified by bulk RNA-seq analyses.Data areshown
图 2: 人类生命周期中外周免疫细胞的全局转录组动态
a) 柱状图显示所有细胞亚群scRNA-seq分析得出的差异表达基因(DEGs)数量。
b) 热图展示所有细胞亚群DEGs的功能富集分析结果。$P$值通过超几何检验计算得出。
c) 柱状图显示所有细胞亚群中观察到的上调(up regulated)和下调(down regulated)M-DEGs频率。仅展示频率≥5的M-DEGs。
d) 折线图展示通过bulk RNA-seq分析识别的上调(左图)和下调(右图)年龄相关基因表达轨迹。
in terms of gene expression z-score.Eachgray line represents the expression trajectory of a gene.e,Venn diagrams showingthe number of overlapping genes between common M-DEGs identified in scRNA-seq data and age-related genes identified in bulk RNA-seq data.Amongthe 56 common M-DEG genes defined by scRNA-seq, 13 intersected with the age-related genes defined by bulk RNA-seq. f, Heatmap showing the expression of SELL gene in seven cell subsets by scRNAseq and CD62L protein (encoded by SELL) in corresponding cell subsets by CyTOF amongeight life stages.
就基因表达z值而言。每条灰线代表一个基因的表达轨迹。e,维恩图展示了在单细胞RNA测序(scRNA-seq)数据中识别的共有M-DEGs与批量RNA测序(bulk RNA-seq)数据中识别的年龄相关基因之间的重叠基因数量。在scRNA-seq定义的56个共有M-DEG基因中,有13个与bulk RNA-seq定义的年龄相关基因存在交集。f,热图显示SELL基因在七个细胞亚群中的scRNAseq表达情况,以及CyTOF检测的相应细胞亚群中CD62L蛋白(由SELL编码)在八个生命阶段的表达水平。
(monotonic al DEGs, M-DEGs). Consistent with results from the DEG analyses,thetop tencell subsets with the highest number of M-DEGs $(\geq30)$ were lymphoid lineage cell subsets, supporting that these lymphoid lineage cell subsets are susceptible to aging (Extended Data Fig. 4a,b and Supplementary Table 5). Eight of the top ten were T cell subsets,one was anNKcell subset and onewasa Bcell subset.The top three cell subsets with the largest number of M-DEGs were T cell subsets:CD 4 Naive C CR 7,CD 4 TCM A QP 3 and CD 8 Naive LEF 1 cells (Extended Data Fig. 4a,b). Functional enrichment analyses showed that the up regulated M-DEGs were enriched in pro-inflammatory biological processes, such as type Il interferon production and positive regulation of I kinase/NF-kB signaling (Extended Data Fig. 4c and
(单调性差异表达基因, M-DEGs)。与DEG分析结果一致,M-DEGs数量最多$(\geq30)$的前十个细胞亚群均为淋巴系细胞亚群,表明这些淋巴系细胞亚群易受衰老影响(扩展数据图4a、b和附表5)。其中八个为T细胞亚群,一个为NK细胞亚群,一个为B细胞亚群。M-DEGs数量最多的前三个细胞亚群均为T细胞亚群:CD4 Naive CCR7、CD4 TCM AQP3和CD8 Naive LEF1细胞(扩展数据图4a、b)。功能富集分析显示,上调的M-DEGs富集于促炎性生物过程,如II型干扰素产生和I激酶/NF-κB信号的正调控(扩展数据图4c和
Supplementary Table 6), whereas the down regulated M-DEGs were enriched in telomere organization and actin filament organization (Extended Data Fig.4d and Supplementary Table 6).
补充表 6),而下调的 M-DEGs 在端粒组织和肌动蛋白丝组织中富集 (扩展数据图 4d 和补充表 6)。
Next,weranked theM-DEGsbythe number of cell subsets in which they were involved and determined the common M-DEGs shared by multiple cell subsets. M-DEGs shared by greater than or equal to five cell subsets were defined as common M-DEGs. The analyses identified 18 up regulated common M-DEGsand 38 down regulated common M-DEGs (Fig. 2c and Supplementary Table 5). Functional enrichment analyses showed that the up regulated commonM-DEG s were enriched inpathways associated with positive regulation of I-kB kinase/NF-kB signaling, positive regulation of cell-cell adhesion and major his to compatibility complex class Il protein complex assembly (Extended Data Fig. 4e). Down regulated common M-DEG s were enriched in pathways associated with telomere maintenance, mono nuclear cell differentiation and actin filament organization (Extended Data Fig.4f). In addition to monotonic changes across the life stages, age-related DEGs defined by scRNA-seq exhibited enrichment at specific age ranges.For instance,144 DEGs from 12 cell types were enriched at the child stage (Extended Data Fig. 4g,i), whereas 90 DEG s from 18 cell types were enriched atthe70-80-year-old (elderly) stage (Extended Data Fig.4h,j). The scRNA-seq-defined common M-DEGs represent genes that are both widely expressed in different cell subsets and strongly correlated with age. We next sought to validate these scRNA-seq-defined common M-DEGs using traditional bulk RNA-seq in an independent cohort of 34 healthy individuals (O to ${\ge}90$ years old), which provides an average gene expression profile for the entire population of cells26,27. We identified 1,071age-related genes in PBMCs inthebulk RNA-seq dataset(Fig.2d,Extended Data Fig.4k and Supplementary Table 7). Remarkably, among the 56 common M-DEG genes defined by scRNA-seq, 13 intersected with the age-related genes defined by bulk RNA-seq, including four genes up regulated with age and nine down regulated with age (Fig. 2e). This overlap underscores a high concordance between the findings obtained from scRNA-seq, covering different cell subsets, and those from bulk RNA-seq, giving a global view.Of the four up regulated common M-DEGs,S100A6,S100A4 and L GALS len code proteins that can promote inflammation and drive tumor i genesis and metastasis 28-3. Although S100A4 was previously reported to be highly elevated in centenarians?,its potential functional role in aging has not been investigated. Using an in vitro aging model, we found that S100A4 acted as a positive regulator in the aging process (Supplementary Fig. 1and Supplementary Information Section1.1). Of the down regulated common M-DEGs,SELL,CCR7and TMI GD 2 encode proteins that maintain immune homeostasis by modulating the balance of immunity and tolerance 31-33. The age-related common M-DEG SELL (encoding CD62L) was further validated atthe single-cell protein level with CyTOF (Fig. 2f). These results suggest that the age-related common M-DEGs shared by different cell subsets may represent potential immune biomarkers across the lifespan.
接下来,我们根据M-DEGs涉及的细胞亚群数量进行排序,并确定多个细胞亚群共有的M-DEGs。将共享于五个及以上细胞亚群的M-DEGs定义为共有M-DEGs。分析共鉴定出18个上调共有M-DEGs和38个下调共有M-DEGs(图2c和补充表5)。功能富集分析显示,上调共有M-DEGs在I-kB激酶/NF-kB信号正向调控、细胞间黏附正向调控及主要组织相容性复合体II类蛋白复合体组装等通路中富集(扩展数据图4e);下调共有M-DEGs则在端粒维持、单核细胞分化和肌动蛋白丝组织等通路中富集(扩展数据图4f)。除生命周期中的单调变化外,scRNA-seq定义的年龄相关DEGs在特定年龄段呈现富集特征:例如12种细胞类型的144个DEGs在儿童期富集(扩展数据图4g,i),而18种细胞类型的90个DEGs在70-80岁(老年)阶段富集(扩展数据图4h,j)。这些scRNA-seq定义的共有M-DEGs代表在不同细胞亚群中广泛表达且与年龄强相关的基因。
我们随后在34名健康人(0至${\ge}90$岁)的独立队列中使用传统bulk RNA-seq验证这些scRNA-seq定义的共有M-DEGs,该方法可提供细胞群体的平均基因表达谱[26,27]。在bulk RNA-seq数据集中,我们鉴定了1,071个PBMCs年龄相关基因(图2d、扩展数据图4k和补充表7)。值得注意的是,在scRNA-seq定义的56个共有M-DEGs中,有13个与bulk RNA-seq定义的年龄相关基因重叠,包括4个随年龄上调基因和9个下调基因(图2e)。这种重叠凸显了scRNA-seq(覆盖不同细胞亚群)与bulk RNA-seq(提供全局视角)结果的高度一致性。
在4个上调共有M-DEGs中,S100A6、S100A4和LGALS1编码的蛋白可促进炎症并驱动肿瘤发生与转移[28-30]。尽管此前报道S100A4在百岁老人中显著升高,但其在衰老中的潜在功能尚未研究。通过体外衰老模型,我们发现S100A4在衰老进程中发挥正向调控作用(补充图1和补充信息章节1.1)。在下调共有M-DEGs中,SELL、CCR7和TMIGD2编码的蛋白通过调节免疫与耐受平衡维持免疫稳态[31-33]。年龄相关共有M-DEG SELL(编码CD62L)在CyTOF单细胞蛋白水平得到进一步验证(图2f)。这些结果表明,不同细胞亚群共享的年龄相关共有M-DEGs可能代表贯穿生命周期的潜在免疫生物标志物。
Immunological processes involve interactions among various immune cell types to coordinate their behavior during an immune response34,35s We thus further investigated the rewiring of CCIs among peripheral immune cells across the lifespan and found that the CCis displayed different patterns at different stages ofthe lifespan (Fig. 3a-f). Topological characteristics indicated that immune checkpoints have a more critical role in the age-modulated CCl network than non-immune checkpoints (Fig.3g-j). The alterations of immune checkpoints across the whole human lifespan mainly occur in T cells and might have essential roles during human development and aging (Fig. 3k,l and Extended Data Fig. 5a,b). Detailed information about CCls is provided in Supplementary Information Section 1.2.
免疫过程涉及多种免疫细胞类型之间的相互作用,以协调它们在免疫反应中的行为[34,35]。因此,我们进一步研究了外周免疫细胞在整个生命周期中细胞间通讯(CCI)的重连情况,发现CCI在生命周期的不同阶段表现出不同的模式(图 3a-f)。拓扑特征表明,在年龄调节的CCI网络中,免疫检查点比非免疫检查点具有更关键的作用(图 3g-j)。免疫检查点在整个人类生命周期中的变化主要发生在T细胞中,并可能在人类发育和衰老过程中发挥重要作用(图 3k,l 和扩展数据图 5a,b)。关于CCI的详细信息见补充信息第1.2节。
Clonal expansion ofeffector T cells across the lifespan We further explored the cl on al expansion in all T cell subsets and their V(D)J gene usages across the entire lifespan by reconstructing
效应T细胞的克隆扩增跨越整个生命周期
我们通过重建所有T细胞亚群及其V(D)J基因使用情况,进一步探究了这些细胞在整个生命周期中的克隆扩增现象。
Fig.3|Rewiring ofcCls among peripheral immune cells across the human lifespan.a, Heatmap showing that CCI probability varied significantly across the lifespan.The probability foreach CCl was row-scaled with az-score. The significantly varied CCls among eight different human life stages were determined using Kruskal-Wallis one-way analyses. $P$ valueswere adjusted with FDR, and an adjusted $P$ value $({P_{\mathrm{{adj}}}})$ of $<0.0001$ was considered statistically significant. b, Bar plot showing that T cells experience the most intensive rewiring in CCls with age.c-f, Network graphs showing the CCI cluster 1(c), cluster 2 (d),cluster3(e)and cluster4(f)froma.The hub cell subset of each cluster is highlighted with a larger node size inc-f.The hub cell subset of each cluster is highlighted with a larger node size.g,Network graph showing the CCI network mediated by differential ly expressed ligands and receptors. h, Bar plot showing
图 3: 人类生命周期中外周免疫细胞间 CCls 的重连情况。a, 热图显示 CCI 概率在整个生命周期中变化显著。每个 CCl 的概率通过 z 分数进行行标准化。使用 Kruskal-Wallis 单因素分析确定了八个不同人生阶段中显著变化的 CCls。$P$ 值经 FDR 校正,校正后 $P$ 值 $({P_{\mathrm{adj}}})$ $<0.0001$ 被认为具有统计学意义。b, 条形图显示 T 细胞在 CCls 中随年龄变化经历最剧烈的重连。c-f, 网络图展示来自 a 的 CCI 簇 1(c)、簇 2(d)、簇 3(e) 和簇 4(f)。每个簇的中心细胞亚群在 c-f 中用较大的节点突出显示。g, 网络图展示由差异表达的配体和受体介导的 CCI 网络。h, 条形图显示
TCR sequences from scTCR-seq. We obtained matched TCR and transcript ome information from over $90%$ ofall Tcells(Extended Data Fig.6a). In prior research focused onthe adult population,an increased peak in T cell cl on al expansion was observed only in the elderly36. However,our lifecycle-widecohort,with exclusive access to childhood and adolescent data, reveals that high T cell clonal expansions manifest two peaks, occurring duringthe early life developmental stage (2-12 years old) and the late-life elderly stage (70-90 years old) (Fig. 4a). Further comparison of the V(D) gene usage profiles among these six age groups showed that 43 genes were differential ly used (Kruskal-Wallis test, $P<0.05)$ (Fig. 4b). No significant differences were observed in the length distribution of the complement ari ty-determining region-3 (CDR3) amino acid sequences in TCR chains and chains(Extended Data Fig. 6b). Using hierarchical clustering from the differential ly used v(D)J genes, the six age groups were clustered into two distinct groups: the 2-12-year-old group (early life high T cell cl on al expansion (high-TCE) group) and the 70-90-year-old group (late-life high-TCE group) (Fig.4b). The selective usage of V(D) genes reflects the different features of T cell responses in different age groups and may be associated with human immune development or senescence.
来自单细胞TCR测序的TCR序列。我们从超过$90%$的所有 T细胞中获得了匹配的TCR和转录组信息(扩展数据图6a)。在先前针对成年人群的研究中,仅在老年人中观察到T细胞克隆扩增的高峰[36]。然而,我们覆盖全生命周期的队列,凭借独有的儿童和青少年数据,揭示了高T细胞克隆扩增呈现两个高峰,分别出现在生命早期发育阶段(2-12岁)和晚年老年阶段(70-90岁)(图4a)。进一步比较这六个年龄组的V(D)基因使用模式,发现43个基因存在差异使用(Kruskal-Wallis检验,$P<0.05$)(图4b)。在TCR 链和链的互补决定区-3(CDR3)氨基酸序列长度分布上未观察到显著差异(扩展数据图6b)。通过差异使用的V(D)J基因进行层次聚类,六个年龄组被分为两个明显类别:2-12岁组(生命早期高T细胞克隆扩增(high-TCE)组)和70-90岁组(晚年high-TCE组)(图4b)。V(D)基因的选择性使用反映了不同年龄组T细胞应答的不同特征,可能与人类免疫发育或衰老相关。
Next, we explored the distribution of high-TCE cells among all 11 T cell subsets. We detected different degrees of clonal expansion in different T cell subsets: high-TCE cells (clone size of $\mathord{?}\geq50$ cells)only appeared in three effector memory T cell subsets (CD 8 TEM GN LY, CD 4 TEM GN LY and CD 8 TEM CM C 1 cells) with high STARTRAC expansion scores (Fig. 4c-e and Extended Data Fig. 6c).Among them, CD8 TEM_GNLY cells accounted for the highest proportions of high-TCE cells (Fig. 4c and Extended Data Fig.6c).
接下来,我们探究了高TCE细胞在所有11种αβ T细胞亚群中的分布情况。在不同T细胞亚群中检测到不同程度的克隆扩增:高TCE细胞(克隆规模≥50个细胞)仅出现在三个具有高STARTRAC扩增评分的效应记忆T细胞亚群中(CD8 TEM GNLY、CD4 TEM GNLY和CD8 TEM CM C1细胞)(图4c-e和扩展数据图6c)。其中,CD8 TEM_GNLY细胞的高TCE细胞占比最高(图4c和扩展数据图6c)。
Next, we performed DEG analyses to explore the functional features underlying the clonal expansion of CD 8 TEM GN LY cells in the two high-TCE groups. In the early life high-TCE group, 635 significantly up regulated DEGs were identified, including cytotoxic genes (GZMB and PRF1) and energy metabolism-related genes (COX5B,COX5A,NDUFAB1,NDUFB10,NDUFS6and NDUFS3)(Fig.4f).Functional enrichment analyses showed that these DEGs were enriched in the regulation of T cell activation, cell killing, regulation of leukocyte proliferation and oxidative phosphor yl ation pathways (Fig. 4g). In the late-life high-TCE group, 87 up regulated DEGs were identified, including interferon-related genes (IFITM3 and /SG20) and anti-apoptosis genes (TSC22D3 and IL7R)37 (Fig. 4f).Functional enrichment analyses showed that these DEGs were enriched in the response to virus, response to type Il interferon, negative regulation of lymphocyte apoptotic process and response to type Il interferon pathways (Fig. 4g).Accordingly, we analyzed the expression of cyto toxicity, oxidative phosphor yl ation, leukocyte proliferation and type Il interferon-related pathways in the clonal expansion of CD 8 TEM GN LY cells in the early life and late-life high-TCE groups. We found that the expressions of cyto toxicity, oxidative phosp hory la tion and positive regulation of mono nuclear cell proliferation were significantly up regulated in early life high-TCE groups, whereas the expression of genes in the response to type Il interferon pathway was up regulated in late-life high-TCE groups (Fig. 4h). Thus,the upregulated DEGs in high-TCE CD 8 TEM GN LY cells at the elderly stage were the top ten biological processes enriched inthe differential ly expressed ligands and receptors. $P$ values were determined by hyper geometric test.i,Box plot showingthe average shortest path length and degree of immune checkpoints and non-immune checkpoints in g. Box plots show median, first (lower hinge) and third(upper hinge) quartiles;whiskers show $1.5\times$ the interquartile range. $P$ values were determined by a two-sided unpaired Wilcoxon rank-sum test.j, Bar plot showing,for each immune checkpoint ligand and receptor,the number of cell subsets in which they were detected as differential ly expressed. k, Heatmaps showing the scaled expression of CD27,ICOS and PDCD1 genes across the lifespan in the indicated cell subsets, detected by scRNA-seq.I, Heatmaps showing the scaled expression of CD27,ICOS and PD1 proteins across the lifespan in the indicated cell subsets, detected by CyTOF. UCB, umbilical cord blood.
接下来,我们通过差异表达基因(DEG)分析来探究两个高TCE组中CD8 TEM GN LY细胞克隆扩增的功能特征。在生命早期高TCE组中,鉴定出635个显著上调的DEG,包括细胞毒性基因(GZMB和PRF1)及能量代谢相关基因(COX5B、COX5A、NDUFAB1、NDUFB10、NDUFS6和NDUFS3)(图4f)。功能富集分析显示这些DEG在T细胞活化调控、细胞杀伤、白细胞增殖调控及氧化磷酸化通路中显著富集(图4g)。而在生命晚期高TCE组中,发现87个上调DEG,包含干扰素相关基因(IFITM3和ISG20)及抗凋亡基因(TSC22D3和IL7R)[37](图4f),这些基因在病毒应答、II型干扰素响应、淋巴细胞凋亡负调控及II型干扰素通路中富集(图4g)。
据此,我们分析了生命早期和晚期高TCE组CD8 TEM GN LY细胞克隆扩增中细胞毒性、氧化磷酸化、白细胞增殖及II型干扰素相关通路的表达水平。结果显示,生命早期高TCE组的细胞毒性、氧化磷酸化及单核细胞增殖正调控基因表达显著上调,而生命晚期高TCE组的II型干扰素通路响应基因表达升高(图4h)。因此,老年阶段高TCE组CD8 TEM GN LY细胞中上调的DEG是差异表达配体-受体最富集的十大生物过程。$P$值通过超几何检验计算。
图i箱线图展示免疫检查点与非免疫检查点在网络中的平均最短路径长度和连接度。箱体表示中位数及第一(下铰链)、第三(上铰链)四分位数,须线为1.5倍四分位距。$P$值通过双尾非配对Wilcoxon秩和检验确定。图j条形图显示各免疫检查点配体-受体在细胞亚群中被鉴定为差异表达的频次。图k热图展示scRNA-seq检测的CD27、ICOS和PDCD1基因在生命周期各阶段的标准化表达(按指定细胞亚群)。图l热图展示CyTOF检测的CD27、ICOS和PD1蛋白在生命周期各阶段的标准化表达(按指定细胞亚群)。UCB指脐带血。
involved in biological processes related to chronic inflammation,which has been linked to aging of solid organs3738. By contrast, the higher expression of cytotoxic genes in high-TCE CD 8 TEM GN LY cells at the developmental stage indicated that these T cells may possess powerful killing functions against infected and cancerous cells.Furthermore, transcription factor analyses indicated that the transcription factor regulon GATA3 displayed the highest activity in the early life high-TCE group (Fig. 4i), whereas the transcription factor regulon ETS1 displayed the highest activity in the late-life high-TCE group (Fig. 4i). GATA3 regulates T cell development, proliferation and maintenance 39.40. ETS1 acts as a regulator of human healthy aging by decreasing ribosomal activity41. Thus, although both groups displayed high clonal expansions of CD 8 TEM GN LY cells, there were distinct functions underlying these expanded T cells between the early life and late-life high-TCE groups.
涉及与慢性炎症相关的生物过程,该过程与实体器官衰老有关[37][38]。相比之下,高TCE CD8 TEM GN LY细胞在发育阶段细胞毒性基因的高表达表明,这些T细胞可能对感染细胞和癌细胞具有强大的杀伤功能。此外,转录因子分析表明,转录因子调控子GATA3在生命早期高TCE组中表现出最高活性(图4i),而转录因子调控子ETS1在生命晚期高TCE组中表现出最高活性(图4i)。GATA3调控T细胞的发育、增殖和维持[39][40]。ETS1通过降低核糖体活性来调节人类健康衰老[41]。因此,尽管两组均表现出CD8 TEM GN LY细胞的高克隆扩增,但生命早期和生命晚期高TCE组之间这些扩增T细胞的功能存在明显差异。
Fig.4|Cl on al expansion of T cells across theentire lifespan.a,Stacked bar graph showing the distribution of T cell clone sizes in the 13 age groups. b, Heatmap showing the row-scaled usage of significantly differential ly expressedV-Jgenesfrom TCR chains and T CR chains across six high-TCE groups (2,6,12,70,80 and 90 years).The sixage groups were clustered into two distinct groups:the 2-12-year-old group (early life high-TCEgroup) and the 70-90-year-old group (late-life high-TCE group).c,Bar plot showing the distribution of T cell clone sizes in each αβ T cell subset. d,e, Box plot showing the clonal expansion levels of total Tcell subsets(d)and CD 8 TEM GN LY cells (e) across 13 age groups quantified by the STARTRAC-expa metric. The black line indicates LOESS regression.f, Volcano plot showing the DEGs in highTCE CD 8 TEM GN LY cells (clone size $\ge50$ cells) in the late-life high-TCE group (70-90 years old) compared with those in the early life high-TCE group (2, 6 and 12 years old). The representative genes are labeled.Dashed lines demarcate twosided Benjamini-Hochberg-corrected $P_{\mathrm{adj}}=0.05$ and $\log_{2}$ fold change $\left(\mathrm{FC}\right)=0.25$ g,Barplots showing the representative enriched biological processes of DEGs up regulated in high-TCE CD 8 TEM GN LY cells in early life (left panel) and in late life (right panel). $P$ values were determined by hyper geometric test.h,Box plots showing the gene module scores for the pathways in high-TCE CD 8 TEM GN LY cells in the early life and late-life high-TCE groups. The two-sided Wilcoxon rank-sum test was used to calculate the $P$ values.i,Rankfor TFregulons in high- TCE CD 8 TEM GN LY cells in the early life (left panel) and late-life (right panel) high-TCE groups using the regulon specificity score. Ind,eand h, box plots show median, first (lower hinge) and third (upper hinge) quartiles; whiskers show $1.5\times$ the interquartile range.MHC,major his to compatibility complex; TF, transcription factor.
图4 | T细胞克隆扩增在整个生命周期的变化。a,堆叠条形图显示13个年龄组中T细胞克隆大小的分布。b,热图显示TCRα链和TCRβ链中显著差异表达的V-J基因在六个高TCE组(2、6、12、70、80和90岁)的行标准化使用情况。这六个年龄组被聚类为两个不同的组:2-12岁组(生命早期高TCE组)和70-90岁组(生命晚期高TCE组)。c,条形图显示各αβ T细胞亚群中T细胞克隆大小的分布。d,e,箱线图显示通过STARTRAC-expa指标量化的13个年龄组中总αβ T细胞亚群(d)和CD8 TEM GN LY细胞(e)的克隆扩增水平。黑线表示LOESS回归。f,火山图显示生命晚期高TCE组(70-90岁)与生命早期高TCE组(2、6和12岁)相比,高TCE CD8 TEM GN LY细胞(克隆大小≥50个细胞)中的差异表达基因(DEGs)。代表性基因已标注。虚线标出双侧Benjamini-Hochberg校正的Padj=0.05和log2倍数变化(FC)=0.25。g,条形图显示生命早期(左图)和生命晚期(右图)高TCE CD8 TEM GN LY细胞中上调DEGs的代表性富集生物过程。P值通过超几何检验确定。h,箱线图显示生命早期和生命晚期高TCE组中高TCE CD8 TEM GN LY细胞通路的基因模块评分。使用双侧Wilcoxon秩和检验计算P值。i,使用调控子特异性评分对生命早期(左图)和生命晚期(右图)高TCE组中高TCE CD8 TEM GN LY细胞的转录因子(TF)调控子进行排序。在d、e和h中,箱线图显示中位数、第一(下铰链)和第三(上铰链)四分位数;须线显示1.5倍四分位距。MHC,主要组织相容性复合体;TF,转录因子。
Different aging patterns in naive versus cells
naive CD4+ 与 CD8+ 细胞的不同衰老模式
A key characteristic of age-related immune dysfunction is the loss of naive T cells5.42. Previous studies that relied on adult data have concludedthat $\mathbf{CD8}^{+}$ cells show a more pronounced decline with age than their $\mathrm{CD4^{+}}$ counterparts 36. However, using scRNA-seq-derived cell classification with our full lifespan coverage, we observed that although both naive T cell subsets (CD 8 Naive LEF 1 and CD 4 Naive C CR 7) exhibited a decreased trend with age (Fig. 5a), their aging trajectories diverge beginning at birth. Specifically, the two naive T cell subsets followed different decreasing trajectories across the human lifespan: naive $\mathsf{C D4^{+}}$ T cells exhibited an abrupt decrease during the early life developmental stage followed by a slower decline after entering adulthood; conversely, naive $\mathbf{CD8}^{+}$ T cells exhibited a gradual decrease through the early life stage followed by a rapid decline in adulthood. The aging patterns ofthe two naive T cell subsets were further supported by CyTOF data (Fig.5b).
与年龄相关的免疫功能障碍的一个关键特征是幼稚T细胞的减少[5.42]。先前基于成人数据的研究认为,$\mathbf{CD8}^{+}$细胞比$\mathrm{CD4^{+}}$细胞表现出更明显的年龄相关性下降[36]。然而,通过scRNA-seq细胞分类结合全生命周期覆盖数据,我们观察到虽然两种幼稚T细胞亚群(CD8 Naive LEF1和CD4 Naive CCR7)均呈现随年龄下降趋势(图5a),但它们的衰老轨迹从出生起就存在差异。具体表现为:幼稚$\mathsf{C D4^{+}}$T细胞在生命早期发育阶段急剧减少,成年后下降趋缓;而幼稚$\mathbf{CD8}^{+}$T细胞在生命早期缓慢减少,成年后则快速下降。CyTOF数据进一步验证了这两种幼稚T细胞亚群的衰老模式(图5b)。
We next sought to examine the functional dynamics of the two naive T cell subsets and identified 76 up regulated M-DEGs for naive $\mathbf{CD4^{+}}$ T cells and 73 for naive $\mathbf{CD8^{+}}$ T cells. The M-DEGs for naive $\mathbf{CD4^{+}}$ T cells changed rapidly in earlier life and reached a plateau-like state in laterlife,whereas theM-DEG s for naive $\mathbf{CD8}^{+}$ T cells showed a conti nuous and straight increase throughout life (Fig.5c,d). Functional enrichment analyses of the up regulated M-DEG s in naive $\mathrm{CD4^{+}}$ andnaive $\mathbf{CD8}^{+}$ T cells revealed several common biological processes shared bythe two naive T cell subsets related to pro-inflammatory pathways, including response to type Il interferon and regulation of autophagy+3 (Fig.5e).
我们接下来研究了两种初始T细胞亚群的功能动态变化,鉴定出初始 $\mathbf{CD4^{+}}$ T细胞有76个上调的M-DEGs,初始 $\mathbf{CD8^{+}}$ T细胞有73个。初始 $\mathbf{CD4^{+}}$ T细胞的M-DEGs在生命早期快速变化,后期趋于稳定平台期;而初始 $\mathbf{CD8}^{+}$ T细胞的M-DEGs则呈现持续线性增长 (图5c,d)。对初始 $\mathrm{CD4^{+}}$ 和初始 $\mathbf{CD8}^{+}$ T细胞中上调M-DEGs的功能富集分析显示,这两个初始T细胞亚群共享多个与促炎通路相关的生物学过程,包括II型干扰素应答和自噬调控+3 (图5e)。
Next, we examined the dynamic patterns of TCR diversity in naive $\mathrm{CD4^{+}}$ and $\mathbf{CD8^{+}}$ T cells and found that their patterns were also different (Fig. 5f,g and Extended Data Fig. 6d). Again, naive $\mathbf{CD4}^{+}$ T cells exhibited a more abrupt decrease during the early life developmental stage than naive $\mathbf{CD8}^{+}$ cells;conversely,naive $\operatorname{CD}8^{+}$ Tcells displayed amore prominent decline in TCR diversity atolder ages than naive $\mathbf{CD4}^{+}$ T cells. These results reveal the different aging-associated transcriptional patterns and repertoire dynamics between naive $\mathrm{CD}4^{+}$ and $\mathbf{CD8}^{+}$ T cells.
接下来,我们检测了初始型 $\mathrm{CD4^{+}}$ 和 $\mathbf{CD8^{+}}$ T细胞中TCR多样性的动态模式,发现它们的模式也存在差异 (图 5f,g 和扩展数据图 6d)。同样,初始型 $\mathbf{CD4}^{+}$ T细胞在生命早期发育阶段表现出比初始型 $\mathbf{CD8}^{+}$ 细胞更急剧的下降;相反,初始型 $\operatorname{CD}8^{+}$ T细胞在老年时期比初始型 $\mathbf{CD4}^{+}$ T细胞表现出更显著的TCR多样性下降。这些结果揭示了初始型 $\mathrm{CD}4^{+}$ 和 $\mathbf{CD8}^{+}$ T细胞之间不同的衰老相关转录模式和受体库动态变化。
Expansion of MAIT cells during maturation
成熟过程中 MAIT细胞的扩增
Mucosal-associated invariant T (MAIT) cells are unique innate-like T lymphocytes that bridge innate and adaptive immunity+.MAITcells rapidly produce cytokines and cytotoxic effectors through both TCR-dependent and TCR-independent mechanisms 45.46 in response to a broad range ofbacteria and viruses,such as Mycobacterium tuberculosist7 and hepatitis B virus. We show that CD 8 MA IT S LC 4 A 10 cells exhibit a uni modal pattern of enrichment during adolescence with a peakin the 18-year-old group (Fig. 5h). The results from the CyTOF analys es exhibited a pronounced concordance withthose from scRNA-seq (Fig. 5i).Clonal analyses showed that the TCR diversity of CD8_MAIT_ SLC4A10 cells also peaked at adolescence (Fig.5j and Extended Data
黏膜相关不变T细胞 (MAIT cells) 是一类独特的先天样T淋巴细胞,连接着先天免疫与适应性免疫。MAIT细胞能通过TCR依赖性和非依赖性机制快速产生细胞因子和细胞毒性效应分子[45][46],响应多种细菌和病毒(如结核分枝杆菌[17]和乙肝病毒)。我们发现CD8_MAIT_SLC4A10细胞在青春期呈现单峰型富集模式,18岁年龄组达到峰值(图5h)。质谱流式分析结果与单细胞RNA测序数据高度一致(图5i)。克隆分析显示CD8_MAIT_SLC4A10细胞的TCR多样性同样在青春期达到顶峰(图5j及扩展数据。
Fig.5|Aging features of naive T cells and CD8+MAIT cells.a,b,Scatter plot illustrating changes inthe proportion of $\mathrm{CD4^{+}}$ (CD 4 Naive C CR 7;left) and $\mathrm{CD8^{+}}$ naive T cells (CD 8 Naive LEF 1; right) in total T cells with aging from scRNA-seq (a) and CyTOF (b).c,d, Heatmap showing the row-scaled expression of M-DEGs in CD 4 Naive C CR 7 (c) and CD 8 Naive LEF 1(d) cells across eight life stages (left), and line plots showing the expression trajectories of M-DEGs up regulated with age (right). Expression z-scores are shown.e,Dot plot showing the representative enriched biological processes of M-DEGs up regulated with age in $\mathrm{CD4^{+}}$ and $\mathrm{CD8^{+}}$ naive T cells.f,g,Scatter plots showing TCR repertoire diversities in $\mathbf{CD4^{+}}(\mathbf{CD4}_{-}$ Naive_CCR7;left) and $\mathrm{CD8^{+}}$ naive T cells(CD 8 Naive LEF 1; right) across eight life stages, as measured by Shannon's entropy (f) and the inverse Simpson index $\mathbf{\Psi}(\mathbf{g})$ .h-j,Scatter plot illustrating the proportion changes of CD 8 MA IT S LC 4 A 10 cells in total T cells with aging from scRNA-seq (h) and CyTOF (i) analyses and their T CR repertoire diversities as measured by Shannon'sentropy (j) across
图 5: 初始T细胞与CD8+MAIT细胞的衰老特征。a,b, 散点图展示总αβ T细胞中CD4+ (CD4初始CCR7;左)与CD8+初始T细胞 (CD8初始LEF1;右)比例随衰老的变化,数据来自单细胞RNA测序 (a) 和质谱流式 (b)。c,d, 热图显示CD4初始CCR7 (c) 和CD8初始LEF1 (d) 细胞中M-DEGs在八个生命阶段的行标准化表达 (左),折线图展示随年龄上调的M-DEGs表达轨迹 (右)。图中显示表达z值。e, 点图展示CD4+和CD8+初始T细胞中随年龄上调的M-DEGs代表性富集生物过程。f,g, 散点图展示八个生命阶段中CD4+ (CD4初始_CCR7;左) 和CD8+初始T细胞 (CD8初始LEF1;右) 的TCR谱多样性,通过香农熵 (f) 和逆辛普森指数Ψ (g) 测量。h-j, 散点图展示总αβ T细胞中CD8 MAIT SLC4A10细胞比例随衰老的变化,数据来自单细胞RNA测序 (h) 和质谱流式 (i) 分析,以及通过香农熵 (j) 测量的TCR谱多样性。
Fig. 6d,e), implying that CD 8 MA IT S LC 4 A 10 cells at this stage might exhibit a stronger ability to fend off pathogen invasion than those at other life stages.
图 6d,e) 表明,这一阶段的 CD 8 MA IT S LC 4 A 10 细胞可能比其他生命阶段的细胞具有更强的抵御病原体入侵能力。
To characterize the function of CD8_MAIT SLC4A10 cells during maturation, we compared gene expression profiles of CD8_MAIT SLC4A10 cells in adolescents with those in other life periods, including children, adults and the elderly. Fig. 5k displays the expression of the up regulated and down regulated DEGs in CD 8 MA IT S LC 4 A 10 cells in adolescents; the expression of the up regulated DEGs reached a peak in adolescents (named'up regulated DEGs'; Fig. 5k). The interaction network analyses suggested that 19 out of the 29 up regulated DEGs closely interacted, with KL R Bl identified as ahub gene (Extended Data Fig.6f).Notably,CyTOF analyses showed that the expression ofCD161 protein (encoded by KLRB1) within MAIT cells also reached its peak in the adolescent period (Extended Data Fig. 6g).
为表征CD8_MAIT_SLC4A10细胞在成熟过程中的功能,我们比较了青少年与其他生命周期(包括儿童、成人和老年人)中该细胞的基因表达谱。图5k展示了青少年CD8_MAIT_SLC4A10细胞中上调与下调差异表达基因(DEGs)的表达情况;其中上调DEGs的表达在青少年时期达到峰值(命名为"上调DEGs";图5k)。相互作用网络分析表明,29个上调DEGs中有19个存在紧密互作,其中KL R Bl被确定为枢纽基因(扩展数据图6f)。值得注意的是,CyTOF分析显示MAIT细胞内CD161蛋白(由KLRB1编码)的表达量也在青少年时期达到峰值(扩展数据图6g)。
Further Gene Ontology enrichment analyses showed that the up regulated DEGs in CD 8 MA IT S LC 4 A 10 cells from individuals during adolescence were enriched in biological processes related to aerobic respiration, lymphocyte proliferation and positive regulation of inflammatory response (Fig. 51), whereas the down regulated DEGs were related to cell adhesion and the apoptotic pathway (Fig. 5l). We further depicted the functional properties of CD 8 MA IT S LC 4 A 10 cells in the adolescent group by gene set enrichment analysis with all expressed genes.Tuberculosis-related signaling displayed a significant down regulation in CD8_MAIT SLC4A10 cells at this stage (Fig. 5m). The incidence and prevalence of tuberculosis are the lowest in young people (15-24 years old) according to the World Health Organization 2021 Global Tuberculosis Report (https://www.who.int), in line with our findings of enhanced MAIT cell function during this period. In addition, antimicrobial effect or function assays showed that the antimicrobial activities of MAIT cells from young adults (aged 18 years) were significantly higher than those from middle-aged counterparts (aged 50 years) (Fig. 5n). Together, these results indicate that CD8_ MA IT S LC 4 A 10 cells at the maturation stage may have a greater ability to resist pathogen invasion than those in other age groups.
进一步的基因本体富集分析显示,青春期个体CD8_MAIT_SLC4A10细胞中上调的差异表达基因(DEGs)富集于有氧呼吸、淋巴细胞增殖和炎症反应正向调控等生物过程(图5i),而下调DEGs则与细胞粘附和凋亡通路相关(图5l)。我们通过全表达基因集富集分析进一步描绘了青春期组CD8_MAIT_SLC4A10细胞的功能特性。结核病相关信号通路在该阶段的CD8_MAIT_SLC4A10细胞中呈现显著下调(图5m)。根据世界卫生组织2021年全球结核病报告(https://www.who.int),青少年(15-24岁)的结核病发病率和患病率最低,这与我们发现的该时期MAIT细胞功能增强现象一致。此外,抗菌效应或功能检测表明,青年组(18岁)MAIT细胞的抗菌活性显著高于中年组(50岁)(图5n)。这些结果表明,成熟期CD8_MAIT_SLC4A10细胞可能比其他年龄段具有更强的抵抗病原体入侵能力。
A newly identified cytotoxic B cell subset
新发现的一种细胞毒性B细胞亚群
Immune cells with cytotoxic functions inthe peripheral blood of healthy individuals generally include NK and cytotoxic T cells+8. Interestingly, we detected a newly identified and unique B cellsubset(B_BCR+GNLY* cells) that simultaneously expressed B cell markers (such as BCR genes, including immunoglobulin heavy-chain or light-chain coding genes, andJCHAIN) and canonical cytotoxic genes (such as GNLY and NKG7) (Fig. $^{1\mathrm{b},\mathrm{c}}$ and Extended Data Fig. 1b,d). The expression of TCR genes, TRACand KLRF1 were hardly detected inthem,excluding the possibility of these cells being T cells or NK cells(Extended Data Fig.1b,d).Weruled out the possibility of potential technical artefacts (thatis,multiplets) because the number of detected genes and UMIs in B_BCR+GNLY+ cells were lower than those in other subsets(Extended DataFig.1e)and a stringent doublet filtration standard was used to exclude potential eight life stages.k, Heatmap showing the expression of all the DEGs (left) and zoom-in on up regulated genes(right) in CD 8 MA IT S LC 4 A 10 cells in adolescents (12-18 years old).1,Bar plots showing the representative enriched biological processes of up regulated (top) and down regulated (bottom) DEGs in CD8_MAIT_ SLC4A10 cells in adolescents.m,GSEA enrichment plot forgene set'tuberculosis in CD 8 MA IT S LC 4 A 10 cells in the adolescent group compared with other groups. n, Schematic diagram showing the design of MAIT cell's b act eric i dal capability assay(left)and bar plot showing the bacteria load afterMAIT cell coculture.The $P$ value was determined by two-sided unpaired Wilcoxon rank-sum test(right, $n=9$ each). $P$ values ine,Iand m were determined by hyper geometric test. $P<0.05$ was considered statistically significant ine,I,mand n.The black lines indicate LOESS regression ina,b,fand g. CFU, colony forming unit; GSEA, gene set enrichment analysis.
健康人外周血中具有细胞毒性的免疫细胞通常包括NK细胞和细胞毒性T细胞+8。有趣的是,我们检测到一个新发现且独特的B细胞亚群(B_BCR+GNLY*细胞),该亚群同时表达B细胞标志物(如BCR基因,包括免疫球蛋白重链或轻链编码基因,以及JCHAIN)和典型细胞毒性基因(如GNLY和NKG7)(图1b,c和扩展数据图1b,d)。这些细胞中几乎检测不到TCR基因TRAC和KLRF1的表达,排除了它们是T细胞或NK细胞的可能性(扩展数据图1b,d)。我们排除了潜在技术假象(即多重捕获)的可能性,因为B_BCR+GNLY+细胞中检测到的基因数和UMI数低于其他亚群(扩展数据图1e),并且采用了严格的双联体过滤标准来排除可能的八个生命阶段。k,热图显示青少年(12-18岁)CD8_MAIT_SLC4A10细胞中所有差异表达基因(DEG)的表达情况(左)及上调基因的放大视图(右)。1,条形图显示青少年CD8_MAIT_SLC4A10细胞中上调(上)和下调(下)DEG的代表性富集生物过程。m,青少年组CD8_MAIT_SLC4A10细胞与其他组相比,基因集"结核病"的GSEA富集图。n,MAIT细胞杀菌能力检测的设计示意图(左)及MAIT细胞共培养后细菌载量的条形图(右,每组n=9)。P值通过双尾非配对Wilcoxon秩和检验确定(右)。e、l和m中的P值通过超几何检验确定。e、l、m和n中P<0.05被认为具有统计学意义。a、b、f和g中的黑线表示LOESS回归。CFU,菌落形成单位;GSEA,基因集富集分析。
Fig.6|Function of the newly identified cytotoxic B cell subset.a, Heatmap showing the functional enrichment analyses of signature genes ineach B cell subset based on the scRNA-seq analyses. The top ten pathways in each cell subset (ranked by $P$ value)arelisted. $P$ valueswere determined by hyper geometric test. b, Immuno fluorescence of B cell markers(IgM,IgA andJ chain) and the cytotoxic protein granulysin (GNLY) in PBMCs collected from six healthy 6-year-olds. The experiment was repeated fourtimes with similar results.c,Bar plot showing the percentage of B ${\tt B C R}^{+}{\tt G N L Y}^{+}$ cells inPBMCs from three infants with longitudinal follow-up,examined at the ages of1year and 2 years by scRNA-seq analyses.R1, R2 and R3 represent different individuals.d,Flow cytometry analysis showing proportions of IgM*GNLY+ cells in children (4-9 years old), adults (24-30 years old)and the elderly(64-72yearsold),without stimulation(control,topleft) and day1after in vitro stimulation with an expansion mixture consisting of IL-21 $\mathrm{10:ngml^{-1}},$ ,II $\cdot2(50\mathrm{IU}\mathrm{m}\mathrm{I}^{-1})$ ,CD40L $(50\mathsf{n g}\mathsf{m l}^{-1})$ ,anti-BCR $(5\mathrm{mg}\mathrm{ml}^{-1})$ and
图 6 | 新发现的细胞毒性B细胞亚群功能。a、基于单细胞RNA测序分析的各B细胞亚群特征基因功能富集分析热图。列出了每个细胞亚群中排名前十的通路(按P值排序)。P值通过超几何检验确定。b、六名健康6岁儿童外周血单个核细胞中B细胞标志物(IgM、IgA和J链)与细胞毒性蛋白颗粒溶素(GNLY)的免疫荧光检测。实验重复四次,结果相似。c、条形图显示通过单细胞RNA测序分析的三名婴幼儿在1岁和2岁时外周血单个核细胞中B${\tt B C R}^{+}{\tt G N L Y}^{+}$细胞的百分比。R1、R2和R3代表不同个体。d、流式细胞术分析显示儿童(4-9岁)、成人(24-30岁)和老年人(64-72岁)中IgM*GNLY+细胞比例,包括未刺激组(对照,左上)及体外刺激1天后(刺激混合物含IL-21 $\mathrm{10:ngml^{-1}}$、IL-2 $(50\mathrm{IU}\mathrm{m}\mathrm{I}^{-1})$、CD40L $(50\mathsf{n g}\mathsf{m l}^{-1})$、抗BCR $(5\mathrm{mg}\mathrm{ml}^{-1})$)。
doublets.B $\mathbf{^{+}G N L Y^{+}}$ cells exhibited a unimodal pattern, with enrichment in children with a peak in the 6-year-old group. These cells were low in neonates and 1-year-old infants and increased in 2-6-year-old children, followed by a gradual decrease at 12 years, reaching a low level in adulthood (Fig.1d).
双阳性B细胞 $\mathbf{^{+}G N L Y^{+}}$ 呈现单峰分布模式,在儿童群体中富集且峰值出现在6岁组。这类细胞在新生儿和1岁婴儿中含量较低,2-6岁期间逐渐增加,12岁时开始缓慢下降,至成年期维持在较低水平 (图1d)。
Toexplore the functional differences between B ${\bf B}{\bf C}{\bf R}^{\mathrm{}}{\bf G}{\bf N}{\bf L}{\bf Y}^{\mathrm{*}}$ cells and the other four B cell subsets, we identified the signature genes of all B cell subsets (Supplementary Table 10). The top pathway enriched by the signature genes of E $\phantom{-}3_\mathrm{BCR^{+}G N L Y^{+}}$ cells was the cytolysis pathway (Fig. 6a). Double-positive cells (J chain and granulysin, IgM and granulysin or IgA and granulysin) were identified in PBMCs from six additional healthy 6-year-olds using immuno fluorescence (Fig.6b). Longitudinal scRNA-seqand scTCR/BCR-seq results showed the expansion of $\mathsf{B_B C R^{+}G N L Y^{+}}$ cells from age 1 $(0.02%$ $0.03%$ and $0.04%$ inPBMCs)toage 2 $(1.84%$ $10.96%$ and $3.09%$ in PBMCs) in these children (Fig. 6c). We performed flow cytometry to detect the
为了探究B ${\bf B}{\bf C}{\bf R}^{\mathrm{}}{\bf G}{\bf N}{\bf L}{\bf Y}^{\mathrm{*}}$ 细胞与其他四种B细胞亚群的功能差异,我们鉴定了所有B细胞亚群的特征基因 (补充表 10)。E $\phantom{-}3_\mathrm{BCR^{+}G N L Y^{+}}$ 细胞特征基因富集度最高的通路是细胞溶解通路 (图 6a)。通过免疫荧光技术,我们在另外六名健康6岁儿童的PBMCs中鉴定出双阳性细胞 (J链与颗粒溶素、IgM与颗粒溶素或IgA与颗粒溶素) (图 6b)。纵向单细胞RNA测序和单细胞TCR/BCR测序结果显示,这些儿童体内 $\mathsf{B_B C R^{+}G N L Y^{+}}$ 细胞从1岁时的占比 $(0.02%$ 、 $0.03%$ 和 $0.04%$ ) 扩张至2岁时的 $(1.84%$ 、 $10.96%$ 和 $3.09%$ ) (图 6c)。我们采用流式细胞术检测了
CpG0DN2006 $\mathrm{(1:mgml^{-1})}$ (bottomleft) $\mathit{\Omega}_{n=5}$ pergroup).Bar plots showing the percentage of IgM'GNLY+cells in indicated groups(rightpanel).e,Flow cytometry analysis showing the proportion of IlgM*GNLY* cells in cultured human B cells on days 1 and 3 after in vitro stimulation with an expansion mixture (left panel) $n=4$ per group). Bar plots showing the percentage of IgM*GNLY* cells in B cells from each group (right panel).f, Bar plots showing GNLY concentration in culture super nat ants of Bcells on days 1 and 3 of culture after in vitro stimulation with or without an expansion mixture. The concentrations of GN LY were determined byELISA $n=4$ pergroup).Ind,eandf,barplots show themean $\pm\mathsf{s}.\mathsf{e}.\mathsf{m}$ : $P<0.05$ was considered statistically significant.Pvaluesin a were determined by hyper geometric test. $P$ values in d were determined by one-way A NOvA followed bytheTukey post hoctest.Pvalues in e and f were determined by two-sided unpaired Wilcoxon rank-sum test.ELISA,enzymelinked immuno s or bent assay.
CpG0DN2006 $\mathrm{(1:mgml^{-1})}$ (左下) $\mathit{\Omega}_{n=5}$ 每组)。条形图显示指定组中IgM'GNLY+细胞百分比(右图)。e, 流式细胞术分析显示体外扩增混合物刺激后第1天和第3天培养人B细胞中IlgMGNLY细胞比例(左图) $n=4$ 每组)。条形图显示各组B细胞中IgM*GNLY*细胞百分比(右图)。f, 条形图显示有无扩增混合物体外刺激培养后第1天和第3天B细胞培养上清中GNLY浓度。GNLY浓度通过ELISA测定 $n=4$ 每组)。d、e和f中条形图显示均值 $\pm\mathsf{s}.\mathsf{e}.\mathsf{m}$ : $P<0.05$ 被认为具有统计学显著性。a中P值通过超几何检验确定。d中P值通过单因素方差分析(ANOVA)及Tukey事后检验确定。e和f中P值通过双侧非配对Wilcoxon秩和检验确定。ELISA, 酶联免疫吸附试验。
percentage of double-positive cells (IgM and granulysin) in peripheral blood. The proportions of double-positive cells (IgM and granulysin) were significantly higher in children (4-9 years old; $n=5$ )thanthose fromadults(24-30years old; $n=5.$ )or the elderly(64-72yearsold; $n=5,$ (Fig.6d and Supplementary Fig. 2).
外周血中双阳性细胞(IgM和颗粒溶素)的比例。儿童组(4-9岁;$n=5$)的双阳性细胞比例显著高于成人组(24-30岁;$n=5$)或老年组(64-72岁;$n=5$)(图6d和补充图2)。
Cytotoxic Tand B cells areinduced afterin vitro stimulation with expansion mixtures (including cytokines, chemokines, antibodies and adjuvants)49-51. Therefore, circulating B cells were isolated from healthy individuals and stimulated with expansion mixtures that activate B cells°. Flow cytometric analyses showed that the proportion of $\mathsf{I g M}^{+}\mathsf{G N L Y^{+}}$ double-positive B cells was significantly increased in cultured B cells exposed tothe expansion mixtures compared withthose with vehicle (Fig. 6e). Meanwhile, nanogram levels of gran u lys in were detected in the culture super nat ant of B cells (Fig. 6f). Furthermore, to determine whether the response to stimulation was age-dependent, PBMCs isolated from three different life stages (children, 4-9 years old;
细胞毒性T细胞和B细胞在体外经扩增混合物(包括细胞因子、趋化因子、抗体和佐剂)刺激后被诱导产生[49-51]。因此,我们从健康个体中分离出循环B细胞,并用能激活B细胞的扩增混合物进行刺激。流式细胞分析显示,与对照组相比,暴露于扩增混合物的培养B细胞中$\mathsf{I g M}^{+}\mathsf{G N L Y^{+}}$双阳性B细胞比例显著增加(图6e)。同时,在B细胞培养上清中检测到纳克水平的颗粒溶素(图6f)。此外,为确定这种刺激反应是否具有年龄依赖性,我们分离了三个不同生命阶段(儿童组4-9岁;
adults,24-30 years old;elderly,64-72 years old) were subjected tothe expansion mixtures. The proportion of IgM+GNLY+ double-positive B cells was significantly higher in children than in the adult and elderly groups (Fig. 6d), in accordance with our scRNA-seq results.
成人组(24-30岁)和老年组(64-72岁)接受了扩增混合物处理。与我们的单细胞RNA测序结果一致,儿童组中IgM+GNLY+双阳性B细胞的比例显著高于成人组和老年组(图6d)。
The role of granulysin in B cell-mediated functions has yet to be reported.Therefore,we sought to determine whether B_BCR+GNLY+ cells possess potential cytotoxic activities. Our results indicate that although B_BCR+GNLY+ cells are closest to plasma cells among all the B cell subsets with respect to gene expression profiles, the distribution of immunoglobulin isotypes, BCR clonotypes and transcriptional regula tory mechanisms,this newly identified B cell type has its own distinct characteristics and displays cytotoxic activities. Additional details concerning B_BCR+GNLY+ cells are provided in Supplementary Information Section1.3 (Extended Data Figs.8 and 9 and Supplementary Fig.3).
颗粒溶素(granulysin)在B细胞介导功能中的作用尚未见报道。因此,我们试图确定B_BCR+GNLY+细胞是否具有潜在的细胞毒性活性。结果表明,尽管B_BCR+GNLY+细胞在基因表达谱、免疫球蛋白同种型分布、BCR克隆型和转录调控机制方面最接近浆细胞,但这种新发现的B细胞类型具有自身独特特征并表现出细胞毒性活性。补充信息第1.3节提供了关于B_BCR+GNLY+细胞的更多细节 (扩展数据图8和9及补充图3)。
Single-cell immune age prediction model
单细胞免疫年龄预测模型
The aging immune system contributes to the morbidity and mortality of the elderly'. One significant obstacle in understanding immune system alterations involves the precise quant if i cation of aging20.52. Our unique lifecycle-wide single-cell data provided an opportunity to construct a single-cell immune age clock model (that is, siAge score) to quantify an individual's immune age and describe their immune status (Fig. 7a). We first built 25 models with data from just one cell subset and another model incorporating all cell subsets. The prediction model integrating all 25 cell subset datasets had the highest predictive score and achieved the best performance (Pearson's correlation coefficient $\left(\mathrm{PCC}\right)=0.9940$ $P<2.2\times10^{-16},$ among the 26 prediction models (Fig. 7b,c and Supplementary Fig. 4a) and was chosen as the final siAge prediction model.
衰老的免疫系统是导致老年人发病和死亡的重要因素之一。理解免疫系统变化的一个主要障碍在于如何精确量化衰老过程[20,52]。我们独特的全生命周期单细胞数据为构建单细胞免疫年龄时钟模型(即siAge评分)提供了机会,该模型可用于量化个体免疫年龄并描述其免疫状态(图7a)。我们首先利用单一细胞亚群数据构建了25个模型,另整合所有细胞亚群数据构建了一个综合模型。在26个预测模型中,整合全部25个细胞亚群数据集的预测模型具有最高的预测评分和最佳性能(Pearson相关系数$\left(\mathrm{PCC}\right)=0.9940$,$P<2.2\times10^{-16}$)(图7b、c和补充图4a),因此被选为最终的siAge预测模型。
Tofurther validate the prediction capability ofthe siAge prediction model, we measured the prediction model efficacy with scRNA-seq data using several independent validation cohorts (detailed in Methods). In the healthy validation cohort (33 individuals),there was a strong positive correlation between chronological age (cAge) and siAge $(\mathrm{PCC}=0.8894,P=4.557\times10^{-12})$ (Fig. 7d), indicating that the siAge prediction model achieved good performance in healthy individuals. However, in the validation cohort with a compromised immune function (56 PBMC samples), the correlation between cAge and siAge became weaker $\left(\mathrm{PCC}=0.6526\right)$ $P=5.008\times10^{-8})$ (Fig. 7e and Supplementary Fig.4b). The disrupted correlation between the cAge and siAge scores in the diseased indicated the potential ability of our prediction model to present the disrupted immune homeostasis in individuals with immune-related diseases. To better assess the predictive ability of the si Age model,we defined a fit index that was calculated from the ratio between siAge and cAge and used it toevaluate the deviations in siAge and cAge between healthy subjects and patients with a disturbed immune function. Our findings highlight that the siAge prediction model, driven by 21 keygenes across T cell subsets,serves as a potential marker to access immune system status and aging-associated changes across the human lifespan (Fig. 7f-i, detailed in Supplementary Information Section1.4).
为了进一步验证siAge预测模型的预测能力,我们使用多个独立验证队列(方法部分详述)通过单细胞RNA测序(scRNA-seq)数据评估了预测模型效能。在健康验证队列(33名个体)中,实际年龄(cAge)与siAge呈强正相关 $(\mathrm{PCC}=0.8894,P=4.557\times10^{-12})$ (图7d),表明该预测模型在健康人群中表现良好。然而在免疫功能受损的验证队列(56个PBMC样本)中,cAge与siAge的相关性减弱 $\left(\mathrm{PCC}=0.6526\right)$ $P=5.008\times10^{-8})$ (图7e和补充图4b)。疾病状态下cAge与siAge评分的相关性破坏,提示我们的预测模型可能具备反映免疫相关疾病个体免疫稳态失衡的能力。为更好评估siAge模型的预测能力,我们定义了拟合指数(siAge与cAge的比值)来评估健康受试者与免疫功能紊乱患者间的siAge-cAge偏差。研究结果表明,基于T细胞亚群21个关键基因构建的siAge预测模型,可作为评估人类生命周期中免疫系统状态和衰老相关变化的潜在标志物(图7f-i,详见补充信息章节1.4)。
Fig. 7|Construction ofthe single-cell immune age prediction model based on lifecycle-wide single-cell data. a, Workflow of the single-cellimmune age prediction model construction.b, Dot plot showing the predictive performance of the cell-type-specific immune age prediction models based on our lifecyclewidesingle-celldata.The top ten cell types with the highest scores are shown. c,Scatter plot showingthe correlation of single-cell immune age (siAge) with chronological age (cAge)inthe training set( $\scriptstyle n=56$ samples).d,Scatter plot showingthe correlation of siAge with cAge in the external healthy validation cohort $\scriptstyle{n=33}$ samples).e,Scatter plot showing the correlation of siAge with cAge in the external validation cohort of diseased patients with disturbed immune functions $\scriptstyle n=56$ samples).f,Box plot comparing the fit index distribution calculated by the si Age model in the healthy validation cohort withthose ofthe diseased validation cohorts,which includeNPC $\bf\Delta n=10$ samples),SLE( $\mathit{\Omega}_{n=40}$ samples)and KD( $n=6$ samples) cohorts.Box plots show median,first (lower
图 7: 基于全生命周期单细胞数据的单细胞免疫年龄预测模型构建。a, 单细胞免疫年龄预测模型构建流程。b, 点图展示基于我们全生命周期单细胞数据的细胞类型特异性免疫年龄预测模型的预测性能,显示了得分最高的前十种细胞类型。c, 散点图展示训练集中单细胞免疫年龄 (siAge) 与实际年龄 (cAge) 的相关性 (n=56 样本)。d, 散点图展示外部健康验证队列中 siAge 与 cAge 的相关性 (n=33 样本)。e, 散点图展示免疫功能紊乱疾病患者外部验证队列中 siAge 与 cAge 的相关性 (n=56 样本)。f, 箱线图比较健康验证队列与疾病验证队列 (包括 NPC (n=10 样本)、SLE (n=40 样本) 和 KD (n=6 样本) 队列) 中 siAge 模型计算的拟合指数分布,箱线图显示中位数、第一
Discussion
讨论
Our data represent a resource to simultaneously encompass both single-cell transcriptional and protein-level changes of human peripheral immune cells across the entire human lifespan, with the largest age range to date (13 age groups, ranging from age 0 to over 90 years). Using this unique resource, the current study provides important insights into the age-specific alterations in cell-type composition, transcription and protein profiles, CCls and immune repertoires of PBMCs over the lifespan at a fine temporal and single-cell resolution. Several important observations were made(Extended DataFig.10). First, the T cell subsets were the most susceptible to aging among all PBMCs and were characterized by increased pro-inflammatory pathways, reduced telomere organization and markedly altered immune checkpoints. Second, scRNA-seq and immune repertoire sequencing revealed unique age-specific transcriptional and clonal expansion featuresofT cell subsets:(1)high T cell cl on al expansions exhibited peaks at both ends of life:the early life developmental stage (2-12 years old) and the late-life elderly stage (70-90 years old); (2) naive $\mathrm{CD4^{+}}$ andnaive $\mathbf{CD8^{+}}$ T cells exhibited distinct aging-associated cell compositions, transcriptional patterns and repertoire dynamics from birth; and (3) CD 8 MA IT S LC 4 A 10 cells exhibited peak abundance and clonal diversity during adolescence, with greater antimicrobial ability than those in other age groups. The findings from scRNA-seq-based and scTCR/ BCR-seq-based analyses were further recapitulated by high-throughput CyTOF-based protein profiling. Importantly, we described a newly identified and unique'cytotoxic' B cell subset( $\mathbf{B_BCR^{+}G N L Y^{+}}$ cells) that simultaneously expressed BCR genes and cytotoxic genes and was enriched in children. Using this single-cell lifecycle-wide dataset, we developed an immune age prediction model (siAge score) that enabled the identification of a disturbed immune state. Overall, our work provides fine temporal resolution of peripheral immune cell transcriptomes and immune repertoires during development, maturation and senescence in humans across the entire lifespan and will facilitate in-depth exploration ofthe immune system at each age stage.
我们的数据资源首次实现了对人类外周免疫细胞在整个生命周期(迄今最大年龄跨度:13个年龄组,0岁至90岁以上)单细胞转录组与蛋白质组变化的同步覆盖。基于这一独特资源,本研究以精细时间分辨率和单细胞精度揭示了PBMC在生命周期中细胞类型组成、转录与蛋白特征、细胞周期及免疫库的年龄特异性变化,获得多项重要发现(扩展数据图10)。首先,T细胞亚群是所有PBMC中对衰老最敏感的群体,表现为促炎通路激活、端粒组织功能减退和免疫检查点显著改变。其次,单细胞RNA测序(scRNA-seq)与免疫库测序共同揭示了T细胞亚群特有的年龄依赖性转录特征与克隆扩增模式:(1)高克隆扩增T细胞在生命两端(幼儿发育期2-12岁与老年期70-90岁)呈现双峰分布;(2)初始型$\mathrm{CD4^{+}}$与$\mathbf{CD8^{+}}$T细胞从出生起就展现出截然不同的衰老相关细胞组成、转录模式及受体库动态;(3)CD8+ MAIT SLAMF7+细胞在青春期达到丰度与克隆多样性峰值,其抗菌能力显著高于其他年龄段。基于质谱流式(CyTOF)的高通量蛋白质组分析进一步验证了上述发现。值得注意的是,我们发现了一个同时表达BCR基因与细胞毒性基因的新型"细胞毒性"B细胞亚群($\mathbf{B_BCR^{+}GNLY^{+}}$细胞),该群体在儿童期显著富集。利用这一全生命周期单细胞数据集,我们开发了可识别免疫紊乱状态的免疫年龄预测模型(siAge评分)。本研究首次绘制了人类发育、成熟与衰老过程中外周免疫细胞转录组与免疫库的精细时间图谱,为深入探索各年龄段免疫系统特征提供了重要基础。
The immune system is immature at birth and gradually develops under postnatal environmental stimuli 3.Maternally derived antibodies taper off in the infant's circulation and are almost completely lost by 1 year after birth54.5, whereas self-derived immune mediators-such as T cells from the thymus, and B cells and innate immune cells from the bone marrow-augment over time?°, accompanied by external stimula tion including routine immunization and environmental microbe and food antigen exposures. Immunoglobulins in infants increase progressively and reach normal adult levels at age 5-9 years57.5s During this critical developmental period, children are more vulnerable to infectious diseases59.Wefound that certain cell subsets exhibited enrichment in certain age categories during development, including $\mathbf{CD14}^{+}$ monocytes in neonates and plasma cells in children.Importantly, we identified a unique'cytotoxic' B cell type (B_BCR+GNLY+ cells) that expanded significantly in children from 2to12 yearsold, with a peak in 6-year-olds. Given its low proportion in other age groups, it may have been overlooked in previous single-cell studies that did notexamine all hinge)andthird(upperhinge)quartiles;whiskers show $1.5\times$ the interquartile range. $P$ values were determined by two-sided unpaired Wilcox on rank-sum test.g,Bar plot showing the top 21 key genes identified by random forest regression. The genes are ranked in descending order of importance with respect to the accuracy ofthe model. The insert represents a tenfold cross-validation error as a function of the numberof input features used toregress against the chronological ages.h, Heatmap showing the row-scaled expression ofthe top 21key genes across age in corresponding cell subsets.i,Bar plot showing functional enrichment of the top 21 key genes. $P$ values were determined by hyper geometric test. Inc-e,the blue line indicates linear regression; PCC and $P$ values are indicated.The gray shadow covers the $95%$ confidence interval.In c-f $P<0.05$ was considered statistically significant.KD,Kawasaki diseases;NPC, na so pharyngeal carcinoma; SLE,systemic lupus erythematosus.
免疫系统在出生时尚未成熟,会随着出生后环境刺激逐步发育。母源抗体在婴儿血液循环中逐渐衰减,出生后54.5周时几乎完全消失,而自体免疫介质——如胸腺来源的T细胞、骨髓来源的B细胞与先天免疫细胞——会随时间推移逐渐增多,这一过程还伴随着常规免疫接种、环境微生物及食物抗原等外部刺激。婴儿免疫球蛋白水平持续上升,至5-9岁时达到正常成人水平。在这个关键发育期,儿童更易感染疾病。我们发现特定细胞亚群在发育过程中呈现年龄特异性富集,包括新生儿期的CD14+单核细胞与儿童期的浆细胞。值得注意的是,我们鉴定出一种独特的"细胞毒性"B细胞类型(B_BCR+GNLY+细胞),该细胞在2-12岁儿童中显著扩增,6岁时达到峰值。鉴于其在其他年龄组占比极低,过往未覆盖全年龄段的单细胞研究可能忽略了该细胞类型。
(箱线图元素说明:箱体表示四分位距,中线标记中位数,须线展示1.5倍四分位距。P值通过双样本Wilcoxon秩和检验获得)
g. 条形图展示随机森林回归模型识别的前21个关键基因,按模型精度重要性降序排列。插图显示十折交叉验证误差随输入特征数量的变化趋势。
h. 热图呈现前21个关键基因在各年龄组对应细胞亚群中的标准化表达水平。
i. 条形图展示前21个关键基因的功能富集分析结果(P值通过超几何检验计算)。
(图注说明:c-e图中蓝线表示线性回归,标注PCC与P值;灰色阴影为95%置信区间。c-f图中P<0.05视为具有统计学显著性。缩写说明:KD-川崎病;NPC-鼻咽癌;SLE-系统性红斑狼疮)
age categories or perform scBCR-seq analyses.B_BCR'GNLYt cells are characterized by the simultaneous expression of BCR genes and cytotoxic genes,such as GNLYand NKG7.Traditionally,B cells are thought to be immune cell populations incapable of cytotoxic functions.However, recent studies have reported a'cytotoxic' phenotype of B cell subsets expressing cytotoxic genes in diseased populations 51,60-62. The enrichment of the B ${\tt B C R^{+}G N L Y^{+}}$ cells with cytotoxic signatures before adolescence may represent a potential mechanism for protection against infection before immune maturation. The detailed biological functions of B_BCR+GNLY+ cells require further investigation (detailed in Supplementary Information Section 2.1).
B_BCR+GNLY+细胞的特征是同时表达BCR基因和细胞毒性基因,如GNLY和NKG7。传统上认为B细胞是不具备细胞毒性功能的免疫细胞群体。然而,最近的研究报道了在疾病人群中表达细胞毒性基因的B细胞亚群具有"细胞毒性"表型[51,60-62]。青春期前具有细胞毒性特征的B${\tt B C R^{+}G N L Y^{+}}$细胞富集,可能代表了免疫成熟前抵御感染的潜在机制。B_BCR+GNLY+细胞的具体生物学功能需要进一步研究(详见补充信息第2.1节)。
The immune system undergoes dramatic changes during and after adolescence and sexual maturity63,64. Both our scRNA-seq and CyTOF analyses showed that MAIT cells were enriched during adolescence, with a peak in the 18-year-old group, and MAIT cells in adolescents possessed higher TCR diversity and stronger antimicrobial effector function than in other age groups. MAlT cells are a unique T cell type that bridge innate and adaptive immunity44. The involvement of MAlT cells in a large spectrum of pathological conditions makes them attractive therapeutic targets+'.Commensal bacteria mediate MAIT cell maturation and re circulation into the thymus65. The dramatic hormonal changes that occur during adolescence lead to alteration ofthe mucosal membrane micro environment and variation in commensal bacterial species and numbers°, which may account for the MAIT cell expansion during the period of adolescence seen in our study. However, a systematic understanding of the peripheral circulating MA iT cells in healthy individuals across the life course was lacking at a single-cell resolution. Our findings delineate the potential functional significance of MAIT cells during adolescence by characterizing the transcript ome and immune repertoire of this cell type across the lifespan. The observed transcript omi c and expansion features and antimicrobial activities of MAIT cells during adolescence may represent a key component of immune maturation and competence for protection against infection.
免疫系统在青春期及性成熟期间和之后会发生显著变化[63,64]。我们的单细胞RNA测序(scRNA-seq)和质谱流式(CyTOF)分析均显示,MAIT细胞在青春期富集,并在18岁年龄组达到峰值,且青少年期的MAIT细胞比其他年龄组具有更高的TCR多样性和更强的抗菌效应功能。MAIT细胞是连接先天免疫与适应性免疫的特殊T细胞类型[44]。由于MAIT细胞参与多种病理过程,使其成为极具吸引力的治疗靶点。共生菌群介导MAIT细胞的成熟并促进其再循环至胸腺[65]。青春期发生的剧烈激素变化会导致黏膜微环境改变及共生菌种类与数量的变化,这可能是本研究中观察到的青春期MAIT细胞扩增的原因。然而,此前缺乏从单细胞分辨率系统理解健康个体外周循环MAIT细胞在整个生命历程中的变化。我们的研究通过表征MAIT细胞在整个生命周期的转录组和免疫组库,揭示了该细胞类型在青春期的潜在功能意义。观察到的青春期MAIT细胞转录组特征、扩增现象及抗菌活性,可能代表着免疫成熟和抗感染能力发展的关键组成部分。
Immune function declines with aging, termed 'immuno sen escence. However, different types of immune cells display differential susceptibility to aging. In line with a mouse aging study showing that T cells exhibited a more vulnerable phenotype to senescence than other immune cells°, our results in humans revealed that the functions of T cell subsets were the most susceptible to aging among all PBMCs across the entire lifespan. Previous studies on the adult population (25-88 years old) have concluded that cytotoxic T cells $(\mathbf{CD8}^{+})$ ,compared to helper T cells $(\mathbf{CD4}^{+})$ ,show a more pronounced decline with age36. However, with our full lifespan coverage, we found the aging trajectories of $\mathbf{CD4^{+}}$ and $\mathbf{CD8^{+}}$ T cells to diverge beginning at birth, based on both scRNA-seq and CyTOF analyses. During the early life developmental stage, naive $\mathbf{CD4}^{+}$ cells declined more rapidly than naive $\mathbf{CD8^{+}}$ cells, whereas during later life, naive $\mathbf{CD8^{+}}$ cells exhibited more drastic attrition than naive $\mathbf{CD4}^{+}$ cells in terms of frequency and TCR diversity. The collapse of TCR diversity in naive T cells has been considered a strong predictor of poor health in older people67. Our scRNA-seq dataset, covering each age category from birth to old age, revealed, at the single-cell level, the differential dynamics of the alteration of T CR diversity in naive $\mathrm{CD4^{+}}$ T cells and naive $\mathbf{CD8}^{+}$ T cells in both early and late life.
免疫功能随衰老而下降,称为"免疫衰老"。然而,不同类型的免疫细胞对衰老表现出不同的易感性。与小鼠衰老研究显示T细胞比其他免疫细胞表现出更易衰老的表型一致,我们在人类中的研究结果表明,在整个生命周期中,T细胞亚群的功能在所有外周血单个核细胞(PBMCs)中对衰老最为敏感。先前针对成人群体(25-88岁)的研究得出结论,与辅助性T细胞(CD4+)相比,细胞毒性T细胞(CD8+)随年龄增长表现出更明显的下降[36]。但通过我们覆盖全生命周期的研究发现,基于单细胞RNA测序(scRNA-seq)和质谱流式(CyTOF)分析,CD4+和CD8+T细胞的衰老轨迹从出生起就开始分化。在生命早期发育阶段,初始CD4+细胞比初始CD8+细胞下降更快,而在生命后期,就频率和TCR多样性而言,初始CD8+细胞比初始CD4+细胞表现出更剧烈的耗竭。初始T细胞中TCR多样性的崩溃被认为是老年人健康状况不佳的强预测指标[67]。我们的scRNA-seq数据集覆盖了从出生到老年的每个年龄阶段,在单细胞水平揭示了生命早期和晚期初始CD4+T细胞与初始CD8+T细胞在TCR多样性改变方面的差异动态。
Age-dependent changes in the immune response are the most obvious at the extremes of age, with immune malfunction or dysregulation occurring during the early life development and the late-life elderly stages56. Previous studies that were limited tothe adult population have shown elevated T cell clonal expansion only in elderly ages36. However, taking advantage ofour lifecycle-wide cohort from birth to old age, we revealed that high T cell clonal expansions actually exhibited two peaks at the extremes ofthe lifespan, with an early peak in the 2-12-year-old group and a late peak in the 70-90-year-old group. Our results provide insights into the similarities and differences in T cell functions between two age groups, representing immune development and immune aging, respectively (detailed in Supplementary Information Section 2.2). The human lifespan has increased in recent decades, which raises major health issues related to many age-associated path o logie s. Using our lifecycle-wide single-cell dataset covering 13 age groups over the extent of the human lifespan, we established a comprehensive cross-age index (siAge score) as a type of machine-learning-based immune aging clock to evaluate an individual's immune status across different life stages. We demonstrated that the siAge score is highly correlated with the calendar age in healthy individuals but not in those with disturbed immune functions in multiple diseased validation cohorts. Thus, our siAge model holds the potential to effectively quantify a person's immune status and distinguish a healthy individual from one with an unhealthy immune state. It has been transformed into user-friendly software (https://pu-lab.sjtu.edu.cn/shiny/lifespan), making it easily accessible to end users.Further research is needed to validate the generalization of our findings (detailed in Supplementary Information Section 2.3).
免疫反应的年龄依赖性变化在生命两端最为显著,表现为生命早期发育阶段和老年阶段的免疫功能障碍或失调[56]。既往局限于成年人群的研究仅发现老年群体存在T细胞克隆扩增水平升高[36]。但借助我们覆盖从出生到老年全生命周期的队列,我们首次揭示高T细胞克隆扩增实际上在生命两端呈现双峰分布:早期高峰出现在2-12岁组,晚期高峰出现在70-90岁组。该结果揭示了分别代表免疫发育和免疫衰老的两个年龄组在T细胞功能上的异同(详见补充信息第2.2节)。
近几十年来人类寿命的延长带来了诸多年龄相关疾病的重大健康挑战。基于覆盖人类生命周期13个年龄组的单细胞数据集,我们建立了一个综合性跨年龄指标(siAge评分),作为基于机器学习(machine learning)的免疫衰老时钟,用于评估个体不同生命阶段的免疫状态。我们在多个疾病验证队列中证实,siAge评分与健康个体的实际年龄高度相关,但在免疫功能紊乱的个体中无此相关性。因此,siAge模型有望量化个体免疫状态并区分健康与非健康免疫状态。该模型已转化为用户友好型软件(https://pu-lab.sjtu.edu.cn/shiny/lifespan)供终端用户使用。未来需进一步研究验证本发现的普适性(详见补充信息第2.3节)。
Online content
在线内容
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements,peer review information; details of author contributions and competing interests; and statements of data and code availability are available at https://doi.0rg/10.1038/s41590-024-02059-6.
任何方法、补充参考文献、《自然》系列期刊报告摘要、源数据、扩展数据、补充信息、致谢、同行评审信息、作者贡献与利益竞争详情、以及数据和代码可用性声明,请访问 https://doi.0rg/10.1038/s41590-024-02059-6 获取。
References
参考文献
Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
出版方声明:施普林格·自然对出版地图和机构附属关系中的管辖权主张保持中立。
OpenAccess This article is licensed under a Creative Commons Attribution-NonCommercial-No Derivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material.You do not have permission under this licence to share
开放获取 本文遵循知识共享署名-非商业性使用-禁止演绎 4.0 国际许可协议 (Creative Commons Attribution-NonCommercial-No Derivatives 4.0 International License)。该协议允许在任何媒介或格式中进行非商业性使用、共享、分发和复制,但须满足以下条件:注明原作者及来源、提供知识共享许可协议链接,并说明是否对授权材料进行了修改。本许可协议不允许共享演绎作品。
adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.To view a copy of this licence, visit http://creative commons.org/licenses/by-nc-nd/4.0/.
本文改编自该文章或其部分内容。除非在材料署名中另有说明,否则本文中的图片或其他第三方材料均包含在文章的 Creative Commons 许可中。如果材料未包含在文章的 Creative Commons 许可中,且您的预期用途未经法定许可或超出允许范围,您需要直接获得版权持有者的授权。要查看该许可的副本,请访问 http://creativecommons.org/licenses/by-nc-nd/4.0/。
$\circledcirc$ The Author(s) 2025
$\circledcirc$ 作者 2025
Methods
方法
Participants and ethics
参与者和伦理规范
All human samples were obtained in accordance with protocols approved by the Institutional Review Board at Shanghai Jiao Tong University.All donors were enrolled with approval by the Ethics Committee of RenJi Hospital (KY2019-136),School of Medicine, Shanghai Jiao Tong University. After providing written informed consent in accordance with the criteria set by the Declaration of Helsinki (2013)68, 220 healthy volunteers agedOto ${\ge}90$ years were recruited from Shanghai Pudong Cohort (NC T 05206643), including 61 healthy participants aged 0 to $\scriptstyle\geq90$ years for scRNA and scTCR/scBCR sequencing; 7O healthy partic i pants age dO to $\scriptstyle\geq90$ years for independent CyTOF-based single-cell protein validation experiments; 34 healthy participants aged 0 to ${\ge}90$ years for independent bulk RNA-seq validation experiments; and 55 healthy participants aged 2-72 yearsfor validation experiments. The inclusion and exclusion criteria for participants in this study is provided in Supplementary Information Section 3. The participants spanned the entire lifespan, partitioned into 13 age groups (0-,1-,2-,6-,12-,18-,30-, 40-,50-,60-,70-,80-and over 90-year-olds). Toyield more biologically meaningful results, when performing statistical testing to determine the differences among groups,participants were categorized into eight life stages as suggested by developmental biologists 22-24 as follows: O years old, newborn (umbilical cord blood); 1-2 years old, toddler; 6-8 years old, childhood; 12-18 years old, adolescent; 30-40 years old, young adult, 50-60 years old, middle-aged adult; 70-80 years old, elderly; and over 90 years old, elderly-plus.
所有人体样本的获取均遵循上海交通大学机构审查委员会批准的实验方案。捐赠者入组经上海交通大学医学院附属仁济医院伦理委员会审批通过(KY2019-136)。根据《赫尔辛基宣言》(2013)[68]标准签署书面知情同意书后,从上海浦东队列(NCT05206643)招募了220名0岁至${\ge}90$岁的健康志愿者,包括:61名0-$\scriptstyle\geq90$岁健康参与者用于单细胞RNA测序(scRNA)及单细胞TCR/BCR测序(scTCR/scBCR);70名0-$\scriptstyle\geq90$岁健康参与者用于基于质谱流式(CyTOF)的单细胞蛋白质独立验证实验;34名0-${\ge}90$岁健康参与者用于批量RNA测序(bulk RNA-seq)独立验证实验;55名2-72岁健康参与者用于验证实验。本研究受试者纳入与排除标准详见补充信息第3节。参与者覆盖全生命周期,划分为13个年龄组(0岁、1岁、2岁、6岁、12岁、18岁、30岁、40岁、50岁、60岁、70岁、80岁及90岁以上)。为获得更具生物学意义的结果,在进行组间差异统计检验时,参照发育生物学家建议[22-24]将参与者分为八个生命阶段:0岁新生儿(脐带血)、1-2岁幼儿、6-8岁儿童、12-18岁青少年、30-40岁青年、50-60岁中年、70-80岁老年及90岁以上高龄老年。
Single-cell RNA-seq library preparation and sequencing
单细胞RNA测序文库制备与测序
To produce single-cell gel beads in emulsion, cell suspensions were quantified with 400-600 living cellsper determined by a Countstar automated cell counter.A Chromium single-cell controller $^{10\times}$ Genomics) was used to prepare cell suspensions as gel beads in emulsion, with a Single Cell $5^{\prime}$ Library and Gel Bead kit (1000006, $^{10\times}$ Genomics) and Chromium Single Cell A Chip kit (120236, $^{10\times}$ Genomics) following the manufacturer's protocols.Approximately 15,0oo cells were added to each channeland the recovered target cells were estimated to be approximately 8,ooO cells.scRNA-seq libraries were constructed using the Chromium Single Cell $5^{\prime}$ v.1.1 Reagent (HumanTCell,1000005;HumanB Cell,1000016; $^{10\times}$ Genomics), and a unique sample index was included in each sequencing library. Finally, sequencing was performed with a sequencing depth of at least 100,000 reads per cell with a paired-end 150 bp reading strategy using the Ilumina platform (CapitalBio Technology).
为制备单细胞乳液凝胶珠,使用Countstar自动细胞计数器对细胞悬液进行定量,确保每微升含有400-600个活细胞。采用Chromium单细胞控制器(10× Genomics)将细胞悬液制备为乳液凝胶珠,过程中使用Single Cell 5′ Library and Gel Bead试剂盒(1000006,10× Genomics)和Chromium Single Cell A Chip试剂盒(120236,10× Genomics),严格遵循制造商的操作流程。每个通道约加入15,000个细胞,预计回收目标细胞数约为8,000个。利用Chromium Single Cell 5′ v.1.1试剂(Human T Cell,1000005;Human B Cell,1000016;10× Genomics)构建scRNA-seq文库,每个测序文库均包含独特的样本索引。最终,在Ilumina平台(CapitalBio Technology)上采用双端150 bp读长策略进行测序,确保每个细胞至少获得100,000条reads的测序深度。
scRNA-seq data processing
scRNA-seq 数据处理
Cell Ranger (v.3.0.2) was used to align readsto reference(GRCh38-3.0.0) and generate feature-barcode matrices from the Chromium s cRNA-seq output.Filtering stepsfor high-quality single cells included removal of doublets using Scrub let"(doublet rate $=0.075$ ,doublet Score $<0.25,$ cells expressing ${<}400$ genes, cells with $>2{,}500$ detected genes and cells with a mitochondrial gene count percentage of ${\tt>}10%$ Genes appearing in lessthan three cellswere also excluded.
使用Cell Ranger (v.3.0.2)将测序读数比对至参考基因组(GRCh38-3.0.0),并从Chromium单细胞RNA测序输出中生成特征-条形码矩阵。高质量单细胞的筛选步骤包括:使用Scrublet去除双联体(双联体率=0.075,双联体评分<0.25),剔除表达基因数<400的细胞、检测基因数>2,500的细胞,以及线粒体基因占比>10%的细胞。同时排除在少于三个细胞中出现的基因。
Based on the clean scRNA-seq data after quality control, gene expression profiles were normalized for eachcell using the Log Normalize method with the Seurat NormalizeD at a function. The Find Variable Features function was then applied toidentify the highly variable genes using the default parameters. Next, we scaled the data using the ScaleData function and performed principal component analysis on the scaled data using the RunPCA function with default parameters. The s cV lln te g ration algorithm was used for batch correction?°. We constructed a shared nearest neighbor graph using the FindNeighbors function and clustered cells using the Louvain algorithm with the Find Clusters function based on the s cV lln te g ration reduction. Finally,the Run U MAP function facilitated the visualization of all cells. A total of 538,266 high-quality cells were derived from 61 samples.
基于质量控制后的洁净单细胞RNA测序(scRNA-seq)数据,我们使用Seurat包中的NormalizeData函数,采用Log Normalize方法对每个细胞的基因表达谱进行归一化处理。随后运用FindVariableFeatures函数,通过默认参数识别高变基因。接着使用ScaleData函数对数据进行标准化,并采用RunPCA函数以默认参数对标准化后的数据执行主成分分析。批次校正采用s cV lln te g ration算法[20]。基于s cV lln te g ration降维结果,我们通过FindNeighbors函数构建共享最近邻图,并利用FindClusters函数结合Louvain算法进行细胞聚类。最终通过RunUMAP函数实现所有细胞的可视化。从61个样本中共获得538,266个高质量细胞。
Cell-type annotation and cluster marker identification
细胞类型注释与聚类标记识别
The Find All Markers function was used to find marker genes for each cluster.Clusters were then classified and annotated based on the marker gene expression of each cell subset. We used a two-round hierarchical approach to cluster and annotate the cells71.72. In brief, the first round of clustering (resolution, 0.8) identified 11 major cell types, including $\mathbf{CD4^{+}}$ Tcells, $\mathrm{CD}8^{+}$ T cells, y T cells, NK cells, B cells, plasma cells, $\mathbf{CD14}^{+}$ monocytes, $\mathtt{C D16}^{+}$ monocytes,monocyte-derived dendritic cells, plasma cyto id dendritic cells and mega kary o cyte s. In round two, cells from these major types were subset out and the whole integration and clustering process was applied to them separately. The second round of clustering(resolution,0.3-1.2)resulted in 25 distinct cell subsets(shown in Fig.1b and Extended Data Fig.1a-d).Single cells expressing