说过好多次要汇总在分析单细胞数据时候用到的marker基因,因为不同组织中还是有些特异的marker基因的!
新年伊始决定改一下拖延的坏毛病,所以被朋友教育之后,决定从整理marker基因开始!慢慢改掉拖延的坏毛病,说过的事情就要着手去实行!
主要是去年分享的全部数据代码以及推文已经整理好了,收尾工作就是marker基因合集的整理,是真的不能再拖了!
关于去年直播分享的系列数据集可见——#单细胞实战
所有的marker基因基于文献以及个人经验进行简单的汇总,不作为任何参考标准!
这是基于曾老师整理好的代码中check-all-marker,摘录了一下我常用的一些marker基因
T Cells (CD3D, CD3E, CD8A,PTPRC,CD4,CD2),
B cells (CD19, CD79A, MS4A1 [CD20]),
Plasma cells (IGHG1, MZB1, SDC1, CD79A),
Monocytes and macrophages (CD68, CD163, CD14),
NK Cells (FGFBP2, FCG3RA, CX3CR1),
Fibroblasts (FGF7, MME,GSN,LUM,DCN),
Endothelial cells (PECAM1, VWF).
epi or tumor (EPCAM, KRT19, PROM1, ALDH1A1, CD24).
immune (CD45+,PTPRC),
epithelial/cancer (EpCAM+,EPCAM),
stromal (CD10+,MME,fibo or CD31+,PECAM1,endo)
Mast(CPA3, CST3, KIT, TPSAB1, TPSB2, MS4A2)
#髓系
macrophages (Adgre1, Cd14, and Fcgr3,LYZ,CD68,CD163),
cDCs (Xcr1, Flt3, Ccr7,CD1E),
pDCs (Siglech, Clec10a, Clec12a,CLEC4C),
monocytes (Ly6c2 , Spn,CD300E),
neutrophils (Csf3r, S100a8, and Cxcl3),
GSE212199_中枢神经系统免疫细胞
对应相关的marker基因:
astrocytes = c("AQP4", "ADGRV1", "GPC5", "RYR3")
endothelial = c("CLDN5", "ABCB1", "EBF1")
excitatory = c("CAMK2A", "CBLN2", "LDB2")
inhibitory = c("GAD1", "LHFPL3", "PCDH15")
microglia = c("C3", "LRMDA", "DOCK8")
oligodendrocytes = c("MBP", "PLP1", "ST18")
OPC='Tnr,Igsf21,Neu4,Gpr17'
Ependymal='Cfap126,Fam183b,Tmem212,pifo,Tekt1,Dnah12'
pericyte=c( 'DCN', 'LUM', 'GSN' ,'FGF7','MME', 'ACTA2','RGS5')
因为是神经元相关的marker基因,所以类似的还有:
GSE161045—人类纹状体胶质细胞
GSE184370-瘫痪后恢复行走能力的神经元
GSE136001-脑瘤小鼠模型
GSE162610-小鼠脊髓损伤——文章有提供对应的marker基因集,可作为参考
GSE185042—小鼠肝脏组织
对应相关的marker基因:
Kupffer = c("CD163", "CD206", "F4/80", "CD68")
endothelial = c("CLDN5", "ABCB1", "EBF1","CD31","VWF")
macrophage = c('Adgre1', 'Cd14', 'Fcgr3')
Cholangiocyte = c("CK7", "CK19", "SOX9","EpCAM","FYXD2","TM4SF4","ANXA4")
Hepatocyte = c("ALB", "AFP", "CYP2E1","HNF4A","ASGR1","APOC3","FABP1"," APOA1")
Dividing='Ki-67,PCNA,MCM2,AURKA'
PlasmaB='CD138,CD19,CD20,CD27,CD38,IRF4'
Hepatic_stellate=c( 'GFAP', 'PDGFRβ',"ACTA2","COL1A1")
cDCs=c('Xcr1', 'Flt3', 'Ccr7')
pDCs=c('Siglech', 'Clec10a', 'Clec12a')
GSE214611—心肌梗死心脏
对应相关的marker基因:
Macrophages = c("APOC1","HLA-DRB5","C1QA","C1QB")
CM=c("TTN","MYH7","MYH6","TNNT2") #心肌细胞
endothelial=c("VWF", "IFI27", "PECAM1","MGP")
Fibroblast=c("DCN", "GSN" ,"LUM","FBLN1","COL1A2")
SMC=c("ACTA2", "CALD1", "MYH11","Myo1b","RGS5')
monocytes=c('Ly6c2' , 'Spn')
neutrophils=c('Csf3r', 'S100a8', 'Cxcl3')
GSE163558——胃癌器官特异性转移
这篇文章比较特别,是咱们单细胞文献月更分享的第一篇文章,有详细的数据分析及代码,如果需要的话可以联系客服小助手进群交流。——承包你2025全部的单细胞转录组降维聚类分群
对应相关的marker基因:
genes_to_check = c('EPCAM','KRT19','CLDN4', #上皮
'PECAM1' , 'CLO1A2', 'VWF', #基质
'CD3D', 'CD3E', 'CD8A', 'CD4','CD2', #T
'CDH5', 'PECAM1', 'VWF', #内皮
'LUM' , 'FGF7', 'MME', #成纤维
'AIF1', 'C1QC','C1QB','LYZ', #巨噬
'MKI67', 'STMN1', 'PCNA', #增殖
'CPA3' ,'CST3', 'KIT', 'TPSAB1','TPSB2',#肥大
'GOS2', 'S100A9','S100A8','CXCL8', #中性粒细胞
'KLRD1', 'GNLY', 'KLRF1','AREG', 'XCL2','HSPA6', #NK
'MS4A1','CD19', 'CD79A','IGHG1','MZB1', 'SDC1', #B
'CSF1R', 'CSF3R', 'CD68') #髓系
GSE242889-肝细胞癌微血管侵袭
对应相关的marker基因:
last_markers = c('PTPRC', 'CD3D', 'CD3E', 'CD4','CD8A',
'CD19', 'CD79A', 'MS4A1' ,
'IGHG1', 'MZB1', 'SDC1',
'CD68', 'CD163', 'CD14',
'TPSAB1' , 'TPSB2', # mast cells,
'RCVRN','FPR1' , 'ITGAM' ,
'C1QA', 'C1QB', # mac
'S100A9', 'S100A8', 'MMP19',# monocyte
'FCGR3A',
'LAMP3', 'IDO1','IDO2',## DC3
'CD1E','CD1C', # DC2
'KLRB1','NCR1', # NK
'FGF7','MME', 'ACTA2', ## human fibo
'GJB2', 'RGS5',
'DCN', 'LUM', 'GSN' , ## mouse PDAC fibo
'MKI67' , 'TOP2A',
'PECAM1', 'VWF', ## endo
"PLVAP",'PROX1','ACKR1','CA4','HEY1',
'EPCAM' , 'KRT19','KRT7', # epi
'FYXD2', 'TM4SF4', 'ANXA4',# cholangiocytes
'APOC3', 'FABP1', 'APOA1', # hepatocytes
'Serpina1c',
'PROM1', 'ALDH1A1' )