세미나/행사

  • Home
  • 정보센터
  • 세미나/행사
  • 세미나/행사
세미나 담당교수 : 2024-2학기 김진홍 (금요세미나, 콜로퀴움, jinhkim@snu.ac.kr), 강찬희 (신진과학자세미나, chanhee.kang@snu.ac.kr), 윤태영 (10-10 project, tyyoon@snu.ac.kr)
조 교 : 장사라 (02-880-4431, jsarah@snu.ac.kr)
호암교수회관 : 5572, 교수회관: 5241, 두레미담: 9358, 라쿠치나: 1631.

[초청강연] Cross-protein transfer learning substantially improves zero-shot prediction of disease variant effects

2023-01-05l 조회수 2847

일시: 2023-01-03 14:00 ~ 16:00
발표자: Yun S. Song (UC Berkeley Dept. of Statistics & EECS)
담당교수: 생명과학부
장소: 대면 | 교수회의실(504-105)
Cross-protein transfer learning substantially improves zero-shot prediction of disease

variant effects

Yun S. Song, Department of Statistics & EECS, UC Berkeley

Genetic variation in the human genome is a major determinant of individual disease risk,
but the vast majority of missense variants have unknown etiological effects. Various
computational strategies have been proposed to predict the effects of missense variants
across the human proteome, using many different predictive signals. Here, we present a
robust learning framework for leveraging functional assay data to construct
computational predictors of disease variant effects. We train cross-protein transfer (CPT)

models using deep mutational scanning data from only five proteins and achieve state-
of-the-art performance on unseen proteins across the human proteome. On human

disease variants annotated in ClinVar, our model CPT-1 improves specificity at 95%
sensitivity to 64%, from 31% for ESM-1v and 50% for EVE. Our framework combines
general protein sequence models with vertebrate sequence alignments and AlphaFold2
structures, and it is adaptable to the future inclusion of other sources of information. We
release predictions for all missense variants in 90% of human genes. Our results
establish the utility of functional assay data for learning general properties of variants
that can transfer to unseen proteins.