Videos

Mohammed AlQuraishi - OpenFold: Lessons and insights from rebuilding and retraining AlphaFold2

January 23, 2023
Abstract
Recorded 23 January 2023. Mohammed AlQuraishi of Harvard Medical School, Systems Biology, presents "OpenFold: Lesson learned and insights gained from rebuilding and retraining AlphaFold2" at IPAM's Learning and Emergence in Molecular Systems Workshop. Abstract: AlphaFold2 revolutionized structural biology by accurately predicting protein structures from sequence. Its implementation however (i) lacks the code and data required to train models for new tasks, such as predicting alternate protein conformations or antibody structures, (ii) is unoptimized for commercially available computing hardware, making large-scale prediction campaigns impractical, and (iii) remains poorly understood with respect to how training data and regimen influence accuracy. Here we report OpenFold, an optimized and trainable version of AlphaFold2. We train OpenFold from scratch and demonstrate that it fully reproduces AlphaFold2’s accuracy. By analyzing OpenFold training, we find new relationships between data size/diversity and prediction accuracy and gain insights into how OpenFold learns to fold proteins during its training process. Learn more online at: http://www.ipam.ucla.edu/programs/workshops/learning-and-emergence-in-molecular-systems/