Generative design of sequence specific DNA binding proteins
Presentation

Generative design of sequence specific DNA binding proteins

Paper Author

Enisha Sehgal, Yuliya Politanska,Raktim Mitra, Paul T. Kim, Nayim González Rodríguez, David Baker Institute for Protein Design, University of Washington, Seattle, WA

Abstract

De novo protein design has advanced rapidly in recent years, yet the programmable recognition of specific DNA sequences remains a longstanding challenge. Here we describe a deep learning based approach for designing sequence selective DNA binding proteins. Our method combines structure generation using RFdiffusion3 with explicit screening against off-target interactions using AlphaFold3. We test this approach by generating 96 designs for each of 15 diverse DNA targets and identify specific binders for 7 targets, representing a ~100-fold improvement in success rates over previous approaches. We further characterize the binding landscape using variant competition assays and randomized library screening, revealing robust sequence discrimination across diverse targets. Together, these results represent a significant step forward in de novo sequence specific DNA binder design.

Research Paper

Previous Talks

50 talks

PathInHydro, a Set of Machine Learning Models to Identify Unbinding Pathways of Gas Molecules in [Ni

Oct 04, 2024 Ariane Nunes-Alves

Self-supervised graph neural networks for polymer property prediction

Feb 20, 2025 Jana M. Weber

Learning-Order Autoregressive Models with Application to Molecular Graph Generation

Aug 07, 2025 Michalis K. Titsias