We apologize for Proteopedia being slow to respond. For the past two years, a new implementation of Proteopedia has been being built. Soon, it will replace this 18-year old system. All existing content will be moved to the new system at a date that will be announced here.

7rgr

From Proteopedia

Revision as of 06:56, 8 February 2023 by OCA (Talk | contribs)

(diff) ←Older revision | Current revision (diff) | Newer revision→ (diff)

Jump to: navigation, search

Lysozyme 056 from Deep neural language modeling

Structural highlights

7rgr is a 2 chain structure with sequence from Synthetic construct. Full crystallographic information is available from OCA. For a guided tour on the structure components use FirstGlance.
Ligands:	,
Resources:	FirstGlance, OCA, PDBe, RCSB, PDBsum, ProSAT

Publication Abstract from PubMed

Deep-learning language models have shown promise in various biotechnological applications, including protein design and engineering. Here we describe ProGen, a language model that can generate protein sequences with a predictable function across large protein families, akin to generating grammatically and semantically correct natural language sentences on diverse topics. The model was trained on 280 million protein sequences from >19,000 families and is augmented with control tags specifying protein properties. ProGen can be further fine-tuned to curated sequences and tags to improve controllable generation performance of proteins from families with sufficient homologous samples. Artificial proteins fine-tuned to five distinct lysozyme families showed similar catalytic efficiencies as natural lysozymes, with sequence identity to natural proteins as low as 31.4%. ProGen is readily adapted to diverse protein families, as we demonstrate with chorismate mutase and malate dehydrogenase.

Large language models generate functional protein sequences across diverse families.,Madani A, Krause B, Greene ER, Subramanian S, Mohr BP, Holton JM, Olmos JL Jr, Xiong C, Sun ZZ, Socher R, Fraser JS, Naik N Nat Biotechnol. 2023 Jan 26. doi: 10.1038/s41587-022-01618-2. PMID:36702895^[1]

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.

References

↑ Madani A, Krause B, Greene ER, Subramanian S, Mohr BP, Holton JM, Olmos JL Jr, Xiong C, Sun ZZ, Socher R, Fraser JS, Naik N. Large language models generate functional protein sequences across diverse families. Nat Biotechnol. 2023 Jan 26. PMID:36702895 doi:10.1038/s41587-022-01618-2

1 Structural highlights
2 Publication Abstract from PubMed
3 References

Proteopedia Page Contributors and Editors (what is this?)

OCA

Retrieved from "http://52.214.119.220/wiki/index.php/7rgr"

7rgr

From Proteopedia

Lysozyme 056 from Deep neural language modeling

Structural highlights

Publication Abstract from PubMed

References

Contents

Proteopedia Page Contributors and Editors (what is this?)

Views

Personal tools

Navigation

Search

Toolbox