In the current COVID-19 pandemic, there are many problems that hackers have been happy to pounce on. From 3D printed face shields and homemade medical masks to replacing a complete mechanical ventilator, this stream of ideas has inspired and delighted the soul. At the same time, there were attempts to advance in another area: in research aimed at combating the virus itself.
By all appearances, the approach that tries to get to the very root of the problem has the greatest potential for stopping the current pandemic and getting ahead of all subsequent ones. This βknow your enemyβ approach is espoused by the Folding@Home computing project. Millions of people have signed up for the project and are donating some of their CPU and GPU processing power, creating the largest [distributed] supercomputer in history.
But what exactly are all these exaflops used for? Why is it necessary to throw such computing power at
To begin with, the most important thing: why do we need proteins?
Proteins are vital structures. They not only provide building material for cells, but also serve as enzyme catalysts for almost all biochemical reactions. Squirrels, be they
To understand how proteins get the structure that determines their function, you need to go over the basics of molecular biology and the information flow in the cell.
production, or
Ribosomes behave like assembly devices - they capture the mRNA template and match it to other small pieces of RNA,
This sequence of amino acids is the first level of the structural hierarchy of the protein, which is why it is called
Long-range bonds of protein parts
The next level of the three-dimensional structure, which goes beyond the primary, was given a clever name
Alpha helices and beta sheets in proteins. Hydrogen bonds are formed during protein expression.
These two structures and their combinations form the next level of protein structure β
Also, the stability of tertiary structures is provided by long-range bonds between amino acids. A classic example of such connections is
The tertiary structure is stabilized by long-range interactions such as hydrophobicity or disulfide bonds.
Disulfide bonds can form between
Modeling structures in search of a cure for disease
Polypeptide chains begin to fold into their final shape during translation, when the growing chain emerges from the ribosome, much like a piece of memory alloy wire can take on complex shapes when heated. However, as always in biology, things are not so simple.
In many cells, transcribed genes undergo extensive editing prior to translation, significantly altering the basic structure of the protein compared to the pure base sequence of the gene. At the same time, translational mechanisms are often enlisted with the help of molecular escorts, proteins that temporarily bind to the nascent polypeptide chain and prevent it from taking any intermediate form, from which they will then be unable to move to the final one.
This is all to the fact that predicting the final shape of a protein is not a trivial task. For decades, the only way to study the structure of proteins was through physical methods such as X-ray crystallography. It was not until the late 1960s that biophysical chemists began to build computational models of protein folding, mainly concentrating on modeling secondary structure. These methods and their descendants require huge amounts of input data in addition to the primary structure - for example, tables of amino acid bond angles, lists of hydrophobicity, charged states, and even the preservation of structure and function over evolutionary timescales - all in order to guess what will happen. look like the final protein.
Today's computational methods for predicting the secondary structure, working in particular in the Folding@Home network, work with about 80% accuracy - which is quite good, given the complexity of the problem. Data generated by predictive models for proteins such as the SARS-CoV-2 spike protein will be compared with data from physical studies of the virus. As a result, it will be possible to obtain the exact structure of the protein and, possibly, to understand how the virus attaches to the receptors.
Protein folding research is at the heart of our understanding of so many diseases and infections that even when we use the Folding@Home network to figure out how to beat COVID-19, which we've been seeing an explosion of late, the network won't be down for long without work. It's a research tool that's great for studying the protein models that underlie dozens of protein misfolding diseases, such as Alzheimer's disease or the often-inaccurately referred to as mad cow disease, a variant of Creutzfeldt-Jakob disease. And when the next virus inevitably appears, we will be ready to start fighting it again.
Source: habr.com