I'm sure it's more complex than I grasp as a layperson, but I'm utterly amazed at how simple this _appears_. I get the feeling that this is something I have a better chance of understanding than the average SaaS Terms and Conditions.
I expected to have to scroll through pages upon pages of indecipherable text. Instead it's no bigger than a large paragraph of text, and I can easily fit it on my screen.
The protein they're trying to manufacture is indeed quite simple - AFAIU both BioNTech and Moderna put together their sequences in a weekend. (Though there was a more involved process of winnowing down the sequences for the most effective ones.)
The technically challenging parts are:
- delivery mechanism: you need to take a very unstable molecule, protect it from the environment - both external, and when inside the patient - and insert it into a human cell. (This is called the "platform", and is usually developed independently from the specific payload.)
- manufacturing: both producing the mRNA itself at a large scale, and inserting it into the delivery mechanism, at a large scale and in low-temperature conditions
- testing: the newly-developed payload and the existing platform were integrated at small scales within weeks, but testing the thing for safety and efficacy took months
EDIT: As schoen pointed out, this was not actually released by Moderna, but reverse engineered by third-party researchers. Original text was: "Hence they feel safe releasing this. Their moat is not the gene sequence, their moat is everything else."
Reading the primary claim is fascinating: "A composition, comprising: a messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a betacoronavirus (BetaCoV) S protein or S protein subunit formulated in a lipid nanoparticle."
I have a "I'm sure that means something to somebody" feeling. It's also surprising that the remaining claims seem to describe the resulting bits of the sequence, and that that primary claim can stand on its own. Of course, I'm by no means an expert.
> I have a "I'm sure that means something to somebody" feeling.
Break it down! It's not so bad:
> A composition
A bunch of stuff
> a messenger ribonucleic acid (mRNA)
mRNA are cellular instructions on how to make proteins that are read by ribosomes that make those proteins as they read along.
> an open reading frame
This is something that starts with a "start codon" and ends with an "ending codon" and encodes valid instructions to make a protein between.
> encoding a betacoronavirus (BetaCoV) S protein or S protein subunit
The instructions refer to the spike protein of a betacoronavirus, or a fragment thereof, because this is what we want the immune system to pay attention to (and make antibodies to bind to and neutralize).
> in a lipid nanoparticle
The immune system gets pissed off about mRNA floating around, because that's one of the things that happens with active infection. So if you want this to get into cells and tell them to make your protein, you need to encase it so that it mostly escapes immune notice itself.
If you squint you could say that viruses are already doing that. A lipid bilayer is a key component of several kinds of membranes. All of our cells have a lipid bilayers separating the inside from the outside. The corona virus is also built from these same lipids, which is why it's vulnerable to soap.
Anyway, I think the important thing that the other commenter was saying is that mRNA needs to be carefully packaged to be medically useful. You can't inject pure RNA as a vaccine because not only is the mRNA going to be quickly degraded before it gets anywhere but if the immune system sees any RNA floating around by then it kicks itself into a frenzy because free-floating RNA is usually a sign that something nasty is afoot.
They have. That spike protein is part of their envelope ("nanoparticle").
The envelopes used in an RNA vaccine are generally simpler, because they're working under different constraints than viruses. For example, their envelopes don't need to be easily manufactured in a cell.
But some RNA and DNA vaccines do use viruses as their delivery mechanisms, eg the J&J COVID vaccine.
Doesn't this describe almost ANY vaccine - I think that it's probably bad public policy to allow anyone to patent ALL COVID vaccines - I think that patenting a particular vaccine (a particular mRNA string) should be allowed, but not effective wildcards in the RNA
No. It doesn't describe the viral vector vaccines (J&J/AstraZeneca). It doesn't describe the inactivated virus vaccines. It doesn't include the viral fragment (NovaVax) vaccines. And it wouldn't describe some possible mRNA vaccines because of differences in formulation or differences in targeting.
While this particular Moderna claim would likely affect BioNTech/Pfizer's mRNA vaccine, it's not clear whether it would survive in litigation, too.
As to a "specific string"-- if you could just pad a few codons onto the end and not be violating the patent, that's not too worthwhile.
This is a great animation of the life cycle of an HIV virus. It’s not exactly what happens with the pandemic virus but it gives you a good idea of the complexity of the process of viral reproduction (vaccine or immune response isn’t covered here):
Nice!!! I didn’t even notice that was uploaded! The differences between HIV and SARS-CoV-2 makes me wonder what reliability there is in this process, and if some viruses are more reliably able to enter the cell than others (presumably those that are more infectious?)
It blows my mind to see how life (and death) work at the molecular level--it's almost like some kind of manmade machine, but far more subtle and complex.
These might interest you then: 3d visualizations of cellular processes in real time. I was shown them in my intro to biology class, which filled me with the same interest.
It's been decades since I was in high school but I really hope these videos, or something similarly realistic and mind-bending, is in the modern biology curriculum. Learning about Darwin, Mendel, Watson and Crick and the experiments they did to develop an understanding of biology was informative, but it wasn't compelling to me. These and the work of Drew Berry and WEHI are just amazing:
I like the wehi videos because they take effort to make the molecular motions appear to be random, a result of stuff blundering about: https://youtu.be/7Hk9jct2ozY
Just remember that it's not really orchestrated. All of the molecules involve are kind of randomly blundering about and it's one-in-a-million collisions that are responsible for getting shit done (on membranes it's more like one in a thousand and on ropelike structures like dna or actin it's one in a hundred).
As a biochemist, that's one of the things that has kept me interested in the work - it's truly mind boggling what is going on at the molecular level every second of our lives.
I love reading articles around biology, micro/molecular biology in particular, and looking for references to agency in the text. 'selects', 'filters', 'checks', 'seeks', etc when the reality is that the whole thing is just a massive chemical reaction.
Yes.One must know all programming of molecules and all laws current in the universe. perfect hierarchy from atoms to the cells, from cells to plants, animals from animals to earth, from earth to stars
Coronavirus replication is pretty dramatically different from HIV replication-- coronaviruses are not retroviruses, and do not have a step where viral RNA is converted into DNA and integrated into the host cell's genome. Instead, coronavirus RNA is directly interpreted by the cell's ribosomes to make the proteins that ultimately build and comprise the replicated viruses.
The mRNA vaccines work in much the same way-- it's just that the mRNA vaccines only include the code for the spike protein and not the rest of the virus's machinery. So you get the vaccine and your body produces a bunch of spike protein by itself, which gives your immune system the opportunity to learn how to identify and react to the spike protein before it sees it on a real virus.
Claims are read in the context of the body of the patent and generally known and/or cited knowledge in the field in question. As long as you define precisely what you mean by your terms in the body, you can be as succinct as you want in the claims.
I think that's the question. I'm very much used to reading "A machine-readable medium, comprising..." I'm curious what bits are unique to COVID-19, and what bits are generally protecting the idea of using a carrier to send a specific protein. In this case, is it the "S" protein phrasing that protects the specific embodiment of COVID19's spike?
If it's just the spike protein encapsulated in a lipid nanoparticule (for isolation and transportation?), that looks like something not creative and quite established for people in the field of genetic material transport.
My layperson's understanding is that the actual spike protein and/or mRNA are modified from the natural versions. Both for stabilization (if either falls apart quickly, they're of no use) and for response optimization.
So somewhat like how a fishing fly differs from the insect it represents.
meh, I could do that over a weekend never sounded so scary, or impressive at the same time. That weekend just so happened to stand on the shoulders of prior decades of research though.
i guess this is big pharma's version of `apt-get install`
Not only did Moderna have a decade of experience with mRNA as 'drug', but the mechanism of coronavirus infection was well-understood from SARS research, namely the importance of the spike protein. All the parts were in place and just waiting for the specific SARS-CoV-2 sequence. They designed it as soon as they had access to the Wuhan sequence.
Modern a has been working on mRNA since 2010 and mRNA vaccines since 2012. They have the process down pretty solid, but vaccines do not bring in the bacon.
Now that they've spent a year developing the mass-production methods and infrastructure, it might bring in the bacon! e.g. there's a steady annual market for a flu vaccine for whatever the latest strain is, and being able to get that to market faster than the competition could give them an edge.
I just don't think it was every profitable enough for them to put in this enormous capital expenditure.
Yes, really. Typical vaccines are like $5 to $200 or so. And worst of all, usually just one or two doses.
For all the horror HIV has wrought, global spending on vaccine development for HIV has been around $1 billion a year for the last few years. In contrast, the USA federal government spends $3 billion a year for HIV antiviral drugs for low-income Americans. $20,000 per patient per year for life. Unsurprisingly, new antivirals are where most of the research is.
It can sound almost like a conspiracy if I put it like that, but it's just the economics incentives. Especially since the developed countries where most of the market for charging a decent markup is, have the smallest market for most new vaccines, while having the largest markets for therapeutics for chronic conditions.
Now HIV is genuinely devillish to develop a vaccine for despite our attempts. But vaccines for hepatitis C, gonorrhea, HSV, among others appear to be possible. We almost certainly could develop effective vaccines for these with existing techniques, if someone coughed up the funding. Maybe all the buzz about mRNA vaccines will spur some progress here.
> It can sound almost like a conspiracy if I put it like that, but it's just the economics incentives
Talk about market failures! It's completely obvious that this economical system is not placing the good of the whole human species as its first priority.
Yes, clearly. With COVID-19, there is pretty much guaranteed market for about ten billion doses. Along with direct investment by governments in wealthier countries. Most people, including politicians, want a COVID-19 vaccine real bad.
The parent poster was describing the situation with vaccine development in general, to which COVID-19 is quite the exception. A potential hepatitis C vaccine for example has very different economics, as it would not be deployed anywhere nearly as widely or quickly. Consider that, 40 years after hepatitis B immunization became available, the majority of Americans haven't been jabbed with it.
Yes, but a pandemic only comes around every hundred years or so. Moderna happened to be in the right place at the right time for this one, but delivering vaccines for a pandemic is not much of a solid business plan.
Current pricing for mRNA vaccines is something in the $4-7 dollars (that's $7.00 dollars, not $7,000.00 dollars) range. Compare that to one of the Hepatitis C treatments, which costs north of $350,000 by several accounts. Even remdesivir is something like $3000 for a course.
There are roughly 75 million HCV infections in the world.
That translates to a total cure cost of:
75M * $350k = $26.25T
There are other ARV treatments available which cost now roughly $50-100k and cure in 3 months.
Whereas immunizing everyone against coronaviruses currently costs:
8B * $7 = $56B
Clearly, the costs of the HCV cure are predatory and unreasonable because it doesn't lead to eradication and it's inaccessible to the poor and the third-world.
The true unit cost is probably $6 per dose. The COVID vaccines may break-even short-term. Long-term, it's probably worth keeping an extra 120M potential customers alive for what will probably result in a small profit.
The true unit cost of HCV cures is unknowable but possibly half of the current price.
There was already good work previously on SARS-1 and MERS spike protiens for use in vaccines. This is what enabled the "in a weekend" speed.
https://pubmed.ncbi.nlm.nih.gov/28807998/
from what I've gathered, the rate limiting step for production as of yet, is creating the lipid vesicles and getting the RNA inside of them. Only a few companies have a process for this, and the supply chain for the precursors is limited as well.
RNA would get thrashed by your immune system if it isn't encapsulated by something: liposome deliver of therapeutic RNA is really next-generation tech, and the fact that the RNA does what it's supposed inside the your cells is no small feat either.
From the CDC "ingredients" for the BioNTech/Pfizer vaccine, along with cholesterol (which modulates the stability of lipid membranes), they report using this molecule, which would form a phospholipid bylayer, just like our own cells use: https://www.sigmaaldrich.com/catalog/product/avanti/850365P?...
They're both encased in lipids. Pfizer didn't have long term data on long-term storage at standard freezer temperatures, but has since confirmed their vaccine can be stored at similar conditions to Moderna.
> delivery mechanism: you need to take a very unstable molecule, protect it from the environment - both external, and when inside the patient - and insert it into a human cell. (This is called the "platform", and is usually developed independently from the specific payload.)
Of note, the immune system is pretty good at destroying foreign mRNA so you also need to evade it.
I wouldn't even say the immune system, your body has a ton of nonspecific RNA-digesting enzymes floating around to patrol for exactly this sort of thing happening, even by accident, as cells can sometimes rupture. It's a problem enough that good RNA researchers have a reputation of being clean freaks. Some RNA labs I've been in had a lingering, slightly sweet smell, that's the nonspecific RNAase inhibitor that gets sprayed on everything.
RNA is also just generally fantastically unstable and reactive. You don't want any surface to be too alkaline, for example. There's a reason that basically every life form switched to DNA.
(Though RNA may have been more stable under the high-UV-exposure conditions the early Earth.)
> - delivery mechanism: you need to take a very unstable molecule, protect it from the environment - both external, and when inside the patient - and insert it into a human cell. (This is called the "platform", and is usually developed independently from the specific payload.)
The most amazing thing is that now that the platform is proven secure in dozens of millions of people, it should be be very easy and fast to get approval for other payloads. Biontech for example wants to go after cancers - a platform that can deliver payloads targeted to an individual's cancer is nothing short of a game changer in cancer treatment because the current standard of blasting the patient's body with a lot of highly toxic chemicals is arcane compared to letting the body's immune system do the cleanup.
Even if the platform is safe, the payload itself needs to have its safety proven. Remember, the payload is just instructions, and those instructions make your cells pump out oodles of arbitrary proteins. That in itself can cause health problems. see e.g. the AstraZeneca vaccine's safety issues, which were caused IIUC by immune responses to the manufactured proteins. DNA vaccine, not mRNA, but the principle is the same.
re: cancers, that is actually what this technology was originally developed for! Moderna has been spending about a decade getting this tested and proven out for the cancer role, and they're quite close. From my quick reading of the literature, there seems to be some regulatory confusion about how exactly to run approval for this kind of personalized drug design (testing the method of generating the individual drugs?), but the bar is usually much lower for cancers with high mortality rates.
> Hence they feel safe releasing this. Their moat is not the gene sequence, their moat is everything else.
One or more of the vaccine developers may have released such details, but this particular file is a reverse engineering effort by unaffiliated scientists based on analyzing the dregs of used vaccine vials (!).
> but testing the thing for safety and efficacy took months
What kind of tweaks were made from "the version they threw together in a weekend" to "the version that is in production now"? What's a typical "mRNA" feedback iteration loop like?
I'm not sure if there were changes in the sequence at all necessary during testing, however if you align the sequences given in that link from Biontech and Moderna, you see they encode the exact same protein (which is of course necessary). However the RNA sequence contains quite a few differences between the companies, they often use a different codons. This could be to make the translation more efficient, and can be a thing to optimize.
Almost certainly - e.g. the J&J vaccine (a DNA vaccine, not mRNA, but same principle) is using a viral delivery platform that they'd had sitting on the shelf for years and have used for other vaccines.
After the massive capex that has gone into mass-production of encapsulated mRNA delivery systems, I suspect this new technology will be very cost-competitive for the big markets like the annual flu vaccine.
- delivery mechanism: you need to take a very unstable molecule, protect it from the environment - both external, and when inside the patient - and insert it into a human cell. (This is called the "platform", and is usually developed independently from the specific payload.)
Sounds like a problem you solve once and for all, for any vaccine. And also that this problem was already solved since decades (e.g viral vectors)
- testing: the newly-developed payload and the existing platform were integrated at small scales within weeks, but testing the thing for safety and efficacy took months
And so many people have been killed by this overly conservative testing, phase ~<2.5 was enough
> Sounds like a problem you solve once and for all, for any vaccine.
I strongly doubt it. It's more like a problem you solve once for a particular class of payload and particular destination. Biology doesn't do packet switching - everything is just rapidly bumping into everything else at random, so your envelope needs to be designed in a way that's ignored by everything else than molecules at your target site, and it needs to not react with the payload it's carrying.
> And so many people have been killed by this overly conservative testing, phase ~<2.5 was enough
Overly conservative? That's what super-accelerated testing looks like. We're lucky it went well; had they screwed up, it would scare a lot more people away from vaccinating, lengthening the pandemic and increasing death toll.
a bad vaccine could kill much more than that. remember that RNA vaccines are developed for 10+ years now and COVID19 is the first time they actually worked without side effects.
Liken it to the 4kb demoscene: it's amazing what can be done with a little bit of information, as long as you don't have to describe the machine running it.
Or the distribution method, or even really invent the thing, since you're mostly just copying someone else's work. Plus it doesn't have to even do anything. In fact, doing anything might be a problem, so best to just sit there and look menacing (and spikey).
Getting it designed and building it is more difficult.
At its core, it’s a piece of mRNA that creates a protein. That code gets transcribed into a protein (often those are relatively short). That protein then triggers your bodies immune response, which trains it to attack covid19.
Inject this mRNA into a cell and it’ll create the protein. Anything can be injected at this point once the mechanism for injection is developed
Which makes me wonder. Could you place the entire virus genome in these liposomes and get them to hijack the machinery to make an entire virus? Like plasmid but for viral structures?
Yes, that's one of the concerns many have about this technology. Literally, anything can be injected and done at this point.
Not sure where the technology is exactly at, but I suspect we're no more than 5 years from major incident related to this.
Even this vaccine, we really don't know the long-term impacts or risks involved with this. For instance, this vaccine does appear more risky than the standard flu vaccine:
Presumably this is due to increased inflammation. It's not hard to imagine that we'll be doing genetic editing soon enough with this (if we aren't already).
Do you happen to have a more verifiable source for your claim that COVID-19 vaccines are more risky than the flu vaccine?
Excerpt from the disclaimer in your source:
> VAERS accepts reports of adverse events and reactions that occur following vaccination. Healthcare providers, vaccine manufacturers, and the public can submit reports to VAERS. While very important in monitoring vaccine safety, VAERS reports alone cannot be used to determine if a vaccine caused or contributed to an adverse event or illness. The reports may contain information that is incomplete, inaccurate, coincidental, or unverifiable. Most reports to VAERS are voluntary, which means they are subject to biases. This creates specific limitations on how the data can be used scientifically. Data from VAERS reports should always be interpreted with these limitations in mind.
Give the incentives no, however it should be noted VAERS is only going to undercount not over count. Physicians are required to fill it out if there’s adverse side effects at the hospital.
Wish there was more but the incentives aren’t really aligned for open research on this.
We can get the real cdc idc10 (billing) results in several months.
How do you edit genes with an mRNA vaccine? You'd need DNA, enzymes (maybe requiring post-translational modification) to splice them in, etc.
Also, you might not even be able to print full viruses with this platform. manufacturing mRNA is different from manufacturing all the random types of RNA in a virus, isn't it?
Sequencing technologies have improved immensely over the last decade and a half. And, in this particular case, getting the sample RNA is incredibly easy, since its purity and integrity in the vial is quite high.
I didn't look at the details of how they sequenced it, but given that there are chemically modified bases in the mRNA vaccines there is a chance the normal methods for sequencing (and the first step of translating to DNA) don't work. Well, I guess in practice they did.
While not completely equal to the naturally occuring bases, the modified bases in the vaccine mRNA need to be able to complement to the non-modified ones present in tRNA anticodons during translation. If they can pair to their corresponding natural bases, then the chemically modified RNA can be also used as a template by the reverse transcriptase to generate the complementary DNA needed for the sequencing reaction.
Generally I agree, but it could be the case that the modified bases work just well enough for tRNA matching in the ribosome, but not with the reverse transcriptase.
The mechanism of base complementarity is identical in both cases. If a modified uracil complements an adenine in tRNA, it will complement an adenine in the RT primer or an adenine being added to it.
I think it's a bit like a private key- the difficulty is in finding some combination that works in an absolutely massive space of possible proteins, not necessarily in the length of the protein.
"but I'm utterly amazed at how simple this _appears_."
Biology is a funny old thing. You can look at that concise description - the orange and so on blocks of a few letters and a few short groupings.
Now ATCG are basic building blocks but they consist of quite a lot of stuff. I think it's a bit more complex than that because this is RNA not DNA so ATCG might not be quite right. Each of those bases are horrifically complicated depending on scale. Search "ATCG" - this is a good start: https://en.wikipedia.org/wiki/Nucleobase
Now dive into one of those bases and decompose it to its constituent atoms. Now look at the maths around this stuff. It gets quite complicated, quite quickly.
That said, the fact that a bloody complicated thingie can be described so concisely is absolutely amazing and as you say it looks so simple.
> This is somewhat of a problem for our vaccine - it needs to sneak past our immune system. Over many years of experimentation, it was found that if the U in RNA is replaced by a slightly modified molecule, our immune system loses interest. For real.
> So in the BioNTech/Pfizer vaccine, every U has been replaced by 1-methyl-3’-pseudouridylyl, denoted by Ψ. The really clever bit is that although this replacement Ψ placates (calms) our immune system, it is accepted as a normal U by relevant parts of the cell.
In case others don't know this, the reason this is abbreviated Ψ (psi) is that Ψ is the first letter of Greek ψευδής 'false, lying', the origin of the prefix pseudo-.
It's part of an instruction to cells to make something. Viruses replicate by instructing cells to make viruses. Our cells don't know how to make Ψ, so the replicated virus would have the normal instruction.
No. For a number of reasons. First of all, the virus uses the nucleotides (A, G, C and U, for wich Ψ is used as a substitute) produced by the attached cell to create a copy of it's genome (RNA). The nucleotides are produced by the cell, the virus does not instruct the cell to produce them. It just tells the cell to produce and assemble the proteins AND the RNA.
Second, our immune system doesn't just attack and recognize free floating RNA, but the virus itself. And different parts of the virus. First and foremost it will recognize the surface proteins (like the spike protein) because those are the things that it can see while the virus is outside of the cell. Also these are the things that the infected cells present on their surface (MHC II sites, if I'm not mistaken) to the immune system. (As far as I can understand, cells have to present the proteins they produce to the immune system otherwise they get killed. If they produce alien virus proteins that get recognized by the immune system, they also get killed.)
Interesting enough, the immune system somehow also recognizes the so called nucleocapsid protein, which is the one used to wrap the viral RNA inside the virus. (But it gets produced by the cells, so I guess they get presented on the cell surface so the immune system can learn to recognize and counter them.) I didn't look into the details too much, but as far as I can understand it's not clear yet how those antibodies (the ones created against this protein) work, because antibodies are supposed to be used outside of the cells, but the nucleocapsids are only present inside the cells and then inside the virus.
To sum it up: the immune system is much more complex, has several recognition mechanisms, the viral RNA is mostly packed into the viruses (or are inside the cells) and the viruses don't have any way to produce Ψ (or any of the other nucleotides).
> I didn't look into the details too much, but as far as I can understand it's not clear yet how those antibodies (the ones created against this protein) work, because antibodies are supposed to be used outside of the cells, but the nucleocapsids are only present inside the cells and then inside the virus.
You are correct, the antibodies are made and they end up not recognizing the nucleocapsid protein while it is in the virus but when the infected cell displays internal proteins with MHC I, which helps T-Cells target the infected cell. MHC I displays self and MHC II displays proteins that have been "eaten" by the surveillance cells of the immune system.
Maybe I'm thinking too simplified here, but wouldn't this only work on the first iteration? After all the virus would replicate with Us in your cells and then the replicas wouldn't have the advantage anymore.
But that's what I mean, viruses need to replicate. They do this by injecting their RNA into your cells and hijacking your ribosomes for their replication. So the first viruses would definitely get past your immune system, but the replicated viruses would then be produced by your cells, so they wouldn't have the U->psi replacement that the first generation had. So every subsequent generation of the virus could be fought by your immune system. Effectively giving the virus a head start of one replication cycle. I'm guessing this wouldn't change much. But I'm not a biologist.
1) As someone else pointed out, this molecule substitution would not persist during replication. New viruses being produced in your cells would be made with a normal "U".
2) Your immune system does not usually attack the RNA housed inside a virus, but rather protein fixtures on its "body".
"Many people have asked, could viruses also use the Ψ technique to beat our immune systems? In short, this is extremely unlikely. Life simply does not have the machinery to build 1-methyl-3’-pseudouridylyl nucleotides. Viruses rely on the machinery of life to reproduce themselves, and this facility is simply not there. The mRNA vaccines quickly degrade in the human body, and there is no possibility of the Ψ-modified RNA replicating with the Ψ still in there. “No, Really, mRNA Vaccines Are Not Going To Affect Your DNA[2]“ is also a good read."
As far as I could tell, this would work well for getting a synthetic virus into the human body, but without the necessary mechanics within our cell, the special Ψ chemical won't be reproduced by the virus. That'd mean the replicated virus would get snatched up by the immune system as soon as it'd get released from the cell.
Theoretically, a complex enough RNA string could be used to have our cells build the necessary cellular machinery to properly reproduce the virus, but that's a kind of altering DNA that's a whole different can of worms. There's probably cheaper and easier way of defeating the immune system, for example by simply "enhancing" ebola or HIV to make them more infectious and more resistant to our current drugs.
RNA is genetic material, but it encodes instructions to make proteins, which form the physical shell of the virus crucial to its function. As a very rough analogy, the RNA is source code and the proteins are the compiled program.
It's often the protein molecules that the immune system learns to recognise and attack.
RNA vaccines work because your body automatically translates them into some recognisable part of the viral protein, and then develops an immune reaction to that.
If a virus had Ψ instead of U in its RNA, it's still going to be making the same type of proteins. I can't see why it would be more likely to evade an immune response.
We are really quite fortunate that there was a ton of work done on coronaviruses, mRNA vaccines, adenovirus vaccines, etc prior to the pandemic. It seems like a pandemic even a year or three prior would have made the vaccine rollout considerably slower.
No they have allergies to the polyethylene glycol PEG compound in the lipid nanoparticles. It is also used in skin creams, toothpastes, condom lubricants and in larger quantities as a laxative. Some people are just allergic to it.
Any individual protein doesn't seem that complex since it's just a combination of some 20 amino acids, but the variations are endless:
"Since each of the 20 amino acids is chemically distinct and each can, in principle, occur at any position in a protein chain, there are 20 × 20 × 20 × 20 = 160,000 different possible polypeptide chains four amino acids long, or 20n different possible polypeptide chains n amino acids long. For a typical protein length of about 300 amino acids, more than 10^390 (20^300) different polypeptide chains could theoretically be made. This is such an enormous number that to produce just one molecule of each kind would require many more atoms than exist in the universe."
proteins are also unique in that not just their sequence matters, but also their physical shape. 2 proteins can have the same sequence but a different physical shape, and therefore have different impacts on the body's chemistry. I started a PhD researching DSP methods for matching protein sequences and locations of amino acids. Fun stuff.
Then there are also post translational modifications, like addition of acetyl or phosphate groups, and sugars to the protein (glycoproteins).
I mean, I can understand how an eye or a brain can evolve by natural selection, but I’m still stunned by abiogenesis. I guess we’ll never know for sure how it all started.
People tend to think of genetic code as a sort of assembly language which is very verbose, but I wonder if the correct way to view it is in fact a very terse domain-specific language, because it actually depends on the entire complex machinery of the cell to be present in order to work, which in itself contains a lot of information?
> I wonder if the correct way to view it is in fact a very terse domain-specific language
Honestly, na. It's pretty verbose. There's a lot of weird ass things in there like "Skip basepairs until you find the matching terminating sequence" (I think it's AG .* GA but its been a decade since my bioinformatics course), but you still have to include the non-AA-coding basepairs in the middle of that.
Compensating for that is the fact that there are like, multiple independent programs; if a ribosome is offset by a single base pair, the result is entirely different. If it runs the other strand, the result is different. And instead of crashing like any program would, biology just learns to use all of those possible encodings. In part, this works because there are 64 possible codons but only 20 amino acids, and the redundancy allows a substitution to affect only some of the offsets.
Yes. Another important metaphor is that the common idea of DNA as blueprints is entirely wrong. It's not blueprints, it's a recipe. A blueprint describes what something is. A recipe describes the steps needed to make something, making use of a lot of complex existing machinery and parts with only a reference to them.
The nucleotide sequence is obviously important, but people also sometimes forget that DNA and RNA are real things with 3D structure too. That matters too: it’s as if builders make errors where the blueprint rolls up or pages stick together.
The whole thing is absolutely fascinating and wild.
Interesting reasoning. But isn't it true to say that the "complex existing machinery and parts" which interprets the DNA was itself put together from instructions found in other DNA? I suppose that metaphors are rarely entirely comparable.
Some of that machinery and parts isn't directly represented by DNA. As an example, DNA codes some proteins that help extend cell walls, but those only work if you already have cell walls. If you have only the full DNA for a cell, and no other knowledge, you cannot build that cell out of that.
Heh. Technically, there isn't even GATTACA in there since it's RNA and hence all the T's are actually U's. It's just convention to use the T's. GAUUACA doesn't have the same ring to it.
I'm estimating roughly 90-ish characters in a row, roughly 40 rows encoding the spike protein. So about 3600 base pairs. There are 3 base pairs per amino acid, so That's 1200 amino acids.
For comparison, the smallest chain that they technically call a protein is 100 amino acids that's an arbitrary limit to separate proteins from enzymes. So this thing isn't tiny tiny.
But Titin (also called connectin), a giant protein responsible for passive elasticity in mucles, is ~27,000-35,000 amino acids. So this thing isn't even close to the biggest proteins out there.
> that's an arbitrary limit to separate proteins from enzymes
Do you mean “to separate polypeptides from proteins”? Enzymatic activity has nothing to do with size. For example, one of the smallest enzymes in humans has 62 amino acid residues. And, under certain conditions, even single amino acids can be catalytic.
But yeah, the polypeptide-protein threshold can get fuzzy, especially with the recent advances in miniprotein characterization.
yes, that is what I meant. It's been a long time since I've used that info.
The story I remember was that Insulin was the first protein that was sequenced, which is funny because it was before they made the distinction. It's actually too small to be considered a protein now.
the way I see it we're just at the beginning, and we're mainly copy/pasting a lot of code, we understand some small parts, and generally in the teenage years of genome programming.
I don't know how long it will be before we get a bit more serious with it, but geneticists have a big obstacle in their understanding, any change might needs a thousand strong lifelong population study to be understood. That's way crappier than dumping the assembly or only having the documentation in Chinese.
I will add that moreover the developers might have been even more conservative in their code because they knew it was going for large scale deployment, they probably avoided the cutting edge as much as they could.
Great quote from Maurice Hilleman, creator of many (most?) of our childhood vaccines goes something like “Don’t be smart. Instead be careful and accurate”
Lots of these things aren’t complicated. It’s the careful systematic testing and public trust building that’s the hard part.
The genetic code itself is reasonably comparable to ASCII in complexity - every 6 bits is the code for one amino acid in a string, which will fold itself into the required protein.
I remember a lot of features and especially bug fixes where I had to change one line of code, it took hours to figure out how exactly though.
I guess this is kinda similar?
The New York Times published an article last year with the entire genome of the SARS-Cov-2 virus, with a breakdown of different sections to explain what protein the RNA codes for and what that protein does. Like you said it was amazing that it all fit within an [albeit long] newspaper article. It doesn't surprise me that the RNA for the vaccine, which only targets a single protein, is even smaller than that. Here's the NY Times article I was referring too:
It appears simple, but a whole lot of work went in to producing that string even pte-COVID. Some of it is generic in the sense that it might apply to any mRNA vaccine. Some is quite specific:
There’s also (IIRC, no citation right now) prior work suggesting that coronavirus vaccines against the spike are likely to be effective and that vaccines against the N protein might be counterproductive.
True, most pharmaceuticals can't do it now but given the right knowledge, which is known, it can be done relatively fast. I suspect in the next few years there will be many companies that will be able to replicate and advance the process.
It’s like looking at the binary file and saying “that’s pretty simple” while ignoring the massive amount of machinery that allows us to run that file and use it (CPUs, Motherboards, computers, etc).
I presume a whole bunch goes into making vaccine and this is just the top of the iceberg.
In the case the bioreactor “compiler” is actually our own cells which read out the mRNA “source code” and translate it into protein. The lipid encapsulation delivers the mRNA to our cells, so perhaps it’s more analogous to a network protocol that delivers source code intact across firewalls and other defenses.
"The authorization server is down, no longer signing this version of this medication, or this medication has expired. Please contact your supplier for more information."
Shit, I'm going to have to google for a working hex edit or look for a bpatch.
I absolutely do not have a link, but I remember reading that the lipid nanoparticles are actually created by mechanical action (possibly fluid dynamics/turbulence). I thought that was pretty neat.
My first thought was `wdiff pdizer moderna`. It's short enough to post here in its entirity, but I guess I had better not, anyway it's easy enough to extract from the pdf. Add a space after every letter and wdiff can find the common sequences nicely.
Short except for flavor, this is from near the beginning:
Knowing nothing about biotech – if Moderna and Pfizer were working from the same sequencing data, why would their resulting vaccine mRNA sequences be different? Even slightly?
Edit: I guess what I'm asking is: presumably these vaccines both target the spike protein. Do both of these sequences express the same protein? Or is there a "close enough!" thing in the immune system, where it can be a little different and still be targeted by the immune system?
The sequence can be changed and optimized for several reasons:
* There are untranslated regions (UTR) that could influence the regulation or stability of the mRNA.
* Since most amino acids are encoded by more than codon, the coding region for the spike protein can be codon optimized. Altering the codon composition can improve protein expression.
* Likewise, enrichment of G:C content in the mRNA sequence might result in increased mRNA and expressed protein yields in vivo.
> Do both of these sequences express the same protein?
In this case both vaccines express exactly the same amino acid sequence.
> Or is there a "close enough!" thing in the immune system, where it can be a little different and still be targeted by the immune system?
It depends on how different the sequence is. For instance, if it is a little different the immune response should be very similar because, for example, the three-dimensional conformation of the spike protein chain should remain very similar as well. This is why the vaccines can be effective against several SARS-CoV2 variants.
Sequences are different because they are differently codon optimized. See https://en.wikipedia.org/wiki/Codon_usage_bias, especially "Effect on transcription or gene expression" section.
That is a super interesting article, thank you for posting it!
But, I guess my question is more about why the abstraction of "protein chunks" doesn't fall apart when there are relatively significant "diffs" in the RNA sequence.
The most significant diffs between both vaccines occur in the untranslated regions located around the protein coding sequence and will never be present in the actual spike protein.
Regarding the protein coding region, because of the degeneracy/redundancy of the genetic code, all changes within it are synonymous and code for identical amino acids.
That is a fascinating read (and the perfect level of depth in this field for me). How did you happen across it? Always looking to add a good source to my RSS feed list
Excel finally has a facility for manipulating data that keeps it where you put it. It also incorporates a fairly decent functional programming language. It's called Power Query, not to be confused with all the other things that MS has named starting with "Power" and have no relationship at all and are mostly awful.
The only real annoyance I have with it is that the editor window is modal, like it blocks all the spreadsheets you have open on your machine, and it's primitive even compared to VBA, especially for debugging.
It's not just that it's given me the experience of "this is the way a spreadsheet or BI tool should work" but also "this is the way SQL should work". It's a little cumbersome to do the standard SQL-type operations, but the clean integration of functions means you can implement anything that's missing. Like say, Oracle has grouping sets - you can, and I did, just write a function to do that. I always felt that having a separate procedural language in your database was wrong, but I'd never seen the alternative until now. And I've been falling in love with higher order functions.
Power query is one of the best things to be added to Excel in recent years. I especially like how it makes import/ cleanups easier to reproduce vs the old ways.
Exactly. FAIR (Findable, Accessible, Interoperable and Reusable) principles are at a loss here [1]. The "Reusable" part seems to be especially problematic as the sequence is buried in a PDF file though all aspects of FAIR are compromised here. Edit: It looks like there is now a PR to address this issue [2]
Things are getting better, but it still so so bad. The funny thing about that Nature article is that I recently had to parse a html table from a recent Nature article. Thankgod pd.read_html did a decent job and I then only needed another hour to hunt down all the typos and weird text issues.
Despite how complex this really is, and how many "gotchas" there might be when using this repository, it's nice that it gets a shitload of attention. As a united humanity we should strive to solve our common problems.
The convention of genomic research is to present all RNA sequences as equivalent cDNA sequences. As this will be the output of most common sequencing platforms.
DNA is way more stable than RNA. Since you can easily synthesize RNA from DNA, and DNA synthesis technology is much more mature, folks normally synthesize DNA and then derive/make the RNA from it. That makes most researches default to DNA 5' to 3', even when talking about RNA.
Note that independently of the notation used the mRNA of those vaccines use even more "weird" bases, such as 1-methyl-3’-pseudouridylyl, to make the vaccine mRNA not be detected by the immune system [1].
DNA uses base pairs [A,T] and [G,C], this code is for a piece of DNA,. if you keep a DNA sequence in vials for later use, that is much more stable and easier to manipulate, and repair when corrupted.
normally RNA in vivo is complexed with protiens that prevent RNA from folding, and annealing into structure that is not compatible with translation to protien. In the vaccine this isnt happening, this is why RNA is hard to work with and the vaccine must be kept so cold.
This is not to say that DNA is simple to work with, but it solves problems if you dont need direct access to RNA.
RNA uses uracil/uridine rather than thymine, but uridine is actually quite immunogenic. That's what has prevented people from using mRNA as a therapy until recently, when the founders of BioNTech figured out that they could use pseudouridine (abbreviated as Ψ) instead. See [1] for more information.
Wow Looks like it is analogous to having a header on a TCP packet. [0] Here is an animation of mRNA encoding translated to proteins inside a ribosome. [1]
"The ribosome is composed of one large and one small sub unit that assemble around the messenger RNA, which then passes through the ribosome like a computer tape. The amino acid building blocks, that's the small glowing red molecules, are carried into the ribosome attached to specific transfer RNAs; that's the larger green molecules also referred to as tRNA. The small sub unit of the ribosome positions the mRNA so that it can be read in groups of three letters known as a codon."
Some parts of gene transcription are so straightforward one can almost be tricked into thinking it has the logic of a computer program. It may be an illusion. To stretch the metaphor, TCP parsers don't match probabilistically along the entire length of the packet in parallel, and they don't interpret the same part of a packet as data in some contexts, and a header in others.
I ended up majoring in biochemistry and molecular biology in my undergrad because I was browsing on Wikipedia one day and came across an article written on an E. Coli variant that had sentences like:
01J3 e. Coli has a DNA Polymerase that contains 3k’-5’ proofreading capability and 5’-3’ error correcting with a polymerisation rate of 50bps
I’ve made the above up because I have never been able to find a Wikipedia page winxe that as succinctly pointed out to me that biology was a machine and I was hooked
A given combination of 7 bases has a probability of occurring of 1/16,384. Since the COVID genome is about 22k bases long I guess you have pretty good chance of it appearing in there somewhere. This assumes uniformity, which of course is not true. COVID’s genome is under crazy intense selection pressure!
Yep, the usual coding tools aren't ideal for bioinformatics. We have our own set of tools that work well with the various "standard" formats for sequence data.
If I remember correctly "solving" protein folding was essentially some high probability prediction that state A transform to state B with some reasonably high chance, on a big dataset. Or something like that anyway. It's as far from high level work with genetics as creating nanotubes a few molecules long in lab manually is away from industrial production.
The most fundamental reason for that is that it's just not amenable to human mind. We are quite primitive actually, being able to hold only a handful of "things" in our mind at any one time and relying on abstraction to think of more complex things. However, you can't abstract much in biology; there is no locality or separation of concerns, everything affects everything.
Take that piece of RNA. An intuitive mental model is that it's some form of "instruction" or a bunch of instruction, isn't it? It's also wrong, because it just encodes a protein that acts the way it does only because of its shape (that is, one of its potential energy local minimums) and the shape of other proteins around it. That shape is only weakly local, it can be affected by far-away sections of peptide sequence. So it's almost impossible to systematically break it down, you have to consider and model things as a whole , which is insanely complex both computationally and cognitively.
If you want a good mental model of how it works, imagine you assemble a thing from metal balls and springs. You take a few thousands balls and connect most of them with springs of different strengths. You then take this thing and throw it on the floor; it will assume a shape that is implicitly encoded in spring strengths, its environment, and the way you've assembled it. You can even make it change shape if you poke on it the right way. That's how biology works in a nutshell; it's a nightmare to design anything for systems like that. Again, you can't simplify and break down and encapsulate and abstract like you do in programming.
My thought exactly. So this thing is like a VM with a bunch of primitive opcode, why can’t someone write a higher-level language or at least some gadgets
The problem with trying to program genetics is that there is a bunch of code already running on the system and every variable is a global. You can't just start up a new program with minimal impact on the stuff that is already running, like you can in most human-made computers. Also don't forget that the extremely simplified version of the running system looks like this: https://www.sigmaaldrich.com/technical-documents/articles/bi...
I’m a little confused by the title? Looking at the document, it seems to me (knowing next to nothing about this field) it includes both Pfizer and Moderna’s protein spike sequence in figures 1 and 2, respectively. Is that correct?
It’s also interesting the way it’s worded: that the sequence was “assembled from $vaccine”. Does that mean whoever published this has backed into these sequences rather than having gathered this information directly from the source(s)?
You are correct. The researchers here sequenced each vaccine starting with the bit of vaccine left in the vial after administration. The goal was to get a raw sequence of the Moderna mRNA component so it can be easily filtered out as being a signal of therapeutic origin. Pfizer's sequence has already been published; it's incldued here to confirm that the result achieved experimentally matches the published sequence.
And reverse engineering only sounds dramatic until you take a step back and acknowledge that it's what they literally do all the time. Only that usually the sequences they read are not the outcome of some human development effort but of naturally occurring evolutionary processes.
Sure, but if you took that 30k of data and dropped it on a planet just like earth it would still take 10k years or so for us to build civilizations as we know it again.
Not 10k year as it needs to go through the million years scale - rna, hot, uv then dna ... with no oxygen to oxygen etc. Then million of years of evolving ... scale is a bit off.
Yep exactly, but also that's a fun problem to think about how long it would take if we sent our DNA on an asteroid/space probe to another earth-like planet. :)
we wrote some code last year to build a big Trie of the whole transcriptome -- you could use it to fuzzy-search to see if this mRNA is within some edit distance of any piece of normal human RNA, because then it could theoretically cause side effects via RNA interference. stopped the project because I can't afford to develop a gene therapy right now, but the fuzzy search worked
to make the trie use the function here. the variable K is the length of the Kmers (runs of RNA). Larger values are gonna take a lot longer. ( warning: big job, uses multiprocessing...pypy recommended for speed )
https://github.com/bionicles/coronavirus/blob/b6f0db9dd8aaf7...
What are the purple and blue sections after the stop codon for? I read a little about the 3' region, but for the vaccine, are these sections taken from a particular natural human sequence, or specially engineered for something else?
The 3' and 5' untranslated regions are the parts of the mRNA directly before and after the part that encodes the actual protein. So they are themselves not translated into amino acids.
What they actually do can vary, but essentially they can provide places for other things to bind and influence what happens with the mRNA. There are some fancier cases like riboswitches, but you don't see those in humans. The stuff at the start and end of the mRNA also determines stability of the mRNA.
> The injection contains volatile genetic material that describes the famous SARS-CoV-2 ‘Spike’ protein. Through clever chemical means, the vaccine manages to get this genetic material into some of our cells.
> These then dutifully start producing SARS-CoV-2 Spike proteins in large enough quantities that our immune system springs into action. Confronted with Spike proteins, and (importantly) tell-tale signs that cells have been taken over, our immune system develops a powerful response against multiple aspects of the Spike protein AND the production process.
What happens to the "volatile genetic material" at the end of this? Does it just linger in the body indefinitely? Or does it somehow get destroyed (and what does that mean)? From my reading of the above excerpt, it's the produced spike proteins that get destroyed but not the original genetic material that's injected. The reason I'm asking is to understand how the vaccine designers determine if there are any long-term effects of having this artificial material inside your body. They couldn't have tested it over a long time frame given how quickly all this moved.
The mRNA is stable for a few hours or so, it is both chemically unstable in solution under the conditions in a cell and also actively degraded by various mechanisms.
Read the article, it answers your question in detail:
> The very end of mRNA is polyadenylated. This is a fancy way of saying it ends on a lot of AAAAAAAAAAAAAAAAAAA. Even mRNA has had enough of 2020 it appears.
> mRNA can be reused many times, but as this happens, it also loses some of the A’s at the end. Once the A’s run out, the mRNA is no longer functional and gets discarded. In this way, the ‘poly-A’ tail is protection from degradation.
Also, your cells continuously make mRNAs, depending on what proteins they need to synthesize. And those (have to) get discarded too. And also this is what happens to the actual viral RNA when the virus attacks you for real.
> The reason I'm asking is to understand how the vaccine designers determine if there are any long-term effects of having this artificial material inside your body
The properties of mRNA are well known and have been for decades. Your cells are constantly producing more from the nucleus. It degrades, even more so when it gets transcribed. That's the beauty of this, it's self-limiting.
The only 'artificial' thing about it is the special base that's added to avoid detection by the immune system. Everything else is the exact same compounds present in your cells.
What happens once it degrades? Does it eventually get decomposed into constituent molecules? Or removed from your body somehow? And what happens to that base?
I highly recommend reading about Ribosomes. They are made up of two pieces that were likely independent at some time. It becomes quite clear that "life" began as a machine that all it could do was replicate itself:
You can think of RNA as a copy of a section of DNA. They look very much like computer programs except rather than producing code, the Ribosome can read them and translate each codon for an amino acid into its corresponding actual amino acid that it then binds together into a protein. The execution engine is the environment of the cell. All highly probabilistic rather than deterministic. I can't imagine any programmer not finding them completely fascinating.
I imagine we have tools in that direction, but nothing complete. Unlike math and computers, biological systems don’t really go from a uniform set of simple rules to emergent complexity - there is a whole lot of sideways complexity thrown in.
Something that might fit the computation vision of your comment are the various Ontologies for bioinformatics. The Gene Ontology is probably the most complete, although it lags many years behind the literature.
What this does, as a non-biotech person, I believe I understand at a high level: plonk this code into a ribosome and out comes the desired protein.
What I don't understand is:
a) how the m-RNA code relates to the produced protein (i.e I can read C-code and get an idea of what is does fairly quickly, but can the same be said of m-RNA and the resulting protein)?
b) how did they get their hands on that code in the first place? Do the coronaviruses use m-RNA as well? Was then a coronavirus somehow "dissected" to get at the spike protein "source code"?
a) From the mRNA you can learn the amino acid sequence of the protein very quickly. You absolutely cannot (yet) learn the function of the protein from that sequence - normally, people just do comparisons with proteins whose functions ARE known. Oftentimes in enzymes there are "domains" or little functional regions that stay consistent over long periods of time, so that's a good way to assign function (given knowledge of other proteins in the same family)
b) Yep. Every virus at some point in their lifecycle use mRNA. You can just sequence the virus and get all that data (I've done that on SARS-COV-2, it's honestly pretty easy). Then you just do homology alignment (as stated above) and you can figure out approximately what each gene does.
The problem of de-novo protein prediction is ONE OF THE HARDEST PROBLEMS IN BIOTECH, but just like getting amino acid sequence, doing homology searches, sequencing viruses, etc, is basic biotech and I'd expect an eager high schooler or undergrad to be able to do them.
a) I don't know if protein-folding software is good enough to figure out the exact structure of the resulting protein given just the gene, but I suspect you could figure out through the equivalent of the strings command - looking for sub-chains of the protein, and looking for matching sequences in the gene
b) Coronaviruses happen to be RNA viruses; that is, their genomes are RNA rather than DNA. DNA viruses also exist and are common. We got full genomes from sequencing early in the pandemic, and continue to use it to monitor the evolution of the virus (see e.g. [1], where the results are available for download). Sequencing is very cheap and easy these days - you take a sample from a patient, use chemicals to break down all the cell membranes and such, sequence all of the DNA and RNA in it, and look through the results for a virus genome (i.e. something that isn't a human chromosome and isn't a known virus or bacterial genome). "m"RNA is more a description of the function than the chemical - tRNA and rRNA are short snippets of RNA used for manufacturing purposes inside the cell, while mRNA is the long chunks that actually carry information from the DNA to the protein manufacturing sites. Virus RNA basically functions as imposter mRNA, getting those manufacturing systems to make more viruses.
a) Yes, you can translate a mature[1] mRNA sequence, codon by codon, from the start until the stop codon, and it will give you the sequence of the protein it encodes.
b) Coronaviruses have a RNA genome. Researchers extracted it from wild-type viruses and then sequenced it.
[1]: mRNAs can undergo several maturation steps, such as splicing, which removes regions that won’t be translated into protein.
Everyone else has had good answers, but I'm also going to note that we knew a ton about covid's general molecular biology well before it ever came into existence. Covid (more properly, SARS-CoV-2) is a cronovirus. Cronoviruses have been studied for some time since some of them cause common colds, and studied very intensely since 2002 when SARS showed pandemic potential. So when Covid showed up, there was a ton of prestablished information and expertise avaliable to help every element of the pandemic response.
I compared the spike encoding regions, and it looks like they're quite different...I wonder if the codons wind up coding for different amino acids. And who got it right?
The lipid container is weird to me. Is that all it takes to send instructions inside a cell? Seems like a security hole. Why haven’t viruses evolved to have a lipid container?
People joked a lot about "injectible source code / machine code" but it is kind of interesting injecting yourself with something that has the source on github.
Note that this is a sequencing result, so it will lack a lot of nonstandard RNA tricks that these companies are or might be using, like pseudouridines, or fluorobases. I think those would have to be disclosed in the patent.
> We aren’t that different from machines, we just need to know more about the CPU and all the co-processors and how the logic gates interact
Except it is unfortunately not that simple, because it assumes that distinct components such as CPU, co-processors and even logic gates exist in that context, as is totally reasonable to assume on devices created by humans. Abstracting complex machines into distinct components is a proven strategy to engineer a system, but it's not a necessity for functioning systems to exist.
In the case of natural organism, they "just" need to work. They don't have a blueprint, and they don't need to be organized in a way that allows for easy understanding by looking at individual parts in separation.
Consider also the difference between machine learning through neural networks ("we stuff a lot of training data in there and get what we want eventually, we hardly understand what the model does or why it fails"), and a QR code reader ("we carefully designed the format from the top down, including e.g. framing, error correction, and several invariants like rotation; if a QR code does not get recognized, we can usually tell exactly where and why it failed").
I am not able to draw the distinction you are trying to make. The more we make machines, especially ones to interact with inputs from our world, the more easy it is to understand our bodies.
Correct, because there is no blueprint then we don't know about how the brains and neurons interact. But if there is a problem with a heart valve we know exactly where and why it failed.
I expect greater convergence in these fields, and as such I can't agree with you.
> But if there is a problem with a heart valve we know exactly where and why it failed.
I highly doubt that. Even for heart valves, which seem less complicated than plenty of other body parts (at first glance, I'm sure they are plenty complicated in detail), because they are comparatively mechanical. For example, Wikipedia says (with citation): "Causes of aortic insufficiency in the majority of cases are unknown, or idiopathic."
Try a kidney or something related to the nervous system next.
> The more we make machines, especially ones to interact with inputs from our world, the more easy it is to understand our bodies.
FWIW I am making machines, and the more I do, the more amazed I am about how intensely complicated our body in general and our nervous system specifically is.
“Imagine a being like nature, wasteful beyond measure, indifferent beyond measure, without mercy and justice, fertile and desolate and uncertain at the same time; imagine indifference itself as a power, how could you live according to this indifference?"
There is a theory "endosymbiotic hypothesis" that mitochondria were wholesale absorbed bacteria into our DNA line. Which I think is pretty wild, and explains how neatly they're compartmentalized.
No doubt, and separate organs with (sometimes rough, sometimes very clear) separate functions are also a thing, so evolution seems to favor compartmentalization for more complex systems. But that does not mean that there are not complex overarching interactions that make full understanding really hard. The sort of interactions you would stay away from when designing a computer, not because it would not work, but because it would make design, debugging, and iteration upon your system prohibitively hard.
Just because you see the RNA/DNA sequence on Github doesn't mean anything, DNA sequencing has been around since at least the early 70s [0]. Many pharmaceutical drugs already employ such techniques.
> So how different is the mRNA in the Moderna, BioNTech/Pfizer & CureVac vaccines? There are 1274 codon positions. 808 are identical across all 3 vaccines. 103 are unique to Moderna, 249 unique to BioNTech, 230 to CureVac
Thanks. So what makes that happen in this case? Is it because internally the cell doesn’t recognize the protein? Or it does this for all proteins it makes? Does say some hemoglobin get transported to the cell surface?
Yes, the even healthy cells are constantly transporting protein fragments to the cell surface but the immune system learns to ignore these as it is developing.
In addition, B and T cells only detect fragments on the surface of active Dendric cells. Dendric cells become active in response to alternate and less specific indications of infection such as an unusually high amount of mRNA translation or families of protein that occur only in bacteria and viruses.
So you have a header/footer sequence that we sort of know is required (remember the MZ and chksum for .EXE files) but we have no idea what that bits in between does except we can read the letters and copied it in part from the actual virus.
'A group of Stanford researchers has hacked Moderna’s messenger RNA (mRNA) vaccine for the novel coronavirus, Motherboard first reported on Monday, and published its entire genetic sequence on the open-source code repository Github.'
What does "hacked" mean here? The article makes it sound like this wasn't something illegal:
> Fire and Shoura told Motherboard that they had received permission from the FDA to collect scraps of vaccines that wouldn’t have otherwise been used from empty vials and that they’d notified Moderna in advance of their plans to publish the sequence without receiving any objection in turn.
Also:
> The research team told Motherboard that they didn’t “reverse engineer” the vaccine, they simply “posted the putative sequence of two synthetic RNA molecules that have become sufficiently prevalent in the general environment of medicine and human biology in 2021.”
I'm not familiar enough with how these sequences to work to understand what's being discussed. Is it simply that they took a sample of the vaccine and studied its composition using some standard machine/process?
So I guess Josiah Zayner has to pick up on this now and do a DIY Moderna COVID vaccine video. He already did a DIY vaccine video with full open source documentation on how to do it yourself.
if you have understanding of how the sequence mutates then you can predict what the next strain is going to be and design spike protein that matches it.
ELI5 could this be used by "evil governments" to make designer pathogens to release during doomsday situations (say by North Korean leaders in their suicide bunkers if things went badly) ?
I believe only Sputnik vaccine has different first and second dose, but their vaccine is of different category (it belongs to the same as AstraZeneca and Johnson and Johnson). The reason is that these vaccine use a vector (adenovirus) and there's a risk that body will develop antibodies for the vector and the second dose might not be as effective.
"Some vaccines require two doses because the immune response to the first dose is rather weak. The second dose helps to better reinforce this immune response." - I would have to think over time that could be optimized somehow to just require one w/ ML and test results, etc.
tangential: do biologists sometimes use some form of base 64 encoding for their triplets? so instead of AAG.TCA.GGA just g5F or something?
other than the obvious advantage of being shorter, it would also be easier to read: the boundaries would be unambiguous and each char would correspond directly to and amino acid (if applicable/coding)
Proteins are written in standardised IUPAC amino acid codes that carry some semantic meaning, e.g. Alanine: A, Glycine: G etc. Also viral genomes often have overlapping transcription with shifted open reading frames. Biology is not as simple as you think.
This is amazing. It appears quite "simple" - of course I know nothing about this part of the sciences.
I do think back to the early days of Covid when there were all these predictions around when a vaccine would show up. It seemed like there was knowledge that the mRNA platform would be the likely solution and probably by April we knew a vaccine would be possible - it just took 6+ months to test.
One of Modernas cofounders, MIT Prof Robert Langer, was profiled on 60 Minutes a few years back as MITs most prolific patent holder. He specialized in nanoparticle delivery systems to any desired internal tissue. One can deliver medicine, nutrients, diagnostics, etc where and when they want. Vaccines are just a small of subset of these applications.
As a software/hardware guy who knows less than zero about the subject: is this something that (given the right resources) makes possible to replicate the vaccines? I mean in countries where they can't afford enough vaccines but already have or could invest in the ability to replicate them without caring about patents.
Is this all another medical company needs to start manufacturing and selling the vaccine themselves? Or is this sequence licensed/proprietary in some way?
i'm a dna noob: is it possible to do the growing and sampling thing to get the sequence from a sample of the vaccine or does the bubble of fat get in the way?
My question is does the Johnson & Johnson DNA-based vaccine encode for the exact same spike protein, or a different one they chose to target? From this PDF I conclude both the moderna and Pfizer vaccines target the same protein.
No reason you can't make your own NFT of it. Heck, if you promise to pay at least 1.337 ETH I'll figure out how to do it myself and make it for you. :-P
read the article linked in the thread. it actually does not work against all of the covid virus. it works against the spikes.
so, the virus is sort of like a ball with these spikes on top (that’s where the corona name comes from) and the vaccine helps your body develop antibodies against the spikes. so when the virus gets in your body, it actually receives a “haircut” which leads to it no longer being able to enter the cells and hijack their internals to produce more viruses.
it’s extremely clever, but it also means that your code is wrong ;))
I expected to have to scroll through pages upon pages of indecipherable text. Instead it's no bigger than a large paragraph of text, and I can easily fit it on my screen.