Simulation of a 2B-atom cell that metabolizes and grows like a living cell

vasili111 · on Jan 28, 2022

Is this simulation on the atomic level with full all interatomic physics processes simulation or there were made some simplifications?

All interatomic interactions are simulated separately for each atom or they made statistical estimations and used some assumptions? Those two are absolutely two different types of simulation.

Frost1x · on Jan 28, 2022

It doesn't appear to be ab initio simulated (e.g. QED up) if that's what you're asking. They appear to swoop in at higher scales (molecular level) and simulate molecular interactions across "hundreds of molecular species" and "thousands of reactions."

Apparently the interface between molecules uses the Chemical Master Equations (CME) and Reaction-Diffusion Master Equations (RDME) both of which I'm unfamiliar with: http://faculty.scs.illinois.edu/schulten/lm/download/lm23/Us...

dahart · on Jan 28, 2022

Yes, this appears to be the underlying simulation software. Here’s a home page link to the project as well: http://faculty.scs.illinois.edu/schulten/Software2.0.html

“Lattice Microbes is a software package for efficiently sampling trajectories from the chemical and reaction-diffusion master equations (CME/RDME) on high performance computing (HPC) infrastructure using both exact and approximate methods.”

vasili111 · on Jan 28, 2022

For anyone who is wondering what QED is: Quantum electrodynamics (QED) https://en.wikipedia.org/wiki/Quantum_electrodynamics

kingcharles · on Jan 28, 2022

Ah, should have realized when Quad Erat Demonstradum made no sense...!

tobmlt · on Jan 29, 2022

I've always imagined Feynman making a QED, QED joke sometime in the late 1940's for this very reason. Like he could end his Shelter Island Conference talk with the joke and then look up from the blackboard... to a bunch of confused and deeply furrowed brows, lol.

marcosdumay · on Jan 28, 2022

The paper (well, the abstract) calls it "fully dynamical kinetic model".

Or, in other words, it doesn't solve the Schrodinger equation at all, but uses well known solutions for parts of the molecules, and focuses on simulating how the molecules interact with one another using mostly classical dynamics.

blix · on Jan 28, 2022

I do classical molecular dynamics simulations for a living, and I feel the model using in this paper is pretty dramatically different than what would typically be described as classical dynamics. 2B atoms would be absolutely insane for any sort of simulation that resolves forces between atoms of even groups of atoms, especially in organic systems.

As far as I can tell from their model, molecules don't interact with each other ~at all~ through classical dynamics. Rather, they define concentrations of various molecules on a voxel grid, assign diffusion coffecients for molecules and define reaction rates between each pair of molecules. Within each voxel, concentrations are assumed constant and evolve through a stochastic Monte-Carlo type simulation. Diffusion is solved as a system of ODEs.

This is a cool large scale simulation using this method, but this is a far cry from an actual atomic-level simulation of a cell, even using the crude approximations of classical molecular dynamics. IMO it is kind of disingenuous for them to say 2B atoms simulation when atoms don't really exist in their model, but it's a press release so it should be expected.

chermi · on Jan 28, 2022

Excuse formatting, one phone. Was gonna put more refs... But phone.

Yes, this is not the standard "force field" pairwise stuff you're used to when you heard "simulation" of biomolecular systems. I don't know if it's quite disingenuous, just not what we expect based on what the vast, vast majority of the field does! It does represent that many atoms. We shouldn't include atoms for the sake of having them, right? It should depend on what questions we're asking of the system.

I like seeing other (simulation or analytic) methods get attention. Lattice methods --(HP models[0], for hydrophobicity[1], lattice-boltzmann even. field theoretic (see polymer physics, melts, old theory[2], new theory, and even newer simulation[3]). Even the simplest shit like springs[4]!

[0] https://scholar.google.com/citations?view_op=view_citation&h... [1] https://journals.aps.org/prl/abstract/10.1103/PhysRevLett.11... [2]https://www.google.com/books/edition/Renormalization_Group_T... [3]The Equilibrium Theory of Inhomogeneous Polymers (International Series of Monographs on Physics) https://www.amazon.com/dp/0199673799/ref=cm_sw_r_apan_glt_i_... (numerical scft) [4] https://en.m.wikipedia.org/wiki/Gaussian_network_model

blix · on Jan 29, 2022

> It does represent that many atoms.

It also represents even more electrons and even more quarks than that. I think it would be silly to characterize this system by the number of constituent quarks, but that's just me. To me, the important number is the number of degrees of freedom within the model. In a force-field model this scales linearly in with the number of atoms. In the model presented in the paper, this depends on voxel resolution and number of molecular species. Sadly, this is omitted in the press release.

Thanks for your links :) I work in inorganic materials but I really should understand more about models for more complex systems.

markwkw · on Jan 29, 2022

Do you know how they deal with DNA? Does it fit in a single voxel? (Probably no). Are strands of DNA and RNA treated in chunks in a chain of voxels? Does the simulation perform transcription?

blix · on Jan 29, 2022

I am not an expert on this type of model but after reading their methods section a little more thoroughly, this is my sense:

I don't believe there is any case where a molecule in one voxel knows about molecules in other voxels. DNA and RNA is coarse grained where a 'molecule' might represent a specific sequence of interest rather than a full chain. Transcription and translation are modeled, essentially by saying if the neccessary ingredients are present in a voxel (DNA/RNA sequences, enzymes, raw materials, etc) there is some chance of forming mRNA or a protein as a function of the molecular concentrations present in the voxel. DNA and RNA reactions are treated with somewhat different equations than the rest of the molecules, I think to handle the coarse graining.

tmearnest · on Jan 29, 2022

This is all correct. These simulations actually can model molecular crowding by using a diffusive propensity proportional to the particles in the adjacent cells but I don’t think it was used here. I developed this methodology in grad school, but it didn’t go any where.

blix · on Jan 29, 2022

Thanks for confirming my take. This is a little out of my comfort zone.

Do you see a future for this simulation method with increased computing power, or do you think limitations of the method might still limit its applicability. Maybe this is naivete coming from atomistic perspective, but it seems to me that the inability to model reactions that aren't explicitly predefined would be a significant challenge.

tmearnest · on Jan 29, 2022

Dna does not fit in a voxel. Transcription is modeled by particles fixed in space which produce transcripts at a constant rate. The mRNA is treated the same as the other discrete particles in the simulation, though they are likely a bit bigger than the voxel size.

gfd · on Jan 28, 2022

In theory, I don't think there's such a thing as simulation without simplifications. The world seems to be continuous but our computers are discrete. There's a small set of things we know how to solve exactly with math but in general we have no ways to deal infinity. Any given variable you're calculating will be truncated at 32 or 64 bits when in reality they have an infinite number of digits, changing at continuous timesteps, interacting with every other atom in the universe.

In practice, none of this matters though and we can still get very useful results at the resolution we care about.

qboltz · on Jan 28, 2022

All simulations have to make the Born-Oppenheimer approximation, nuclei have to be treated as frozen, otherwise electrons don't have a reference point.

There will never be true knowledge of both a particle's location and momentum a la uncertainty principle, and will always have to be estimated.

drdeca · on Jan 28, 2022

But, for a system of two quantum particles which interact according to a central potential, you can express this using two quantum non-interacting particles one of which corresponds to the center of mass of the two, and the other of which corresponds to the relative position, I think?

And, like, there is still uncertainty about the position of the "center of mass" pretend particle, as well as for the position of the "displacement" pretend particle.

(the operators describing these pretend particles can be constructed in terms of the operators describing the actual particles, and visa versa.)

I don't know for sure if this works for many electrons around a nucleus, but I think it is rather likely that it should work as well.

Main thing that seems unclear to me is what the mass of the pretend particles would be in the many electrons case. Oh, also, presumably the different pretend particles would be interacting in this case (though probably just the ones that don't correspond to the center of mass interacting with each-other, not interacting with the one that does represent the center of mass?)

So, I'm not convinced of the "nuclei have to be treated as frozen, otherwise electrons don't have a reference point" claim.

chermi · on Jan 28, 2022

You are right not be convinced, because it is entirely incorrect.

aeternum · on Jan 28, 2022

With a quantum computer could one theoretically input the super position of possible locations and momenta and run the simulation based on that?

chermi · on Jan 28, 2022

What? This is simply untrue.

phkahler · on Jan 28, 2022

A simulation can have both.

Tagbert · on Jan 28, 2022

Then is it an accurate simulation without the uncertainty?

contravariant · on Jan 29, 2022

The pilot wave theory works perfectly fine with both exact position and momentum, but in other interpretations such particles simply don't exist.

dekhn · on Jan 28, 2022

I doubt it makes sense to assume the unverise is continuous (I'm glad you said "seems"). In particular, space could be spatially quantized (say, around the planck length) or any number of other details.

People have done simulations with quad precision (very slow) but very few terms in molecular dynamics would benefit from that. In fact, most variables in MD can be single precision, exceptt for certain terms like the virial.

mensetmanusman · on Jan 28, 2022

It’s definitely fun to think about.

If the universe is discrete, how does one voxel communicate to the neighboring voxel what to update without passage through ‘stuff in between’ that doesn’t exist? Heh

It seems physics is going the opposite way with infinite universes and multiple dimensions to smooth out this information transfer problem and make the discrete go away.

javajosh · on Jan 28, 2022

As someone with keen interest in physics (and a bit of training) I find speculation about "discrete space" disquieting. It's the level of abstraction where intuition about space breaks down, and you have to be very careful. Remember that coordinate systems are short-hand for measurement. It's one thing to admit fundamental limits on measurement resolution, and quite another to say that space itself is quantized! Mostly I get around this by not thinking about it; most of these theories are only testable in atrocious and unattainable conditions, doing things like performing delicate QED experiments at the edge of a black hole.

I don't think your "voxel" intuition can be right because it's a small jump from that to (re)introducing an absolute reference frame.

joshmarlow · on Jan 28, 2022

> how does one voxel communicate to the neighboring voxel what to update without passage through ‘stuff in between’ that doesn’t exist? Heh

That kind of reminds me of the 'aether' that was once hypothesized as a medium of transmission for light and radio waves [0].

Also, voxel's communicating sounds an awful lot like a higher-dimensioned cellular automata.

[0] - https://en.wikipedia.org/wiki/Aether_theories

Yajirobe · on Jan 28, 2022

Stephen Wolfram was right all along

whatshisface · on Jan 28, 2022

All of our current theories are set in continuous spacetime. At the present, there's no reason to assume anything else.

feoren · on Jan 28, 2022

I wouldn't say that "all our current theories" are set in continuous spacetime. For example, Quantum chromodynamics is set in SU(3), an 8-dimensional group of rotation-like matrices. Electric charge is discrete, spin is discrete, electron orbitals are discrete. In fact position and momentum would seem to be the outlier if they were not also discrete. I hardly call that "no reason".

whatshisface · on Jan 28, 2022

SU(3) is a continuous group.

freemint · on Jan 28, 2022

Yeah but it is very much not in space time.

whatshisface · on Jan 28, 2022

But it is. SU(3) is the group for swapping colors around. It still has spacetime.

freemint · on Jan 28, 2022

You can Cartesian product it with space time, yes. But that is possible for any system.

whatshisface · on Jan 28, 2022

I'm having a hard time imagining quantum chromodynamics set in a single point. :)

drdeca · on Jan 28, 2022

It is based on SU(3), but, does it really make sense to say that it isn't still set in spacetime? Like, quarks still have position operators, yes?

jacquesm · on Jan 28, 2022

True, but we do not actually know this for sure. There is a (small) possibility that we are simply looking at this at a scale where all we see is macro effects. It would require the quanta to be much smaller than the Planck distance though.

webmaven · on Jan 28, 2022

> There is a (small) possibility that we are simply looking at this at a scale where all we see is macro effects. It would require the quanta to be much smaller than the Planck distance though.

How much smaller?

jacquesm · on Jan 28, 2022

Many orders of magnitude. How many? I do not know, I don't think anybody does.

But photons resulting from the same event but with different energies arrive at detectors an appreciable distance away to all intents and purposes simultaneously, something that would not happen if spacetime were discrete at a level close to the Planck length. So it would have to be quite a big difference for an effect not to show up as a difference in time-of-flight.

aeternum · on Jan 30, 2022

Is the idea that the two photons would traverse slightly different voxels due to the lower frequency wave being more spread out?

What accounts for the expectation that they do not arrive simultaneously in a voxel-based universe?

dekhn · on Jan 28, 2022

the issue is that there are no theories based on experimental evidence at very small scales. I agree that in most situations, it would be silly to violate this assumption, unless you were working on advanced physics experiments.

merely-unlikely · on Jan 28, 2022

There's this concept that causation moves at the speed of light. When I first heard that, it sounded very much like a fixed refresh rate to me. Or maybe the "real world" is just another simulation

Filligree · on Jan 28, 2022

It does if you put it that way, but another way of putting is that spacetime is hyperbolic (...well, lorentzian), and all (lightspeed) interactions are zero-ranged in 4D.

As in, photons that leave the surface of the sun always strike those specific points in space-time which are at a zero spacetime interval from said surface. If you take the described geometry seriously, then "spacetime interval" is just the square of the physical distance between the events.

(And any FTL path has a negative spacetime interval. If that's still the square of the distance, then I think we can confidently state that FTL is imaginary.)

perihelions · on Jan 28, 2022

>"full all interatomic physics"

It's certainly not that -- that's a hideously difficult algorithm with exponential complexity.

https://en.wikipedia.org/wiki/Full_configuration_interaction

sseagull · on Jan 28, 2022

It’s worse than exponential, it’s factorial :)

unemphysbro · on Jan 28, 2022

verlet list is the standard algo used to reduce the complexity in the number of interatomic calculations

https://en.wikipedia.org/wiki/Verlet_list

dekhn · on Jan 28, 2022

that's old tech, these days it's usually some sort of PPPM (particle-particle particle-mesh) which parallelizes better.

But that's for classical simulations. Full configuration interaction is effecftively computing the schrodinger equation at unlimited precision, in principle if you could scale it up you could compute any molecular property desired, assuming QM is an accurate model for reality.

unemphysbro · on Jan 28, 2022

p3m, well pme, is exactly what we used for our calculations ;)

i never did any qm work beyond basic parameterization

i'm guessing you are/were also computational physics guy :)

dekhn · on Jan 28, 2022

I was a computational biologist for many years, which included a bunch of biophysics. I did extensive work with PME about 20 years ago, on supercomputers. It's a pretty neat technique (https://en.wikipedia.org/wiki/Ewald_summation), once you wrap your head around it!

unemphysbro · on Jan 28, 2022

yup, we used PME for non-bonded calculations in our simulations and to calculate things like electric potentials. I finished a biophysics phd back in 2020 and focused mainly on fluid flow.

Pretty cool, what're you up to now?

dekhn · on Jan 29, 2022

helping genentech scientists move to the cloud. I stopped being a scientist a long time ago and now I just sort of help scientists with the stuff I'm already good at.

unemphysbro · on Jan 29, 2022

funny enough I'm doing the exact same thing in public sector education. I'm always curious where people in our field end-up.

i saw some of your other comments about being at google. did you touch jax-md at all?

https://github.com/google/jax-md

dekhn · on Jan 29, 2022

I talked to the team, but unfortunately, jax-md at the ttime didn't do bond angles or torsions, so it wasn't good for biomolecular simulations.

My work mostly predated tensorflow and was much more about massive-scale embarassingly parallel computing, and produced some interesting large-scale results from MD and protein folding.

https://www.nature.com/articles/nchem.1821 https://onlinelibrary.wiley.com/doi/full/10.1002/pro.2389

unemphysbro · on Jan 29, 2022

yup, i noticed that when i saw the first commits; in fact i thought it was someone's pet project. however, when i read that first odenet paper, it's clear keeping track of the gradients is extremely useful.

I'm very familiar with the first paper, the second author was on my committee.

so what does a cloud migration at a biotech company mean?

is it sort of a standard orchestrator + warehouse/lakehouse + distributed compute + cicd tools stack?

dekhn · on Jan 29, 2022

"is it sort of a standard orchestrator + warehouse/lakehouse + distributed compute + cicd tools stack?"

Ideally, yes, exactly. Except there are 100 orchestrators, 100 small local warehouses, and CI/CD is mostly jenkins.

Some things get forklifted over. I'm not trying to push people to adopt cloud native practices, just move them off physical onprem resources. Even that is a challenge because of data gravity.

CorrectHorseBat · on Jan 28, 2022

Of course not, we can't even simulate how one protein folds.

dekhn · on Jan 28, 2022

Small proteins (one to two alpha helices) can now be routinely folded (that is, starting form a fully unfolded state, to getting stick in the minimum around the final structure) using ab initio simulations that last several multiples of the folding time.

Larger proteins (a few alpha helices and beta sheets), the folding process can be studied if you start with structures near the native state.

None of this means to say that we can routinely take any protein and fold it from unfolded state using simulations and expect any sort of accuracy for the final structure.

qboltz · on Jan 28, 2022

When you say ab initio calculations, could you cite the level of theory? I think there could be some ambiguity given differences in scope.

dekhn · on Jan 28, 2022

When I say ab initio I mean "classical newtonian force field with approximate classical terms derived from QM", AKA something like https://ambermd.org/AmberModels.php

Other people use ab initio very differently (for example, since you said "level of theory" I think you mean basis set). I don't think something like QM levels of theory provide a great deal of value on top of classical (and at a significant computational cost), but I do like 6-31g* as a simple set.

Other people use ab initio very differently. For example, CASP, the protein structure prediction, uses ab initio very loosely to me: "some level of classicial force field, not using any explicit constraints derived from homology or fragment similarity" which typically involves a really simplified or parameterized function (ROSETTA).

Personally I don't think atomistic simulations of cells really provide a lot of extra value for the detail. I would isntead treat cell objects as centroids with mass and "agent properties" ("sticks to this other type of protein for ~1 microsecond"). A single ribosome is a single entity, even if in reality it's made up of 100 proteins and RNAs, and the cell membrane is modelled as a stretchy sheet enclosing an incompressible liquid.

qboltz · on Jan 28, 2022

Level of theory as it relates to an-initio QM calculations usually indicates Hartee Fock, MP2 and so on, then the basis set gets specified after.

I also agree that QM doesn't provide much for the cost at this scale, I just wish the term ab initio would be left to QM folks, as everything else is largely just the parameterization you mentioned.

dekhn · on Jan 28, 2022

The systemn I work with, AMBER, explains how individual classical terms are derived: https://ambermd.org/tutorials/advanced/tutorial1/section1.ht... which appears te be MP2/6-31g* (sorry, I never delved deeply into the QM parts). Once those terms are derived, along with various approximated charges (classical fields usually just treat any charge as point-centered on the nucleus, which isn't great for stuff like polarizable bonds), everything is purely classical springs and dihedrals and interatomic potentials based on distance.

I am more than happy to use "ab initio" purely for QM, but unfortunately the term is used widely in protein folding and structure prediction. I've talked exdtensively with David Baker and John Moulton to get them to stop, but they won't.

blix · on Jan 28, 2022

I would not describe AMBER, or anything using a newtonian force field, as ab initio.

In inorganic materials ab initio means you actually solve Schrodinger's equation (though obviously with aggressive simplifications e.g. Hartree-Fock).

dekhn · on Jan 29, 2022

Sure. But in the protein structure prediction field, "ab initio" is used to mean "structure was predicted with no homology or other similarity information" even though the force fields incorporate an enormous amount of protein structural knowledge.

blix · on Jan 29, 2022

I guess it's just wierd to me to see ab initio used to describe a class of methods than in my field it explicitly precludes.

Having developed a few newtonian force fields, calling them "derived from QM" is very, very generous :P

Cthulhu_ · on Jan 28, 2022

What does https://foldingathome.org/ do then? That's been going on for nearly two decades.

orangepurple · on Jan 28, 2022

Full list of achievements https://foldingathome.org/category/fah-achievements/?lng=en

This is the only real update of the year: https://foldingathome.org/2022/01/03/2021-in-review-and-happ...

SARS-CoV-2 has intricate mechanisms for initiating infection, immune evasion/suppression and replication that depend on the structure and dynamics of its constituent proteins. Many protein structures have been solved, but far less is known about their relevant conformational changes. To address this challenge, over a million citizen scientists banded together through the Folding@home distributed computing project to create the first exascale computer and simulate 0.1 seconds of the viral proteome. Our adaptive sampling simulations predict dramatic opening of the apo spike complex, far beyond that seen experimentally, explaining and predicting the existence of ‘cryptic’ epitopes. Different spike variants modulate the probabilities of open versus closed structures, balancing receptor binding and immune evasion. We also discover dramatic conformational changes across the proteome, which reveal over 50 ‘cryptic’ pockets that expand targeting options for the design of antivirals. All data and models are freely available online, providing a quantitative structural atlas.

echelon · on Jan 28, 2022

Simulating very expensive to compute protein dynamics. These aren't guaranteed solutions, but it's still useful information.

vasili111 · on Jan 28, 2022

So, even one protein cannot be simulated as in real world?

JabavuAdams · on Jan 28, 2022

Even one atom of a heavier element cannot be simulated in the real, depending on what level of detail you want. Multi-atom simulations usually treat them as little non-quantum balls moving around in a force-field that may have been approximated from quantum mechanics.

qboltz · on Jan 28, 2022

Not if you're going off of ab initio theory such as Hartee Fock, MP2, CC, etc. We're talking amounts of matrix multiplication that wouldn't be enough to finish calculating this decade, even if you had parallel access to all top 500 supercomputers, you get bigger than a single protein, it's beyond universal time scales with current implementations.

dekhn · on Jan 28, 2022

Every time some computer scientist interviews me and shows off their O(n) knowledge (it's always an o(n) solution to a naive o(n**2) problem!) I mention that in the Real World, engineers routinely do O(n**7) calculations (n==number of basis functions) on tiny systems (up to about 50 atoms, maybe 100 now?) and if they'd like to help it would be nice to have better, faster approximations that are n**2 or better. Unfrotunately, the process of going from computer scientist to expert in QM is entirely nontrivial so most of them do ads ML instead

drsnow · on Jan 29, 2022

How on Earth? I can't imagine the difference between the computational power of all top 500 supercomputers is THAT many orders of magnitude far off from the computational power of all the folding@home computational power donated by the general public.

dekhn · on Jan 29, 2022

supercomputers are specialized products with fast networking to enable real-time updates between nodes. The total node count is limited by the cost of the interconnect to get near-peak performance. You typically run one very large simulation for a long period of simulation time. folding@home doesn't have the luxury of fast networks, jut lots of CPU/GPU. They run many smaller simulations for shorter times, then collect the results and do stats on them.

I looked at the various approaches and sided with folding@home. At one point I had 1 million fast CPU cores running gromacs.

CorrectHorseBat · on Jan 29, 2022

It's not, foldingathome doesn't do those calculations either but uses a simplified model too.

beecafe · on Jan 28, 2022

One single iron atom's electrons - 26 of them - contain more degrees of freedom than atoms in the solar system.

dekhn · on Jan 28, 2022

A custom supercomputer dedicated to simulating folding proteins (two-state folders with nontrivial secondary and tertiary structure) from unfolded to correctly folded state using only classical force fields probably could work, and DE Shaw has invested a lot of money in that idea: https://en.wikipedia.org/wiki/Anton_(computer)

but, as I pointed out elsewhere, this would not be particularly helpful as it would use an enormous amount of resources to compute something we could probably approximate with a well-trained ML model.

It also wouldn't address questions like biochemistry, enzymatic reactions, and probably wouldn't be able to probe the energetics of interactions accurately enough to do drug discovery.

chermi · on Jan 28, 2022

Nope. Don't dare ask how they treated water/solvation. Lolz.

Now, the question is, does it matter? When do you EVER need to know the exact atomistic, yet alone electronic, trajectory of a single protein starting from a given position within a cell surrounded by waters in a given configuration?

It doesn't really matter. This is the beauty of noise and averages and -- dear to me-- statistical mechanics. At finite temperatures, AKA most everything we experience as living things, quantum details (or precise classical trajectories for that matter) aren't that important for the vast majority of questions we tend to have about a system.

vasili111 · on Jan 29, 2022

I think it will be extremely useful to be able to simulate cell at real atomic level with all physics involved. That means that you can introduce changes at atomic level in DNA and see its effects. Introduce experimental medications in cell and see its effects. You can actually study cell processes (many of which are not known yet) in simulation. And many, many more. It will be revolution in medicine and in biology in general!

dekhn · on Jan 29, 2022

Any atomic level change in DNA is going to have maybe a 1-2 kcal energy difference. To properly calculate that (by running simulations) would require an economically impractical amount of computer time, and doesn't actually really change any of the hard problems in medicine.

Why am I saying this? Because I thought the same as you and it took me 20+ years to realize MD doesn't affect medicine at all.

immibis · on Jan 29, 2022

then [why is my computer fan whirring?](https://foldingathome.org/)

alpineidyll3 · on Jan 28, 2022

Absolutely definitely not. It's not even possible to simulate a single protein-molecule interaction to an accuracy such that reaction rates are reproduced at room temperature. Small effects such as the quantum nature of H-motion prevent this from happening with present computational resources.

This research is something like a pixar movie, or one of those blender demos with a lot of balls :P

capableweb · on Jan 28, 2022

It seems that the researchers did use NVIDIA GPUs to perform the work, but it's not clear what sets the GPUs apart from others and why this research wouldn't be possible without NVIDIA's GPUs, as the article title and body implies.

01100011 · on Jan 28, 2022

Nvidia is pushing vertical integration hard. There are all sorts of libraries from Nvidia which build on top of CUDA, from simple cuBLAS to smart cities, autonomous driving, robotics and 5G.

They also provide acceleration of open source libraries like GROMACS, used for molecular dynamics simulation.

immibis · on Jan 29, 2022

As you do, when you're the market leader. Wouldn't want people to be able to swap their GPUs to AMD. This is a real problem at the moment.

Of course AMD is making their own system that works on AMD and nVidia and Intel GPUs. It'll probably get locked down again if AMD gets market dominance.

01100011 · on Jan 29, 2022

Reminds me of how people used to think MS was absolutely evil while Apple was an altruistic company and Steve Jobs was some kind of saint. Turns out Apple just needed to gain some market dominance.

tmearnest · on Jan 28, 2022

There are two main reasons to take advantage of the Gpu in lattice microbes. It can simulate the stochastic chemical reaction and diffusion dynamics in parallel: one thread per voxel. For instance, an E. coli sized cell would have ~40000 voxels. It’s not quite embarrassing parallel, but close. Second, the simulation is totally memory bound so we can take advantage of fast gpu memory. The decision to use CUDA over OpenCL was made in like 2009 or so. Things have changed a lot since then. I don’t think anyone has the time or interest to port it over, unfortunately.

Tenoke · on Jan 28, 2022

Most likely because the software they use uses CUDA.

capableweb · on Jan 28, 2022

I know that CUDA is faster than OpenCL for many tasks, but is there something that is not possible to achieve in OpenCL but possible in CUDA?

TomVDB · on Jan 28, 2022

I’m subscribed to some CUDA email list with weekly updates.

One thing that strikes me is how it evolves with new features. Not just higher level libraries, but also more fundamental, low level stuff, such as virtual memory, standard memory models, c++ libraries, new compilers, communication with other GPUs, launching dependent kernels, etc.

At their core, OpenCL and CUDA both enable running parallel computing algorithms on a GPU, but CUDA strikes me as much more advanced in terms of peripheral features.

Every few years, I think about writing a CUDA program (it never actually happens), and investigate how to do things, and it’s interesting how the old ways of doing things has been superseded by better ways.

None of this should be surprising. As I understand it, OpenCL has been put on life support by the industry in general for years now.

4gotunameagain · on Jan 28, 2022

If you ever need to reap the benefits of CUDA & GPU computations without getting into the details, check out JAX by our corporate overlords™ (https://github.com/google/jax), it has a NumPy like syntax and super fast to get started

p1esk · on Jan 28, 2022

Why would you suggest JAX? CuPy seems like an obvious choice here (simpler and a lot more mature). Jax is only needed if you want automatic differentiation.

hamilyon2 · on Jan 29, 2022

TIL CuPy exists and is stable and mature. This is why i read this forum, every now and then there is serendipitous connection, new angle or discovery.

dekhn · on Jan 29, 2022

I just want pyXLA, a tool to directly construct XLA programs.

belval · on Jan 28, 2022

> is there something that is not possible to achieve in OpenCL but possible in CUDA

Developing fast... OpenCL is much harder to learn than CUDA. Take someone who did some programming classes, explain how CUDA works and they'll probably get somewhere. Do the same thing with OpenCL and they'll probably quit.

immibis · on Jan 29, 2022

What makes it harder?

Cthulhu_ · on Jan 28, 2022

Possibly, but that's not really the point, the article is part marketing push from nvidia for their HPC department.

capableweb · on Jan 28, 2022

> but that's not really the point

That's what I thought as well, so the title on the website ("NVIDIA GPUs Enable Simulation of a Living Cell") is not really truthful then.

Symmetry · on Jan 28, 2022

My understanding is that CUDA has a lot of optimized libraries for common tasks, think BLAS, that don't currently exist in OpenCL/Vulkan Compute.

Gehoti · on Jan 28, 2022

I'm much more aware of slot of things research is doing with Nvidia.

Due to cuda, tools, SDKs etc Nvidia is providing.

I'm not aware of anything similar at any other GPU company

CoastalCoder · on Jan 28, 2022

Can someone comment on the legality of a 3rd party providing an unauthorized implemention of the CUDA API?

I would think that Oracle's loss of a similar lawsuit with Java would be related.

my123 · on Jan 28, 2022

> Can someone comment on the legality of a 3rd party providing an unauthorized implemention of the CUDA API?

NVIDIA said that they would be fine with it in the past, and ROCm HIP is just a (bad) CUDA API clone.

jabbany · on Jan 28, 2022

Interesting question. Press article aside, GPGPU applications like scientific compute, ML etc. have all mostly gravitated to Nvidia / CUDA.

Not working in this space, I'm curious why this is the case. Is there something inherently better about CUDA? Or is it that Nvidia's performance is somehow better for these tasks? Or maybe something else?

mattkrause · on Jan 28, 2022

The products are good, but NVidia also cleverly bootstrapped a whole ecosystem around them.

One of the other posts mentions 2014 as a turning point. At that time, GPGPU stuff was entering the (scientific) mainstream and NVidia was all over academia, convincing people to try it out in their research. They handed out demo accounts on boxes with beefy GPUs, and ran an extremely generous hardware grant proposal. There was tons of (free) training available: online CUDA MOOCs and in-person training sessions. The first-party tools were pretty decent. As a result, people built a lot of stuff using CUDA. Others, wanting to use those programs, basically had to buy NVidia. Lather, rinse, repeat.

This is in stark contrast to the other “accelerator” vendors. Around the same time, I looked into using Intel Xenon Phi and they were way less aggressive: “here are some benchmarks, here’s how it works, your institution has one or two somewhere if you want to contact them and try it out.” As for OpenCL…crickets. I don’t even remember any AMD events, and the very broad standard made it hard to figure out what would work/work well and you might end up needing to port it too!

jjoonathan · on Jan 28, 2022

AMD's GPU OpenCL wasn't just not marketed, it was also a bad product, even for relatively tame scientific purposes, even when AMD made loud, repeated statements to the contrary. Hopefully now that AMD has money they can do better.

I'm sure that NVidia's ecosystem building played a role (I remember events in 2009 and books before that), perhaps even a big role, but it wasn't the only factor. I paid a steep price in 2014 and 2016 for incorrectly assuming that it was.

jjoonathan · on Jan 28, 2022

Back in 2014 or so I made the unfortunate mistake of buying AMD cards with the thought that I'd just use OpenCL. I knew that some codes wouldn't run, but I had catalogued the ones I really cared about and thought I was up for the challenge. I was so, so wrong.

First of all, software that advertised OpenCL or AMD compatibility often worked poorly or not at all in that mode. Adobe creative suite just rendered solid black outputs whenever acceleration was enabled and forums revealed that it had been that way for years and nobody cared to fix it. Blender supported OpenCL for a while, but it was slower than CPU rendering and for a sticky reason (nvidia did the work to support big kernels with heavy branching and AMD didn't). Ironically, OpenCL mode had decent performance but only if you used it on an nvidia card.

The situation was even worse in scientific codes, where "OpenCL version" typically meant "a half-finished blob of code that was abandoned before ever becoming functional, let alone anywhere near feature-parity."

I quickly learned why this was the case: the OpenCL tooling and drivers weren't just a little behind their CUDA counterparts in terms of features, they were almost unusably bad. For instance, the OpenCL drivers didn't do memory (or other context object?) cleanup, so if your program was less than perfect you would be headed for a hard crash every few runs. Debugging never worked -- hard crashes all around. Basic examples didn't compile, documentation was scattered, and at the end of the day, it was also leagues behind CUDA in terms of features.

After months of putting up with this, I finally bit the bullet, sold my AMD card, bought an NVidia card, ate the spread, the shipping, the eBay fees, and the green tax itself. It hurt, but it meant I was able to start shipping code.

I'm a stubborn bastard so I didn't learn my lesson and repeated this process two years later on the next generation of cards. The second time, the lesson stuck.

naavis · on Jan 28, 2022

NVIDIA provides tools that just mostly do not exist for other GPUs, making it easier to build on CUDA instead of something else.

hiptobecubic · on Jan 28, 2022

Absolutely this. When cuda was first making headway it was the only thing even remotely close to a "developer environment" and made things significantly easier than any of the alternatives.

It might be different now, but at that time, many of the users were not computer scientists, they were scientists with computers. Having an easier to use programming model and decent debugging tools means publishing more results, more quickly.

carlmr · on Jan 28, 2022

From my cursory knowledge of the topic, there are competitors like ROCm, but CUDA was the first that had a useable solution here. Also last time I checked ROCm doesn't have broad support on consumer cards, which makes it harder for people to try it out at home.

But it seems ROCm is getting better and it has tensorflow and pytorch support, so there's reasons to be hopeful to see some competition here.

erwincoumans · on Jan 28, 2022

The fine grain parallelism of this simulation suits the GPU well. It would be possible on multicore CPUs, but possibly slower.

Jeff_Brown · on Jan 28, 2022

How much emergent behavior arises from the model? The only passage I see describing any of it is this one:

> The model showed that the cell dedicated most of its energy to transporting molecules across the cell membrane, which fits its profile as a parasitic cell.

Whether it mimics the behavior of real cells seems like the right test. We'll never be able to get it to parallel the outcome of a real system, thanks to chaos theory. But if it does lots of things that real cells do -- eating, immune system battles, reproduction -- we should be pretty happy.

VikingCoder · on Jan 28, 2022

How long did it take to simulate 20 minutes?

Looks like one NVIDIA Titan V took 10 hours to do it, and one NVIDIA Tesla Volta V100 GPU took 8 hours to do it?

Am I reading that right?

So the NVIDIA Tesla Volta V100 is 24 times slower than real life? Pretty cool.

Koshkin · on Jan 28, 2022

Generally speaking, this depends on the size (in terms of the number of constituents) of the piece of "real life" you are simulating.

rsfern · on Jan 28, 2022

This is really cool, but I don’t think it’s an atomistic simulation so I’m not sure where the title is coming from.

It seems to be some kind of a (truly impressive) kinetic model

The paper in Cell is open access https://doi.org/10.1016/j.cell.2021.12.025

amelius · on Jan 28, 2022

A two-billion atom cell ... isn't that a bit small for a cell?

wcoenen · on Jan 28, 2022

Yes. The cell was first created in the real world as part of research about the minimal set of genes required for life.[1] It is known as "JCVI-syn3.0" or "Mycoplasma laboratorium".'[2]

Still amazing that it can now be fully simulated "in silico".

[1] https://www.science.org/doi/10.1126/science.aad6253

[2] https://en.wikipedia.org/wiki/Mycoplasma_laboratorium

Traubenfuchs · on Jan 28, 2022

It says

"In this new organism, the number of genes can only be pared down to 473, 149 of which have functions that are completely unknown."

But if we now can simulate this cell completely, shouldn't it be easy to figure out what those genes are doing? Just start the simulation with them knocked out.

WJW · on Jan 28, 2022

Presumably if the number of genes cannot be pared down below 473, it dies very quickly if one of the 149 genes is knocked out. But "it doesn't work without it" is not a very satisfactory answer to "what does it do".

amelius · on Jan 28, 2022

Yes, this is similar to opening a radio and saying "I don't know what this transistor does; let's take it out and see what the radio does".

dekhn · on Jan 28, 2022

See also "Can a biologist fix a radio" https://www.cell.com/cancer-cell/pdf/S1535-6108(02)00133-2.p... "Doug & Bill"(http://www2.biology.ualberta.ca/locke.hp/dougandbill.htm) "Could a neuroscientist understand a microprocessor"? https://journals.plos.org/ploscompbiol/article?id=10.1371/jo...

The funny thing is if you read the history of Feynman and others, most of them grew up opening up radios and learning how they worked by removing things. fixing them. It's a very common theme (sort of falls off post-transistor tho). I opened up radios as a kid, tried to figure out what parts did what, and eventually gave up.

breck · on Jan 28, 2022

That is a great read. Thanks :)

Jeff_Brown · on Jan 28, 2022

Before attempting to crack the copy-protection on a game, one might think something similar.

quickthrower2 · on Jan 28, 2022

Valgrind that cell!

vital101 · on Jan 28, 2022

The article mentions that they use minimal cells. "Minimal cells are simpler than naturally occurring ones, making them easier to recreate digitally."

agentultra · on Jan 28, 2022

Permutation City, here we come.

I wonder if Greg Egan had the foresight to predict this for the story or if he invented that part for narrative purposes.

chinathrow · on Jan 28, 2022

When I was like 12 or so, I had a thought that if we can calculate everything, we could be living in a full blown simulation.

To be honest, like 30y later, I still go back to that nagging thought _a lot_.

benlivengood · on Jan 28, 2022

The thought that sticks in my mind is mathematical realism; if we can prove the mathematical existence of the outcome of a simulation (nothing harder than inductively showing that the state of a simulation is well-defined at state S for the first and all successive S) then what's the difference between things in the simulation actually existing v.s. possibly existing? All of the relationships that matter between states of the simulation are already proven to exist if we looked at (calculated) them, so what necessary property can we imagine our Universe having that the possible simulation does not?

visarga · on Jan 28, 2022

> so what necessary property can we imagine our Universe having that the possible simulation does not?

It lacks the magical spark, the qualia, the spirit, the transcendent. Or what people like to imagine makes our own reality special. Our own reality cannot be understood because it's such a hard problem, and it "feels like something" (maybe like a bat?), while a simulation is just math on CPUs. Consciousness is a hard problem because it transcends physical sciences, it's so great that it can exist even outside the realm of verification. /s

Hope you forgive the rant, it's just amazing how much philosophy can come from the desire to fly above the mechanics of life. But what they missed is that the reality of what happens inside of us is even more amazing than their imaginary hard problem and special subjective experience. The should look at the whole system, the whole game, not just the neural correlates. What does it take to exist like us?

VikingCoder · on Jan 28, 2022

A simulated hurricane doesn't kill anyone.

But it may be possible that there's no such thing as "simulating" intelligence. If you do certain calculations, that is "intelligent." Same for consciousness, etc.

wrinkl3 · on Jan 28, 2022

A simulated hurricane would kill simulated people.

tsol · on Jan 28, 2022

Think of simulated children! Oh the simulated pain..

disease · on Jan 28, 2022

"We live inside a dream."

kingofclams · on Jan 28, 2022

https://qntm.org/responsibility

afshin · on Jan 28, 2022

This idea has been formalized: https://www.simulation-argument.com/

sva_ · on Jan 28, 2022

This idea has also existed for at least 200 years

https://en.m.wikipedia.org/wiki/Laplace%27s_demon

dekhn · on Jan 28, 2022

I read that as a teenager, thought it sounded nice, went to grad school and did molecular dynamics simulations (like folding at home) for a decade, then went to google and built the world's largest simulation system (basically, the largest group of nodes running folding at home). Eventually we shut the system down because it was an inefficient way to predict protein structure and sample folding processes (although I got 3-4 excellent papers from it).

The idea is great, it was a wonderful narrative to run my life for a while, but eventually, the more I learned, the more impractical using full atomistic simulations seem for solving any problem. It seems more likely we can train far more efficient networks that encapsulate all the salient rules of folding in a much smaller space, and use far less CPU time to produce useful results.

sva_ · on Jan 28, 2022

Yeah, I think the idea of Laplace's Demon is mostly just useful to make a philosophical argument about whether or not the universe is deterministic, and it's implication on free will.

dekhn · on Jan 28, 2022

I dunno, I wonder what Laplace would have made of the argument over the meaning of wavefunction collapse. It took me a very long time to come to terms with the idea of a non-deterministic universe.

sva_ · on Jan 28, 2022

That's peculiar. Most people probably struggle more with the idea of a deterministic universe, as it'd leave no room for free will, which would make everything kind of meaningless.

I'm also more in the camp of "quantum effects making the universe non-determinstic." It's a nicer way to live.

dekhn · on Jan 28, 2022

I've evolved over the years from "determinism implies no free will" to roughly being a compatibilist (https://en.wikipedia.org/wiki/Compatibilism, see also Daniel Dennett). I don't particularly spend much time thinking that (for example) a nondeterministic universe is required for free will. I do think from an objective sense the universe is "meaningless", but that as humans with agency we can make our own meaning.

However, most importantly, we simply have no experimental data around any of this for me to decide. Instead I enjoy my subjective life with apparent free will, regardless of how the machinery of the actual implementation works.

mensetmanusman · on Jan 28, 2022

It’s interesting that many things are deterministic to human-relevant time/length scales. If the small stuff is non-deterministic, it’s interesting that large ensembles of them are quite deterministic.

It’s maddening :)

lelandfe · on Jan 28, 2022

https://en.wikipedia.org/wiki/Evil_demon

Going further back to the 1600's, Descartes' idea of an evil demon deceiving one's mind with a perfect, fake reality made me think often of simulations in my undergrad philosophy classes

tsol · on Jan 28, 2022

Looking at it from that view, we're just as likely to be a simulation as we are to have been created by God. I mean I'm a theist, but I don't see many huge differences except the cultural aspect where the theism/atheism debate is something most people have an emotional connection to.

KarlKemp · on Jan 28, 2022

A God, not being out for her own amusement, will likely create only one universe.

A player with a simulator will create dozens.

tsol · on Jan 28, 2022

>A God, not being out for her own amusement, will likely create only one universe.

Why would that be? I see no reason why God might not create parallel universes

coolspot · on Jan 28, 2022

Electrical bill and GPU shortages in God’s reality could be a reason.

Taylor_OD · on Jan 28, 2022

It's a bit naive... But the best argument for me that we are living in a simulation is that we went from Pong to pretty good VR (good enough that if you have a beer or two before using you can forget its VR for some period of time) in 50 years. In another 50 years it seems fair to assume that we will be able to create VR that fully immersive and impossible to distinguish from real life.

Even with no other arguments about the benefits of WHY one would want to live in a fully simulated world... It seems probable to me that we are based on the idea that it could be possible.

iamstupidsimple · on Jan 28, 2022

> In another 50 years it seems fair to assume that we will be able to create VR that fully immersive and impossible to distinguish from real life.

Technology growth is always non-linear. it's also fair to assume we could stagnate for 50 years also.

joseluis · on Jan 28, 2022

we don't even need to be able to calculate everything, we just need to fool you! The Truman's show meets the Matrix.

reasonabl_human · on Jan 28, 2022

If you want to solve that nagging thought, pick up Griffith‘s intro to quantum mechanics textbook. Goes through the philosophical implications of qm alongside learning the physics. The world as we know it is non-deterministic thanks to wave functions and their random collapsing!

amself · on Jan 28, 2022

I went through the same phase at 12. I am nearing 18 now, and I am very thankful for nondeterminism.

agentultra · on Jan 28, 2022

Specifically, I'm referring to Autoverse, the artificial simulation of a single bacterium down to the atomic level.

It was such a fascinating idea that I found myself more than once trying to mimic the atomic part at a much smaller scale over the years.

webmaven · on Jan 28, 2022

> Permutation City, here we come.