Ex-Meta scientists debut gigantic AI protein design model
EvolutionaryScale introduces ESM3, a powerful AI protein design model trained on billions of sequences. Secured $142 million funding for drug development. Addresses concerns about AI-designed proteins. Researchers anticipate its impact.
Read original articleEvolutionaryScale, founded by ex-Meta scientists, has introduced a large AI protein design model called ESM3. This model, trained on over 2.7 billion protein sequences and structures, has the capability to create new proteins based on user specifications. The company secured $142 million in funding to expand its applications in drug development and sustainability. By leveraging AI technology, EvolutionaryScale aims to make biology programmable and has already demonstrated the creation of new fluorescent proteins. The model has the potential to revolutionize medicine by designing entirely new proteins. Despite concerns about the potential weaponization of AI-designed proteins, EvolutionaryScale has taken measures to mitigate risks, including excluding certain sequences from training data. Researchers are excited about the possibilities ESM3 offers in designing proteins and anticipate its impact on various fields, including drug development and sustainability. The model's open-source version allows for collaboration and experimentation, although the largest version requires significant computing resources to replicate independently.
Related
Are AlphaFold's new results a miracle?
AlphaFold 3 by DeepMind excels in predicting molecule-protein binding, surpassing AutoDock Vina. Concerns about data redundancy, generalization, and molecular interaction understanding prompt scrutiny for drug discovery reliability.
ESM3, EsmGFP, and EvolutionaryScale
EvolutionaryScale introduces ESM3, a language model simulating 500 million years of evolution. ESM3 designs proteins with atomic precision, including esmGFP, a novel fluorescent protein, showcasing its potential for innovative protein engineering.
How AI Revolutionized Protein Science, but Didn't End It
Artificial intelligence, exemplified by AlphaFold2 and AlphaFold3, revolutionized protein science by accurately predicting protein structures. Despite advancements, AI complements rather than replaces biological experiments, highlighting the complexity of simulating protein dynamics.
AI Revolutionized Protein Science, but Didn't End It
Artificial intelligence, exemplified by AlphaFold2 and its successor AlphaFold3, revolutionized protein science by predicting structures accurately. AI complements but doesn't replace traditional methods, emphasizing collaboration for deeper insights.
New technique opens the door to large-scale DNA editing to cure diseases
Researchers have described a new genetic editing mechanism using jumping genes to insert DNA sequences accurately. This system shows promise in overcoming CRISPR limitations, with 94% accuracy and 60% efficiency in bacteria. Optimizations are needed for mammalian cell use.
The classical approach was to understand how genes transcribe to mRNA, and how mRNA translates to polypeptides; how those are cleaved by the cell, and fold in 3D space; and how those 3D shapes results in actual biological function. It required real-world measurement, experiment, and modeling in silico using biophysical models. Those are all hard research efforts. And it seems like the mindset now is: we've done enough hard research, let's feed what we know into a model, hope we've chosen the right hyperparameters, and see what we get. Hidden in the weights and biases of the model will be that deeper map of the real world that we have not yet fully grasped through research.
But the AI cannot provide a 'why'. Its network of weights and biases are as unintelligible to us as the underlying scientific principles of the real world we gave up trying to understand along the way. When AI produces a result that is surprising, we still have to validate it in the real world, and work backwards through the hard research to understand why we are surprised.
If AI is just a tool for a shotgun approach to discovery, that may be fine. However, I fear it is sucking a lot of air out of the room from the classical approaches. When 'AI' produces incorrect, misleading, or underwhelming results? Well, throw more GPUs at it; more tokens; more joules; more parameters. We have blind faith it'll work itself out.
But because the AI can never provide a guarantee of correctness, it is only useful to those with the infrastructure to carry out those real-world validations on its output, so it's not really going to create a paradigm shift. It can provide only a marginal improvement at the top of the funnel for existing discovery pipelines. And because AI is very expensive and getting more so, there's a pretty hard cap on how valuable it would be to a drugmaker.
I know I'm not the only one worried about a bubble here.
> When the researchers made around 100 of the resulting designs, several were as bright as natural GFPs, which are still vastly dimmer than lab-engineered variants.
So they didn't come up with better functionality, unlike what some commentators imply. They basically introduced a bunch of mutations while preserving the overall function.
Relevant: https://en.wikipedia.org/wiki/Conservative_replacement
That sounds like a glove being thrown down.
Not to downplay this achievement, but 60% sequence identity is nowhere near “vastly different”.
"It's just three guys and a model and they think it's worth X hundred million"
we just had to wait for this approach to be apparent
Except for the part where a sequence is actually deemed more fit, ie natural selection? And the part where mutations are random, instead of sampled from the training data manifold, so much more constrained?
...so really it's a worse version of random search?
Related
Are AlphaFold's new results a miracle?
AlphaFold 3 by DeepMind excels in predicting molecule-protein binding, surpassing AutoDock Vina. Concerns about data redundancy, generalization, and molecular interaction understanding prompt scrutiny for drug discovery reliability.
ESM3, EsmGFP, and EvolutionaryScale
EvolutionaryScale introduces ESM3, a language model simulating 500 million years of evolution. ESM3 designs proteins with atomic precision, including esmGFP, a novel fluorescent protein, showcasing its potential for innovative protein engineering.
How AI Revolutionized Protein Science, but Didn't End It
Artificial intelligence, exemplified by AlphaFold2 and AlphaFold3, revolutionized protein science by accurately predicting protein structures. Despite advancements, AI complements rather than replaces biological experiments, highlighting the complexity of simulating protein dynamics.
AI Revolutionized Protein Science, but Didn't End It
Artificial intelligence, exemplified by AlphaFold2 and its successor AlphaFold3, revolutionized protein science by predicting structures accurately. AI complements but doesn't replace traditional methods, emphasizing collaboration for deeper insights.
New technique opens the door to large-scale DNA editing to cure diseases
Researchers have described a new genetic editing mechanism using jumping genes to insert DNA sequences accurately. This system shows promise in overcoming CRISPR limitations, with 94% accuracy and 60% efficiency in bacteria. Optimizations are needed for mammalian cell use.