New AI Blood Test Predicts ALS Diagnosis and Survival

New AI Blood Test Predicts ALS Diagnosis and Survival

With the devastating journey of an amyotrophic lateral sclerosis (ALS) diagnosis often taking more than a year, the search for a faster, more accurate method is a top priority in neuroscience. Biopharma expert Ivan Kairatov joins us to discuss a groundbreaking new approach that uses the power of machine learning to analyze blood samples. We’ll explore how this technology can distinguish the unique genetic signature of ALS with remarkable accuracy, how it could revolutionize patient eligibility for clinical trials, and how it’s already pointing the way toward potential new therapies, offering a much-needed beacon of hope for patients and their families.

Your findings suggest a machine learning model can diagnose ALS from blood with up to 91% accuracy. Could you walk us through the process of how the XGBoost model analyzes RNA sequencing data to distinguish between the 2,500 unique genes found in ALS patients versus controls?

It’s a process that moves from immense complexity to refined precision. We start by taking a simple blood sample and performing RNA sequencing. This gives us a snapshot of all the gene activity in the blood cells. What we discovered was staggering: we identified over 2,500 unique genes that behave differently—or “express” differently—in individuals with ALS compared to healthy controls. Many of these were surprisingly linked to the immune system. The challenge then becomes, how do you make sense of 2,500 signals? That’s where the XGBoost machine learning model comes in. You can’t just look at one or two genes; you have to see the whole pattern. We trained the model on this vast dataset, teaching it to recognize the subtle, collective signature of ALS. The model learns which combinations of genes are the most powerful predictors, ultimately narrowing that massive list down to a focused panel of just 27 to 46 key genes. It’s this multi-gene panel that allows it to achieve that remarkable 91% accuracy, a level of certainty that was previously unheard of for an ALS biomarker.

The article notes that a typical ALS diagnosis can take over a year. Based on your research, what steps would be involved in implementing this gene expression biomarker panel in a clinical setting, and what specific impact could it have on a patient’s eligibility for clinical trials?

The current diagnostic odyssey for patients is agonizing. They face a year, sometimes more, of uncertainty, undergoing numerous tests while the disease progresses. Implementing our panel could fundamentally change this. The first step in a clinical setting would be to validate these findings in larger, more diverse patient populations. Once validated, the process for a patient would be as simple as a routine blood draw. That sample would undergo RNA sequencing, and the data would be fed into our predictive model. Instead of a year-long process of elimination, a physician could get a highly accurate, data-driven assessment much earlier. This has a monumental impact on clinical trial eligibility. Many trials have strict enrollment criteria, often excluding patients whose disease is too advanced. By diagnosing someone earlier, we open a crucial window of opportunity. They can enroll in trials for promising new treatments when their motor neurons are more intact and the therapy has the greatest chance of making a meaningful difference. It shifts the entire paradigm from reactive to proactive care.

You contrast your multi-gene approach with the single biomarker, neurofilament light chain (NfL). Can you elaborate on the key limitations of relying on NfL and provide an example of how your model more accurately differentiates ALS from conditions like Alzheimer’s or Parkinson’s disease?

Neurofilament light chain, or NfL, has been a useful tool, but it has a significant flaw: it’s a general alarm, not a specific one. NfL is a protein that gets released when neurons are damaged. So yes, its levels are elevated in ALS, but they are also elevated in Alzheimer’s, Parkinson’s, and multiple sclerosis. If a patient comes in with early neurological symptoms and has high NfL levels, you still don’t know for sure what the underlying cause is. It tells you there’s a fire, but not where or what kind. Our multi-gene model is fundamentally different. It’s not looking for general damage; it’s searching for a specific genetic fingerprint. For example, the pattern of gene expression it detects is unique to the biological cascade of ALS. So, when our model flags a sample as positive, it’s not just saying “there is neurodegeneration.” It’s saying, “the specific biological processes we see active in these blood cells are characteristic of ALS and not of Parkinson’s or Alzheimer’s.” This specificity is the key to providing a confident, early diagnosis and avoiding the misdirection that a non-specific marker like NfL can cause.

Beyond diagnosis, you developed models to predict patient survival by adding clinical information. Could you share some details on what “core genes” you identified and how their discovery led your team to pinpoint eight potential drugs, such as trifluoperazine, for future therapeutic research?

That’s right, diagnosis was just the beginning. We wanted to see if we could also predict prognosis, as patient survival can vary from two to four years, and sometimes longer. By incorporating clinical data alongside the gene expression levels, our models could better differentiate between patients with shorter or longer survival times. During this analysis, we identified a set of “core genes” that were particularly influential. What’s fascinating is that these genes found in the blood share features with the very motor neurons in the spinal cord that are devastated by ALS. This provided a powerful biological link. It meant these core genes weren’t just random markers; they were likely central to the disease mechanism itself. This discovery became a launchpad for therapeutic exploration. We used this genetic information to search databases of existing drugs to see which compounds might influence these core genes. This computational approach flagged eight potential candidates for repurposing, including drugs like trifluoperazine. It’s a beautiful example of how a diagnostic tool can directly illuminate a path toward treatment.

What is your forecast for the future of ALS diagnostics and treatment, considering the potential of machine learning tools like the one you’ve developed?

My forecast is one of cautious but profound optimism. We are on the cusp of a major shift in how we approach ALS. In the next decade, I believe machine learning-driven blood tests will become a standard part of the diagnostic toolkit, drastically cutting down the painful diagnostic delay. This will not only bring relief to patients and families but will also accelerate research by getting the right patients into the right clinical trials faster. But it goes beyond just a faster diagnosis. The future is about precision. We’ll use these tools not only to diagnose but to stratify patients, predicting their disease course and understanding their unique biology. This will lead to more personalized treatments. The fact that our model could identify potential drug targets like ibrutinib is a testament to this. We’re moving away from a one-size-fits-all approach and toward a future where we can analyze a patient’s genetic signature to predict their prognosis and select the most promising therapies for them. It’s an incredibly exciting time.

Subscribe to our weekly news digest.

Join now and become a part of our fast-growing community.

Invalid Email Address
Thanks for Subscribing!
We'll be sending you our best soon!
Something went wrong, please try again later