As a kid growing up in Hyderabad, India, found himself drawn to technology and its possibilities.
He liked the idea of researching a problem and applying that knowledge to create a technological solution to address it. It鈥檚 ultimately what led him to the United States to study computer science with the intention of getting involved in app development.
Pullannagari, now a senior at the , hadn鈥檛 initially considered health care as an area to deploy those skills, but he made it a point to start going to hackathons around the country, at places like Harvard, MIT and the University of California, Berkeley, and he kept noticing it as one possible track for exploration.
鈥淚t always intrigued me,鈥 Pullannagari says. 鈥淓specially AI in health care.鈥
People are also reading…
His curiosity was similarly piqued last year when told him about a Biological Data Science course she was teaching in the fall semester, designed to give students an introduction to key areas of biological data science and provide them hands-on experience processing and analyzing genetic data.
Pullannagari eagerly enrolled, joining five graduate students and nine other undergraduates in the class. Over the course of the semester, they downloaded data sets associated with a disease of their choosing from the Gene Expression Omnibus, a public functional genomics data repository, and learned how to clean and process the data, built network models that captured correlation patterns and tested those patterns for association with a trait of interest 鈥 such as the presence of a disease. They then validated their results with independent data.
鈥淚 had built apps that were basically cutting down the gap between doctors and patients and using AI so they could have better care,鈥 Pullannagari says. 鈥淏ut this is a completely different side of it. It鈥檚 network modeling, which I鈥檝e been really interested to try. There鈥檚 been a boom recently in how it can be used for lead creating and lead generating. I thought, 鈥榃hy not explore this in health care? Let鈥檚 see where we would go with this.鈥欌
Pullannagari and his classmates followed the same approach that Climer has used in her own research, which is often focused on genetic data associated with Alzheimer鈥檚.
But they used it to investigate data sets connected to a variety of other conditions. They were able to identify and validate significant associations involving gene expression levels in cells lining the colon for colon cancer patients, blood samples for rheumatoid arthritis patients and skin biopsies for psoriasis patients. There were other relevant findings involving gene expression levels in blood samples for lupus patients and microRNA levels in blood samples for cancer patients.
Some of Climer鈥檚 students have been continuing their analyses this semester and are examining traits such as DNA methylation levels for COVID-19, DNA methylation levels for hepatocellular carcinoma and gene expression levels for ALS.
鈥淓ach of the validated patterns,鈥 Climer says, 鈥渞epresents a biomarker signature with potential to identify individuals exhibiting a subtype of the given trait.鈥
One size doesn鈥檛 fit all
The word 鈥渟ubtype鈥 is an import one because conditions and diseases, including most types of cancer, diabetes, rheumatoid arthritis, lupus, Alzheimer鈥檚 and countless others, are heterogeneous.
That means they have multiple forms, often arising from varying genetic, molecular or environmental factors, such as diet, exercise and exposure to toxins. In general, complex diseases have multiple genetic variants working together in complicated biological processes, possibly augmented by those environmental factors, that create a distinctive multifactorial genetic and environmental signature for each subtype.
It all can lead to different clinical presentation. One subtype might show up earlier in a person鈥檚 life than another or seem to act more aggressively, increasing the urgency to diagnose the problem and begin treatment.
Precision medicine aims to account for those differences while creating a more tailored approach to health that customizes disease prevention and treatment based on each patient鈥檚 unique genetic profile, environment and lifestyle.
But Climer says common tests for diseases often still don鈥檛 account for those differences.
鈥淭he methods that other people are using are really good if there鈥檚 only one type 鈥 if it鈥檚 homogenous 鈥 but they don鈥檛 work when there are subtypes,鈥 she says. 鈥淭he correlation measures don鈥檛 work because they鈥檙e looking for the correlations for everybody, not just for a small group.鈥
The statistical measures used to evaluate a test鈥檚 effectiveness often rely on examining the true positive, true negative, false positive and/or false negative rates. But if the test is looking for a single biomarker associated with a subtype of a disease that represents only about 10% of all cases of that condition, the true positive rates will be inherently small, rendering the test ineffective.
The computational tools Climer has developed 鈥 and the ones Pullannagari and his fellow students learned to deploy 鈥 are geared not only at recognizing a combination of biomarkers associated with a disease but looking at how closely correlated those biomarkers are with particular subtypes, which reveal themselves as clusters in the network models.
鈥淚dentifying the various subtypes is essential for advancing science on multiple fronts,鈥 Climer says. 鈥淔irst, the specific analytes present in the biomarker pattern of each subtype offer insights that can help formulate hypotheses regarding the pathogenesis for that group, as well as highlight potential drug development targets. Second, the capacity to categorize individuals into subtypes during drug trials can be critical; a drug may work effectively for one subtype but may fail due to the inclusion of other subtypes. Lastly, the realization of successful precision medicine hinges on the ability to accurately diagnose an individual鈥檚 subtype and tailor treatment to their specific needs.鈥
A better way
Climer has tried to advocate for a new approach to biomarker research to improve health care prevention and treatment.
鈥淪he came up with a general method for estimating networks of interacting or associating elements that blew away all the alternative analyses,鈥 said Alan Templeton, the Charles Rebstock Professor of Biology Emeritus at Washington University in 最新杏吧原创 and one of Climer鈥檚 PhD advisors. 鈥淕enomics allows us to look at literally millions of genetic variants at a time, but most analyses could only analyze them one by one. Even looking at pairs was extremely difficult. But genes rarely work in isolation; they interact with one another and with the environment to produce the traits that people have.
鈥淭his is the reality that all geneticists know, but this reality was a computationally difficult problem 鈥 indeed, seemingly impossible 鈥 so the field was dominated by single-factor analyses with multiple factor data bases. Sharlee came up with an algorithm that could estimate networks of interacting or associated elements that broke this field wide open.鈥
In 2024, Climer highlighted her methods in a presentation to the Hope Center Neurogenetics and Transcriptomics Group at Washington University in 最新杏吧原创.
鈥淭o get people to come, I titled it: 鈥榃e鈥檙e using the wrong statistics for precision medicine,鈥欌 Climer says. 鈥淚 had a packed room, and I showed them, 鈥楾hese popular association testing methods are wrong, and these prevalent correlation measures are wrong.鈥 And then I explained how I鈥檓 doing it. I鈥檝e developed tools for capturing subtypes by using data distributions for association testing and correlation metrics that evaluate each type of alignment. I convinced them all of it. But I still don鈥檛 see the change out there.
鈥淭he current methods are obviously wrong, and yet editors are reluctant to send my papers out. It seems like they鈥檙e looking for incremental advancements, not for a whole different way to approach things.鈥
She鈥檚 hoping the results from last fall鈥檚 biological data science course can help change that.
Seeing is believing
Pullannagari has continued working with Climer through an independent study course this semester and has been creating visualizations for each of the validated results produced by him or his classmates. He鈥檒l then work to develop a manuscript that demonstrates the effectiveness of Climer鈥檚 methods across a variety of conditions or diseases.
鈥淭hey show how the code or the approach that we took is universal and would be useful in all of the use cases that we have already done,鈥 Pullannagari says. 鈥淚t provides a broad spectrum to show how it can be used in different sides of biology, especially for gene expression and other diseases. It works perfectly fine.鈥
Climer intends to submit the paper for journal publication with each of the students serving as a co-author.
鈥淚 think we鈥檒l get something good because there are quite a few really strong results,鈥 Climer says. 鈥淲hat we have in the discovery set is almost identical in the validation, and that鈥檚 completely independent data. There鈥檚 got to be something there. It can鈥檛 just be by chance.鈥
The work could find an audience among other computer science researchers or clinicians who might use specific results to guide their research on a particular disease or a drug that might alleviate it.
Pullannagari feels fortunate to have had the opportunity to be part of the work, and he鈥檚 moved into other areas of health care-connected study while serving as a research assistant investigating lung cancer drug binding affinity at UMSL鈥檚 Center for Nanoscience.
鈥淚t鈥檚 been amazing,鈥 he says. 鈥淚 would really want to explore this field. I didn鈥檛 expect it to be this impactful and exciting to work on.鈥

