New data-mining effort launched to study mental disorders

New data-mining effort launched to study mental disorders

$13.75 million in grants to fund multi-institutional project based in Chicago

October 6, 2011

Chicago will be home to a new $13.75 million project that will apply data mining methods to better understand the genetic and environmental factors behind neuropsychiatric disorders.

The Silvio O. Conte Center, a multi-institutional effort based at the University of Chicago, will combine the statistical power of pre-existing genetics, pharmacogenomics, text-mining, and clinical record databases to confront diseases that have so far frustrated researchers.

"There are multiple communities looking at the same problem," said Andrey Rzhetsky, PhD, Professor of Medicine and Human Genetics, Senior Fellow of the Computation Institute at the University of Chicago and Institute for Genomics and Systems Biology, and Director of the Conte Center. "We are trying to combine them all to model and analyze those data types jointly, looking at multiple phenotypes simultaneously and looking for possible environmental factors."

The center will initially be funded by an $11.75 million grant from the National Institute of Mental Health and $2 million from the Chicago Biomedical Consortium. Researchers and datasets from the University of Chicago, Northwestern University, University of Illinois at Chicago, Stanford University, Children's Hospital Boston, Columbia University, and the University of Haifa will be involved in the project.

To determine the biological basis of mental disorders such as schizophrenia and depression, scientists have tried many technical approaches -- each with their own strengths and weaknesses.

For example, genetic association studies of psychiatric disorders have located gene variants associated with the disorders, but have been able to explain only a small percentage of their heritability. Researchers have also collected detailed clinical records on psychiatric patients and the efficacy and side effects of available treatments, but the potentially valuable information within those records remains largely untapped.

Rather than focusing on just one of these methods, the Conte Center will apply computational analysis to data from all of them to discover new network relationships between genes, environmental factors, and clinical phenotypes. The results will create novel, testable hypotheses that could alter how experts define and treat neuropsychiatric disorders.

"There's more data than we know what to do with at this point," said Edwin H. Cook, Jr., MD, Professor of Psychiatry at the University of Illinois at Chicago. "The analytic, informational, and data management approaches in this very forward-thinking Conte Center should allow us to find things that we couldn't before."

Neuropsychiatric disorders are particularly well suited for this approach due to the hazy diagnostic and biological borders between conditions. A 2007 study by Rzhetsky and colleagues that applied statistical modeling methods to patient records alone found a significant overlap between autism, schizophrenia, and bipolar disorder that implied a genetic relationship.

"Most studies are done one disorder at a time, and that's like studying the trunk or the hoof or the tail of an elephant; you might miss the big picture," said Benjamin Lahey, PhD, Irving B. Harris Professor of Epidemiology at the University of Chicago. "This project will enable us to look at things in a way that has never been done before, at a scale that dwarfs anything that's ever been done."

Russ Altman, MD, PhD, Professor of Bioengineering, Genetics, and Medicine at Stanford University said, "Diagnosis and treatment of these disorders is incredibly challenging. These data-driven approaches have a real chance to uncover new models for not only the pathogenesis of the individual diseases, but perhaps even a new way to think about the constellation of related diseases."

The center will operate similarly to a "large software project," the investigators said, with four simultaneous projects and three core centers working in parallel to produce integrated results. An advanced, cloud-based computing system will be used to share data among investigators and with the public.

The data-mining efforts will generate models of the interaction between genes, environmental factors, and phenotypes that can then be tested in collaboration with the Institute for Genomics and Systems Biology at the University of Chicago.

"The molecular basis for diseases such as autism, schizophrenia and bipolar disorder has been tremendously difficult to resolve," said Kevin White, PhD, James and Karen Frank Family Professor of Ecology & Evolution and Director of the IGSB. "By bringing together experts from multiple domains, from computational biologists to clinicians, the team Rzhetsky has assembled will be focusing on high-risk, high-reward research in an attempt to propel the field forward."

If successful, the approach of mining existing data from several different methods could potentially be applied to other types of disorders, including the overlap between mental and physical disorders such as schizophrenia and diabetes.

"We definitely have one of the strongest genomics groups in the country, we have probably one of the strongest statistical genetics groups, and we have excellent world-renowned experts in phenotypes," Rzhetsky said. "It's exciting because there is potential, but now we have to work hard to get there."

Funding for this project was provided by the National Institute of Mental Health and the Chicago Biomedical Consortium. In addition to Rzhetsky, Cook, Lahey, Altman, and White, the Conte Center investigators include Dan Nicolae, Nancy Cox, Robert Grossman, Barry Aprison, and Elliot Gershon of the University of Chicago; Isaac Kohane of Children's Hospital Boston; Raul Rabadan of Columbia University; Karl Deisseroth of Stanford University; Richard Morimoto of Northwestern University; and Mor Peleg of University of Haifa.