Schizophrenia is another one of those conditions (like alcoholism) that we're constantly being told is highly heritable, based on the results of twin studies and various other findings (from studies that pre-date the DNA-sequencing era). Studies "proving" the heritability of schizophrenia are, in general, rarely questioned, or even reviewed critically, due to the fact that a genetic basis for schizophrenia makes so much intuitive sense and agrees so well with ordinary experience. Everybody knows someone who "has schizophrenia in their family," and it very much does seem to travel in families.
The fact that something travels in families is not automatically indicative of a genetic connection, but most people aren't ready to believe such a thing since it goes against common sense. Nevetheless, in the past few decades, a great deal of work (much of it quite compelling) has gone into the study of things like intergenerational trauma, with researchers finding, for example, that the course of PTSD in modern-day Israeli military veterans is much different for soldiers whose parents or grandparents were Holocaust survivors. (I talk about intergenerational trauma in Of Two Minds. In fact, there's a whole chapter on Trauma.)
Social workers and criminologists have known for years that childhood physical and sexual abuse tend to be intergenerational. Yet no one seriously suggests there's a "child abuse gene" or a "child molestation gene." A lot of things that aren't purely genetic are trans-generational; familial. Schizophrenia itself may be in this category. The links between childhood trauma and schizophrenia are quite strong and well known. Indeed, the literature shows that the associations between childhood trauma and schizophrenia are actually as strong as or stronger than the associations between childhood trauma and other disorders that we know have a high correspondence to childhood trauma (including depression, PTSD, anxiety disorders, dissociative disorders, eating disorders, personality disorders, substance abuse, and sexual dysfunction, among others).
Still, the search for "schizophrenia genes" goes on, and every so often we hear claims for this or that group of scientists having discovered some new genetic feature that confers high risk for schizophrenia. The latest of these reports comes from the Schizophrenia Working Group of the Psychiatric Genomics Consortium, which published a piece in Nature Letters in July 2014 called "Biological insights from 108 schizophrenia-associated genetic loci." The report trumpets the finding of "108 conservatively defined loci that meet genome-wide significance, 83 of which have not been previously reported." This is a genome-wide association study, subject to the usual GWAS caveats.
Of the 108 loci, 75% were associated with protein-coding genes, a very high number for a GWAS study, and quite a few genes involve physiologically relevant functions.
On the surface, it sounds promising.
Where things start to fall down is in determining how much heritability risk the 108 loci can account for. The authors said: "Assuming a liability-threshold model, a lifetime risk of 1%, independent SNP effects, and adjusting for case-control ascertainment, RPS" (risk profile scores) "now explain about 7% of variation on the liability scale to schizophrenia across the samples, about half of which (3.4%) is explained by genome-wide significant loci." To parse that: We know the lifetime risk of schizophrenia is 1%. If we total up the disease risk attributable to all of the 128 markers found in the study, we can explain 7% of schizophrenia risk. But if we limit ourselves to the 108 markers that met criteria for genome-wide significance (a stringent test intended to reduce the risk of false positives), we can explain only a little more than 3% of schizophrenia risk.
So once again, it's a mixed bag: a genome-wide association study (GWAS) that's filled with interesting findings (some of which may prove to be important), but that fails, quite miserably, to explain heritability.
Many people have commented on the possible reasons why genome-wide association studies have been hit-or-miss in their ability to explain heritability, why they so often find SNPs (single nucleotide polymorphisms) that don't map to genes, why the results often don't replicate well, etc. I think if we're honest, we have to admit that GWAS isn't always the right tool for the job. It's a great tool, but you have to remember how it works. GWAS attempts to match fairly common (i.e., occurring in 1% or more of the population) single nucleotide polymorphisms (which you can think of as point mutations) in databases that have been specially built to accommodate the most common polymorphisms. In a GWAS, you aren't doing deep sequencing of people's genes; you're looking for markers that cover a rather tiny percent of the genome. These markers occur commonly, which almost by definition prevents them from being very serious defects, because serious genetic defects are rendered rare by evolution (natural selection removes them from the gene pool over time). Therefore, in a technique that, by design, looks mainly at commonly occurring SNPs, which are overwhelmingly neutral (from an evolutionary standpoint), we should not be surprised to find that the markers that get spotted in GWAS (as genome-wide significant) are essentially neutral mutations without much phenotypic effect.
Unfortunately, we're at an awkward point in the history of science, because technology has given us some powerful DNA analysis tools (and the computers needed to crunch the data), but we're not quite yet at the point where detailed, deep genetic sequencing of whole human genomes is economically feasible for a study that includes, say, a thousand cases and a thousand controls. (Right now, it costs about $10,000 to do the necessary sequencing.) When a human genome can be reliably sequenced for under $2000, we'll have the tools to do really serious, deep genetic studies of hard-to-pin-down diseases. Whether we'll have the computer power needed to do the necessary statistical analysis on the scale of large (N > 1000) case-control studies is another matter; remember that in GWAS we're dealing with perhaps 500,000 data points per genome whereas in deep sequencing we're generating billions of data points per genome. It'll probably be doable using Amazon AWS/EC2 infrastructure (or Google's compute-cloud). Another reason to invest in Amazon, probably.
I'm in the process of writing a 100,000-word book on mental illness. It's evidence-based; part science, part memoir, with over 200 footnotes (so you can refer to the scientific literature yourself). To follow the progress of the book, check back here often, and also consider adding your name to our mailing list. Thanks!
Showing posts with label GWAS. Show all posts
Showing posts with label GWAS. Show all posts
Saturday, January 10, 2015
Thursday, January 08, 2015
Where Are the Alcoholism Genes?
Alcohol abuse is the third leading cause of preventable death in the U.S. (behind smoking and obesity) creating a burden to the U.S. economy of about $200 billion a year. Data from twin studies (and other studies) have long suggested a strong genetic component to alcoholism. It's widely claimed that the heritability of alcoholism is about 50%; this number is supported by the National Institute on Alcohol Abuse and Alcoholism, for example, as well as at least one meta-analysis of twin and adoption studies. (For a dissenting meta-analysis, see this paper.)
No one expects that alcoholism will turn out to be a simple matter of inheriting one or two "alcoholism genes" (although at one time, this was in fact the hope). Instead, it's expected that multiple genetic features probably interact to produce the propensity to develop alcohol dependence. Even so, if alcoholism is as heritable as people say it is, and if it involves a tractable number of genes (dozens or scores, rather than hundreds or thousands), candidate genes should be fairly easy to discover using the powerful new tools of DNA analysis. Candidate SNPs (single nucleotide polymorphisms) should jump right off the page and hit you in the face. If gene copy number variants are involved, it should be readily apparent. If genes are involved at all, it should be pretty obvious pretty fast.
That's not what we find.
Genome-wide association studies (GWAS) are able to account for only a small proportion of the "known" heritability of alcoholism. This "small proportion of heritability" problem is true also for GWAS investigations of things like body height (a trait that's even more heritable than alcoholism), where only 10% of the variability of human body height can be accounted for by the more than 100 genetic features thought to control it. In fact, the general failure of GWAS to account for heritable conditions is so widespread, papers have been written about it.
The general idea behind GWAS is the "common disease, common variant" hypothesis, which says that common diseases are attributable to potentially many allelic variants (genetic differences) present in a sizable percentage of the population. In GWAS, several hundred thousand to more than a million single nucleotide polymorphisms (SNPs) are assayed, per person, times hundreds (or often, thousands) of individuals, and statistics for the expected (versus actual) rate of occurrence of various SNPs are calculated. SNPs that appear to "travel together" down through the generations are assessed for linkage disequilibrium to determine if they are associated more often than normal population statistics would predict. The data sets generated in GWAS are enormous and the statistical calculations challenging. For a description of the technique, see this article; also see this piece for criticisms of the technique (and a spirited rebuttal of criticisms).
One of the best known GWAS alcoholism studies is the so-called German study, by Treutlein et al. (2009). This study (which was likely underpowered, statistically; and the authors admit this) included 1460 study subjects with DSM-IV alcohol dependence and 2332 control individuals. The study claimed to have detected and replicated evidence for association of 15 markers to alcoholism, although only 9 SNPs were in genes (the rest were in what used to be called "junk DNA") and only two met criteria for genome-wide significance. The two SNPs that met criteria for genome-wide significance were colocated (5000 base-pairs apart) in chromosomal region 2q35 (the long arm of chromosome 2).
Less well known is the 2012 GWAS conducted on two samples (N=1721 and N=1113) of Korean men by Baik et al., which found that a dozen SNPs on chromosome 12 had genome-wide significant associations with alcohol consumption. Most of the SNPs were in intronic regions of genes C12orf51, CCDC63, MYL2, OAS3, CUX2, and RPH3A. Three of these successfully replicated in the second test arm. These locations were not the same as the ones found in the German study (nor the studies below). On the other hand, it's not clear whether results applicable to Korean men will be applicable to non-Korean men, or women. Also, this was not a case-control study, but an imputation-based study using an Asian subset of HapMap baseline data.
Another study, often called the SAGE study, by Bierut et al. (2010), looked at 1897 alcohol-abuse subjects and 1932 controls. Here, 15 SNPs of interest were identified, but only one SNP (identified as rs13160562) was among the 15 identified in the German study, and it did not reach genome-wide significance. Said the researchers: "Our top 15 association signals were not replicated in the independent datasets nor did the present study replicate the two genome-wide significant results reported by Treutlein and colleagues." (One of the independent datasets was a large family-based study of 258 families with more than 2,000 genotyped individuals, while the second was a study of alcohol-dependent men and community-based comparison subjects of German descent.)
In 2012, Heath et al. looked at 8754 individuals (2062 of whom were alcohol dependent) from the Australian Twin Registry. It's hard to interpret the Heath writeup, but it appears some of the study population were twins and some were parents of the twins, while all of the controls were non-alcohol-dependent twins. (So imagine that. A twin study that's also a GWAS.) No findings reached genome-wide significance (p=8.4×10−8 for this study). Of the top 400 SNPs, 65% had effect sizes of less than 0.25% of the variance (when ranked by p-value for association with heavy drinking), and only 7.5% had effect sizes greater than 0.3%, with a median effect size of 0.23%. So in other words, the main finding was one of small effect sizes. The Heath writeup was quite clear on this point, stating: "The primary conclusion from these analyses is that, as for many other complex phenotypes (e.g. body-mass index), effect sizes for the contribution of individual genetic variants to differences in heaviness of alcohol consumption and alcoholism risk are small, perhaps accounting for as little as one-tenth of one percent of the variance."
The fact that Heath et al. found 30 SNPs with effect size 0.3% or more is not terribly helpful from an alcoholism-prediction standpoint, because while in theory those 30 SNPs, put together, might account for 10% of phenotypic variance (if their effects are addictive), in reality it's unlikely you'd inherit all 30 defects together (and it's not clear whether you'd have to inherit one copy of each, or two). But even if you did, it raises your alcoholism risk 10%—not the 50% that's so often claimed.
The Heath group also reported: "No SNPs or tagging SNPs confirmed in the German Alcoholism GWAS study were replicated in our analyses, nor were any SNPs or tagging SNPs identified as the most strongly associated SNPs in the SAGE study." In other words, the German results didn't replicate in the Heath study, nor did the results of Bierut et al.
Edenberg et al. (2010), in a case-control study involving 1399 people, found somewhat more encouraging results (arguably). Although no single SNP met genome-wide criteria for significance, the researchers found that 14 out of 140 top SNPs from the German study "were significant in our primary analysis of alcohol dependence" (no p-value stated). So in some sense, the German results replicated for Edenberg (although it's by no means clear that Edenberg's 14 SNPs were among the German group's top 15 SNPs; read the study carefully; it just says 14 of the top 140). Also, using a variety of types of evidence, Edenberg et al. argue that a cluster of 6 genes on chromosome 11 are associated with alcohol dependence, but no effect sizes were given.
It should be noted that genome-wide significance (mentioned several times above) is a fairly stringent criterion. The need for it arises from the fact that any time you look at frequency numbers for a million data points, some are statistically likely to jump out (producing false positives) unless you correct for the sheer number of data points.
My reading of the studies is that no smoking pistols were found; some of the SNPs are suggestive of a relationship to alcohol dependence, but I wouldn't exactly call the results strong, because in general, the studies all found different SNPs in different genes (with only a couple of exceptions), and many of the SNPs were not in genes. (Some of those that were, were in introns.) The studies were not terribly robust in terms of replication (neither within studies nor between them). Certainly, any hope of finding an "alcoholism gene" (singular) are dashed. In fact, it appears unlikely that any gene, of any kind, will be found that can explain more than a few tenths of a percent of your likelihood to have gotten "alcoholism" from your parents or grandparents.
How, then, is alcoholism heritable? We know it runs in families, yes. But that doesn't mean it's more than a few percent genetic. Lots of things that aren't genetic run in families: poverty, political beliefs, the Tooth Fairy myth, etc. (An affirmative answer to the question “Did you have your back rubbed?” has been shown to be 92% heritable for males and 21% heritable for females.)
Environment, in all likelihood, is the great multiplier, the amplifier that brings out the weak signal in DNA; that's what future research will probably end up telling us. That's what the research tells us now.
I'm in the process of finishing a book on mental illness. It's an evidence-based book, part science, part memoir. (It's not as technical as this post, though! It's meant for non-technical readers.) For more info, sign up for the mailing list, and return here often. Thanks!
No one expects that alcoholism will turn out to be a simple matter of inheriting one or two "alcoholism genes" (although at one time, this was in fact the hope). Instead, it's expected that multiple genetic features probably interact to produce the propensity to develop alcohol dependence. Even so, if alcoholism is as heritable as people say it is, and if it involves a tractable number of genes (dozens or scores, rather than hundreds or thousands), candidate genes should be fairly easy to discover using the powerful new tools of DNA analysis. Candidate SNPs (single nucleotide polymorphisms) should jump right off the page and hit you in the face. If gene copy number variants are involved, it should be readily apparent. If genes are involved at all, it should be pretty obvious pretty fast.
That's not what we find.
Genome-wide association studies (GWAS) are able to account for only a small proportion of the "known" heritability of alcoholism. This "small proportion of heritability" problem is true also for GWAS investigations of things like body height (a trait that's even more heritable than alcoholism), where only 10% of the variability of human body height can be accounted for by the more than 100 genetic features thought to control it. In fact, the general failure of GWAS to account for heritable conditions is so widespread, papers have been written about it.
The general idea behind GWAS is the "common disease, common variant" hypothesis, which says that common diseases are attributable to potentially many allelic variants (genetic differences) present in a sizable percentage of the population. In GWAS, several hundred thousand to more than a million single nucleotide polymorphisms (SNPs) are assayed, per person, times hundreds (or often, thousands) of individuals, and statistics for the expected (versus actual) rate of occurrence of various SNPs are calculated. SNPs that appear to "travel together" down through the generations are assessed for linkage disequilibrium to determine if they are associated more often than normal population statistics would predict. The data sets generated in GWAS are enormous and the statistical calculations challenging. For a description of the technique, see this article; also see this piece for criticisms of the technique (and a spirited rebuttal of criticisms).
One of the best known GWAS alcoholism studies is the so-called German study, by Treutlein et al. (2009). This study (which was likely underpowered, statistically; and the authors admit this) included 1460 study subjects with DSM-IV alcohol dependence and 2332 control individuals. The study claimed to have detected and replicated evidence for association of 15 markers to alcoholism, although only 9 SNPs were in genes (the rest were in what used to be called "junk DNA") and only two met criteria for genome-wide significance. The two SNPs that met criteria for genome-wide significance were colocated (5000 base-pairs apart) in chromosomal region 2q35 (the long arm of chromosome 2).
Less well known is the 2012 GWAS conducted on two samples (N=1721 and N=1113) of Korean men by Baik et al., which found that a dozen SNPs on chromosome 12 had genome-wide significant associations with alcohol consumption. Most of the SNPs were in intronic regions of genes C12orf51, CCDC63, MYL2, OAS3, CUX2, and RPH3A. Three of these successfully replicated in the second test arm. These locations were not the same as the ones found in the German study (nor the studies below). On the other hand, it's not clear whether results applicable to Korean men will be applicable to non-Korean men, or women. Also, this was not a case-control study, but an imputation-based study using an Asian subset of HapMap baseline data.
Another study, often called the SAGE study, by Bierut et al. (2010), looked at 1897 alcohol-abuse subjects and 1932 controls. Here, 15 SNPs of interest were identified, but only one SNP (identified as rs13160562) was among the 15 identified in the German study, and it did not reach genome-wide significance. Said the researchers: "Our top 15 association signals were not replicated in the independent datasets nor did the present study replicate the two genome-wide significant results reported by Treutlein and colleagues." (One of the independent datasets was a large family-based study of 258 families with more than 2,000 genotyped individuals, while the second was a study of alcohol-dependent men and community-based comparison subjects of German descent.)
In 2012, Heath et al. looked at 8754 individuals (2062 of whom were alcohol dependent) from the Australian Twin Registry. It's hard to interpret the Heath writeup, but it appears some of the study population were twins and some were parents of the twins, while all of the controls were non-alcohol-dependent twins. (So imagine that. A twin study that's also a GWAS.) No findings reached genome-wide significance (p=8.4×10−8 for this study). Of the top 400 SNPs, 65% had effect sizes of less than 0.25% of the variance (when ranked by p-value for association with heavy drinking), and only 7.5% had effect sizes greater than 0.3%, with a median effect size of 0.23%. So in other words, the main finding was one of small effect sizes. The Heath writeup was quite clear on this point, stating: "The primary conclusion from these analyses is that, as for many other complex phenotypes (e.g. body-mass index), effect sizes for the contribution of individual genetic variants to differences in heaviness of alcohol consumption and alcoholism risk are small, perhaps accounting for as little as one-tenth of one percent of the variance."
The fact that Heath et al. found 30 SNPs with effect size 0.3% or more is not terribly helpful from an alcoholism-prediction standpoint, because while in theory those 30 SNPs, put together, might account for 10% of phenotypic variance (if their effects are addictive), in reality it's unlikely you'd inherit all 30 defects together (and it's not clear whether you'd have to inherit one copy of each, or two). But even if you did, it raises your alcoholism risk 10%—not the 50% that's so often claimed.
The Heath group also reported: "No SNPs or tagging SNPs confirmed in the German Alcoholism GWAS study were replicated in our analyses, nor were any SNPs or tagging SNPs identified as the most strongly associated SNPs in the SAGE study." In other words, the German results didn't replicate in the Heath study, nor did the results of Bierut et al.
Edenberg et al. (2010), in a case-control study involving 1399 people, found somewhat more encouraging results (arguably). Although no single SNP met genome-wide criteria for significance, the researchers found that 14 out of 140 top SNPs from the German study "were significant in our primary analysis of alcohol dependence" (no p-value stated). So in some sense, the German results replicated for Edenberg (although it's by no means clear that Edenberg's 14 SNPs were among the German group's top 15 SNPs; read the study carefully; it just says 14 of the top 140). Also, using a variety of types of evidence, Edenberg et al. argue that a cluster of 6 genes on chromosome 11 are associated with alcohol dependence, but no effect sizes were given.
It should be noted that genome-wide significance (mentioned several times above) is a fairly stringent criterion. The need for it arises from the fact that any time you look at frequency numbers for a million data points, some are statistically likely to jump out (producing false positives) unless you correct for the sheer number of data points.
My reading of the studies is that no smoking pistols were found; some of the SNPs are suggestive of a relationship to alcohol dependence, but I wouldn't exactly call the results strong, because in general, the studies all found different SNPs in different genes (with only a couple of exceptions), and many of the SNPs were not in genes. (Some of those that were, were in introns.) The studies were not terribly robust in terms of replication (neither within studies nor between them). Certainly, any hope of finding an "alcoholism gene" (singular) are dashed. In fact, it appears unlikely that any gene, of any kind, will be found that can explain more than a few tenths of a percent of your likelihood to have gotten "alcoholism" from your parents or grandparents.
How, then, is alcoholism heritable? We know it runs in families, yes. But that doesn't mean it's more than a few percent genetic. Lots of things that aren't genetic run in families: poverty, political beliefs, the Tooth Fairy myth, etc. (An affirmative answer to the question “Did you have your back rubbed?” has been shown to be 92% heritable for males and 21% heritable for females.)
Environment, in all likelihood, is the great multiplier, the amplifier that brings out the weak signal in DNA; that's what future research will probably end up telling us. That's what the research tells us now.
I'm in the process of finishing a book on mental illness. It's an evidence-based book, part science, part memoir. (It's not as technical as this post, though! It's meant for non-technical readers.) For more info, sign up for the mailing list, and return here often. Thanks!
Tuesday, January 06, 2015
The Problem of "Missing Heritability"
When the human genome was sequenced in 2003, the expectation was that scientists, armed with a powerful array of new genetic techniques, would very quickly identify the genetic correlates of things like schizophrenia, depression, autism, and alcoholism, which we supposedly "know" have a large genetic component. Extremely large, well-funded genome-wide association studies (GWAS) were begun. (See this PLoS paper to learn how such studies are conducted.) Surprisingly, the studies failed, for the most part, to converge on genes, gene copy number variants, SNPs (single nucleotide polymorphisms), gene rearrangements, mutations, or other peculiarities of allelic architecture whose presence could predict important diseases. When genetic loci of interest were found, the findings often didn't replicate in followup studies; or if they did, the relative odds ratios (a measure of the ability of genetic features to predict the prevalence of a trait) fell far short of explaining the known magnitudes of various traits in target populations.
If things like schizophrenia and alcoholism truly do have a strong genetic basis (as we've been told), and if they were to involve tractable numbers of genes (say, dozens or scores of genes, rather than hundreds or thousands), DNA studies of the GWAS kind should produce immediate, strong, recognizable genetic signatures of disease. The results should leap off the page. But they don't. Time and again, the very few candidate alleles that are found are either found in low numbers, or have low "penetrance" (low capacity to predict disease), or both, and quite often the candidate alleles that are found are not confirmed by followup studies.
This has given rise to major crisis in genetics, summed up in a paper called "Finding the missing heritability of complex diseases" that appeared in Nature in 2009. Scientists are desperate to explain the "missing heratibility" of disorders we "know" are genetic.
It's assumed, by most scientists, that the failure of genome-wide association studies to find genetic explanations of complex diseases can be attributed to such studies' low power and resolution for genetic variations of modest effect. (So the quest has begun to increase the study size of future trials, in hopes of seeing more robust results.) In addition, there's a reasonable expectation that many complex diseases will doubtless be found to involve numerous genes, each contributing only a small effect to the total. It's also been suggested that some traits are dictated by extremely rare genetic features with high "penetrance." There's also an awareness that current laboratory techniques have low power to detect gene-gene interactions. What no scientist wants to say, however—and what none of the 24 co-authors of the Nature article (above) would say—is that maybe the genetic component(s) of schizophrenia, alcoholism, depression, etc. are simply vanishingly small to begin with. Yet that's exactly what the data are telling us. But we won't listen to the data because it doesn't fit our preconception of how the world should work.
We "know" that a person's height is largely controlled by genes. Studies going back almost a century have determined that body height is 80% to 90% heritable; no one seriously questions this fact. (Height is heritable—it "runs in families.") However, at least three large, modern genetic studies have been done to find "height genes"; the largest involved over 180,000 study subjects (and 291 co-authors). In all, some 180 genetic loci were identified that play a role in determining a person's height. But the 180 genomic features, put together, accounted for only 10% of observed variations in height. The rest appears to be environment.
Are we now supposed to enlarge our "study population" from 180,000 to several million, in order to find the genetic explanation for body height, just because we "know" one exists?
At some point, don't we have to just admit "the data's the data"?
Shouldn't we be willing (at least provisionally) to entertain the idea that maybe the twin studies and the data showing that certain things "run in families" (pre-GWAS-era data, mostly) are in need of reevaluation? Shouldn't we at least consider the idea that prior studies misjudged the importance of uncontrolled-for environmental variables? Now that we have powerful DNA-analytic techniques for investigating heritability, and the techniques aren't giving us the results we want, must we "increase the size of the microscopic" to make the results look bigger? Or shouldn't we just accept what the data are telling us (as painful as that may be)?
In a future post, I'll talk about some of the GWAS results for alcoholism. Stay tuned.
This post is, in part, derived from material for a forthcoming book on mental illness I'm writing. Please come back often to find out how to get free sample chapters.
If things like schizophrenia and alcoholism truly do have a strong genetic basis (as we've been told), and if they were to involve tractable numbers of genes (say, dozens or scores of genes, rather than hundreds or thousands), DNA studies of the GWAS kind should produce immediate, strong, recognizable genetic signatures of disease. The results should leap off the page. But they don't. Time and again, the very few candidate alleles that are found are either found in low numbers, or have low "penetrance" (low capacity to predict disease), or both, and quite often the candidate alleles that are found are not confirmed by followup studies.
This has given rise to major crisis in genetics, summed up in a paper called "Finding the missing heritability of complex diseases" that appeared in Nature in 2009. Scientists are desperate to explain the "missing heratibility" of disorders we "know" are genetic.
It's assumed, by most scientists, that the failure of genome-wide association studies to find genetic explanations of complex diseases can be attributed to such studies' low power and resolution for genetic variations of modest effect. (So the quest has begun to increase the study size of future trials, in hopes of seeing more robust results.) In addition, there's a reasonable expectation that many complex diseases will doubtless be found to involve numerous genes, each contributing only a small effect to the total. It's also been suggested that some traits are dictated by extremely rare genetic features with high "penetrance." There's also an awareness that current laboratory techniques have low power to detect gene-gene interactions. What no scientist wants to say, however—and what none of the 24 co-authors of the Nature article (above) would say—is that maybe the genetic component(s) of schizophrenia, alcoholism, depression, etc. are simply vanishingly small to begin with. Yet that's exactly what the data are telling us. But we won't listen to the data because it doesn't fit our preconception of how the world should work.
We "know" that a person's height is largely controlled by genes. Studies going back almost a century have determined that body height is 80% to 90% heritable; no one seriously questions this fact. (Height is heritable—it "runs in families.") However, at least three large, modern genetic studies have been done to find "height genes"; the largest involved over 180,000 study subjects (and 291 co-authors). In all, some 180 genetic loci were identified that play a role in determining a person's height. But the 180 genomic features, put together, accounted for only 10% of observed variations in height. The rest appears to be environment.
Are we now supposed to enlarge our "study population" from 180,000 to several million, in order to find the genetic explanation for body height, just because we "know" one exists?
At some point, don't we have to just admit "the data's the data"?
Shouldn't we be willing (at least provisionally) to entertain the idea that maybe the twin studies and the data showing that certain things "run in families" (pre-GWAS-era data, mostly) are in need of reevaluation? Shouldn't we at least consider the idea that prior studies misjudged the importance of uncontrolled-for environmental variables? Now that we have powerful DNA-analytic techniques for investigating heritability, and the techniques aren't giving us the results we want, must we "increase the size of the microscopic" to make the results look bigger? Or shouldn't we just accept what the data are telling us (as painful as that may be)?
In a future post, I'll talk about some of the GWAS results for alcoholism. Stay tuned.
This post is, in part, derived from material for a forthcoming book on mental illness I'm writing. Please come back often to find out how to get free sample chapters.
Subscribe to:
Posts (Atom)