Genome-wide association studies have identified a large number of single nucleotide polymorphisms (SNPs) associated with a wide array of cancer sites. Several of these variants demonstrate associations with multiple cancers, suggesting pleiotropic effects and shared biological mechanisms across some cancers. We hypothesised that SNPs previously associated with other cancers may additionally be associated with colorectal […]
Automated extraction of clinical traits of multiple sclerosis in electronic medical records.
The clinical course of multiple sclerosis (MS) is highly variable, and research data collection is costly and time consuming. We evaluated natural language processing techniques applied to electronic medical records (EMR) to identify MS patients and the key clinical traits of their disease course.We used four algorithms based on ICD-9 codes, text keywords, and medications […]
Utilization of an EMR-biorepository to identify the genetic predictors of calcineurin-inhibitor toxicity in heart transplant recipients.
Calcineurin-inhibitors CI are immunosuppressive agents prescribed to patients after solid organ transplant to prevent rejection. Although these drugs have been transformative for allograft survival, long-term use is complicated by side effects including nephrotoxicity. Given the narrow therapeutic index of CI, therapeutic drug monitoring is used to prevent acute rejection from underdosing and acute toxicity from […]
Associations between KCNJ6 (GIRK2) gene polymorphisms and pain-related phenotypes.
G-protein coupled inwardly rectifying potassium (GIRK) channels are effectors determining degree of analgesia experienced upon opioid receptor activation by endogenous and exogenous opioids. The impact of GIRK-related genetic variation on human pain responses has received little research attention. We used a tag single nucleotide polymorphism (SNP) approach to comprehensively examine pain-related effects of KCNJ3 (GIRK1) […]
The somatic genomic landscape of glioblastoma.
We describe the landscape of somatic genomic alterations based on multidimensional and comprehensive characterization of more than 500 glioblastoma tumors (GBMs). We identify several novel mutated genes as well as complex rearrangements of signature receptors, including EGFR and PDGFRA. TERT promoter mutations are shown to correlate with elevated mRNA expression, supporting a role in telomerase […]
Rapid storage and retrieval of genomic intervals from a relational database system using nested containment lists.
Efficient storage and retrieval of genomic annotations based on range intervals is necessary, given the amount of data produced by next-generation sequencing studies. The indexing strategies of relational database systems (such as MySQL) greatly inhibit their use in genomic annotation tasks. This has led to the development of stand-alone applications that are dependent on flat-file […]
Putting pleiotropy and selection into context defines a new paradigm for interpreting genetic data.
Natural selection shapes many human genes, including some related to complex diseases. Understanding how selection affects genes, especially pleiotropic ones, may be important in evaluating disease associations and the role played by environmental variation. This may be of particular interest for genes with antagonistic roles that cause divergent patterns of selection. The lectin-like low-density lipoprotein […]
ICD-9 tobacco use codes are effective identifiers of smoking status.
To evaluate the validity of, characterize the usage of, and propose potential research applications for International Classification of Diseases, Ninth Revision (ICD-9) tobacco codes in clinical populations.Using data on cancer cases and cancer-free controls from Vanderbilt’s biorepository, BioVU, we evaluated the utility of ICD-9 tobacco use codes to identify ever-smokers in general and high smoking […]
Enabling high-throughput genotype-phenotype associations in the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) project as part of the Population Architecture using Genomics and Epidemiology (PAGE) study.
Genetic association studies have rapidly become a major tool for identifying the genetic basis of common human diseases. The advent of cost-effective genotyping coupled with large collections of samples linked to clinical outcomes and quantitative traits now make it possible to systematically characterize genotype-phenotype relationships in diverse populations and extensive datasets. To capitalize on these […]
Characterization of the Metabochip in diverse populations from the International HapMap Project in the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) project.
Genome-wide association studies (GWAS) have identified hundreds of genomic regions associated with common human disease and quantitative traits. A major research avenue for mature genotype-phenotype associations is the identification of the true risk or functional variant for downstream molecular studies or personalized medicine applications. As part of the Population Architecture using Genomics and Epidemiology (PAGE) […]
A small number of candidate gene SNPs reveal continental ancestry in African Americans.
Using genetic data from an obesity candidate gene study of self-reported African Americans and European Americans, we investigated the number of Ancestry Informative Markers (AIMs) and candidate gene SNPs necessary to infer continental ancestry. Proportions of African and European ancestry were assessed with STRUCTURE (K 2), using 276 AIMs. These reference values were compared to […]
A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data.
Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more […]