[PDF] Statistical Methods For Detecting Expression Quantitative Trait Loci Eqtl eBook

Statistical Methods For Detecting Expression Quantitative Trait Loci Eqtl Book in PDF, ePub and Kindle version is available to download in english. Read online anytime anywhere directly from your device. Click on the download button below to get a free pdf file of Statistical Methods For Detecting Expression Quantitative Trait Loci Eqtl book. This book definitely worth reading, it is an incredibly well-written.

Statistical Methods for Detecting Expression Quantitative Trait Loci (EQTL)

Author : Wei Zhang
Publisher :
Page : 224 pages
File Size : 45,37 MB
Release : 2009
Category :
ISBN :

GET BOOK

We applied the procedure to a data set consisting of gene expression and genotypes for 112 segregants of S. cerevisiae (Brem and Kruglyak 2005). Our method identified modules containing genes mapped to previously reported eQTL hot spots, and dissected these large eQTL hot spots into several modules corresponding to different causal regulators or primary and secondary responses to causal perturbations. In addition, we identified nine modules associated with pairs of eQTLs, of which two have been previously reported, including the mating module (Brem et al. 2005) and the ZAP1 target module (Lee et al. 2006). We demonstrated that one of the novel modules containing many daughter-cell expressed genes is regulated by AMN1 and BPH1 .

New Statistical Methods in Bioinformatics

Author : Rhonda DeCook
Publisher :
Page : 270 pages
File Size : 33,35 MB
Release : 2006
Category :
ISBN :

GET BOOK

This thesis focuses on new statistical methods in the area of bioinformatics which uses computers and statistics to solve biological problems. The first study discusses a method for detecting a quantitative trait locus (QTL) when the trait of interest has a zero-inflated Poisson (ZIP) distribution. Though existing methods based on normality may be reasonably applied to some ZIP distributions, the characteristics of other ZIP distributions make such an application inappropriate. We compare our method to an existing non-parametric approach, and we illustrate our method using QTL data collected on two ecotypes of the Arabidopsis thaliana plant where the trait of interest is shoot count. The second study discusses a method to detect differentially expressed genes in an unreplicated multiple-treatment microarray timecourse experiment. In a two-sample setting, differential expression is well defined as non-equal means, but in the present setting, there are numerous expression patterns that may qualify as differential expression, and that may be of interest to the researcher. This method provides the researcher with a list of significant genes, an associated false discovery rate for that list, and a 'best model' choice for every gene. The model choice component is relevant because the alternative hypothesis of differential expression does not dictate one specific alternative expression pattern. In fact, in this type of experiment, there are many possible expression patterns of interest to the researcher. Using simulations, we provide information on the specificity and sensitivity of detection under a variety of true expression patterns using receiver operating characteristic curves. The method is illustrated using an Arabidopsis thaliana microarray experiment with five time points and three treatment groups. The third study discusses a new type of analysis, called eQTL analysis. This analysis brings together the methods of microarray and QTL analyses in order to detect locations on the genome that control gene expression. These controlling loci are called expression QTL, or eQTL. Locating eQTL can help researchers uncover complex networks in biological systems. The method is illustrated using an Arabidopsis thaliana eQTL experiment with 22,787 genes and 288 markers.

Statistical Methods for QTL Mapping

Author : Zehua Chen
Publisher : CRC Press
Page : 308 pages
File Size : 24,72 MB
Release : 2016-04-19
Category : Mathematics
ISBN : 143986831X

GET BOOK

While numerous advanced statistical approaches have recently been developed for quantitative trait loci (QTL) mapping, the methods are scattered throughout the literature. Statistical Methods for QTL Mapping brings together many recent statistical techniques that address the data complexity of QTL mapping. After introducing basic genetics topics an

Quantitative Trait Loci

Author : Nicola J. Camp
Publisher : Springer Science & Business Media
Page : 362 pages
File Size : 37,36 MB
Release : 2008-02-03
Category : Medical
ISBN : 1592591760

GET BOOK

In Quantitative Trait Loci: Methods and Protocols, a panel of highly experienced statistical geneticists demonstrate in a step-by-step fashion how to successfully analyze quantitative trait data using a variety of methods and software for the detection and fine mapping of quantitative trait loci (QTL). Writing for the nonmathematician, these experts guide the investigator from the design stage of a project onwards, providing detailed explanations of how best to proceed with each specific analysis, to find and use appropriate software, and to interpret results. Worked examples, citations to key papers, and variations in method ease the way to understanding and successful studies. Among the cutting-edge techniques presented are QTDT methods, variance components methods, and the Markov Chain Monte Carlo method for joint linkage and segregation analysis.

Statistical Genetics of Quantitative Traits

Author : Rongling Wu
Publisher : Springer Science & Business Media
Page : 371 pages
File Size : 50,60 MB
Release : 2007-07-17
Category : Science
ISBN : 038768154X

GET BOOK

This book introduces the basic concepts and methods that are useful in the statistical analysis and modeling of the DNA-based marker and phenotypic data that arise in agriculture, forestry, experimental biology, and other fields. It concentrates on the linkage analysis of markers, map construction and quantitative trait locus (QTL) mapping, and assumes a background in regression analysis and maximum likelihood approaches. The strength of this book lies in the construction of general models and algorithms for linkage analysis, as well as in QTL mapping in any kind of crossed pedigrees initiated with inbred lines of crops.

Statistical Methods for Genetic Variants Detection with Epigenomic Information

Author : Maria Constanza Rojo
Publisher :
Page : 158 pages
File Size : 35,14 MB
Release : 2019
Category :
ISBN :

GET BOOK

Genome-wide association studies (GWAS) have successfully identified thousands of genetic variants contributing to disease and other phenotypes. However, significant obstacles hamper our ability to elucidate causal variants, identify genes affected by causal variants, and characterize the mechanisms by which genotypes influence phenotypes. The increasing availability of genome-wide functional annotation data provides unique opportunities to incorporate prior information into the analysis of GWAS to better understand the impact of variants on disease etiology. Regulatory genomic information has been recognized as a potential source that can improve the detection and biological interpretation of single-nucleotide polymorphisms (SNPs) in GWAS. Although there have been many advances in incorporating prior information into the prioritization of trait-associated variants in GWAS, functional annotation data has played a secondary role in the joint analysis of GWAS and molecular (i.e., expression) quantitative trait loci (eQTL) data in assessing evidence of association. Moreover, current methodologies that aim to integrate such annotation information focus mainly on fine-mapping and overlook the importance of its usage in earlier stages of GWAS analysis. Equally important, there is a lack of development in proper statistical frameworks that can perform selection of annotations and SNPs jointly. To address these shortcomings, we develop two statistical models: iFunMed and GRAD. iFunMed is a novel mediation framework to integrate GWAS and eQTL data with the utilization of publicly available functional annotation data. iFunMed extends the scope of standard mediation analysis by incorporating information from multiple genetic variants at a time and leveraging variant-level summary statistics. GRAD integrates high-dimensional auxiliary information into high-dimensional regression. This method allows annotation information to assist the detection of important genetic variants while identifying relevant annotation simultaneously. We provide an upper bound for the estimation error of the SNP effect sizes to gain insights on what factors affect estimation accuracy. For iFunMed, data-driven computational experiments convey how informative annotations improve SNP selection performance while emphasizing the robustness of the model to non-informative annotations. Applications to the Framingham Heart Study data indicate that iFunMed is able to boost the detection of SNPs with mediation effects that can be attributed to regulatory mechanisms. Simulation experiments indicate that GRAD can improve the identification of genetic variants by increasing the average area under the precision-recall curve by up to 60\%. Real data applications to the Framingham Heart Study show that GRAD can select relevant genetic variants while detecting several transcription factors involved in specific phenotypical changes.

Statistical Methods for Molecular Quantitative Trait Locus Analysis

Author : Heather J. Zhou
Publisher :
Page : 0 pages
File Size : 38,86 MB
Release : 2023
Category :
ISBN :

GET BOOK

Molecular quantitative trait locus (molecular QTL, henceforth "QTL") analysis investigates the relationship between genetic variants and molecular traits, helping explain findings in genome-wide association studies. This dissertation addresses two major problems in QTL analysis: hidden variable inference problem and eGene identification problem. Estimating and accounting for hidden variables is widely practiced as an important step in QTL analysis for improving the power of QTL identification. However, few benchmark studies have been performed to evaluate the efficacy of the various methods developed for this purpose. In my first project, I benchmark popular hidden variable inference methods including surrogate variable analysis (SVA), probabilistic estimation of expression residuals (PEER), and hidden covariates with prior (HCP) against principal component analysis (PCA)-a well-established dimension reduction and factor discovery method-via 362 synthetic and 110 real data sets. I show that PCA not only underlies the statistical methodology behind the popular methods but is also orders of magnitude faster, better performing, and much easier to interpret and use. To help researchers use PCA in their QTL analysis, I provide an R package PCAForQTL along with a detailed guide, both of which are available at httpss://github.com/heatherjzhou/PCAForQTL. I believe that using PCA rather than SVA, PEER, or HCP will substantially improve and simplify hidden variable inference in QTL mapping as well as increase the transparency and reproducibility of QTL research. A central task in expression quantitative trait locus (eQTL) analysis is to identify cis-eGenes (henceforth "eGenes"), i.e., genes whose expression levels are regulated by at least one local genetic variant. Among the existing eGene identification methods, FastQTL is considered the gold standard but is computationally expensive as it requires thousands of permutations for each gene. Alternative methods such as eigenMT and TreeQTL have lower power than FastQTL. In my second project, I propose ClipperQTL, which reduces the number of permutations needed from thousands to 20 for data sets with large sample sizes (>450) by using the contrastive strategy developed in Clipper; for data sets with smaller sample sizes, it uses the same permutation-based approach as FastQTL. I show that ClipperQTL performs as well as FastQTL and runs about 500 times faster if the contrastive strategy is used and 50 times faster if the conventional permutation-based approach is used. The R package ClipperQTL is available at httpss://github.com/heatherjzhou/ClipperQTL. This project demonstrates the potential of the contrastive strategy developed in Clipper and provides a simpler and more efficient way of identifying eGenes.

Statistical Methods to Understand the Genetic Architecture of Complex Traits

Author : Farhad Hormozdiari
Publisher :
Page : 239 pages
File Size : 49,33 MB
Release : 2016
Category :
ISBN :

GET BOOK

Genome-wide association studies (GWAS) have successfully identified thousands of risk loci for complex traits. Identifying these variants requires annotating all possible variations between any two individuals, followed by detecting the variants that affect the disease status or traits. High-throughput sequencing (HTS) advancements have made it possible to sequence cohort of individuals in an efficient manner both in term of cost and time. However, HTS technologies have raised many computational challenges. I first propose an efficient method to recover dense genotype data by leveraging low sequencing and imputation techniques. Then, I introduce a novel statistical method (CNVeM) to identify Copy-number variations (CNVs) loci using HTS data. CNVeM was the first method that incorporates multi-mapped reads, which are discarded by all existing methods. Unfortunately, among all GWAS variants only a handful of them have been successfully validated to be biologically causal variants. Identifying causal variants can aid us to understand the biological mechanism of traits or diseases. However, detecting the causal variants is challenging due to linkage disequilibrium (LD) and the fact that some loci contain more than one causal variant. In my thesis, I will introduce CAVIAR (CAusal Variants Identification in Associated Regions) that is a new statistical method for fine mapping. The main advantage of CAVIAR is that we predict a set of variants for each locus that will contain all of the true causal variants with a high confidence level (e.g. 95%) even when the locus contains multiple causal variants. Next, I aim to understand the underlying mechanism of GWAS risk loci. A standard approach to uncover the mechanism of GWAS risk loci is to integrate results of GWAS and expression quantitative trait loci (eQTL) studies; we attempt to identify whether or not a significant GWAS variant also influences expression at a nearby gene in a specific tissue. However, detecting the same variant being causal in both GWAS and eQTL is challenging due to complex LD structure. I will introduce eCAVIAR (eQTL and GWAS CAusal Variants Identification in Associated Regions), a statistical method to compute the probability that the same variant is responsible for both the GWAS and eQTL signal, while accounting for complex LD structure. We integrate Glucose and Insulin-related traits meta-analysis with GTEx to detect the target genes and the most relevant tissues. Interestingly, we observe that most loci do not colocalize between GWAS and eQTL. Lastly, I propose an approach called phenotype imputation that allows one to perform GWAS on a phenotype that is difficult to collect. In our approach, we leverage the correlation structure between multiple phenotypes to impute the uncollected phenotype. I demonstrate that we can analytically calculate the statistical power of association test using imputed phenotype, which can be helpful for study design purposes

Quantitative Trait Loci Analysis in Animals

Author : Joel Ira Weller
Publisher : CABI
Page : 288 pages
File Size : 42,78 MB
Release : 2009
Category : Technology & Engineering
ISBN : 1845937341

GET BOOK

Quantitative Trait Loci (QTL) is a topic of major agricultural significance for efficient livestock production. This book covers various statistical methods that have been used or proposed for detection and analysis of QTL and marker-and gene-assisted selection in animal genetics and breeding.