GeneNetwork
GeneNetwork is a combined database and open-source bioinformatics data analysis software resource for systems genetics. This resource is used to study gene regulatory networks that link DNA sequence differences to corresponding differences in gene and protein expression and to variation in traits such as health and disease risk. Data sets in GeneNetwork are typically made up of large collections of genotypes and phenotypes from groups of individuals, including humans, strains of mice and rats, and organisms as diverse as Drosophila melanogaster, Arabidopsis thaliana, and barley. The inclusion of genotypes makes it practical to carry out web-based gene mapping to discover those regions of genomes that contribute to differences among individuals in mRNA, protein, and metabolite levels, as well as differences in cell function, anatomy, physiology, and behavior.
History
Development of GeneNetwork started at the University of Tennessee Health Science Center in 1994 as a web-based version of the . GeneNetwork is both the first and the longest continuously operating web service in biomedical research . In 1999 the Portable Gene Dictionary was combined with Kenneth F. Manly's QT mapping program to produce an online system for real-time genetic analysis. In early 2003, the first large Affymetrix gene expression data sets were incorporated and the system was renamed WebQTL. GeneNetwork is now developed by an international group of developers and has mirror and development sites in Europe, Asia, and Australia. Production services are hosted on systems at University of Tennessee Health Science Center with a backup instance in Europe.A the current production version of GeneNetwork was released in 2016. The current version of GeneNetwork uses the same database as its predecessor, GN1, but has much more modular and maintainable open source code. GeneNetwork now also has significant new features including support for:
- Genetically complex populations using linear mixed model implemented with an updated version of ,
- modules with many mapping options, including mapping of 4-way intercrosses and heterogeneous stock
- Weighted correlation network analysis, also known as WGCNA
- Cytoscape network display
- A genome browser to display genetic and genomic data that is based on Biodalliance
- Linked modules to the , for causal modeling
Organization and use
- Massive collections of genetic, genomic, and phenotype data for large cohorts of individuals
- Sophisticated statistical analysis and gene mapping software that enable analysis of molecular and cellular networks and genotype-to-phenotype relations
- DNA sequences and genotypes
- Molecular expression data often generated using arrays, RNA-seq, epigenomic, proteomic, metabolomic, and metagenomic methods
- Standard quantitative phenotypes that are often parts of a typical medical record
- Annotation files and metadata for traits and data sets
GeneNetwork is primarily used by researchers, but has also been adopted successfully for undergraduate and graduate courses in genetics and bioinformatics, bioinformatics, physiology, and psychology. Researchers and students typically retrieve sets of genotypes and phenotypes from one or more families and use built-in statistical and mapping functions to explore relations among variables and to assemble networks of associations. Key steps include the analysis of these factors:
- The range of variation of traits
- Covariation among traits
- Architecture of larger networks of traits
- Quantitative trait locus mapping and causal models of the linkage between sequence differences and phenotype differences
Data sources
Tools and features
There are tools on the site for a wide range of functions that range from simple graphical displays of variation in gene expression or other phenotypes, scatter plots of pairs of traits, construction of both simple and complex network graphs, analysis of principal components and synthetic traits, QTL mapping using marker regression, interval mapping, and pair scans for epistatic interactions. Most functions work with up to 100 traits and several functions work with an entire transcriptome.The database can be browsed and searched at the main page. An on-line is available. Users can also the primary data sets as text files, Excel, or in the case of network graphs, as SBML. As of 2017, is available as a beta release.