Omics Analysis Portal: A Centralized Platform for Analysis, Integration & Visualization Applications
EMSL Project ID
51675
Abstract
There are many scientific discoveries pertinent to DOE-BER that are waiting to emerge from high-throughput biological data, which are now being generated at an unprecedented rate. Analysis of these datasets can be challenging due to the size and complexity of the data. While there has been much growth in the development of software tools to aid in the analysis of these datasets, analysis tools are most often specific to one or two data types (e.g. Metaboanalyst allows for MS or NMR metabolomics data) or a specific type of analysis (e.g. DAVID does enrichment analysis for gene lists). This results in several complications: 1) in order to accomplish an end-to-end data processing pipeline, users must often string together several tools, 2) many of these tools require statistical programming knowledge, and 3) a user must be very familiar with the valid steps for analyzing their data. Additionally, researchers frequently generate data for multiple omics data types for the same study. Efficient integration of these complex and disparate datasets requires access to and understanding of databases of known biological pathways, understanding of robust data preprocessing techniques and often statistical methods, and programming capability. Gaps in any one of these abilities can lead to unreproducible results or increased timelines. We propose to develop the framework for a one-stop omics data analysis web portal for EMSL users to perform quality control (QC), visually explore, statistically analyze, and integrate data after the identification and quantification of biomolecules by CoreMS and other relevant software. The portal will be modular in nature to allow the addition of new tools and algorithms in future years and will aid users in constructing valid and reproducible workflows. Further, we propose to populate the portal with three initial applications: PMart, iPMart, and MODE. This portal will fill a clear technological gap in omics data exploration, integration, user-guided visualization, and the need for clear guidance based on analysis objectives.
Project Details
Start Date
2020-10-26
End Date
2022-12-31
Status
Closed
Released Data Link
Team
Principal Investigator
Team Members