Centralized Metabolomics Molecular Reference Database
EMSL Project ID
60086
Abstract
Reference databases, both in-silico and molecular standard-based, are vital components in all metabolomics data annotation workflows. However, these databases are often decentralized or have proprietary access structures, which causes inconsistent results between analysts and is challenging to maintain. The development of an internal centralized reference database architecture will enhance our ability to achieve annotation reproducibility and FAIR standard compliance by centralizing data management and access by connecting with EMSL analysis services. Although there are several options for external databases, such as MassBank, HMDB, GNPS, multiple factors restrict their usability and integration with our software solutions. These factors include the lack of local deployment, untidy and immutable datasets, and, more importantly, inconsistent data curation. The proposed framework's standardized and flexible architecture will provide the tools to easily parse and access existing data while laying the foundation for the inclusion of new data. Thus, decreasing the development complexity for the inclusion of new data types while ensuring compatibility with our software products and scientific demands.
Project Details
Start Date
2021-10-01
End Date
2024-01-31
Status
Closed
Released Data Link
Team
Principal Investigator
Team Members