Bioinformatics tools for biological interpretation and data visualization
Our group has developed tools such as the Food-Biomarker Ontology (FOBI) (Castellano-Escuder P, et al., 2020), the first ontology designed to integrate metabolomics and nutrition data, and POMAShiny (Castellano-Escuder P, et al., 2021), which offers univariate and multivariate statistical methods, dimensionality reduction techniques, feature selection approaches, regularized regression analysis, machine learning–based classification algorithms, predictive modeling strategies, and various high-quality interactive visualization options.
Following FAIR principles, both the source codes and data files are available through public GitHub repositories.
The Food-Biomarker Ontology (FOBI) is the first ontology developed to integrate metabolomics and nutrition data (Castellano-Escuder P, et al., 2020). This ontology aims to link different types of foods with their associated metabolites or dietary intake biomarkers.
FOBI comprises 1,197 terms, 4 different properties, 13 top-level food classes, 11 top-level biomarker classes, and over 4,500 relationships. Additionally, FOBI is part of the OBO Foundry project, and FOBI identifiers have been indexed in the HMDB and FooDB databases to facilitate interoperability and data exchange.

Food-Biomarker Ontology (FOBI)
FOBI architecture using the apple as an example.

-
Chemical class enrichment analysis: ORA and MSEA using FOBI chemical classes as metabolite sets.
-
Food enrichment analysis: ORA and MSEA using FOBI food groups as metabolite sets.


POMAShiny
POMAShiny requires two input files in CSV format: a metadata file (target) and a features file. The metadata file should include sample names in the first column, group labels (e.g., control and case) in the second, and optionally, relevant covariates from the third column onward. The features file contains the quantified features from the experiment, with one feature per column. The row order must be the same in both files. Once uploaded, POMAShiny converts the files into an MSnSet object, following the MSnbase package from R/Bioconductor.
Users can select specific samples from the metadata file to create data subsets for analysis. Additionally, POMAShiny offers an optional function to combine features that belong to the same entity (such as peptides from a protein or ions from a compound). To use this feature, a “group” CSV file is required, indicating which features should be combined. It also allows users to download a table with the coefficient of variation for the combined features.


EDA helps identify uncontrolled factors and potential outliers, and it is recommended to perform it before statistical analysis. Moreover, in the absence of significant biases, EDA can provide an initial overview of the most relevant features of the study.
POMAShiny offers interactive and customizable visualizations for EDA, including:
-
Volcano plots (for two-group comparisons)
-
Boxplots
-
Density plots
-
Clustered heatmaps
It also includes options for Principal Component Analysis (PCA) and cluster analysis.

This panel includes a variety of statistical methods, ranging from the most commonly used approaches in metabolomics and proteomics data analysis to other methodologies that are less frequent in these fields. All statistical methods offered by POMAShiny are implemented in a highly intuitive way for the user and generate both downloadable tables and interactive plots as results. The available analyses include:
-
Univariate analysis
-
Limma (Linear Models for Microarray Data)
-
Multivariate analysis
-
Cluster analysis
-
Correlation analysis
-
Regularized regression
-
Random forests
-
Odds ratio
-
Rank products




