.abstract img { width:300px !important; height:auto; display:block; text-align:center; margin-top:10px } .abstract { overflow-x:scroll } .abstract table { width:100%; display:block; border:hidden; border-collapse: collapse; margin-top:10px } .abstract td, th { border-top: 1px solid #ddd; padding: 4px 8px; } .abstract tbody tr:nth-child(even) td { background-color: #efefef; } .abstract a { overflow-wrap: break-word; word-wrap: break-word; }
A7686 - PulmonDB: A Gene Expression Lung Diseases
Author Block: A. B. Villaseñor-Altamirano1, Y. I. Balderas-Martínez2, M. Moretto3, A. Zayas-Del Moral4, M. Maldonado2, A. Munguía-Reyes2, J. S. García-Sotelo1, L. A. Aguilar Bautista5, L. Ramirez-Navarro1, O. Santillan4, K. Engelen3, M. Selman2, A. Medina-Rivera1, J. Collado-Vides4; 1International Laboratory of Human Genome, UNAM, Juriquilla, Mexico, 2Instituto Nacional de Enfermedades Respiratorias Ismael Cosío Villegas, CONACYT-INER, Mexico city, Mexico, 3Fondazione Edmund Mach, Trento, Italy, 4Center for Genomic Scinces, UNAM, Cuernavaca, Mexico, 5Laboratorio Nacional de Visualización Científica Avanzada, LAVIS, UNAM, Juriquilla, Mexico.
RATIONALE: There is a massive amount of transcriptomic data coming from microarrays and RNA-seq experiments, accumulated since the development of these technologies. Analyzing this data and integrate it to study a complex disease can be an overwhelming task, principally because it would require combining data from different technologies and platforms. Moreover, the lack of uniformity on experimental annotations in public databases such as Gene Expression Omnibus (GEO) adds to the challenge.
By integrating transcriptomic datasets from different sources and their curated annotations, we developed an online web resource to facilitate the exploration of gene expression profiles of two respiratory diseases: Idiopathic Pulmonary Fibrosis (IPF) and Chronic Obstructive Pulmonary Disease (COPD). This resource can be used to identify differentially expressed genes that replicate in different experiments. This project sets the foundation to integrate transcriptomics data of other respiratory diseases and smoker phenotypes facilitating the identification of common and divergent pathways that lead to a pathological state.
METHODS: To build the compendium we used COMMAND (Engelen K. et al, 2011), a software that allows the comparison and integration of transcriptomics data from different sources and platforms into a compendium. COMMAND has successfully been used to build transcriptome data compendia in bacteria (Engelen K. et al., 2011) and grapevine (Moretto M. et al., 2016). Using COMMAND we created PulmonDB, a human lung database that allows us to integrate, analyze and explore gene expression data from different sources by contrasting controls and patients using available clinical phenotypes (i.e. age, gender, the status of the disease, FEV1, etc.). We selected transcriptome experiments for IPF and COPD by querying in GEO and ArrayExpress with chosen keywords. Each experiment was downloaded, imported to COMMAND and the experimental conditions were annotated, the contrast group was selected, and data was normalized homogeneously to create PulmonDB.
RESULTS: PulmonDB is an exploratory web interface that contains IPF and COPD gene expression data; the platform will be expanded to include other abnormal lung phenotypes. This resource facilitates the exploration of gene expression profiles under different pathological conditions and allows the identification of co-expression patterns.
CONCLUSIONS: PulmonDB can help the scientific community to study which genes have a distinct expression profile related with a disease, explore the reproducibility across technologies and platforms, identify interesting co-expression patterns across diseases and to find relationships among distinct clinical or experimental variables.