Skip to main content

Bioinformatics News post

I will start with a monthly post about bioinformatics. It might contain news from Spain, or whatever I heard about.

  • Changed the machine room of Embl

Read Ewan's blog post to learn about this silent move.

I would also share this discussion from biostar:

Why the bioinformatics field don't move to the "leading edge technology for biology big data analysis"?
I personally agree with this long answer, which states that the problem is not the how we process information but, what we do with information. In genomics, metabolomics, interactomics, we don't need to process live streams of data, so technolgoy developed by Facebook, and Google towards solving this issues cannot be applied in -omics field.
But I must disagree with the author, I don't think GO or other efforts to understand the data add noise, only if wrongly used they add noise. But I must agree that there is such diversity of teams and purpose that it is very difficult to set a goal or create a standard procedure followed by everyone.

What do you think?  Bioinformatics should use leading edge technology or invest more in how we can use information we already have?


Popular posts from this blog

Functional enrichment methods and pathways

For some time I have been working on one topic. I am not sure if this is how it started but I fail to see other reasons. So I'll describe why I'm now working with gene sets collections.

The trigger
I usually try to help others in Biostars, Bioconductor, and in the StackExchange network (specially in Bioinformatics). On one of these sites I was trying to help some person, and in one of the comments ( Jun 21 '17 ) it says:
You don't build pathway maps from bioinformatics data, you build them from wet-lab experiments. And I was : "Why not? We already know (kind of) the number of genes, and we have an idea or the number of metabolites in a cell. We have many data, why can't we build pathways?" But I did a brief literature search and I couldn't find anything (if there is something let me know in the comments).
The background
Let me explain why this comment got me puzzled: in my work I am usually asked what is the relevance of the lists of gen…

GSEA in Bioconductor

Gene Set Enrichment Analysis is a test thought to find if the position of a group along a list implies some difference. The most know method is the one maintained by the Broad Institute. As it was the first widely used in biology and holds several collection of gene sets. A gene set is a collection of genes related, by either a function or an experiment, it is as fuzzy described as a pathway.

In Bioconductor there is the under used tool of BiocViews, a topic for package classificacion. We can find a category for GSEAs under Software>BiologicalQuestion>GeneSetEnrichment.

This category list 74 packages at the time of writing, which provide function for Gene Set Enrichment Analysis. It will be too long (and too hard for me) to describe all the packages in that category. However, it doesn't include all the packages that perform gene set enrichment.

The first package for GSEA in Bioconductor one should look is GSEABase which provides with tools for reading files from the Br…

RISK cohort

Since some time I am working with Crohn's Disease. One of the problems with the disease is that it is not known what happens. People has found associations with microorganisms, but the relationship between those microorganisms and the patient is still unknown. Also the risk factors for complications is largely unknown.

This post follows up the use of a patient cohort data enrolled for identifying the risk factors of complications and health-care costs in pediatric and adult onset Crohn’s disease. Where we can see some usage of the data and the problems of unclear descriptions when using the same data.

Articles describing the RISK cohort The first mention to the RISK cohort I could found is in this article [1] where they describe a cohort as:
an observational research program that enrolled patients younger than age 17 diagnosed with in flammatory (nonpenetrating, nonstricturing) CD from 2008 through 2012 at 28 pediatric gastroenterology centers in North America. I had 552 patients …