Archive for the ‘Uncategorized’ Category

Martini: using literature keywords to compare gene sets

Monday, January 4th, 2010

martini I have been having a big push over the last couple of months to consider the needs of biocontextualisation of gene list data within a commercial setting. A bioinformatician is not always capable of making a rational decision based on gene names, and even dedicated biologists are swamped by genes that fall outside of their specialist domains – does a collection of differentially expressed genes relate to something good, to something bad, or to something unexpected?

Martini is a pretty good looking solution that was published in Nucleic Acids Research (NAR 2010, Vol 38, p26-38) by researchers from EMBL in Heidelberg. The solution is based on a somewhat clumsy web interface, but the data behind is based on Medline abstract data and more importantly Medline keywords. Some bioinformatics magic, a little creativity and clearly a lot of hard work are then used to condense a set of differentially expressed genes into something that appears (at least superficially thus far) pretty useful and clearly demonstrates superiority over other solutions that are pretty tightly bound to the Gene Ontology.

What about commercial usage? The authors state pretty clearly that the solution is free-for-all! This comes of course with a caveat; is it really wise to upload proprietary gene list data to an external server? While we wouldn’t lose knowledge on molecular structures, there is certainly a risk element here and I imagine that most commercial bioinformaticians will avoid this tool. I feel that the tool also suffers from the need to work with the somewhat loveless web interface. A cleaner R based API would be lovely – perhaps a task for the weekend?

In conclusion Martini looks very appealing, have a clear reason for existence and is certainly something that I will evaluate. I am already tempted to see how I might use the system within an R integration project – straight to paper reporting is the way to go! Have a look at http://martini.embl.de to see what can be done.

Bioinformatics-for-dummies

Wednesday, December 30th, 2009

What is the point of this blog? In the past it was intended as a forum to transfer knowledge across the academic-industrial divide, but I now suspect that I am a firm industrial-bioinformatician and a couple of failed attempts to move back (to academia) in the last 6 months seem to have concreted my intentions to stay on this side of the divide.

Now is I feel, the right time to reestablish the goals of this blog, and the aim should be literature review (for industrial bioinformaticians) and to set out flows, thoughts and logic that may aid and assist best-of-breed bioinformatics workflows in industry, but using open- and academic driven solutions.

Back in business …

Wednesday, December 30th, 2009

I have neglected the blog for quite a long time, a few engines read the pages daily, but the whole business was looking a little tired and I lacked the enthusiasm to continue without some form of reward. Finally a new year, and my chance to establish the BioinformaticsBlog as somewhere relevant? I’m on the case!

google discovers the arse of Finland!

Saturday, August 22nd, 2009


Turku may not be the centre of the known Universe, but 3 google camera cars in downtown Turku suggests that this place may be more important than usually given credit for!

Posted by ShoZu

Polished boat!

Friday, June 12th, 2009


Bioinformaticians need hobbies, along with coding, family, racing bikes and motorcycles I love yachts and sailing. Here the boat is ready for launch and well attached to a Saab. Who said that bioinformatics can’t be the best job in the world?

Posted by ShoZu

ping …

Friday, June 12th, 2009

There has been rather high latency associated with the blog! Interest remains, but other duties have been at the fore! I’ll focus again, bioinformatics remains my life!

Posted by ShoZu

The silence of the bioinformaticians

Wednesday, April 1st, 2009

I have been silent and rather occupied with a small project that is swallowing most of my free time; the development of a server::client interface for the analysis of Affymetrix GeneChips utilising the aroma.affymetrix package. Things are going pretty well; but I have introduced rather more complexity into the system by moving my logging routines from something home made into a more formal log4j schema.

At the moment I have a lovely struggle with the system – something in my code base appears to have some “deprecated” log4j code – and running the methods through Tomcat yields some rather vague and uninterpretable error messages.

java.lang.NoSuchFieldError: level

WTF? This problem appears to be pretty well documented across the WWW – but still resolving the problem down into a .jar file that should be upgraded, deleted or something is a little trying!

Alas – this is bioinformatics!

The problem has been solved, and bioinformatics was again the issue. Within my code I have the martj .jar file that provides some connectivity with the BioMart infrastructure. Within this martj.jar is a copy of log4j; and this version collides with the version that I was trying to use! What a great use of rather too many hours – but at least a load of code has now been refactored and the .jar dependencies are now a little cleaner!

crashing visitor stats – not enough new content

Monday, March 23rd, 2009

distraction.jpg

I have been distracted – after a rather busy week on the road, and a week of catching up with paperwork and a few rather critical tasks within the corporate bioinformatics environment and the bioinformaticsblog is left feeling a little blue and rather unloved. There appear to be pretty good numbers of visitors; but the numbers have crashed over the last couple of weeks – there is a dearth of new content.

A quick check of the server logs show that “aroma.affymetrix” is again one of the top search strings that bring people to this place – my planned “aroma.affymetrix tutorials for bioinformatics dummies” is still in progress, and awaiting release. It is pretty crazy, but the next most popular theme for the bioinformaticsblog search is “bioinformatics iPhone”. I am not sure what I should be reading into this; but it seems that this is a hot topic for those of you out there writing BSc and MSc projects in bioinformatics at the moment.

I am really worried by some of the terms that bring people here – who got here through a search for “I love bioinformatics”? A good sentiment, but one that you should be careful in announcing – I will not name and shame (yet)! An equally good query is the “what to do with my life bioinformatics”. I guess that many of us have thought about this, but someone has actually used Google to solve their problem! Related queries include “what to do bioinformatics in industry” and “best working practice bioinformatics”.

Yikes – is this a good thing or something I should be very afraid of?

Bioinformatics and best working practices are somehow interlinked within industrial bioinformatics; and this is likely to include enterprise biocomputing solutions, but will probably not involve any aroma.affymetrix or iPhone. There does appear to be a collective angst in bioinformatics; but it really is a great place to work in-between the corporate reorganisations ;-)

Bioinformatician on the road

Friday, March 13th, 2009

fail-owned-laundry-fail
see more pwn and owned pictures

I have had a pretty amazing week in Munich and Freising and have learned things that I needed to know (and unfortunately a few things I wish I didn’t need to know – qPCR really is a bizarre technology). Some former colleagues working with a pharmacogenomics CRO on the edge of Starnberger see recommended the failblog as a good place to waste some time! So winding knowledge acquisition down and exploring their suggestion has yielded a little understanding as to the site- and some pretty hearty laughs! Excellent!

I’ll be back in Finland tomorrow – I can start blogging again then and we should start having a look at the bioinformatics of quantitative PCR – a horrible subject! This has at least given me some enthusiasm to implement an R-based RDML parser – another contribution to head to Bioconductor!

Preclinical drug development explained?

Tuesday, March 10th, 2009


A crash course for bioinformaticians presented by the Deutsches Museum in Munich.

Posted by ShoZu