Archive for the ‘web applications’ Category

fogbugz?

Wednesday, April 8th, 2009

fogbugz.png

FogBugz is a pretty clean and simple bug tracking software that is pretty well integrated in Eclipse. Not perfect since it is reliant upon some form of live connection, but this is a pretty good solution for something.

I have created an evaluation account with FogBugz and can create a collection of incidents, WIKIs and documentation. The code is hosted on their servers, and everything is presented through a rich web page GUI. There is more potential through the availability of a FogBugz GUI, and this could be used to solve many of my requirements of a platform independent off-line bug reporting and analysis environment.

One issue that is worth considering is that FogBugz is not free; it is a commercial solution costing $25 per user per month. For a small company such as Mnemosyne BioSciences this is acceptable, but for academic environments this becomes complicated…

This is certainly something that I intend to evaluate more fully, and I will report back on my experiences as I reach the end of the evaluation period and make the decision as to whether the FogBugz solution could provide an answer for the problem of tracking software and development and bugs within a bioinformatics data analysis environment project.

EMAAS: An extensible grid-based Rich Internet Application for microarray data analysis and management

Monday, March 23rd, 2009

G Barton et al., BMC Bioinformatics 2008, 9:493doi:10.1186/1471-2105-9-493

emaas.jpgEMAAS is another environment for handling and analysis of gene expression data. The authors have set about the development of a distributed e-support system for the management and analysis of microarray data; to provide access to complex methods and to apply (from a biologist’s POV) non-trivial technologies to handle large multi-variate datasets.

Whilst other solutions have missed the point and taken an easy approach to solving the problem, the EMAAS approach is rather more complicated and relies instead on integration of internet accessible tools, standard statistical packages (R/Bioconductor) and web-resources (CELSIUS, GEO). The decision to aim for a modular and flexible framework is excellent and makes this in my opinion a very much more interesting project. The completeness with which tools and environments has been included is breathtaking; the depth of IT and analytical platforms required is rather daunting.

In contrast to the manuscript reviewed in the last post, this resource’s source is available under a suitable GPL license, and some of the demo server also works. I have some problems with the resource (Flash for a start), but this is one smooth implementation and is packaged in such a way that I could take it for a spin if I so wished!

This manuscript is heavy to read, but a damned fine resource is described underneath the technical fluff. This is a great resource and this earns a great recommendation from the bioinformaticsblog.

Bioinformatics, academia, open-source and proxy servers

Friday, January 9th, 2009

proxyServer

I have a pet peeve! Proxy servers and filtering gateway servers! As an academic in a OK university on the periphery of the civilised world, proxy and filtering gateway servers are something that exist, but are largely invisible to the user. Network connections just work! Bittorrent, ssh, ftp, telnet whatever. There may be throttling on some ports, and blocks of addresses may be banned, but the process is rather hidden. Writing bioinformatics software is clean and simple. Project X requires access to some data in XML format. Create a quick java script, dump into Tomcat and volia, a working XML dumper running on an obscure port of a production server!

In industry things are a little different. Companies maintain a largely healthy lockdown on the network; you are allowed to access what is identified as necessary. In my corporate environment this includes the non-blocked http pages on ports 80 and 8080. https connections on 443 work fine, and external ssh connections pose no problem. ftp is usually fine, but that is it! The reason is simple, companies need to protect IP, and the network is locked down to prevent the installation of non-approved software, and to prevent any “trojan” applications from leaking anything to the largely untrusted outside world.

Bioinformatics developers please note that connections outside of these ports is not possible. ENSEMBL is a fantastic resource and the ENSEMBL MySQL databases are a trove of mission valuable data. A database may appear on ports 3306, 3316, 5306 or 5316 depending on which version of ENSEMBL is required and whether it is required within the BioMart framework. BioMart itself has a registry, which again requires 3306, and in turn describes a collection of other servers with typically blocked ports.

In an industrial workplace to open a port is possible, but it requires paperwork. A justification as to who needs access to the resource, which ports on which server, the reason for the resource and different levels of approval. The job is actioned by an outsourced service provider and things work, but it all takes time!

If you are writing a really cool java WebStart application, deploying a BioMart, Taverna, MySQL, … resource, please consider the needs of the small (but potentially fee-paying) community who would love to evaluate your resource, but may skip the evaluation if the communication port is blocked. I could provide a long list of resources that really cannot be accessed (ArrayTrack, Chipster, BioMart, MySQL, KEGG …) but this is perhaps not fair…

A more valuable comment would be a suggestion as to how this can be avoided – subdomains are always very simple to implement on the server side, and you could consider mapping key resouces that use non-standard ports to a more standard port. If a resource looks good, then we also have our tricks that can work – the use of external servers that translate requests works, but is a hassle that no-one really needs!

Bioinformatics, task management and support projects

Wednesday, January 7th, 2009

I am quite happy getting into this blog writing drive. In addition to organising myself sufficiently to start writing this blog, I have also setup a web accessible WIKI for some thoughts, ideas and project documentation. I have also started to organise the plethora of academic projects, problems and activities that are running in the background.

In the past I have evaluated solutions such as dotProject, PHPProject and several others. These have provided rather too much duplication of Outlook functionality (shudder – did I really write that?) and have left me rather cold. I am now on a drive to use something rather much more useful, NetOffice. ProjectPier looks like another candidate, but for now I’ll stick with NetOffice that scores very highly with my specific needs.

The NetOffice project is pretty old, the pages are SourceForge are horrible, and the project hasn’t been updated in ages, but it has everything that I need and seems to work just fine. In addition to defining “customer accounts” and “customer users”, I can create projects and share workflows, tasks and forums with the users – this is a really cool application set and something that I guess we should be evaluating more in the future.

I know that the BioinformaticsBlog is read by users in small bioinformatics companies, CRO and academic core facilities. How do you manage your projects, customers and the development and workflow of accounting, experimentation and reporting?