Yet another challenge to informatics, well does it have the answer this time?

- April 30, 2008

Among number of challenges faced by informatics one of the long standing and critical challenge has been the Biodiversity informatics: the challenge of linking data and the role of shared identifiers.

A major challenge facing biodiversity informatics is integratingdata stored in widely distributed databases. Initial effortshave relied on taxonomic names as the shared identifier linkingrecords in different databases. However, taxonomic names havelimitations as identifiers, being neither stable nor globallyunique, and the pace of molecular taxonomic and phylogeneticresearch means that a lot of information in public sequencedatabases is not linked to formal taxonomic names. This reviewexplores the use of other identifiers, such as specimen codesand GenBank accession numbers, to link otherwise disconnectedfacts in different databases. The structure of these links canalso be exploited using the PageRank algorithm to rank the resultsof searches on biodiversity databases. The key to rich integrationis a commitment to deploy and reuse globally unique, sharedidentifiers [such as Digital Object Identifiers (DOIs) and LifeScience Identifiers (LSIDs)], and the implementation of servicesthat link those identifiers.

Search This Blog

Bio Saga Blog - A Chronicle of Life Sciences & Informatics

Yet another challenge to informatics, well does it have the answer this time?

Comments

Popular posts from this blog

Top 100 Cutting-Edge Science Blogs

India, UK based Anuva ties up with US genomics major

Top 25 Indian Bioinformatics Companies