Search
Meeting report: a workshop on Best Practices in Genome Annotation.
Efforts to annotate the genomes of a wide variety of model organisms are currently carried out by sequencing centers, model organism databases and academic/institutional laboratories around the world. Different annotation methods and tools have been developed over time to meet the needs of biologists faced with the task of annotating biological data. While standardized methods are essential for consistent curation within each annotation group, methods and tools can differ between groups,...
A genomic survey of positive selection in Burkholderia pseudomallei provides insights into the evolution of accidental virulence.
Certain environmental microorganisms can cause severe human infections, even in the absence of an obvious requirement for transition through an animal host for replication ("accidental virulence"). To understand this process, we compared eleven isolate genomes of Burkholderia pseudomallei (Bp), a tropical soil microbe and causative agent of the human and animal disease melioidosis. We found evidence for the existence of several new genes in the Bp reference genome, identifying 282 novel genes...
The comprehensive microbial resource.
The Comprehensive Microbial Resource or CMR (http://cmr.jcvi.org) provides a web-based central resource for the display, search and analysis of the sequence and annotation for complete and publicly available bacterial and archaeal genomes. In addition to displaying the original annotation from GenBank, the CMR makes available secondary automated structural and functional annotation across all genomes to provide consistent data types necessary for effective mining of genomic data. Precomputed...
Diversity, function and evolution of genes coding for putative Ni-containing superoxide dismutases.
We examined the phylogenetic distribution, functionality and evolution of the sodN gene family, which has been shown to code for a unique Ni-containing isoform of superoxide dismutase (Ni-SOD) in Streptomyces. Many of the putative sodN sequences retrieved from public domain genomic and metagenomic databases are quite divergent from structurally and functionally characterized Ni-SOD. Structural bioinformatics studies verified that the divergent members of the sodN protein family code for similar...
Global approaches to study protein-protein interactions among viruses and hosts.
While high-throughput protein-protein interaction screens were first published approximately 10 years ago, systematic attempts to map interactions among viruses and hosts started only a few years ago. HIV-human interactions dominate host-pathogen interaction databases (with approximately 2000 interactions) despite the fact that probably none of these interactions have been identified in systematic interaction screens. Recently, combinations of protein interaction data with RNAi and other...
The Protein Naming Utility: a rules database for protein nomenclature.
Generation of syntactically correct and unambiguous names for proteins is a challenging, yet vital task for functional annotation processes. Proteins are often named based on homology to known proteins, many of which have problematic names. To address the need to generate high-quality protein names, and capture our significant experience correcting protein names manually, we have developed the Protein Naming Utility (PNU, /pn-utility). The PNU is a web-based database for storing and applying...
Pathema: a clade-specific bioinformatics resource center for pathogen research.
Pathema (http://pathema.jcvi.org) is one of the eight Bioinformatics Resource Centers (BRCs) funded by the National Institute of Allergy and Infectious Disease (NIAID) designed to serve as a core resource for the bio-defense and infectious disease research community. Pathema strives to support basic research and accelerate scientific progress for understanding, detecting, diagnosing and treating an established set of six target NIAID Category A-C pathogens: Category A priority pathogens;...
EcoCyc: a comprehensive view of Escherichia coli biology.
EcoCyc (http://EcoCyc.org) provides a comprehensive encyclopedia of Escherichia coli biology. EcoCyc integrates information about the genome, genes and gene products; the metabolic network; and the regulatory network of E. coli. Recent EcoCyc developments include a new initiative to represent and curate all types of E. coli regulatory processes such as attenuation and regulation by small RNAs. EcoCyc has started to curate Gene Ontology (GO) terms for E. coli and has made a dataset of E. coli GO...
A newly-developed community microarray resource for transcriptome profiling in Brassica species enables the confirmation of Brassica-specific expressed sequences.
The Brassica species include an important group of crops and provide opportunities for studying the evolutionary consequences of polyploidy. They are related to Arabidopsis thaliana, for which the first complete plant genome sequence was obtained and their genomes show extensive, although imperfect, conserved synteny with that of A. thaliana. A large number of EST sequences, derived from a range of different Brassica species, are available in the public database, but no public microarray...