Search
Enterococcus faecalis antigens in human infections.
Genomic libraries of two Enterococcus faecalis strains, OG1RF and TX52 (an isolate from an endocarditis patient), were constructed in cosmid vectors pBeloBAC11 and pLAFRx, and screened with a serum from a rabbit immunized with surface proteins of an E. faecalis endocarditis isolate and sera from four patients with enterococcal endocarditis. Seventy-five cosmid clones reacted with at least two of the sera. Thirty-eight of the 75 immunopositive clones were considered to contain distinct inserts...
The TIGR human cDNA database.
A tool for analyzing and annotating genomic sequences.
We describe a tool for analyzing and annotating large genomic sequences containing introns. The analysis and annotation tool (AAT) includes two sets of programs, one for comparing the query sequence with a protein database and the other for comparing the query with a cDNA database. Each set contains a fast database search program and a rigorous alignment program. The database search program quickly identifies regions of the query sequence that are similar to a database sequence. Then the...
Identification of a gene within the tandem array of red and green color pigment genes.
The tandem array of color pigment genes on chromosome Xq28 contains nested exons of a distinct gene. This gene (termed TEX28) is composed of five exons that span almost the entire distance between the protein-coding regions of the color pigment genes and a transketolase-related gene. Although most of the TEX28 gene is repeated within the color pigment gene array, the exclusion of exon 1 from the array is predicted to restrict transcription to a single copy of the gene. The TEX28 gene encodes a...
A decision tree system for finding genes in DNA.
MORGAN is an integrated system for finding genes in vertebrate DNA sequences. MORGAN uses a variety of techniques to accomplish this task, the most distinctive of which is a decision tree classifier. The decision tree system is combined with new methods for identifying start codons, donor sites, and acceptor sites, and these are brought together in a frame-sensitive dynamic programming algorithm that finds the optimal segmentation of a DNA sequence into coding and noncoding regions (exons and...
Large-scale sequencing of plant genomes.
The large number of ESTs generated for Arabidopsis and rice in recent years now act as an important complement to whole genome sequencing projects. The Arabidopsis Genome Initiative has begun a coordinated effort to sequence the entire genome and, as a result, increasing numbers of large sequence entries can be found in the public databases. In addition, the mitochondrial genome of Arabidopsis has been completely sequenced. Genome sequencing studies and the public sequence databases have begun...
Sequence-tagged connectors: a sequence approach to mapping and scanning the human genome.
The sequence-tagged connector (STC) strategy proposes to generate sequence tags densely scattered (every 3.3 kilobases) across the human genome by arraying 450,000 bacterial artificial chromosomes (BACs) with randomly cleaved inserts, sequencing both ends of each, and preparing a restriction enzyme fingerprint of each. The STC resource, containing end sequences, fingerprints, and arrayed BACs, creates a map where the interrelationships of the individual BAC clones are resolved through their...
Comparing protein sequence-based and predicted secondary structure-based methods for identification of remote homologs.
We have compared a novel sequence-structure matching technique, FORESST, for detecting remote homologs to three existing sequence based methods, including local amino acid sequence similarity by BLASTP, hidden Markov models (HMMs) of sequences of protein families using SAM, HMMs based on sequence motifs identified using meta-MEME. FORESST compares predicted secondary structures to a library of structural families of proteins, using HMMs. Altogether 45 proteins from nine structural families in...
Human BAC ends.
The Human BAC Ends database includes all non-redundant human BAC end sequences (BESs) generated by The Institute for Genomic Research (TIGR), the University of Washington (UW) and California Institute of Technology (CalTech). It incorporates the available BAC mapping data from different genome centers and the annotation results of each end sequence for the contents of repeats, ESTs and STS markers. For each BAC end the database integrates the sequence, the phred quality scores, the map and the...
Anchoring of rice BAC clones to the rice genetic map in silico.
A wealth of molecular resources have been developed for rice genomics, including dense genetic maps, expressed sequence tags (ESTs), yeast artificial chromosome maps, bacterial artificial chromosome (BAC) libraries and BAC end sequence databases. Integration of genetic and physical maps involves labor-intensive empirical experiments. To accelerate the integration of the bacterial clone resources with the genetic map for the International Rice Genome Sequencing Project, we cleaned and filtered...