Abstract
Free full text
Compilation of tRNA sequences and sequences of tRNA genes
Abstract
Maintained at the Universität Bayreuth, Bayreuth, Germany, the Compilation of tRNA Sequences and Sequences of tRNA Genes is accessible at the URL http://www.tRNA.uni-bayreuth.de with mirror site located at the Institute of Protein Research, Pushchino, Russia (http://alpha.protres.ru/trnadbase). The compilation is a searchable, periodically updated database of currently available tRNA sequences. The present version of the database contains a new Genomic tRNA Compilation including the sequences of tRNA genes from genomic sequences published up to July 2003. It consists of about 5800 tRNA gene sequences from 111 organisms covering archaea, bacteria, higher and lower eukarya. The former Compilation of tRNA Genes (up to the end of 1998) and the updated Compilation tRNA Sequences (561 entries) are also supported by the new software. The database can be explored by using multiple search criteria and sequence templates. The database provides a service that allows to obtain statistical information on the occurrences of certain bases at given positions of the tRNA sequences. This allows phylogenic studies and search for identity elements in respect to interactions of tRNAs with various enzymes.
INTRODUCTION
The remarkable progress in large-scale automated DNA sequencing and the development of computer algorithms for fast automatic identification of the location of tRNA genes in sequenced genomes lead to the exponential increase of the available tRNA sequence information. To date, more than a hundred genomes from Eukarya, Bacteria and Archaea have been completely sequenced and the sequencing of many genomes is in progress. The comprehensive information on the sequences, structure and occurrences of tRNA genes in different genomes allows reliable systematic analysis of phylogenic dependencies, anticodon usage, characteristic structural features of tRNA, etc. In the light of this, the tRNA compilation was completely renewed. New version of the database now includes the separate collection of tRNA genes from published complete genomes. Thus the current compilation keeping all the advantages of the previous versions was extended to cover the needs of modern genomics.
DATABASE CONTENT AND ORGANIZATION
The new Compilation of tRNA Sequences and Sequences of tRNA genes contains in addition to 3279 sequences of the last edition from 1998 (1) the completely new Genomic tRNA Compilation including the sequences of tRNA genes from complete genomes published up to July 2003. The current database consists of three parts:
Genomic tRNA Compilation is a new addition to the database. This is the most complete compilation of the sequences of cytoplasmic tRNA genes derived from complete genome sequences included into DNA databases. Since sequences of tRNA genes originating from cellular organelles frequently cannot be processed to the general cloverleaf scheme, they were not included in the Genomic tRNA Compilation. There are specialized databases dealing with these sequences (2,3). Current Genomic tRNA Compilation consists of about 5800 tRNA gene sequences from 111 organisms covering archaea, bacteria, higher and lower eukarya. The database includes the tRNA gene sequences collected in GtRDB (4) as well as those from the additional complete genomes found in DNA databases. If the genomes of the different strains of the same organism were sequenced, the corresponding tRNA genes were added to the database independently.
Compilation of tRNA Sequences, is a summary of tRNA sequences, including modified bases and references of the publications. The references are restricted to the first publication of the complete sequence unless additional information (e.g. base modification, corrections, etc.) was later obtained. In such cases, additional references were added. This compilation is updated up to December 2002. The table contains the known tRNA sequences of all organisms including organelles. This is the continuation of the original tRNA compilation first published in 1978.
Compilation of tRNA Genes, is a summary of the sequences of tRNA genes published in the literature and databases up to the end of 1998. It contains tRNA genes of all organisms and organelles, but is not updated since January 1999. This table contains about 350 sequences of cytoplasmic tRNA genes that are not included in the Genomic tRNA Database. Most of the tRNA gene entries in this table have references of the publications in which the sequence was communicated.
The database is organized as MS Excel® workbooks. All the information collected is split into different indexed tables according to the type of data (specificity, sequence, organism, etc.) and the descriptions of certain genes are summarized in the main worksheet that includes the relations between the data tables. The information can be obtained by filling the query form that allows to enter the simple search criteria and to select the type of data to be displayed. The result of search is presented as a table containing the description of the sequences found. This includes amino acid specificity, anticodon sequence, organism name, strain, literature reference, PubMed ID, sequence, base-pairing and additional comments. Description of tRNA genes in Genomic tRNA Compilation also includes full organism taxonomy, position of the gene in the genome and index of general database record used as a source of the data.
In order to facilitate a computer analysis, an alignment of sequences is used, which is most compatible with the tRNA phylogeny and known three-dimensional structures of tRNA (5,6). The corresponding numbering system is described in (1). Positions in particular sequence which are not filled (gaps in the generalized structure) are indicated by a dash. The compilations use a one-letter code for all nucleotides including modified ones. To designate modified nucleotides, the other ASCII signs other than A, C, G, T and U are employed. Terminology and structure of the modified nucleosides occurring in tRNAs were used according to (7) and (8). All nucleotide insertions are commented and denoted by underlining at the place of insertion.
In addition to the plain text table one can explore the result of search by presenting the sequences in a cloverleaf form. It is possible to scroll the found sequences one by one or to select directly the sequence of interest from the result table. The presentation supports colour code for different structural features in the canonical cloverleaf model.
Simple statistical information on the occurrences of certain bases at given positions and the preferences in base-pairing also can be obtained on a special data sheet.
Compilation of tRNA Sequences and Compilation of tRNA Genes also exist in the classical tabular form described in (1). For user convenience, tables were converted to MS Excel® format.
ACCESS
The data are freely accessible for research purposes at http://www.tRNA.uni-bayreuth.de and http://alpha.protres.ru/trnadbase. This article should be cited in research projects assisted by the use of the compilation. Comments, corrections and new entries are welcome.
ACKNOWLEDGEMENTS
This project was supported by Fonds der Chemischen Industrie and Universität Bayreuth. We are grateful for advise, cooperation and help with data collection to Todd Michael Johnson Lowe, Genetics, Stanford University, California, Dr Carlos Hoyo-Vadillo, Departamento de Farmacologia y Toxicologia, Cinvestav, Mexico City, Mexico, Yvonne Baberowski, Mark Dürr and Bernhard Thielen, Universität Bayreuth.
REFERENCES
Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press
Full text links
Read article at publisher's site: https://doi.org/10.1093/nar/gki012
Read article for free, from open access legal sources, via Unpaywall: https://academic.oup.com/nar/article-pdf/33/suppl_1/D139/7621475/gki012.pdf
Citations & impact
Impact metrics
Citations of article over time
Smart citations by scite.ai
Explore citation contexts and check if this article has been
supported or disputed.
https://scite.ai/reports/10.1093/nar/gki012
Article citations
A ligation-independent sequencing method reveals tRNA-derived RNAs with blocked 3' termini.
Mol Cell, 84(19):3843-3859.e8, 02 Aug 2024
Cited by: 1 article | PMID: 39096899
Temperature-Dependent tRNA Modifications in Bacillales.
Int J Mol Sci, 25(16):8823, 13 Aug 2024
Cited by: 0 articles | PMID: 39201508 | PMCID: PMC11354880
The complete mitochondrial genome of Pontederia crassipes: using HiFi reads to investigate genome recombination and gene transfer from chloroplast genome.
Front Plant Sci, 15:1407309, 28 Jun 2024
Cited by: 0 articles | PMID: 39006960
Meta-2OM: A multi-classifier meta-model for the accurate prediction of RNA 2'-O-methylation sites in human RNA.
PLoS One, 19(6):e0305406, 26 Jun 2024
Cited by: 0 articles | PMID: 38924058 | PMCID: PMC11207182
NSUN2/YBX1 promotes the progression of breast cancer by enhancing HGH1 mRNA stability through m5C methylation.
Breast Cancer Res, 26(1):94, 06 Jun 2024
Cited by: 0 articles | PMID: 38844963
Go to all (266) article citations
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
tRNAdb 2009: compilation of tRNA sequences and tRNA genes.
Nucleic Acids Res, 37(database issue):D159-62, 28 Oct 2008
Cited by: 574 articles | PMID: 18957446 | PMCID: PMC2686557
Compilation of tRNA sequences and sequences of tRNA genes.
Nucleic Acids Res, 26(1):148-153, 01 Jan 1998
Cited by: 601 articles | PMID: 9399820 | PMCID: PMC147216
Mamit-tRNA, a database of mammalian mitochondrial tRNA primary and secondary structures.
RNA, 13(8):1184-1190, 21 Jun 2007
Cited by: 95 articles | PMID: 17585048 | PMCID: PMC1924894
Evolution from possible primitive tRNA-viroids to early poly-tRNA-derived mRNAs: a new approach from the poly-tRNA theory.
Genome Inform, 16(2):94-103, 01 Jan 2005
Cited by: 3 articles | PMID: 16901093
Review