The carbohydrate-active enzymes database (CAZy) in 2013.

Lombard V; Golaconda Ramulu H; Drula E; Coutinho PM; Henrissat B

doi:10.1093/nar/gkt1178

The carbohydrate-active enzymes database (CAZy) in 2013.

Affiliations

1. Centre National de la Recherche Scientifique, CNRS UMR 7257, 13288 Marseille, France and Aix-Marseille Université, AFMB, 163 Avenue de Luminy, 13288 Marseille, France.
Authors
Lombard V¹
(1 author)

ORCIDs linked to this article

Nucleic Acids Research, 21 Nov 2013, 42(Database issue):D490-5
https://doi.org/10.1093/nar/gkt1178 PMID: 24270786 PMCID: PMC3965031

This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.

Free full text in Europe PMC

Abstract

The Carbohydrate-Active Enzymes database (CAZy; http://www.cazy.org) provides online and continuously updated access to a sequence-based family classification linking the sequence to the specificity and 3D structure of the enzymes that assemble, modify and breakdown oligo- and polysaccharides. Functional and 3D structural information is added and curated on a regular basis based on the available literature. In addition to the use of the database by enzymologists seeking curated information on CAZymes, the dissemination of a stable nomenclature for these enzymes is probably a major contribution of CAZy. The past few years have seen the expansion of the CAZy classification scheme to new families, the development of subfamilies in several families and the power of CAZy for the analysis of genomes and metagenomes. This article outlines the changes that have occurred in CAZy during the past 5 years and presents our novel effort to display the resolution and the carbohydrate ligands in crystallographic complexes of CAZymes.

Free full text

Nucleic Acids Res. 2014 Jan 1; 42(Database issue): D490–D495.

Published online 2013 Nov 21. https://doi.org/10.1093/nar/gkt1178

PMCID: PMC3965031

PMID: 24270786

The carbohydrate-active enzymes database (CAZy) in 2013

Vincent Lombard,^1,² Hemalatha Golaconda Ramulu,^1,² Elodie Drula,^1,² Pedro M. Coutinho,^1,² and Bernard Henrissat^1,^*

Author information Article notes Copyright and License information Disclaimer

This article has been cited by other articles in PMC.

Abstract

INTRODUCTION

Despite their similar chemical composition, carbohydrates can form an enormous number of combinations through the stereochemical variety of the hydroxyl groups that they carry, through the many possibilities to assemble monosaccharides one to another, and through the wealth of noncarbohydrate substituents that can decorate the resulting oligo- and polysaccharides. Complex carbohydrates are widely distributed in nature, where they mediate a multitude of biological functions, from carbon reserve, to structural molecules, or as the mediators of intra- and intercellular recognition within one organism or between organisms. The diversity of complex carbohydrates is controlled by a panel of enzymes involved in their assembly (glycosyltransferases) and their breakdown (glycoside hydrolases, polysaccharide lyases, carbohydrate esterases), collectively designated as Carbohydrate-Active enZymes (CAZymes). CAZymes have been classified in sequence-based families for >22 years (1–6) and this classification has become the standard of the field (7).

The first defining feature of CAZyme classification is that the families are defined based on significant amino acid sequence similarity with at least one biochemically characterized founding member (1). The consequence is that sequences that display too little similarity to ensure a significant alignment are not included, nor used to form putative families, as distant relatives of CAZymes may have other functions. Borderline cases are stored in the nonclassified section of each CAZyme category, awaiting biochemical characterization. A second defining feature is that our classification is made module by module. CAZymes are frequently modular proteins with a catalytic module harbouring a variable number of other discrete modules, which can be either catalytic or not. Thus a modular CAZyme can be assigned to several families if its constitutive modules belong to separate families. The third important feature is that we only analyse systematically protein sequences released in the daily releases of GenBank (ftp://ftp.ncbi.nih.gov/genbank/daily-nc), to avoid analysing unfinished sequences that may change accession number.

As early as 1991, it was noted that the sequence-based families of glycoside hydrolases grouped together enzymes of different substrate specificities (i.e. enzymes with ‘different’ EC numbers) (1) demonstrating that the acquisition of novel specificity has been commonplace during evolution. This feature was subsequently noted for the other classes of CAZymes (4,6). The processes by which a novel substrate specificity was acquired from a common ancestor leave detectable traces in the sequence of contemporary proteins. Thus, unexpectedly, the usual drawback of carbohydrates (their chemical resemblance) is at the origin of their success in the postgenomic era: CAZymes need to be specific to perform their biological functions. While the precise specificity of DNAses, RNAses, proteases and esterases is difficult or impossible to derive from their sequence alone, the CAZyme classification system allows in some cases the prediction of the broad category of carbohydrate substrate, based on the assignment to a family (8). This carries the potential to infer the glycobiological profile of an organism (or a community thereof) based on DNA sequence. However, the occurrence of enzymes that act on different substrates in the same family is a significant problem for the automated functional annotation of CAZyme-related genes. This can sometimes be overcome by the definition of subfamilies within families (9,10) (see below), but our current knowledge of the sequence-to-specificity relationships in CAZymes families is still largely insufficient and unevenly distributed for many families to allow unsupervised automated substrate prediction.

The Carbohydrate-Active Enzymes database (CAZy; http://www.cazy.org) was launched in 1999 to provide online and constantly updated access to the family classification of CAZymes. Coupled to the CAZypedia encyclopaedic resource (http://www.cazypedia.org), CAZy is the only comprehensive resource that correlates the sequence, structure and molecular mechanism of CAZymes. CAZy was presented in this journal in 2009 (11) and the present article outlines the changes that have been implemented in CAZy during the past 5 years.

WEBSITE DESIGN

In March 2011, the website interface was deeply redesigned both in appearance (new layout, new colours and new logo) and in content. Thus new sections and new links have been added to commercial providers that list their products following the CAZy nomenclature. Other additions cover scientific meetings relevant to CAZymes, positions available and a ‘what’s new’ section that provides news on changes in the CAZy database. More interactivity in the display of information associated with each family was introduced (Figure 1). In particular, each family has now a specific tab, which lists those individual CAZymes that we believe have been experimentally characterized. Because the number of entries in several families had become impractical, the display was modified to just show the header for each family along with a series of tabs for access to subsets (All, Archea, Bacteria, Eukaryota, unclassified, Structure, Characterized). Each tab displays 1000 entries per page, except for the tab listing the characterized enzymes, where only 100 entries are shown per page. The search tool was also revisited and one can now search the entire site or specific fields such as CAZy family, taxonomic identifier, organism name, protein name, accessions in different databases (GenBank, Uniprot and Protein Data Bank (PDB)), known activities, EC number, mechanisms or clan.

An external file that holds a picture, illustration, etc.
Object name is gkt1178f1p.jpg

Figure 1.

A view of the GH13 page showing the newly available 3D structural information (carbohydrate ligands and resolution) in the Structure tab.

NOVEL ENZYME CLASS

Because lignin is invariably found together with polysaccharides in the plant cell wall and because lignin fragments are likely to act in concert with polysaccharide lytic mono-oxygenases (LPMO), families of lignin degradation enzymes and of LPMOs have been used to define a new CAZy class that we have named ‘Auxiliary Activities’ to accommodate a broad range of enzyme mechanisms and substrates related to lignocellulose conversion (12).

DATABASE GROWTH

At the date of submission of this article, CAZy reports sequence information on almost 340 000 CAZymes, a staggering 225% increase compared with 5 years ago (Table 1). During the same period, the number of biochemically characterized CAZymes has grown by only 30% to 12 700 and the number of CAZymes with 3D structures has grown by ~78% (Table 1). Despite this growth, only ~1400 (0.4%) of the 340 000 CAZymes have a 3D structure solved to date. The past 5 years have seen the number of families covered by CAZy grow slowly to >330 at present. Five years ago the number of genome sequences analysed in CAZy was 750 (11). This number is now greater than 2800 (see below), representing a 3.8-fold increase. The continuously growing gap between the number of sequences and the number of biochemically or structurally characterized CAZymes is a direct consequence of the avalanche of genome sequences resulting from modern sequencing technologies combined with the much lower pace of experimental characterization of gene products. This gap would even be more considerable if one was to search and list CAZymes in nonfinished genomes.

Table 1.

Growth of the CAZy database during the past 5 years

Protein class	Sequences Sept-2013	Dec-2008	Characterized Sept-2013	Dec-2008	With structure Sept-2013	Dec-2008
GH	159 274	46 654	9221	6805	817	475
GT	119 910	40 863	1936	1846	139	83
PL	4043	1301	336	262	51	34
CE	15 856	5083	275	212	74	43
CBM	32 259	9210	663	570	280	166
AA	5801	464^a	299	71^a	58	3^a
Total	337 143	103 111	12 730	9695	1419	801

^aNumbers estimated from the literature: the AA category did not exist in December 2008.

DATABASE CONTENT: SUBFAMILIES

The occurrence of enzymes that act on different substrates in the same family prevents the straightforward functional annotation of CAZyme-related genes. The division of CAZyme families into subfamilies based on phylogenetic analysis has been explored as a possible approach to improve the relationship between sequence and specificity. Subfamily classification of GH5, GH13, GH30 and all of the PL families has shown that the majority of the defined subfamilies are monospecific, thus indicating that the correlation of substrate specificity with sequences is significantly better at the subfamily level than the family level (9,10,13). An additional benefit of the division into subfamilies is the identification of currently uncharacterized subfamilies that can subsequently be analysed experimentally to unveil potential new activities. Subfamilies are currently displayed for families GH5, GH13, GH30, AA1–AA5 and for all PL families. Many more families are currently evaluated for subfamily definitions. Care is taken that the subfamilies are defined in a robust manner to avoid confusion that would arise from constant redefinitions and resulting different naming conventions. We prefer to let the subfamilies ‘mature’ until we feel that the subfamily quality and stability is sufficient for public release.

DATABASE CONTENT: GENOMES

The collection of carbohydrate-active enzymes encoded by the genome of an organism (‘CAZome’) provides an insight into the nature and extent of the metabolism of complex carbohydrates of the species. The CAZomes of free-living organisms typically correspond to 1–5% of the predicted coding sequences. Extremely reduced CAZomes are characteristic of species with a strict intracellular parasitic lifestyle. Because of the massive chemical, structural and functional variability of carbohydrates, CAZome comparisons can highlight the adaptation of the CAZymes repertoire of species to their environment (14,15).

Since 2011, in addition to giving the family distribution, the new CAZy website displays the complete list of putative CAZymes (with accession numbers) of each genome that was analysed. At present, CAZy covers >2800 genomes in the following kingdoms: Bacteria (2351), Archea (158), Eukaryota (73), Viruses (240). The CAZomes listed in the CAZy website correspond to protein models of finished genomes, i.e. with proteins released in the daily releases of GenBank (ftp://ftp.ncbi.nih.gov/genbank/daily-nc). In a few cases, genomes with protein models not released as finished entries in GenBank but publically available, have been analysed and are presented in CAZy. However, for these few cases, the display only shows the number of proteins in each family, but does not feature the actual list of proteins.

Genomes are analysed using the CAZy pipeline, which combines Blast and HMM tools to compare protein models, respectively, with sequence and profile libraries created from the sequences of the catalytic and noncatalytic modules of the CAZy database. This is followed by a manual inspection by expert curators to resolve borderline cases (11). Our methodology provides coherent, expert and comparable sets of annotations. In this respect, one should note that the correspondence between CAZy families and those in PFAM (16)/INTERPRO (17) or DBCAN (18) is far from perfect. This is due to a variety of reasons that include different strategies, different thresholds, different goals, different methods, different training sets and different degrees of expert curation. An unfortunate consequence is that the CAZyme analysis of a genome performed with one method usually cannot be compared with that done with another.

There are two ways to get a genome analysed by CAZy: if the genome and encoded proteins are deposited as finished entries in GenBank (or EMBL or DDBJ) they will be analysed by our daily routines. Alternatively, if one wishes to perform a CAZy analysis before deposition to GenBank (or EMBL or DDBJ), one should approach us for collaboration. Metagenomic data are analysed exclusively in collaboration due to their usual large size.

DISPLAY OF STRUCTURAL INFORMATION

The CAZy database is not only used by those who wish to analyse genomes, but also by structural biologists who study the molecular details of substrate recognition by CAZymes. Until September 2013, the only information available in the structure pages of CAZy was the accession and macromolecule chain name(s) in the PDB (http://www.rcsb.org) (19). We have made a series of developments to provide additional information relevant to the 3D structure of CAZymes such as the resolution (for crystal structures) and a description of the carbohydrate ligands found in the CAZyme binding sites.

The resolution information is straightforward to generate, as it is present in the PDB files of structures solved by x-ray crystallography. When the resolution information is unavailable in the PDB file, the type of experimental method by which the structure was solved is given instead (powder diffraction or nuclear magnetic resonance).

On the other hand, the PDB does not provide any option to perform a comprehensive search for carbohydrate structures found in CAZyme binding sites and, unlike proteins or nucleic acids, the nomenclature for carbohydrate residues within PDB files is not standardized (20). In addition, the information on how the isolated carbohydrate residues are linked to each other is not described in PDB files. We thus extract the carbohydrate ligand information from PDB files using PDB-care (http://www.glycosciences.de/tools/pdb-care/) (21,22). The carbohydrate molecules covalently linked to an Asn or a Ser/Thr residue were discarded to eliminate N- and O-glycans to identify the carbohydrate ligands bound to CAZyme active sites. The latter are shown in the structure pages of CAZy following their IUPAC nomenclature.

Not all carbohydrate structures are susceptible to automated description by PDB-care. In a number of cases, we have manually curated and provided IUPAC descriptions for structures that are unsuitable to PDB-care: (i) nonreducing glycans (cyclodextrins, sucrose and sucrose derivatives, trehalose, kestose, raffinose, nystose, etc.), (ii) ligands that contain both carbohydrate and noncarbohydrate moieties such as acarbose and acarbose derivatives, (iii) sulfur-containing oligosaccharides, (iv) fluorine-containing carbohydrates and (v) oligosaccharides containing 3,6-anhydro bridges. Table 2 displays examples of the manually handled cases. In addition, automated scripts have been devised to handle ~180 carbohydrate analogues that we denote <carb_like_ligandref> where ligandref corresponds to the three-letter ligand name given by the PDB. For instance, the carbohydrate-like inhibitor 1-deoxynojirimycin appears as <carb_like_NOJ>. The structural biology community is invited to contact us to report the possible errors that might have slipped through our curation process.

Table 2.

Examples of carbohydrate ligands treated manually

Category	Common name	Display in CAZy structure pages	Example of PDB file
Nonreducing oligosaccharides	α-cyclodextrin	α-cyclodextrin	3EDF
	β-cyclodextrin	β-cyclodextrin	3CGT
	Sucrose	α-D-Glcp-(1-2)-β-D-Fruf	4FFH
	Raffinose	α-D-Galp-(1-6)-α-D-Glcp-(1-2)-β-D-Fruf	1W2T
	Kestose	α-D-Glcp-(1-2)-β-D-Fruf-(1-2)-β-D-Fruf	3LDR
	Nystose	α-D-Glcp-(1-2)-β-D-Fruf-(1-2)-β-D-Fruf-(1-2)-β-D-Fruf	3LEM
Thio-oligosaccharides	Thio-cellobiose	β-D-Glcp-(1-4)-β-D-Glcp4S	4IPM
	Thio-laminaribiose	β-D-Glcp-(1-3)-β-D-Glcp3S	1J8V
	Thio-xylopentaose	β-D-Xylp-(1-4)-β-D-Xylp4S-(1-4)-β-D-Xylp4S-(1-4)-β-D-Xylp4S-(1-4)- β-D-Xylp4S	3CUJ
	α-methyl-thio-cellopentaoside	β-D-Glcp-(1-4)-β-D-Glcp4S-(1-4)-β-D-Glcp4S-(1-4)-β-D-Glcp4S-(1-4)- α-D-Glcp4S-(1-1)-methyl	1H5V
Fluoro-oligosaccharides	5-fluoro-β-D-glucose	β-D-Glcp5F	4AMX
	2-deoxy-2-fluoro-α-D-glucose	α-D-Glcp2F	1UYQ
	5-fluoro-β-D-xylose	β-D-Xylp5F	2XVK
3,6-anhydro oligosaccharides	Neoagarohexaose	α-L-3,6-anhydro-Galp-(1-3)-β-D-Galp-(1-4)-α-L-3,6-anhydro-Galp-(1-3)- β-D-Galp-(1-4)-α-L-3,6-anhydro-Galp-(1-3)-β-D-Galp	2CDO
	Porphyran/agarose hexasaccharide	α-L-Galp6SO3-(1-3)-α-D-Galp-(1-4)-α-L-3,6-anhydro-Galp-(1-3)- β-D-Galp-(1-4)-α-L-Galp6SO3-(1-3)-α-D-Galp	4AW7
	Agarooctaose	α-L-3,6-anhydro-Galp-(1-3)-β-D-Galp-(1-4)-α-L-3,6-anhydro-Galp-(1-3)- β-D-Galp-(1-4)-α-L-3,6-anhydro-Galp-(1-3)-β-D-Galp-(1-4)- α-L-3,6-anhydro-Galp-(1-3)-β-D-Galp	4ATF
Acarbose and its derivatives	Acarbose	<non_carb>-(1-4)-α-D-6-deoxy-Glcp4N-(1-4)-α-D-Glcp-(1-4)-β-D-Glcp	3ZOA
	Acarbose-derived trisaccharide	<non_carb>-(1-4)-α-D-6-deoxy-Glcp4N-(1-4)-α-D-Glcp	1XCW
	Acarbose-derived pentasaccharide	α-D-6-deoxy-Glcp4N-(1-4)-α-D-Glcp-(1-4)-<non_carb>-(0-4)- α-D-6-deoxy-Glcp4N-(1-4)-α-D-Glcp-(1-4)-β-D-Glcp	1PIG

As of September 2013, >1400 CAZymes and modules thereof have a known 3D structure, corresponding to almost 6000 PDB entries out of which ~1500 carbohydrate (or carbohydrate analogue) ligands are now identified and presented in the structure tab of each CAZy family.

FUTURE DIRECTIONS

CAZy is a knowledge-based resource that aims to link the sequence, the specificity and the 3D structural features of CAZymes. How these enzymes achieve selective recognition of target substrates that display only subtle stereochemical differences is key to prediction of substrate specificity. While this is already achievable for a few subfamilies, we are still a long way from a reliable automated substrate (and/or product) prediction for all CAZymes encoded by a genome. We believe that subfamily-based target selection for experimental investigation of CAZymes will progressively fill the knowledge gap that will allow reliability in future functional predictions.

FUNDING

Agence Nationale de la Recherche, grant BIP:BIP [ANR-10-BINF-03-04]. Funding for open access charge: Waived by Oxford University Press.

Conflict of interest statement. None declared.

ACKNOWLEDGMENTS

We thank Thomas Lütteke (Justus-Liebig University Gießen, Institute of Veterinary Physiology and Biochemistry, Germany) for providing us the PDB-care program, and Kirk M. Schnorr (Novozymes, Bagsvaerd, Denmark) for his help with proof reading this manuscript. Bernard Henrissat is a Honorary Professor of Glycomics at the Faculty of Health and Medical Sciences, University of Copenhagen, Denmark.

REFERENCES

1. Henrissat B. A classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem. J. 1991;280:309–316. [Europe PMC free article] [Abstract] [Google Scholar]

2. Henrissat B, Bairoch A. New families in the classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem. J. 1993;293:781–788. [Europe PMC free article] [Abstract] [Google Scholar]

3. Henrissat B, Davies G. Structural and sequence-based classification of glycoside hydrolases. Curr. Opin. Struct. Biol. 1997;7:637–644. [Abstract] [Google Scholar]

4. Campbell JA, Davies GJ, Bulone V, Henrissat B. A classification of nucleotide-diphospho-sugar glycosyltransferases based on amino acid sequence similarities. Biochem. J. 1997;326:929–939. [Europe PMC free article] [Abstract] [Google Scholar]

5. Coutinho PM, Deleury E, Davies GJ, Henrissat B. An evolving hierarchical family classification for glycosyltransferases. J. Mol. Biol. 2003;328:307–317. [Abstract] [Google Scholar]

6. Lombard V, Bernard T, Rancurel C, Brumer H, Coutinho PM, Henrissat B. A hierarchical classification of polysaccharide lyases for glycogenomics. Biochem. J. 2010;432:437–444. [Abstract] [Google Scholar]

7. Hart GW, Copeland RJ. Glycomics hits the big time. Cell. 2010;143:672–676. [Europe PMC free article] [Abstract] [Google Scholar]

8. Cantarel BL, Lombard V, Henrissat B. Complex carbohydrate utilization by the healthy human microbiome. PLoS One. 2012;7:e28742. [Europe PMC free article] [Abstract] [Google Scholar]

9. Stam MR, Danchin EG, Rancurel C, Coutinho PM, Henrissat B. Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of alpha-amylase-related proteins. Protein Eng. Des. Sel. 2006;19:555–562. [Abstract] [Google Scholar]

10. Aspeborg H, Coutinho PM, Wang Y, Brumer H, III, Henrissat B. ) Evolution, substrate specificity and subfamily classification of glycoside hydrolase family 5 (GH5) BMC Evol. Biol. 2012;12:186. [Europe PMC free article] [Abstract] [Google Scholar]

11. Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B. The carbohydrate-active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009;37:D233–D238. [Europe PMC free article] [Abstract] [Google Scholar]

12. Levasseur A, Drula E, Lombard V, Coutinho PM, Henrissat B. Expansion of the enzymatic repertoire of the CAZy database to integrate auxiliary redox enzymes. Biotechnol. Biofuels. 2013;6:41. [Europe PMC free article] [Abstract] [Google Scholar]

13. St John FJ, González JM, Pozharski E. Consolidation of glycosyl hydrolase family 30: a dual domain 4/7 hydrolase family consisting of two structurally distinct groups. FEBS Lett. 2010;584:4435–4441. [Abstract] [Google Scholar]

14. Ohm RA, Feau F, Henrissat B, Schoch CL, Horwitz BA, Barry KW, Condon BJ, Copeland AC, Dhillon B, Glaser F, et al. Diverse lifestyles and strategies of plant pathogenesis encoded in the genomes of eighteen Dothideomycetes fungi. PLoS Pathogens. 2012;8:e1003037. [Europe PMC free article] [Abstract] [Google Scholar]

15. Lozupone C, Hamady M, Cantarel BL, Coutinho PM, Henrissat B, Gordon JI, Knight R. The convergence of carbohydrate active gene repertoires in human gut microbes. Proc. Natl Acad. Sci. USA. 2008;105:15076–15081. [Abstract] [Google Scholar]

16. Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, et al. The Pfam protein families database. Nucleic Acids Res. 2012;40:D290–D301. [Europe PMC free article] [Abstract] [Google Scholar]

17. Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, Bernard T, Binns D, Bork P, Burge S, et al. InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res. 2012;40:D306–D312. [Europe PMC free article] [Abstract] [Google Scholar]

18. Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012;40:W445–W451. [Europe PMC free article] [Abstract] [Google Scholar]

19. Rose PW, Bi C, Bluhm WF, Christie CH, Dimitropoulos D, Dutta S, Green RK, Goodsell DS, Prlic A, Quesada M, et al. The RCSB protein data bank: new resources for research and education. Nucleic Acids Res. 2013;41:D475–D482. [Europe PMC free article] [Abstract] [Google Scholar]

20. Petrescu AJ, Petrescu SM, Dwek RA, Wormald MR. A statistical analysis of N- and O-glycan linkage conformations from crystallographic data. Glycobiology. 1999;9:343–352. [Abstract] [Google Scholar]

21. Lütteke T, Bohne-Lang A, Loss A, Goetz T, Frank M, von der Lieth CW. GLYCOSCIENCES.de: an Internet portal to support glycomics and glycobiology research. Glycobiology. 2006;16:71R–81R. [Abstract] [Google Scholar]

22. Lütteke T, von der Lieth CW. PDB-care (PDB carbohydrate residue check): a program to support annotation of complex carbohydrate structures in PDB files. BMC Bioinformatics. 2004;4:69. [Europe PMC free article] [Abstract] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

Full text links

Read article at publisher's site: https://doi.org/10.1093/nar/gkt1178

Read article for free, from open access legal sources, via Unpaywall: https://academic.oup.com/nar/article-pdf/42/D1/D490/3617205/gkt1178.pdf

Citations & impact

Impact metrics

3,036

Citations

Jump to Citations

Data citations

Jump to Data

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/3682525

Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/3682525

Article citations

Unique Fn3-like biosensor in σ<sup>I</sup>/anti-σ<sup>I</sup> factors for regulatory expression of major cellulosomal scaffoldins in Pseudobacteroides cellulosolvens.
Dong S, Chen C, Li J, Liu YJ, Bayer EA, Lamed R, Mizrahi I, Cui Q, Feng Y
Protein Sci, 33(11):e5193, 01 Nov 2024
Cited by: 1 article | PMID: 39470320
Genome-wide analysis of the Amorphophallus konjac AkCSLA gene family and its functional characterization in drought tolerance of transgenic arabidopsis.
Luo C, Luo S, Chen Z, Yang R, He X, Chu H, Li Z, Li W, Shi Y
BMC Plant Biol, 24(1):1033, 31 Oct 2024
Cited by: 0 articles | PMID: 39478464 | PMCID: PMC11526714
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Environmental Driving of Adaptation Mechanism on Rumen Microorganisms of Sheep Based on Metagenomics and Metabolomics Data Analysis.
He H, Fang C, Liu L, Li M, Liu W
Int J Mol Sci, 25(20):10957, 11 Oct 2024
Cited by: 0 articles | PMID: 39456741 | PMCID: PMC11508146
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Comparative genomic analysis of pathogenic factors of Listeria spp. using whole-genome sequencing.
Qi Y, Cao Q, Zhao X, Tian C, Li T, Shi W, Wei H, Song C, Xue H, Gou H
BMC Genomics, 25(1):935, 07 Oct 2024
Cited by: 0 articles | PMID: 39375592 | PMCID: PMC11457443
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Elucidation of the noncovalent interactions driving enzyme activity guides branching enzyme engineering for α-glucan modification.
Zong Z, Zhang X, Chen P, Fu Z, Zeng Y, Wang Q, Chipot C, Leggio LL, Sun Y
Nat Commun, 15(1):8760, 09 Oct 2024
Cited by: 0 articles | PMID: 39384762 | PMCID: PMC11464733
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC

Go to all (3,036) article citations

Other citations

Wikipedia (Showing 10 of 78)

Data

Data behind the article

This data has been text mined from the article, or deposited into data resources.

BioStudies: supplemental material and supporting data

http://www.ebi.ac.uk/biostudies/studies/S-EPMC3965031?xr=true

Protein structures in PDBe (Showing 16 of 16)

(1 citation) PDBe - 1J8V
View structure
(1 citation) PDBe - 4AMX
View structure
(1 citation) PDBe - 3ZOA
View structure
(1 citation) PDBe - 2XVK
View structure
(1 citation) PDBe - 3CGT
View structure
(1 citation) PDBe - 4ATF
View structure
(1 citation) PDBe - 4IPM
View structure
(1 citation) PDBe - 1PIG
View structure
(1 citation) PDBe - 1UYQ
View structure
(1 citation) PDBe - 2CDO
View structure
(1 citation) PDBe - 3EDF
View structure
(1 citation) PDBe - 4FFH
View structure
(1 citation) PDBe - 1XCW
View structure
(1 citation) PDBe - 1H5V
View structure
(1 citation) PDBe - 4AW7
View structure
(1 citation) PDBe - 1W2T
View structure

Show less

Data that cites the article

This data has been provided by curated databases and other sources that have cited the article.

Protein families in InterPro (2)

GH62_arabinosidase(InterPro - IPR005193)
Glyco_hydro_68(InterPro - IPR003469)

Search life-sciences literature (45,103,589 articles, preprints and more)

The carbohydrate-active enzymes database (CAZy) in 2013.

Author information

Affiliations

Authors

ORCIDs linked to this article

Abstract

Free full text

The carbohydrate-active enzymes database (CAZy) in 2013

Vincent Lombard

Hemalatha Golaconda Ramulu

Elodie Drula

Pedro M. Coutinho

Bernard Henrissat

Abstract

INTRODUCTION

WEBSITE DESIGN

NOVEL ENZYME CLASS

DATABASE GROWTH

Table 1.

DATABASE CONTENT: SUBFAMILIES

DATABASE CONTENT: GENOMES

DISPLAY OF STRUCTURAL INFORMATION

Table 2.

FUTURE DIRECTIONS

FUNDING

ACKNOWLEDGMENTS

REFERENCES

Full text links

Citations & impact

Impact metrics

Citations of article over time

Alternative metrics

Article citations

Other citations

Wikipedia (Showing 10 of 78)

Data

Data behind the article

BioStudies: supplemental material and supporting data

Protein structures in PDBe (Showing 16 of 16)

Data that cites the article

Protein families in InterPro (2)

Similar Articles

Partnerships & funding