Rsrc
J-GLOBAL ID:202010003245708661
Research Resource code:NBDC01957
Update date:Jul. 09, 2020
RepeatsDB
RepeatsDB
Owning Organization:
-
BioComputing laboratory, University of Padua
-
Institue of Molecular Biology, gGmbH
-
Structural Bioinformatics and Molecular Modelling Group, CRBM
Resource classification:
Data,Database
Tag (subject) (1):
Protein
Tag (data type) (2):
Sequence
, 3D structure
Species (1):
All (NCBI Taxonomy ID: 1)
Overview:
RepeatsDB (http://repeatsdb.bio.unipd.it/) is a database of annotated tandem repeat protein structures. Tandem repeats pose a difficult problem for the analysis of protein structures, as the underlying sequence can be highly degenerate. Several repeat types haven been studied over the years, but their annotation was done in a case-by-case basis, thus making large-scale analysis difficult. We developed RepeatsDB to fill this gap. Using state-of-the-art repeat detection methods and manual curation, we systematically annotated the Protein Data Bank, predicting 10 745 repeat structures. In all, 2797 structures were classified according to a recently proposed classification schema, which was expanded to accommodate new findings. In addition, detailed annotations were performed in a subset of 321 proteins. These annotations feature information on start and end positions for the repeat regions and units. RepeatsDB is an ongoing effort to systematically classify and annotate structural protein repeats in a consistent way. It provides users with the possibility to access and download high-quality datasets either interactively or programmatically through web services.
Source:
FAIRsharing
Record maintainer:
See the corresponding part in the FAIRsharing record
Record license:
Creative Commons Attribution and Share-alike (CC-BY-SA) International 4.0 license
Return to Previous Page