TY - JOUR
T1 - The Protein Naming Utility
T2 - A rules database for protein nomenclature
AU - Goll, Johannes
AU - Montgomery, Robert
AU - Brinkac, Lauren M.
AU - Schobel, Seth
AU - Harkins, Derek M.
AU - Sebastian, Yinong
AU - Shrivastava, Susmita
AU - Durkin, Scott
AU - Sutton, Granger
PY - 2009/12/8
Y1 - 2009/12/8
N2 - Generation of syntactically correct and unambiguous names for proteins is a challenging, yet vital task for functional annotation processes. Proteins are often named based on homology to known proteins, many of which have problematic names. To address the need to generate high-quality protein names, and capture our significant experience correcting protein names manually, we have developed the Protein Naming Utility (PNU, http://www.jcvi.org/pn-utility). The PNU is a web-based database for storing and applying naming rules to identify and correct syntactically incorrect protein names, or to replace synonyms with their preferred name. The PNU allows users to generate and manage collections of naming rules, optionally building upon the growing body of rules generated at the J. Craig Venter Institute (JCVI). Since communities often enforce disparate conventions for naming proteins, the PNU supports grouping rules into user-managed collections. Users can check their protein names against a selected PNU rule collection, generating both statistics and corrected names. The PNU can also be used to correct GenBank table files prior to submission to GenBank. Currently, the database features 3080 manual rules that have been entered by JCVI Bioinformatics Analysts as well as 7458 automatically imported names.
AB - Generation of syntactically correct and unambiguous names for proteins is a challenging, yet vital task for functional annotation processes. Proteins are often named based on homology to known proteins, many of which have problematic names. To address the need to generate high-quality protein names, and capture our significant experience correcting protein names manually, we have developed the Protein Naming Utility (PNU, http://www.jcvi.org/pn-utility). The PNU is a web-based database for storing and applying naming rules to identify and correct syntactically incorrect protein names, or to replace synonyms with their preferred name. The PNU allows users to generate and manage collections of naming rules, optionally building upon the growing body of rules generated at the J. Craig Venter Institute (JCVI). Since communities often enforce disparate conventions for naming proteins, the PNU supports grouping rules into user-managed collections. Users can check their protein names against a selected PNU rule collection, generating both statistics and corrected names. The PNU can also be used to correct GenBank table files prior to submission to GenBank. Currently, the database features 3080 manual rules that have been entered by JCVI Bioinformatics Analysts as well as 7458 automatically imported names.
UR - http://www.scopus.com/inward/record.url?scp=75549087774&partnerID=8YFLogxK
U2 - 10.1093/nar/gkp958
DO - 10.1093/nar/gkp958
M3 - Article
C2 - 20007151
AN - SCOPUS:75549087774
SN - 0305-1048
VL - 38
SP - D336-D339
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - SUPPL.1
M1 - gkp958
ER -