In the NCBI curation-supported pipeline, how is the GenBank source sequence initially selected?
There are several factors used in selecting the source GenBank sequence that is first used to generate the PROVISIONAL mRNA RefSeq record, but quite often the record used is selected primarily because it includes more complete UTR sequence data. Reference sequence records are not intended to represent the historical ‘first sequenced’ record (although for genes with very limited available sequence data they may at times do so). PROVISIONAL records may be updated before being fully reviewed to use a longer GenBank source sequence that becomes available. While the PROVISIONAL RefSeq records do represent a single GenBank source sequence, the REVIEWED RefSeq records are intended to represent the current state of knowledge as provided by the whole research community rather than by any one laboratory.
Related Questions
- What is the reference database for the ADT (e.g., source and version for sequence, Minor Allele Frequency [MAF], validation)?
- May a student take selected courses from PALCS and take the rest from a private or homeschool source?
- In the NCBI curation-supported pipeline, how is the GenBank source sequence initially selected?