Why aren RefSeq records made for all organisms, or for all of the loci available in Entrez Gene?
RefSeq records are provided for identified complete genomes and for identified genome sequencing projects as collaborations are established or as the sequence data becomes available. For the NCBI curation-supported pipeline they are made under the following conditions: • The locus in question represents a functional gene that either encodes a protein, structural, or other RNA product. In addition, RNA and genomic RefSeq records are provided to represent identified pseudogenes. RefSeq records are not provided for those Gene records that represent a chromosomal region rather than a gene. • At least one representative accession number has been identified for a given locus. The starting point can be either an mRNA or genomic sequence record. • For protein-coding genes, the identified sequence has a full length coding region annotated. RefSeq records are not made for loci where only partial coding region sequence data is available (as annotated on the GenBank source record). In addition, th