How well does the software perform on occupation narratives?
To identify and categorize SOIC errors, unique industry and occupation narratives were reviewed separately. Using Bureau of the Census occupation divisions as a guide, SOIC and manually assigned codes were compared. Of the 48,067 total cases, 9,808 cases contained unique occupation narratives (all duplicates were removed, e.g., the file contained the term “carpenter” in the occupation narrative field only once compared to the numerous times it may have appeared in the original file of 48,067 cases). Almost half (4,852) were coded correctly. The software incorrectly coded 2,020 narratives and could not code 2,936 narratives. Problems that were identified are characterized below..