What is linguistic profiling?
Linguistic profiling is using linguistic characteristics or dialect to identify an author’s characteristics, such as social origin or native tongue. Carabao provides basic linguistic profiling capabilities by detecting words and expressions peculiar to a specific social group. For example, if a text contains a lot of verbs ending with “-ise” and not “-ize” (initialize, recognize, etc.), and nouns ending with “-our” rather than “-or” (flavor, neighbor, etc.), or certain words like “indeed” or “rather”, it is likely to be authored by a British or Australian English speaker. Using the sequence mechanism, it is possible to detect specific constructs favoured by certain groups, or characteristic mistakes.