How are emoji encoded in Unicode?
114 characters in the core emoji set are mapped to sequences of one or more characters available in Unicode before Version 6.0. The other 608 characters in the core emoji set are mapped to sequences of one or more characters added in Unicode 6.0, primarily in the blocks for Miscellaneous Symbols and Pictographs, Emoticons, Transport and Map Symbols, but also in blocks such as Dingbats and Technical Symbols. There is no block set aside specifically for emoji. Characters that are separate in the extended JIS X 0208 sets used by the three major cell phone carriers in Japan are mapped to separate characters in Unicode in what is known as the Emoji Source Separation Rule. For example, The emoji core set includes a character mapped to U+1F3B5 MUSICAL NOTE; this could not be unified with U+226A EIGHTH NOTE, because both exist as separate characters in the extended JIS sets used by all three of the major cell phone carriers in Japan. Because characters in the core emoji set are treated as pict