Specialised JIS character set mappings (index)
Except for NEC-X-0201.txt
, all of the below make greater or lesser use of Apple-designated encoding hints.
- NEC-X-0201.txt: JIS X 0201, NEC variant.
- JIS C 6226:1978, NEC variant. I’m actually sourcing 78JIS variant characters from the x0212 repertoire where appropriate. For instance, 16-02 唖, the second kanji in Level 1, is given as 啞 in 78JIS and hence the bitmap font used by the PC98, and that’s how I’m mapping it. If you need more conventional mapping with NEC extensions, by all means probe a more industry standard mapping (such as from the WHATWG) first, and fall back to this one.
- NEC-C-6226-visual.txt: my version of 2019-02. Changes I made then made it make full use of the x0212 repertoire to obtain more 1978-appropriate mappings. However, 25-23 remained U+6602 (昂) and 90-22 remained U+663B (昻), while actual NEC fonts (to say nothing of IBM-942) have them the other way around (I’ll be honest, this was an oversight on my part: I knew about the itaiji swaps between JIS C 6226 levels 1 and 2 in 1983, but didn’t realise that there was also one between the IBM Extensions repertoire and the JIS C 6226 repertoire). Not that all fonts on my system draw them differently (for those that do, U+663B’s bottom-left component looks somewhat like a katakana エ).
- NEC-C-6226-visual3.txt: my version of 2021-08-12. 25-23 is U+663B, 90-22 is U+6602, and several other characters the CID map for NEC 78JIS maps differently have been changed, where the targets were not compatibility ideographs and the NEC bitmap font concords. Some of these are x0213 but not x0212 characters, while others are neither.
- JAPAN_PS.TXT: MacJapanese, PostScript variant (for some reason, Apple only seem to have supplied the regular variant mapping to Unicode, despite using mappings for both regular and PostScript variants in their system libraries – this is the latter).
- ARIBPicto.txt: ARIB STD B24 extensions to JIS X 0208, in a format akin to
EmojiSources.txt
(Semicolon Separated Values, Unicode representation first, hexadecimal codes, code point sequences delimited by spaces, Shift_JIS form of encoded forms), but with only the one source. - HD-X-0201.txt: JIS X 0201, Hitachi HD44780 variant.