# # Name: Extended PUA for Deseret to Unicode table # Unicode version: 10.0 # Table version: 2.0 # Date: 2019-04-11 # Author: HarJIT , based on existing table. # # Mapping of an extended Deseret PUA encoding extending the obsolete # CSUR mapping to address shortcomings in the existing Unicode mapping # (and also to allow Deseret use in BMP-only apps, *cough* TeXmacs). # # Characters without Unicode mappings are mapped to themselves. # # Helpfully, the CSUR mapping differs from the Unicode mapping in that # both cases begin at the start of a line, leaving far more space for # expansion than the (subsequently filled) two per case left within the # Unicode block. (The Unicode block used as few lines as possible, by # setting the minuscules back by eight cells relative to the majuscules # compared to the pre-existing CSUR mapping - this made the block one # line or sixteen cells shorter, eight cells per case.) # # --- Referenced: -------------------------------------------------------- # # Sources of Shelton's repertoire (similar to that accepted in Unicode # but with long oi, hooked ew, left-facing long e and schwa): # - https://commons.wikimedia.org/wiki/File:Early_Deseret_printed_hand\ # written_lineup_based_on_a_document_written_by_Mormon_pioneer_Mario\ # n_J._Shelton.svg (omits schwa) # - https://commons.wikimedia.org/wiki/File:The_word_broken_in_MJ_Shel\ # ton%27s_Deseret_handwriting.png (with schwa) # # Sources of Watts' pre-1856 alternate variant (rotating glyphs in the # pattern Er -> Short O (u) -> Short E -> Short A -> En -> Schwa), note # left-facing Long E and mutually illegible alternative Ow in both: # - https://commons.wikimedia.org/wiki/File:Early_Deseret_cursive_line\ # up_based_on_a_document_written_by_its_creator_George_D_Watt.svg # - https://commons.wikimedia.org/wiki/File:Early_Deseret_alphabet_cha\ # rt_found_in_Jules_Remy_and_Julius_Brenchley%27s_A_Journey_to_Great\ # -Salt-Lake_City_(1855).png (print form of cursive variant) # # --- Sidenote (Unicode/UCS divisions): ---------------------------------- # # - Line: 16 cells (points) # - Ward (UCS row): 16 lines or 256 cells # - Block: designated non-overlapping region of consecutive complete # lines, can span multiple wards but not multiple planes. # - Plane: 256 wards # - Group: in theory, 256 planes; in reality, limited at 17 planes. # - UCS-4: designed as 128 groups, later limited to one. # # --- Original blurb: ---------------------------------------------------- # # Name: Deseret ConScript to Unicode table # Unicode version: 3.1 # Table version: 1.00 # Table format: Format A # Date: 2001-02-02 # Author: Doug Ewell # # General notes: # # This table provides a mapping for the Deseret Alphabet between the # encoding registered in the ConScript Unicode Registry and the encoding # adopted as part of Unicode 3.1. The Deseret Alphabet was registered # in ConScript in 1997, but the registration was withdrawn in 2001 # following the publication of draft tables for Unicode 3.1. # # This table is provided to allow implementations based on the ConScript # encoding to migrate to Unicode 3.1. # # Format: Three tab-separated columns: # Column #1 is the ConScript code (in hex as 0xXXXX) # Column #2 is the Unicode (in hex as 0xXXXXX) # Column #3 is the Unicode name (follows a comment sign, '#') # # The entries are in ConScript order (compatible with Unicode order). # # ------------------------------------------------------------------- # # ConScript captials: 0xE830 0x10400 # DESERET CAPITAL LETTER LONG I 0xE831 0x10401 # DESERET CAPITAL LETTER LONG E 0xE832 0x10402 # DESERET CAPITAL LETTER LONG A 0xE833 0x10403 # DESERET CAPITAL LETTER LONG AH 0xE834 0x10404 # DESERET CAPITAL LETTER LONG O 0xE835 0x10405 # DESERET CAPITAL LETTER LONG OO 0xE836 0x10406 # DESERET CAPITAL LETTER SHORT I 0xE837 0x10407 # DESERET CAPITAL LETTER SHORT E # like an early short o 0xE838 0x10408 # DESERET CAPITAL LETTER SHORT A # like an early short e 0xE839 0x10409 # DESERET CAPITAL LETTER SHORT AH 0xE83A 0x1040A # DESERET CAPITAL LETTER SHORT O # like an early er 0xE83B 0x1040B # DESERET CAPITAL LETTER SHORT OO 0xE83C 0x1040C # DESERET CAPITAL LETTER AY 0xE83D 0x1040D # DESERET CAPITAL LETTER OW 0xE83E 0x1040E # DESERET CAPITAL LETTER WU 0xE83F 0x1040F # DESERET CAPITAL LETTER YEE 0xE840 0x10410 # DESERET CAPITAL LETTER H 0xE841 0x10411 # DESERET CAPITAL LETTER PEE 0xE842 0x10412 # DESERET CAPITAL LETTER BEE 0xE843 0x10413 # DESERET CAPITAL LETTER TEE 0xE844 0x10414 # DESERET CAPITAL LETTER DEE 0xE845 0x10415 # DESERET CAPITAL LETTER CHEE 0xE846 0x10416 # DESERET CAPITAL LETTER JEE 0xE847 0x10417 # DESERET CAPITAL LETTER KAY 0xE848 0x10418 # DESERET CAPITAL LETTER GAY 0xE849 0x10419 # DESERET CAPITAL LETTER EF 0xE84A 0x1041A # DESERET CAPITAL LETTER VEE 0xE84B 0x1041B # DESERET CAPITAL LETTER ETH 0xE84C 0x1041C # DESERET CAPITAL LETTER THEE 0xE84D 0x1041D # DESERET CAPITAL LETTER ES 0xE84E 0x1041E # DESERET CAPITAL LETTER ZEE 0xE84F 0x1041F # DESERET CAPITAL LETTER ESH 0xE850 0x10420 # DESERET CAPITAL LETTER ZHEE 0xE851 0x10421 # DESERET CAPITAL LETTER ER 0xE852 0x10422 # DESERET CAPITAL LETTER EL 0xE853 0x10423 # DESERET CAPITAL LETTER EM 0xE854 0x10424 # DESERET CAPITAL LETTER EN # like an early short a 0xE855 0x10425 # DESERET CAPITAL LETTER ENG # # Non-ConScript Unicode capitals: 0xE856 0x10426 # DESERET CAPITAL LETTER OI # xxo "short ah" + i "short i" 0xE857 0x10427 # DESERET CAPITAL LETTER EW # i "short i" + xu "long oo" # # Non-ConScript non-Unicode capitals: 0xE858 0xE858 # = AdamicBee U+E019 # long oi (o "long ah" + i "short i") 0xE859 0xE859 # = AdamicBee U+E017 # hook shaped ew 0xE85A 0xE85A # = AdamicBee U+E013 # vertical stroke schwa (like an early en) 0xE85B 0xE85B # = AdamicBee U+E023 # left-facing long e (resembling 3, or erroneous Noto Vee) 0xE85C 0xE85C # = AdamicBee U+E021 # alternate ow (xxo with loop) # # Capital phonetic extensions (other vowels that exist in my accent but not in Deseret): # 0xE85D 0xE85D # just use xxo + w 0xE85E 0xE85E # = AdamicBee U+E005 # air (hybrid of e "short e" and r) 0xE85F 0xE85F # ear (hybrid of two "short i") # # ConScript minuscules: 0xE860 0x10428 # DESERET SMALL LETTER LONG I 0xE861 0x10429 # DESERET SMALL LETTER LONG E 0xE862 0x1042A # DESERET SMALL LETTER LONG A 0xE863 0x1042B # DESERET SMALL LETTER LONG AH 0xE864 0x1042C # DESERET SMALL LETTER LONG O 0xE865 0x1042D # DESERET SMALL LETTER LONG OO 0xE866 0x1042E # DESERET SMALL LETTER SHORT I 0xE867 0x1042F # DESERET SMALL LETTER SHORT E 0xE868 0x10430 # DESERET SMALL LETTER SHORT A 0xE869 0x10431 # DESERET SMALL LETTER SHORT AH 0xE86A 0x10432 # DESERET SMALL LETTER SHORT O 0xE86B 0x10433 # DESERET SMALL LETTER SHORT OO 0xE86C 0x10434 # DESERET SMALL LETTER AY 0xE86D 0x10435 # DESERET SMALL LETTER OW 0xE86E 0x10436 # DESERET SMALL LETTER WU 0xE86F 0x10437 # DESERET SMALL LETTER YEE 0xE870 0x10438 # DESERET SMALL LETTER H 0xE871 0x10439 # DESERET SMALL LETTER PEE 0xE872 0x1043A # DESERET SMALL LETTER BEE 0xE873 0x1043B # DESERET SMALL LETTER TEE 0xE874 0x1043C # DESERET SMALL LETTER DEE 0xE875 0x1043D # DESERET SMALL LETTER CHEE 0xE876 0x1043E # DESERET SMALL LETTER JEE 0xE877 0x1043F # DESERET SMALL LETTER KAY 0xE878 0x10440 # DESERET SMALL LETTER GAY 0xE879 0x10441 # DESERET SMALL LETTER EF 0xE87A 0x10442 # DESERET SMALL LETTER VEE 0xE87B 0x10443 # DESERET SMALL LETTER ETH 0xE87C 0x10444 # DESERET SMALL LETTER THEE 0xE87D 0x10445 # DESERET SMALL LETTER ES 0xE87E 0x10446 # DESERET SMALL LETTER ZEE 0xE87F 0x10447 # DESERET SMALL LETTER ESH 0xE880 0x10448 # DESERET SMALL LETTER ZHEE 0xE881 0x10449 # DESERET SMALL LETTER ER 0xE882 0x1044A # DESERET SMALL LETTER EL 0xE883 0x1044B # DESERET SMALL LETTER EM 0xE884 0x1044C # DESERET SMALL LETTER EN 0xE885 0x1044D # DESERET SMALL LETTER ENG # # Non-ConScript Unicode minuscules: 0xE886 0x1044E # DESERET SMALL LETTER OI # xxo "short ah" + i "short i" 0xE887 0x1044F # DESERET SMALL LETTER EW # i "short i" + xu "long oo" # # Non-ConScript non-Unicode minuscules: 0xE888 0xE888 # = AdamicBee U+E018 # long oi (o "long ah" + i "short i") 0xE889 0xE889 # = AdamicBee U+E016 # hook shaped ew 0xE88A 0xE88A # = AdamicBee U+E012 # vertical stroke schwa (like an early en) 0xE88B 0xE88B # = AdamicBee U+E022 # left-facing long e (resembling 3, or erroneous Noto Vee) 0xE88C 0xE88C # = AdamicBee U+E020 # alternate ow (xxo with loop) # # Minuscule phonetic extensions (other vowels that exist in my accent but not in Deseret): # 0xE88D 0xE88D # just use xxo + w 0xE88E 0xE88E # = AdamicBee U+E004 # air (hybrid of e "short e" and r) 0xE88F 0xE88F # ear (hybrid of two "short i")