Ethiopic Character Classes

From Regexp::Ethiopic

Equivalence of Family (row) and Equivalence of Form (column)

  [#1#] [#2#] [#3#] [#4#] [#5#] [#6#] [#7#] [#8#] [#9#] [#10#] [#11#] [#12#]
[#ሀ#]          
[#ለ#]        
[#ሐ#]        
[#መ#]        
[#ሠ#]        
[#ረ#]        
[#ሰ#]        
[#ሸ#]        
[#ቀ#]
[#ቐ#]
[#በ#]        
[#ቨ#]        
[#ተ#]        
[#ቸ#]        
[#ኀ#]
[#ነ#]        
[#ኘ#]        
[#አ#]        
[#ከ#]
[#ኸ#]
[#ወ#]          
[#ዐ#]          
[#ዘ#]        
[#ዠ#]        
[#የ#]          
[#ደ#]        
[#ዸ#]        
[#ጀ#]        
[#ገ#]
[#ጘ#]          
[#ጠ#]        
[#ጨ#]        
[#ጰ#]        
[#ጸ#]        
[#ፀ#]          
[#ፈ#]        
[#ፐ#]        

 

Character Class Aliases

Alias Equivalence of Form
[:ግዕዝ:]
[:geez:]
[#1#]
[:ኃምስ:]
[:hamis:]
[#2#]
[:ካዕብ:]
[:kaib:]
[#3#]
[:ራብዕ:]
[:rabi:]
[#4#]
[:ሳብዕ:]
[:sabi:]
[#5#]
[:ሳድስ:]
[:sadis:]
[#6#]
[:ሣልስ:]
[:salis:]
[#7#]
[:ዘመደ:ግዕዝ:]
[:zemede:geez:]
[#8#]
[:ዘመደ:ካዕብ:]
[:zemede:kaib:]
[#9#]
[:ዘመደ:ሣልስ:]
[:zemede:salis:]
[#10#]
[:ዘመደ:ራብዕ:]
[:zemede:rabi:]
[#11#]
[:ዘመደ:ኃምስ:]
[:zemede:hamis:]
[#12#]
 
Other Class Expansion
[:አኃዝ:]
[:ahaz:]
[፩-፼]

 


From Regexp::Ethiopic::Amharic

Equivalence in Phono-Orthography

Class Perl RE Expansion
[=ሀ=]
[=ሐ=]
[=ሃ=]
[=ሓ=]
[=ኀ=]
[=ኃ=]
[=ኻ=]
[ሀሃሐሓኀኃኻ]
[=ሁ=]
[=ሑ=]
[=ኁ=]
[=ኹ=]
[ሁሑኁኹ]
[=ሂ=]
[=ሒ=]
[=ኂ=]
[=ኺ=]
[ሂሒኂኺ]
[=ሄ=]
[=ሔ=]
[=ኄ=]
[=ኼ=]
[ሄሔኄኼ]
[=ህ=]
[=ሕ=]
[=ኅ=]
[=ኽ=]
[ህሕኅኽ]
[=ሆ=]
[=ሖ=]
[=ኆ=]
[=ኈ=]
[=ኾ=]
[=ዀ=]
[ሆሖኆኈኾዀ]
[=ኋ=]
[=ሗ=]
[=ዃ=]
[ሗኋዃ]
[=ሰ=]
[=ሠ=]
[ሰሠ]
[=ሱ=]
[=ሡ=]
[ሱሡ]
[=ሲ=]
[=ሢ=]
[ሲሢ]
[=ሳ=]
[=ሣ=]
[ሳሣ]
[=ሴ=]
[=ሤ=]
[ሴሤ]
[=ስ=]
[=ሥ=]
[ስሥ]
[=ሶ=]
[=ሦ=]
[ሶሦ]
[=ሷ=]
[=ሧ=]
[ሷሧ]
[=ቁ=]
[=ቍ=]
[ቁቍ]
[=ቆ=]
[=ቈ=]
[ቆቈ]
[=አ=]
[=ዐ=]
[=ኣ=]
[=ዓ=]
[አኣዐዓ]
[=ኡ=]
[=ዑ=]
[ኡዑ]
[=ኢ=]
[=ዒ=]
[ኢዒ]
[=ኤ=]
[=ዔ=]
[ኤዔ]
[=እ=]
[=ዕ=]
[እዕ]
[=ኦ=]
[=ዖ=]
[ኦዖ]
[=ኮ=]
[=ኰ=]
[ኮኰ]
[=ጎ=]
[=ጐ=]
[ጎጐ]
[=ጸ=]
[=ፀ=]
[ጸፀ]
[=ጹ=]
[=ፁ=]
[ጹፁ]
[=ጺ=]
[=ፂ=]
[ጺፂ]
[=ጻ=]
[=ፃ=]
[ጻፃ]
[=ጼ=]
[=ፄ=]
[ጼፄ]
[=ጽ=]
[=ፅ=]
[ጽፅ]
[=ጾ=]
[=ፆ=]
[ጾፆ]

Equivalence of Families

Class Perl RE Expansion
[=#ሀ#=]
[=#ሐ#=]
[=#ኀ#=]
[=#ኸ#=]
[ሀ-ሆሐ-ሗኀ-ኆኈ-ኍኸ-ኾዀ-ዅ]
[=#ሰ#=]
[=#ሠ#=]
[ሰ-ሷሠ-ሧ]
[=#አ#=]
[=#ዐ#=]
[አ-ኧዐ-ዖ]
[=#ጸ#=]
[=#ፀ#=]
[ጸ-ጿፀ-ፆ]

 


Notes on Notation

The initial philosophy applied to syllabic character class development was to stick with existing POSIX definitions and notation ([=x=], [:x:], etc) and simply apply them in the context of a syllabary. Shoe-horning syllabic classes into POSIX norms has proven at times to be both awkward and confusing. As this package is experimental, a clean break is made at this time from previously proposed notations and class symbols are used that appear to be intuitive and easy to type.

In large part, a complication for working with Ethiopic character classes easily has been the difference between greater number of Ethiopic classes and available (while somewhat applicable) POSIX abstractions. There are four types of character equivalence that are of interest in Ethiopic regular expressions:

The syllable x is:

The choice of # has been made at this time for no other reason than that symbol itself looks like the grid that the syllables are invariably presented in. The interpretation of the character between #s is made by the character's context as either a letter or numeral. This may prove to be a good neumanic device.

Notation is of course subject to change. Putting theory into practice (and code!) and experimentation is the only real way to shake out the wilburries. Once settled GUS notation will be updated accordingly and I'll get back to IPA based pattern matching with GUS (aka "folding script"). Honest, for real this time... ;-)


References