What is this character?
On 3/24/2023 at 3:59 PM, My_Computer_Is_Trash said:The Tamil vowel sign is this: ெ
This is not the same thing as ௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌௌ
Its an artifact of normalization and that language
That language(Tamil) you can normalize a bunch of sounds together into one character. Unicode just has no idea how to do it for that sequence so it puts out each individual part of it. you notice that circle in the middle is its failed attempt to combine them. Its a fall back/fail safe when building unknown sounds to just have it remain expanded.
Simple example to explain what I mean
https://en.wikipedia.org/wiki/Hangul_Jamo_(Unicode_block)
This block is a bunch of sounds and you can normalize them/combine them into the entire Hangul (korean) dictionary.
So like 가 is ACTUALLY two characters in one normalized character of ᄀ and ᅡ and if you ever put those two syllables together they form the initial word as it normalizes it.
Simpler example are accented letters
Á can be made with two Unicode characters, A Normalized with the accent
when normalization fails it will display as A◌́
However because Á is such a common character it just exists. however you can also just put ◌́ on almost any character.
https://unicode.org/reports/tr15/ For more in-depth information, have fun. Learned all this when having an import bug with Hangul and normalization broke
Important part to highlight as you can see what I mean (god have mercy on whoever wrote that webpage jumping between normalization protocols mid sentence constantly)

Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now