Avec @MoritzBrouhaha, découvrez l'histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes.
https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode
🜰^ᯣ⥿ᯣ^🜰
there is an #Unicode proposal to make the cat paws bigger
https://www.unicode.org/L2/L2025/25125r-alchemical-glyphs.pdf
The recycling symbol in a git branch name, what a time to be alive
Also, nice of #github to warn about possibly hidden characters, but not sure it applies in this case
Got a bug report for @novelwriter from someone who uses Cuneiform text in their work. These are 4 byte Unicode symbols, and turned out to be very tricky to handle.
The app is built with Python, which will switch a string to UCS-4 when it contains such characters, so the characters always have a single index in the string.
However, the Qt library uses UTF-16. That means 4-byte characters use two slots, creating a mismatch in indices between the two representations.
Interesting to see letters like ,
, and
proposed for inclusion in Unicode!
Did you know that new #Emoji can be proposed by anyone, simply by following some guidelines laid out by the #Unicode consortium? There's a time window each year where they accept proposals, and a select few might make it into future sets.
This year I turned one in: "Circuit Board", which I was surprised to find 1. didn't exist and 2. had not been proposed before (though CPU and Microchip have both been submitted and declined in the last 5 years)
You can read my proposal here:
https://storage.googleapis.com/greg-kennedy.com/Proposal%20for%20Emoji%20%E2%80%9CCircuit%20Board%E2%80%9D.pdf
and you can see the Unicode emoji proposal guidelines here:
https://www.unicode.org/emoji/proposals.html
Anyway, the odds aren't great of getting accepted, but if it IS then you can say "hey! I know the guy who submitted that one!"
Attached are the sample images I drew up for the proposal - which, incidentally, are now Public Domain as well. Enjoy!
The UNICODE OF THE YEAR is U+2025 TWO DOT LEADER, ‥ ‥ ‥ ‥
#Unicode
TIL that in #Unicode, U+23BE through U+23CC are a series of symbols dedicated to the notation (?!) of dentistry
#TIL #Unicode 有计划在第三平面收录小篆、甲骨文、金文等古代文种。 https://www.unicode.org/roadmaps/tip/
In there a #Unicode way to represent 100 in a single fullwidth character?
The macOS Character Viewer previews all Unicode space characters as a blank space, so finding any particular space can be a bit of a chore. Here's me trying to find PUNCTUATION SPACE.
The iOS 18.5 SDK finally came out and the only change for Unicode coverage is the *removal* of a bunch of Sinhala codepoints:
ඁ෦෧෨෩෪෫෬෭෮෯𑇡𑇢𑇣𑇤𑇥𑇦𑇧𑇨𑇩𑇪𑇫𑇬𑇭𑇮𑇯𑇰𑇱𑇲𑇳𑇴
(Those of you on iOS 18.4: Enjoy seeing those glyphs while you can!)
I was doing some particular work on spaces for my font-in-progress, and as a result i extended the research on the 5 year old draft of my Unicode Spaces article and published it: https://home.octetfont.com/blog/unicode-space.html
#Unicode
(i now realise i should've made a click-baity title: Unicode’s top-ten space codepoints, you won’t believe number 7!)
It sucks that #Linux tech is fragmented in terms of fonts. Like, the most sensible behavior for fonts is using one main fonts and falling back to other fonts when a given #Unicode codepoint is not found in the main one. But! my window manager uses raw #X11 APIs for text output, and these are primitive enough to just display ??? when they encounter an unfamiliar glyph. Which is... too often, because I have a lot of #Russian and a bit of #Armenian in the stuff I window manage. So I have to guess what all these question marks mean. Not cool.
nonsense machines, derisive snark
I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous