BrianHicks/elm-string-graphemes 1.0.0

brian · July 5, 2019, 5:39am

Hello all! I’ve just released a new library: BrianHicks/elm-string-graphemes. It does everything String does, except it operates on graphemes instead of bytes or characters. Observe:

import String.Graphemes

String.toList "🦸🏽‍♂️" --> [ '🦸', '🏽', '\u{200D}', '♂', '\u{FE0F}' ]

String.Graphemes.toList "🦸🏽‍♂️" --> [ "🦸🏽‍♂️" ]

Check it out at https://package.elm-lang.org/packages/BrianHicks/elm-string-graphemes/latest/. In particular, I’ve included a primer on why this library is necessary in the README if you haven’t worked a lot with different levels of text (e.g. the emoji above is one grapheme, but four characters and 17 bytes. If that doesn’t make sense yet, go read it!)

If you find any issues with the grapheme segmentation (e.g. where it breaks improperly) please open an issue! I would also love it if we could get the parser to go even faster—I already took it from 0.1% of String.toList performance to 1% to 2%, but can we get higher? Probably!

brian · July 15, 2019, 5:39am

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elm-unicode is here! Show and Tell	6	1124	April 8, 2021
Announcing brianhicks/elm-csv: a boring CSV decoder Show and Tell	9	1236	February 13, 2021
Thanks Elm!! Now I can make a app~~! Show and Tell	1	679	June 14, 2021
Support for Latin Extended A in String.Extra.removeAccents (pull request) Request Feedback	1	518	November 20, 2021
Emojis In Elm: A guide with live code editing Show and Tell	5	1335	November 4, 2021

BrianHicks/elm-string-graphemes 1.0.0

Related topics