Real UTF-8 functions #66
Labels
No Label
accepted
bug
clj
documentation
enhancement
errors
infrastructure
later
next
now
optimization
proposal
question
research
semantics
syntax
ux
vm
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: twc/ludus#66
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The Prelude should have some real UTF-8 functions, especially now that we're using UTF-8 under the hood.
We have some things bundled under this (#91), but this is a general review of string manipulation function. Currently we have:
upcase
,downcase
,strip
,words
,sentence
,trim
,split
, andjoin
. We wantchars
.length
will give the number of bytes in a string, not the number of characters.What do we want/need beyond this?
Principles of caution, here:
cf: https://stackoverflow.com/questions/27331819/whats-the-difference-between-a-character-a-code-point-a-glyph-and-a-grapheme, https://news.ycombinator.com/item?id=20054745