• 0 Posts
  • 4 Comments
Joined 1 year ago
cake
Cake day: July 8th, 2023

help-circle

  • alvvayson@lemmy.worldtoProgrammer Humor@lemmy.mlWhat a time to be alive
    link
    fedilink
    arrow-up
    128
    arrow-down
    1
    ·
    edit-2
    11 months ago

    It’s a joke.

    UTF-16 already exists, which doesn’t favor Roman characters as much, but UTF-8 is more popular because it is backword compatible with the legacy ASCII.

    UTF-32 also exists which has exactly equal length representation for every character.

    But the thing that equalizes languages is compression.

    Yes, a text written in Cyrillic with UTF-8 will take more space than a Roman language, easily double. However this extra space is much more easily compressed by an algorithm like GZIP.

    So after compression, the two compressed texts will then be similarly sized and much smaller than UTF-16 or UTF-32.



  • Classic forums still exist.

    Voting does allow the cream to rise to the top, which is why reddit was much better than a forum.

    Honestly, I think part of the problem is that companies don’t have an incentive to fight bots or spam: higher numbers of users and engagement make them look better to investors and advertisers.

    I don’t think it’s that difficult of a problem to solve. It should be quite possible to detect patterns between real users and bots.

    We will see how the fediverse handles it.