From: "Martin J. Dürst" Date: 2012-11-05T16:57:46+09:00 Subject: [ruby-core:48891] Re: [ruby-trunk - Bug #7267] Dir.glob on Mac OS X returns unexpected string encodings for unicode file names On 2012/11/03 3:54, meta (mathew murphy) wrote: > > Issue #7267 has been updated by meta (mathew murphy). > > > Relevant links: > > http://search.cpan.org/~tomita/Encode-UTF8Mac-0.03/lib/Encode/UTF8Mac.pm > > Seems to me Ruby should pick one of the standard normalization forms for all UTF-8 data, and convert when necessary. That wouldn't work because we want Ruby to be able to work on different normalizations (because there is different data out there, or data in different forms has to be produced). Regards, Martin. > Apparently there are OS X library calls to assist: > http://developer.apple.com/library/mac/qa/qa1235/_index.html There are quite a few implementations of these in pure Ruby, too. That's not the problem. The problem is figuring out where and when to apply them. Regards, Martin.