Feature #14802 closed
Update Unicode data to Unicode Version 11.0.0
Added by duerst (Martin Dürst) about 7 years ago.
Updated over 6 years ago.
Description
Unicode Version 11.0.0 will be published sometimes later this year, probably in late June. This is an issue to manage updating Ruby to Unicode 11.0.0. Details to follow.
Related to Feature #13685 : Update Unicode data to Unicode Version 10.0.0 added
All power to the emoji. \o/
Blocked by Feature #14839 : How to deal with capitalizing Georgian in Unicode 11.0.0 added
Just a note, the definition of extended grapheme cluster is changed in Unicode 11 (Unicode® Standard Annex #29
UNICODE TEXT SEGMENTATION revision 33: https://www.unicode.org/reports/tr29/tr29-33.html )
This affects Regexp /\X/ which is hardcoded in node_extended_grapheme_cluster() in regparse.c.
( CRLF
| Prepend*
( RI-sequence | Hangul-Syllable | !Control )
( Grapheme_Extend | SpacingMark )*
| . )
crlf
| Control
| precore* core postcore*
Blocked by Feature #15182 : Update extended grapheme cluster implementation for Unicode 11 added
Blocked by Feature #15317 : How to deal with obsolete property values in Unicode 11.0.0 added
Blocks Feature #15321 : Update Unicode data to Unicode Version 12.0.0 added
Blocked by Bug #15337 : String#each_grapheme_cluster wrongly splits "\r\n" added
Status changed from Open to Closed
Some hints for future Unicode updates:
Check early whether modification to algorithms,... are necessary.
For tests, these are the main ones:
test/test_unicode_normalize.rb
test/ruby/enc
test/ruby/test_m17n*
test/ruby/test_regexp.rb
test/ruby/test_string*
There are also some specs involved, so make sure to check them, too.
Related to Feature #17750 : Update Unicode data to Unicode Version 13.0.0 added
Also available in: Atom
PDF
Like 0
Like 0 Like 0 Like 0 Like 0 Like 0 Like 0 Like 0 Like 0 Like 0 Like 0 Like 0 Like 0