From Fedora Project Wiki
 
(12 intermediate revisions by 3 users not shown)
Line 1: Line 1:
{{admon/important | Comments and Explanations | The page source contains comments providing guidance to fill out each section. They are invisible when viewing this page. To read it, choose the "edit" link.<br/> '''Copy the source to a ''new page'' before making changes!  DO NOT EDIT THIS TEMPLATE FOR YOUR CHANGE PROPOSAL.'''}}
<!-- {{admon/important | Comments and Explanations | The page source contains comments providing guidance to fill out each section. They are invisible when viewing this page. To read it, choose the "edit" link.<br/> '''Copy the source to a ''new page'' before making changes!  DO NOT EDIT THIS TEMPLATE FOR YOUR CHANGE PROPOSAL.'''}}-->


<!-- Self Contained or System Wide Change Proposal?
<!-- Self Contained or System Wide Change Proposal?
Line 25: Line 25:


== Unicode 8.0 support ==
== Unicode 8.0 support ==
We are updating Glibc Unicode data from Unicode 5.1 to Unicode 7.0 version. It took a long time, since there was not much documentation on how to update Unicode data and also there was the chance of loosing backward compatibility. Most of the issues are resolved now and patches are ready for inclusion. This update adds around 8000 supported characters in Glibc, and also corrects the Unicode data of many characters per the latest Unicode standard.  
[http://blog.unicode.org/2015/06/announcing-unicode-standard-version-80.html Unicode 8.0] got released on 17th June 2015. It includes 41 new emoji characters (including five modifiers for diversity), 5,771 new ideographs for Chinese, Japanese, and Korean, the new Georgian lari currency symbol, and 86 lowercase Cherokee syllables. It also adds letters to existing scripts to support Arwi (the Tamil language written in the Arabic script), the Ik language in Uganda, Kulango in the Côte d’Ivoire, and other languages of Africa. In total, this version adds 7,716 new characters and six new scripts.


== Owner ==
== Owner ==
Line 43: Line 43:
== Current status ==
== Current status ==
* Targeted release: [[Releases/23 | Fedora 23 ]]  
* Targeted release: [[Releases/23 | Fedora 23 ]]  
* Last updated: 22 Jun 2015
* Last updated: 24 Jul 2015
<!-- After the change proposal is accepted by FESCo, tracking bug is created in Bugzilla and linked to this page  
<!-- After the change proposal is accepted by FESCo, tracking bug is created in Bugzilla and linked to this page  
Bugzilla states meaning as usual:
Bugzilla states meaning as usual:
Line 52: Line 52:
CLOSED as NEXTRELEASE -> change is completed and verified and will be delivered in next release under development
CLOSED as NEXTRELEASE -> change is completed and verified and will be delivered in next release under development
-->
-->
* Tracker bug: <will be assigned by the Wrangler>
* Tracker bug: [https://bugzilla.redhat.com/show_bug.cgi?id=1238412 Bug 1238412]
 


== Detailed Description ==
== Detailed Description ==
 
We are upgrading core libraries in Fedora for Unicode 8.0
[http://blog.unicode.org/2015/06/announcing-unicode-standard-version-80.html Unicode 8.0] got release on 17th June 2015.  It includes 41 new emoji characters (including five modifiers for diversity), 5,771 new ideographs for Chinese, Japanese, and Korean, the new Georgian lari currency symbol, and 86 lowercase Cherokee syllables. It also adds letters to existing scripts to support Arwi (the Tamil language written in the Arabic script), the Ik language in Uganda, Kulango in the Côte d’Ivoire, and other languages of Africa. In total, this version adds 7,716 new characters and six new scripts.
* Updating Glibc localedata to Unicode 8.0 [https://sourceware.org/bugzilla/show_bug.cgi?id=18568 Bugzilla]
 
* Updating Lib ICU: [http://site.icu-project.org/download/56m1 Planned for ICU 56 release] '''''Dropping Lib ICU updates since upstream release is just before final freeze. Bit risky to update in last phase late.'''''
To get support for these characters in Fedora we will need updates to core libraries Glibc and Lib ICU. We are planning to update our core libraries Glibc and Lib ICU with the help of upstream.
* [http://www.gnu.org/software/libunistring/ libunistring] - This portable C library implements Unicode string types in three flavours: (UTF-8, UTF-16, UTF-32), together with functions for character processing (names, classifications, properties) and functions for string processing (iteration, formatted output, width, word breaks, line breaks, normalization, case folding and regular expressions). [https://www.mail-archive.com/bug-gnulib@gnu.org/msg31574.html Patch]
 
* Updating unicode-ucd to Unicode 8.0.
"Update locale data to Unicode 8.0" https://sourceware.org/bugzilla/show_bug.cgi?id=18568


== Benefit to Fedora ==
== Benefit to Fedora ==
Line 126: Line 124:


<!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
<!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
N/A (not a System Wide Change)
Will provide it later.


== Release Notes ==
== Release Notes ==
Line 135: Line 133:
-->
-->


[[Category:ChangePageIncomplete]]
[[Category:ChangeAcceptedF23]]
<!-- When your change proposal page is completed and ready for review and announcement -->
<!-- When your change proposal page is completed and ready for review and announcement -->
<!-- remove Category:ChangePageIncomplete and change it to Category:ChangeReadyForWrangler -->
<!-- remove Category:ChangePageIncomplete and change it to Category:ChangeReadyForWrangler -->
Line 142: Line 140:


<!-- Select proper category, default is Self Contained Change -->
<!-- Select proper category, default is Self Contained Change -->
[[Category:SelfContainedChange]]
<!--  [[Category:SelfContainedChange]] -->
<!-- [[Category:SystemWideChange]] -->
[[Category:SystemWideChange]]

Latest revision as of 08:20, 24 July 2015



Unicode 8.0 support

Unicode 8.0 got released on 17th June 2015. It includes 41 new emoji characters (including five modifiers for diversity), 5,771 new ideographs for Chinese, Japanese, and Korean, the new Georgian lari currency symbol, and 86 lowercase Cherokee syllables. It also adds letters to existing scripts to support Arwi (the Tamil language written in the Arabic script), the Ik language in Uganda, Kulango in the Côte d’Ivoire, and other languages of Africa. In total, this version adds 7,716 new characters and six new scripts.

Owner

Current status

Detailed Description

We are upgrading core libraries in Fedora for Unicode 8.0

  • Updating Glibc localedata to Unicode 8.0 Bugzilla
  • Updating Lib ICU: Planned for ICU 56 release Dropping Lib ICU updates since upstream release is just before final freeze. Bit risky to update in last phase late.
  • libunistring - This portable C library implements Unicode string types in three flavours: (UTF-8, UTF-16, UTF-32), together with functions for character processing (names, classifications, properties) and functions for string processing (iteration, formatted output, width, word breaks, line breaks, normalization, case folding and regular expressions). Patch
  • Updating unicode-ucd to Unicode 8.0.

Benefit to Fedora

With this change, users and developers of Fedora will get Unicode 8.0 support through core libraries. Users will get an updated Unicode data from locales. No new functionality is added. Fedora is the leading distribution when it comes to internationalization. By including this change, Fedora users and developers will get the Unicode 8.0 support exactly on the time.

Scope

  • Proposal owners: Work with upstream and file bugs and provide patches where required.
  • Other developers: This change will impact glibc, ICU and all applications that uses these libraries. Other Developers do not need to make any changes from their end, but they need to watch how their application behaves with improved localedata. We need proper testing to see that it does not break any application.


  • Release engineering: No work required from Release engineering.
  • Policies and guidelines: No, this change does not required any updates to Policies or packaging guideline updates.
  • Trademark approval: N/A (not needed for this Change)

Upgrade/compatibility impact

Upgrade will be smooth. Users will get exact things with updated Unicode data.

N/A (not a System Wide Change)

How To Test

  • Glibc includes extensive test-case coverage to test localedata changes.
  • This change is affecting Unicode characters, so users will notice little effect on rendering if any.
  • Glibc is used by rendering engine for determining the type of characters, so again observe rendering and report if any issue.
  • Document section provide detailed report regarding change.


User Experience

Users and Developers will get support for Unicode standard 8.0 through locales.


Dependencies

  • Upstream release schedule.
  • If our patches does not come in upstream, we will try to patch it in Fedora.


Contingency Plan

  • Contingency mechanism: (What to do? Who will do it?) Will drop patches from icu and glibc.
  • Contingency deadline: Before F23 Beta release eg. Beta freeze.
  • Blocks release? No
  • Blocks product? product No

Documentation

Will provide it later.

Release Notes