Unicode::NormalizeUnicode::Normalize Perl module contains Unicode Normalization Forms. | |
Download |
Unicode::Normalize Ranking & Summary
Advertisement
- License:
- Perl Artistic License
- Price:
- FREE
- Publisher Name:
- SADAHIRO Tomoyuki
- Publisher web site:
- http://search.cpan.org/~sadahiro/ShiftJIS-String-1.04/String.pod
Unicode::Normalize Tags
Unicode::Normalize Description
Unicode::Normalize Perl module contains Unicode Normalization Forms. Unicode::Normalize Perl module contains Unicode Normalization Forms.SYNOPSIS(1) using function names exported by default: use Unicode::Normalize; $NFD_string = NFD($string); # Normalization Form D $NFC_string = NFC($string); # Normalization Form C $NFKD_string = NFKD($string); # Normalization Form KD $NFKC_string = NFKC($string); # Normalization Form KC(2) using function names exported on request:use Unicode::Normalize 'normalize'; $NFD_string = normalize('D', $string); # Normalization Form D $NFC_string = normalize('C', $string); # Normalization Form C $NFKD_string = normalize('KD', $string); # Normalization Form KD $NFKC_string = normalize('KC', $string); # Normalization Form KCParameters:$string is used as a string under character semantics (see perlunicode).$code_point should be an unsigned integer representing a Unicode code point.Note: Between XSUB and pure Perl, there is an incompatibility about the interpretation of $code_point as a decimal number. XSUB converts $code_point to an unsigned integer, but pure Perl does not. Do not use a floating point nor a negative sign in $code_point.Normalization Forms$NFD_string = NFD($string)It returns the Normalization Form D (formed by canonical decomposition).$NFC_string = NFC($string)It returns the Normalization Form C (formed by canonical decomposition followed by canonical composition).$NFKD_string = NFKD($string)It returns the Normalization Form KD (formed by compatibility decomposition).$NFKC_string = NFKC($string)It returns the Normalization Form KC (formed by compatibility decomposition followed by canonical composition).$FCD_string = FCD($string)If the given string is in FCD ("Fast C or D" form; cf. UTN #5), it returns the string without modification; otherwise it returns an FCD string.Note: FCD is not always unique, then plural forms may be equivalent each other. FCD() will return one of these equivalent forms.$FCC_string = FCC($string)It returns the FCC form ("Fast C Contiguous"; cf. UTN #5).Note: FCC is unique, as well as four normalization forms (NF*).$normalized_string = normalize($form_name, $string)It returns the normalization form of $form_name.As $form_name, one of the following names must be given. 'C' or 'NFC' for Normalization Form C (UAX #15) 'D' or 'NFD' for Normalization Form D (UAX #15) 'KC' or 'NFKC' for Normalization Form KC (UAX #15) 'KD' or 'NFKD' for Normalization Form KD (UAX #15) 'FCD' for "Fast C or D" Form (UTN #5) 'FCC' for "Fast C Contiguous" (UTN #5) Requirements: · Perl
Unicode::Normalize Related Software