Pseudo Localization: Difference between revisions

From Lingoport Wiki
Jump to navigation Jump to search
Line 18: Line 18:
==== Typical Configuration Example ====
==== Typical Configuration Example ====


        <expansion>
      <expansion>
            <lengths>
        <lengths>
                <length max="10" expand-percentage="200"/>
            <length max="10" expand-percentage="200"/>
                <length max="20" expand-percentage="100"/>
            <length max="20" expand-percentage="100"/>
                <length max="30" expand-percentage="80"/>
            <length max="30" expand-percentage="80"/>
                <length max="50" expand-percentage="60"/>
            <length max="50" expand-percentage="60"/>
                <length max="70" expand-percentage="40"/>
            <length max="70" expand-percentage="40"/>
                <length max=""  expand-percentage="30"/>
            <length max=""  expand-percentage="30"/>
            </lengths>
        </lengths>
         <expansion-char>-</expansion-char>
         <expansion-char>-</expansion-char>
          
          
Line 46: Line 46:
==== No Expansion Configuration Example ====
==== No Expansion Configuration Example ====


        <expansion>
      <expansion>
            <lengths>
        <lengths>
                <length max=""  expand-percentage="0"/>
            <length max=""  expand-percentage="0"/>
            </lengths>
        </lengths>
         <expansion-char></expansion-char>
         <expansion-char></expansion-char>
         <expansion-end-chars></expansion-end-chars>
         <expansion-end-chars></expansion-end-chars>
Line 64: Line 64:
==== End Character Symbols with no Expansion Configuration Example ====
==== End Character Symbols with no Expansion Configuration Example ====


        <expansion>
      <expansion>
            <lengths>
          <lengths>
                <length max=""  expand-percentage="0"/>
            <length max=""  expand-percentage="0"/>
            </lengths>
          </lengths>
        <expansion-char></expansion-char>
          <expansion-char></expansion-char>
        <expansion-end-chars></expansion-end-chars>
          <expansion-end-chars></expansion-end-chars>
       </expansion>
       </expansion>
       <start-char>[</start-char>
       <start-char>[</start-char>
Line 86: Line 86:
==== Spaces for expansion Configuration Example ====
==== Spaces for expansion Configuration Example ====


        <expansion>
      <expansion>
            <lengths>
        <lengths>
                <length max=""  expand-percentage="0"/>
            <length max="10" expand-percentage="200"/>
            </lengths>
            <length max="20" expand-percentage="100"/>
            <length max="30" expand-percentage="80"/>
            <length max="50" expand-percentage="60"/>
            <length max="70" expand-percentage="40"/>
            <length max=""  expand-percentage="30"/>
        </lengths>
         <expansion-char></expansion-char>
         <expansion-char></expansion-char>
         <expansion-end-chars></expansion-end-chars>
          
        <!-- П: A Cyrillic character - (for Russian or ‘R’) -->
        <!-- 國: A common Chinese character (Hanzi, Kanji or Hanja) defined in Unicode BMP or Plane 0 – (for Chinese or ‘C’) -->    
        <!-- カ: A Katakana character (used for Japanese or ‘J’) -->
        <!-- 내: A Hangul character (for Korean or ‘K’) -->
        <expansion-end-chars> П國カ내</expansion-end-chars>
      </expansion>
      <end-char>]</end-char>
      <accent-chars>1</accent-chars>
 
Based on this configuration, an example pseudo-localization would be:
 
Base Resource string: "This is an example of expansion using spaces configuration"
Pseudo-localized string: "[Ţĥîš îš åñ éẋåɱþļé öƒ éẋþåñšîöñ ûšîñĝ šþåçéš çöñƒîĝûŕåţîöñ                П國カ내]"
 
==== Accent Only Configuration Example ====
 
      <expansion>
          <lengths>
            <length max=""  expand-percentage="0"/>
          </lengths>
          <expansion-char></expansion-char>
          <expansion-end-chars></expansion-end-chars>
       </expansion>
       </expansion>
       <start-char>[</start-char>
       <start-char></start-char>
      <!-- П: A Cyrillic character - (for Russian or ‘R’) -->
       <end-char></end-char>
      <!-- 國: A common Chinese character (Hanzi, Kanji or Hanja) defined in Unicode BMP or Plane 0 – (for Chinese or ‘C’) -->    
      <!-- カ: A Katakana character (used for Japanese or ‘J’) -->
      <!-- 내: A Hangul character (for Korean or ‘K’) -->
       <end-char> П國カ내]</end-char>
       <accent-chars>1</accent-chars>
       <accent-chars>1</accent-chars>


Based on this configuration, an example pseudo-localization would be:
Based on this configuration, an example pseudo-localization would be:


  Base Resource string: "This is an example of no expansion configuration"
  Base Resource string: "This is an example of accent only configuration"
  Pseudo-localized string: "[Ţĥîš îš åñ éẋåɱþļé öƒ ñö éẋþåñšîöñ ŵîţĥ éñð çĥåŕåçţéŕš šýɱƀöļš çöñƒîĝûŕåţîöñ П國カ내]"
  Pseudo-localized string: "Ţĥîš îš åñ éẋåɱþļé öƒ åççéñţ öñļý çöñƒîĝûŕåţîöñ"

Revision as of 18:52, 31 July 2018

config_pseudo_loc.xml.xml

File config_pseudo_loc.xml is the configuration file for setting up pseudo localization instructions for LRM resource files. It is located in the global <HOME>/Lingoport_Data/L10nStreamlining/config folder. If you need different criteria for a group or project then the file can be copied and moved to <HOME>/Lingoport_Data/L10nStreamlining/<group>/config or <HOME>/Lingoport_Data/L10nStreamlining/<group>/projects/<project>config respectively.

The config_pseudo_loc.xml file is used when running the --pseudo-loc command. A pseudo-localized file will be created for each base resource file if the project has a designated pseudo-locale.

There are 2 categories of information contained in the config_pseudo_loc.xml file.

  • pseudo localization instructions
  • regex pattern for each parser type defining the parameters as well as special characters that should not be pseudo-localized.

Pseudo Localization Instructions

The configuration consists of the following xml elements:

  • expansion lengths - defines the expansion percentage based on a strings length. This is useful to test language expansions.
  • expansion character - the character that will be used to pad the string in order to simulate string expansion
  • expansion end character - characters at the end of expansion. Typically, these are symbols across a range of codepoints in order to ensure that content from the various code pages are supported
  • start and end characters - characters at the start and end of the resource string. This is useful to catch resource strings that are concatenated within the code.
  • accents - a flag indicating whether the resource string should be accented. This is needed to ensure that all visible user strings are localized.

Typical Configuration Example

      <expansion>
        <lengths>
           <length max="10" expand-percentage="200"/>
           <length max="20" expand-percentage="100"/>
           <length max="30" expand-percentage="80"/>
           <length max="50" expand-percentage="60"/>
           <length max="70" expand-percentage="40"/>
           <length max=""   expand-percentage="30"/>
        </lengths>
        <expansion-char>-</expansion-char>
        
        <!-- П: A Cyrillic character - (for Russian or ‘R’) -->
        <!-- 國: A common Chinese character (Hanzi, Kanji or Hanja) defined in Unicode BMP or Plane 0 – (for Chinese or ‘C’) -->	    
        <!-- カ: A Katakana character (used for Japanese or ‘J’) -->
        <!-- 내: A Hangul character (for Korean or ‘K’) -->
        <expansion-end-chars> П國カ내</expansion-end-chars>
      </expansion>
      <start-char>[</start-char>
      <end-char>]</end-char>
      <accent-chars>1</accent-chars>

Based on this configuration, an example pseudo-localization would be:

Base Resource string: "This is an example using the typical configuration"
Pseudo-localized string: "[Ţĥîš îš åñ éẋåɱþļé ûšîñĝ ţĥé ţýþîçåļ çöñƒîĝûŕåţîöñ------------------------П國カ내]"

No Expansion Configuration Example

      <expansion>
        <lengths>
           <length max=""   expand-percentage="0"/>
        </lengths>
        <expansion-char></expansion-char>
        <expansion-end-chars></expansion-end-chars>
      </expansion>
      <start-char>[</start-char>
      <end-char>]</end-char>
      <accent-chars>1</accent-chars>

Based on this configuration, an example pseudo-localization would be:

Base Resource string: "This is an example of no expansion configuration"
Pseudo-localized string: "[Ţĥîš îš åñ éẋåɱþļé öƒ ñö éẋþåñšîöñ çöñƒîĝûŕåţîöñ]"

End Character Symbols with no Expansion Configuration Example

      <expansion>
         <lengths>
            <length max=""   expand-percentage="0"/>
         </lengths>
         <expansion-char></expansion-char>
         <expansion-end-chars></expansion-end-chars>
      </expansion>
      <start-char>[</start-char>
      <!-- П: A Cyrillic character - (for Russian or ‘R’) -->
      <!-- 國: A common Chinese character (Hanzi, Kanji or Hanja) defined in Unicode BMP or Plane 0 – (for Chinese or ‘C’) -->	    
      <!-- カ: A Katakana character (used for Japanese or ‘J’) -->
      <!-- 내: A Hangul character (for Korean or ‘K’) -->
      <end-char> П國カ내]</end-char>
      <accent-chars>1</accent-chars>

Based on this configuration, an example pseudo-localization would be:

Base Resource string: "This is an example of no expansion configuration"
Pseudo-localized string: "[Ţĥîš îš åñ éẋåɱþļé öƒ ñö éẋþåñšîöñ ŵîţĥ éñð çĥåŕåçţéŕš šýɱƀöļš çöñƒîĝûŕåţîöñ П國カ내]"

Spaces for expansion Configuration Example

      <expansion>
        <lengths>
           <length max="10" expand-percentage="200"/>
           <length max="20" expand-percentage="100"/>
           <length max="30" expand-percentage="80"/>
           <length max="50" expand-percentage="60"/>
           <length max="70" expand-percentage="40"/>
           <length max=""   expand-percentage="30"/>
        </lengths>
        <expansion-char></expansion-char>
        
        <!-- П: A Cyrillic character - (for Russian or ‘R’) -->
        <!-- 國: A common Chinese character (Hanzi, Kanji or Hanja) defined in Unicode BMP or Plane 0 – (for Chinese or ‘C’) -->	    
        <!-- カ: A Katakana character (used for Japanese or ‘J’) -->
        <!-- 내: A Hangul character (for Korean or ‘K’) -->
        <expansion-end-chars> П國カ내</expansion-end-chars>
      </expansion>
      <end-char>]</end-char>
      <accent-chars>1</accent-chars>

Based on this configuration, an example pseudo-localization would be:

Base Resource string: "This is an example of expansion using spaces configuration"
Pseudo-localized string: "[Ţĥîš îš åñ éẋåɱþļé öƒ éẋþåñšîöñ ûšîñĝ šþåçéš çöñƒîĝûŕåţîöñ                 П國カ내]"

Accent Only Configuration Example

      <expansion>
         <lengths>
            <length max=""   expand-percentage="0"/>
         </lengths>
         <expansion-char></expansion-char>
         <expansion-end-chars></expansion-end-chars>
      </expansion>
      <start-char></start-char>
      <end-char></end-char>
      <accent-chars>1</accent-chars>

Based on this configuration, an example pseudo-localization would be:

Base Resource string: "This is an example of accent only configuration"
Pseudo-localized string: "Ţĥîš îš åñ éẋåɱþļé öƒ åççéñţ öñļý çöñƒîĝûŕåţîöñ"