Difference between revisions of "ReNamer:Rules:Translit"

From den4b Wiki
Jump to: navigation, search
(automatic case conversion section)
Line 1: Line 1:
[[Image:TranslitRule1.png|center]] This rule transliterates Non-English characters from different languages into their English/Latin representation. For example, the German character '''ü''' can be transliterated to '''ue''' (the name '''Müller '''can be also written as'''Mueller''').  
+
[[Image:TranslitRule1.png|center]]
 +
 
 +
This rule transliterates Non-English characters from different languages into their English/Latin representation. For example, the German character '''ü''' can be transliterated to '''ue''' (the name '''Müller '''can be also written as '''Mueller''').  
  
 
This rule uses ''transliteration maps'' (explained below).  
 
This rule uses ''transliteration maps'' (explained below).  
  
=== Transliteration maps<br>  ===
+
== Transliteration maps ==
  
 
To transliterate, we create a pair of equivalent characters, like this: '''ü=ue'''  
 
To transliterate, we create a pair of equivalent characters, like this: '''ü=ue'''  
Line 23: Line 25:
 
Let us see how to do this.  
 
Let us see how to do this.  
  
=== Using a built-in transliteration map ===
+
== Automatic case conversion ==
 +
 
 +
Translit rule does automatic case conversion with an algorithm adopted specifically for transliteration. Translit rule discard the case on the input, i.e. "A=B" is same as "a=b". Case is decided upon case of the input fragment. Multiple character fragments are treated as part of words, with their case decided based on the case of letters around them.
 +
 
 +
The logic for the case conversion is as follows (ReNamer Beta from 23 Aug 2009):
 +
<pre>
 +
set OUTPUT-PART to lower case
 +
if first letter in INPUT-PART is upper case then
 +
  if length of OUTPUT-PART bigger than 1 then
 +
    if next letter in original name is upper case then
 +
      convert whole OUTPUT-PART to upper case
 +
    else
 +
      convert only first letter in OUTPUT-PART to upper case
 +
  else
 +
    convert whole OUTPUT-PART to upper case
 +
</pre>
 +
 
 +
== Using a built-in transliteration map ==
  
 
To select any of the built-in maps, press the [[Image:TranslitMapsButton.png]] button. A list of available transliteration maps pops up:  
 
To select any of the built-in maps, press the [[Image:TranslitMapsButton.png]] button. A list of available transliteration maps pops up:  
Line 35: Line 54:
 
Note that such editing does not alter the saved version of the map. (The map is edited just for a one-time use. So, if you select the same Translit map again, ReNamer will load the ''original'' version, not the ''edited'' version.) We will see how to edit and save a map [[ReNamer:Rules:Translit#Saving_a_transliteration_map|later]].  
 
Note that such editing does not alter the saved version of the map. (The map is edited just for a one-time use. So, if you select the same Translit map again, ReNamer will load the ''original'' version, not the ''edited'' version.) We will see how to edit and save a map [[ReNamer:Rules:Translit#Saving_a_transliteration_map|later]].  
  
Next, select the rule's parameters as shown below::
+
Next, select the rule's parameters as shown below:  
 
 
<br>
 
  
 
{| class="prettytable"
 
{| class="prettytable"
 
|-
 
|-
| <center>'''Parameter'''</center>
+
! Parameter
| <center>'''Details'''</center>
+
! Details
 
|-
 
|-
 
| forward  
 
| forward  
Line 56: Line 73:
 
Finally, press the [[Image:AddRuleButton.png]] button to add the rule to the stack.  
 
Finally, press the [[Image:AddRuleButton.png]] button to add the rule to the stack.  
  
=== Making your own transliteration map ===
+
== Making your own transliteration map ==
  
 
Click in the '''Translit Alphabet '''window, and start entering the equivalent characters (one transliteration per line).  
 
Click in the '''Translit Alphabet '''window, and start entering the equivalent characters (one transliteration per line).  
Line 72: Line 89:
 
Note that this rule is not saved yet (it was just composed for a one-time use). The following topic shows how to save a map.  
 
Note that this rule is not saved yet (it was just composed for a one-time use). The following topic shows how to save a map.  
  
=== Saving a transliteration map ===
+
== Saving a transliteration map ==
  
 
To save a newly composed Transliteration rule,  
 
To save a newly composed Transliteration rule,  

Revision as of 12:53, 23 August 2009

TranslitRule.png

This rule transliterates Non-English characters from different languages into their English/Latin representation. For example, the German character ü can be transliterated to ue (the name Müller can be also written as Mueller).

This rule uses transliteration maps (explained below).

Transliteration maps

To transliterate, we create a pair of equivalent characters, like this: ü=ue

(Note that the right side of this equation has two characters. Any number of characters may be placed on both sides of the equation.)

We need several such equivalent character pairs to convert one language into another. An entire set is called a transliteration map. (This is really a character-level find-and-replace rule.)

ReNamer has several such built-in maps. Each map is named after a language (the second language in all maps is English).

Each map can be used in both directions (e.g. French-to-English or English-to-French.)

When you start up the Translit Rule, its window does not show any maps. You are free to do any of the following:

  1. Use any of the built-in maps (and use it in forward or reverse direction)
  2. Create your own map and use it.
  3. Edit a built-in map first, and then use it.

Let us see how to do this.

Automatic case conversion

Translit rule does automatic case conversion with an algorithm adopted specifically for transliteration. Translit rule discard the case on the input, i.e. "A=B" is same as "a=b". Case is decided upon case of the input fragment. Multiple character fragments are treated as part of words, with their case decided based on the case of letters around them.

The logic for the case conversion is as follows (ReNamer Beta from 23 Aug 2009):

set OUTPUT-PART to lower case
if first letter in INPUT-PART is upper case then
  if length of OUTPUT-PART bigger than 1 then
    if next letter in original name is upper case then
      convert whole OUTPUT-PART to upper case
    else
      convert only first letter in OUTPUT-PART to upper case
  else
    convert whole OUTPUT-PART to upper case

Using a built-in transliteration map

To select any of the built-in maps, press the TranslitMapsButton.png button. A list of available transliteration maps pops up:

TranslitMenu.png

Click on the desired transliteration map. As an example, let us click on the French (to English) transliteration map.

The Rules window changes immediately to show the French characters and their English equivalent.

TranslitRuleExample.png

You can edit any of the entry in this list, add new entries, or delete any of the entries.

Note that such editing does not alter the saved version of the map. (The map is edited just for a one-time use. So, if you select the same Translit map again, ReNamer will load the original version, not the edited version.) We will see how to edit and save a map later.

Next, select the rule's parameters as shown below:

Parameter Details
forward This is transliteration from-left-to-right direction, as defined in the map.
Backward This is transliteration from-right-to-left direction, as defined in the map.
Skip extension If this check box is unselected, the extension will be included in the rule.

Finally, press the AddRuleButton.png button to add the rule to the stack.

Making your own transliteration map

Click in the Translit Alphabet window, and start entering the equivalent characters (one transliteration per line).

For example,

ü=ue

ö=oe

ß=ss

After entering all such transliterations, press the AddRuleButton.png button to add the rule to the rule-stack.

Note that this rule is not saved yet (it was just composed for a one-time use). The following topic shows how to save a map.

Saving a transliteration map

To save a newly composed Transliteration rule,

  1. Press the TranslitMapsButton.png button.
    A menu pops up.
    TranslitMenu.png
  2. Select the last option (Save Translit...).
    A window pops up, as shown below:
    SaveTranslitMapDialog.png
  3. Enter a new name for the map and press OK. The new map is saved.

The process to save an edited Transliteration map is similar. The only difference is that the Save Translit window (see above) shows the current map's name. You can press OK to save the changes you just made, or enter a new name to create a edited version of the current map.

The new map's name is added to the map list.

From now on, the new map will also be available as "standard".