#1 2009-08-08 13:50

Stefan
Moderator
From: Germany, EU
Registered: 2007-10-23
Posts: 777

Translit Rule Case sensitive

5.50

I try to find uppercase letters only
and change them to add an leading space:

From:
zzzzTestDateiMitGroßBuchstaben.txt

To:
zzzz Test Datei Mit Groß Buchstaben.txt

Translit:
A= ABBA   (test)
B= B
C= C
D is removed for test
E= E
F= F
G= G
H= H
I= I
J= J


But lower case letters are translited also
and all replaced letters are lower case now:
z z z z t e s tD abba t e i m i t g r oß b u c h s t abba b e n.txt


Is there room for improvement, can you add code to respect the case, or to many work for the result?


For more help and how-to's  read our fine Wiki.
If ReNamer had helped you, please donate a coin to Denis or buy a Pro license for commercial use. (Lite vs Pro)

Offline

#2 2009-08-08 14:57

narayan
Senior Member
Registered: 2009-02-08
Posts: 409

Re: Translit Rule Case sensitive

Would we need to respect case in normal transliteration applications?

I think so: see http://en.wikipedia.org/wiki/German_alphabet

the umlauts Ä, Ö, Ü, ä, ö and ü should be transcribed as Ae, Oe, Ue, ae, oe and ue, respectively.

*****
BTW Denis, two major European languages are missing in the Translit maps (German and Spanish).

I mention these two specifically, because they have native users who do not use English. (In comparison, we Indians prefer to use plain English for our file management, not the plethora of languages we have).

Chinese and Japanese also come to mind. But preparing a map may be a uphill task, as it is not a simple question of removing some diacritic marks. Unless of course there is a ready map available.

Last edited by narayan (2009-08-08 15:01)

Offline

#3 2009-08-09 18:41

Stefan
Moderator
From: Germany, EU
Registered: 2007-10-23
Posts: 777

Re: Translit Rule Case sensitive

narayan wrote:

languages are missing in the Translit maps (German

Umlaute.txt wrote:

Ä=Ae
Ö=Oe
Ü=ue
ä=ae
ö=oe
ü=ue
ß=ss


Note: save "Umlaute.txt" as UTF-8 encoding (using NotePad2 for example)

Note that the backward rule ss=>ß is a little bit dangerous
because 'ss' is an valid pair in german words too, like "Messer" is not "Meßer"

------------------------------
Note: here the case is respected, Ö => Oe  and ö => oe ? ? ?, even with ReNamer 5.50 (3 May 2009).... roll 
Related to http://www.den4b.com/forum/viewtopic.php?pid=3509#p3509 i have to test more i think.

Last edited by Stefan (2009-08-10 22:26)


For more help and how-to's  read our fine Wiki.
If ReNamer had helped you, please donate a coin to Denis or buy a Pro license for commercial use. (Lite vs Pro)

Offline

#4 2009-08-09 18:56

narayan
Senior Member
Registered: 2009-02-08
Posts: 409

Re: Translit Rule Case sensitive

Yes, that's how a user can do it, but I would like ReNamer to have these two maps as default. smile

Offline

#5 2009-08-09 19:04

Stefan
Moderator
From: Germany, EU
Registered: 2007-10-23
Posts: 777

Re: Translit Rule Case sensitive

Translit Rot13 

http://en.wikipedia.org/wiki/ROT13

Wikipedia wrote:

ROT13 ("rotate by 13 places", sometimes hyphenated ROT-13) is a simple substitution cipher
used in online forums as a means of hiding spoilers, punchlines, puzzle solutions,
and offensive materials from the casual glance. ROT13 has been described as the
"Usenet equivalent of a magazine printing the answer to a quiz upside down".
ROT13 is a variation of the Caesar cipher, developed in ancient Rome.
From: http://en.wikipedia.org/wiki/ROT13

Hide your file names.
F.ex. use this translit before wiping files to make it harder to resolve what you have deleted. (and don't skip extension)


Rot13.txt wrote:

a=m
b=o
c=p
d=q
e=r
f=s
g=t
h=u
i=v
j=w
k=x
l=y
m=z
n=a
o=b
p=c
q=d
r=e
s=f
t=g
u=h
v=i
w=j
x=k
y=l
z=m
A=M
B=O
C=P
D=Q
E=R
F=S
G=T
H=U
I=V
J=W
K=X
L=Y
M=Z
N=A
O=B
P=C
Q=D
R=E
S=F
T=G
U=H
V=I
W=J
X=K
Y=L
Z=m

To make it even harder use your own pairs like A=b, d=B, e=X, C=3.

Note: save "Rot13.txt" as UTF-8 encoding (using NotePad2 for example)

Last edited by Stefan (2009-08-09 19:10)


For more help and how-to's  read our fine Wiki.
If ReNamer had helped you, please donate a coin to Denis or buy a Pro license for commercial use. (Lite vs Pro)

Offline

#6 2009-08-10 21:29

krtek
Senior Member
From: Łódź (Poland)
Registered: 2008-02-21
Posts: 262

Re: Translit Rule Case sensitive

Stefan wrote:

I try to find uppercase letters only
and change them to add an leading space

I guess now you can use CleanUp rule, Stefan big_smile

Last edited by krtek (2009-08-10 21:31)


Regular Expressions are not as hard to understand as you may think. Check ReNamer's manual or nice Regular Expressions tutorial for more info and start to use full power of applications that use them (like ReNamer, Mp3Tag and so on).

Offline

#7 2009-08-10 22:12

eR@SeR
Senior Member
From: Земун, Србија
Registered: 2008-01-23
Posts: 297

Re: Translit Rule Case sensitive

Stefan,
You are active member, from Germany and didn't create German.txt translit yet? How that happened? smile
So, actually Umlaute.txt is German translit?


TRUTH, FREEDOM, JUSTICE and FATHERLAND are the highest morale values which human is born, lives and dies for!

Offline

#8 2009-08-10 22:18

Stefan
Moderator
From: Germany, EU
Registered: 2007-10-23
Posts: 777

Re: Translit Rule Case sensitive

krtek wrote:
Stefan wrote:

I try to find uppercase letters only
and change them to add an leading space

I guess now you can use CleanUp rule, Stefan big_smile

Oh Denis is such an "killjoy" big_smile
I try to let Translit made what i want and he add sillently an new option big_smile

OK, thanks Denis tongue


For more help and how-to's  read our fine Wiki.
If ReNamer had helped you, please donate a coin to Denis or buy a Pro license for commercial use. (Lite vs Pro)

Offline

#9 2009-08-10 22:24

Stefan
Moderator
From: Germany, EU
Registered: 2007-10-23
Posts: 777

Re: Translit Rule Case sensitive

eR@SeR wrote:

Stefan,
You are active member, from Germany and didn't create German.txt translit yet? How that happened? smile

Never needed to exchange files with umlauts with english speakers big_smile
No, just never thinks about this.

So, actually Umlaute.txt is German translit?

Yes, that's all needed.
Note that the backward rule ss=>ß is a little bit dangerous
because 'ss' is an valid pair too, like "Messer" is not "Meßer"


For more help and how-to's  read our fine Wiki.
If ReNamer had helped you, please donate a coin to Denis or buy a Pro license for commercial use. (Lite vs Pro)

Offline

#10 2009-08-10 22:48

eR@SeR
Senior Member
From: Земун, Србија
Registered: 2008-01-23
Posts: 297

Re: Translit Rule Case sensitive

krtek wrote:

I guess now you can use CleanUp rule

Nice notice... Hey it works better than RegEx ([a-z])([A-Z]) --> $1 $2 and it's integrated as checkbox. FABULOUS!
ThisIsALongSentenceWithoutSpaces --> This Is A Long Sentence Without Spaces

Maybe Denis used Stefan's pattern {part}...
Whole words only is missing to be added to Replace/Remove rule and then we can say that is complete cool

Stefan wrote:

Never needed to exchange files with umlauts with english speakers...

Never mind, you can do that for your country, don't think only on yourself tongue
Ok, send it officially to Denis so he can add it to next release.

Note that the backward rule ss=>ß is a little bit dangerous

Yes I'm aware of that, but we have to use to backward issue sad... Look at Serbian translits and you'll see what's the real problem hmmroll big_smile


TRUTH, FREEDOM, JUSTICE and FATHERLAND are the highest morale values which human is born, lives and dies for!

Offline

Board footer

Powered by FluxBB