[libreoffice-l10n] translating dialogs using OmegaT

Milos Sramek <sramek.milos -AT- gmail.com>
Sat, 30 Nov 2013 07:56:36 +0100

Hi,

I've studied a bit the situation regarding the new strings fortranslation in dialogs.

The developers have since lo4.1 release converted 285 dialogs to gladeui files. For us this means that the corresponding strings were movedfrom the files in the (xxx)/source to files in the (xxx)/uiconfigsubdirectories. One can check in Pootle that there really are fewerwords in the source subdirs and more in the uiconfig subdirs comparing4.2 to 4.1.

The consequence for us is that the translations from the source filesare missing in the uiconfig files. There are about 8000 words totranslate now, so this is a lot of work. Moreover, in Pootle we do notsee the old 4.1 stuff and also do not see similar strings from 4.2 - soit looks like we should repeat the translation again from scratch.

I've tried to use OmegaT for translation - it can provide reference to4.1 and in general it significantly speeds up the procedure

OmegaT uses for translation translation memory (tmx files)- except forthe main one it can use also auxiliary tmx files.The idea is to convert 4.1 translation to these auxiliary tmx files -they are then offered by OmegaT as suggestions.


The procedure:
1. Install omegat from omegat.org (I use the beta version)

2. Download of po files from Pootle. I've downloaded everything inlibo_ui-sk.zip (4.2) and libo41x_ui-sk.zip files3. Create a new omegat project (OT asks you to create a directory, let'screate XXX) without importing anything

4. Unzip libo_ui-sk.zip in XXX/sources
5. Unzip libo41x_ui-sk.zip im XXX/tm
6. Convert the po files to tmx in XXX/tm (using the bash shell):
    for i in `find . -name \*po`; do po2tmx --lang sk -i $i -o $i.tmx ;done

7. This step is optional, but can save a lot of typing: convert the ~accelerators by the _ ones:

    sed  -i -e "s/\~/_/" `find . -name \*tmx`

This converts ~ in all strings, not only the dialog ones. It shouldnot be a problem, since we want to translate only the dialog strings8. Start OmegaT and open the XXX directory. It will load po files fromXXX/source. Suggestions from auxiliary tmx files created from 4.1 are inthe upper right pane. Most of the strings are translated (OT understandspo files) and in my setting are displayed yellow. Every now and thenthere is an untranslated (blue) string. By pressing CRTL-U one jumps tothe next untranslated string. OT may directly suggest a translation(based on the tmx files) - marked by [approximate] (or something similarin other language). If you like it, press CTRL-R and [approximate]vanishes. Or edit, as you wish. Then press CTRL-U and go to the next string.9. When done, choose Project > Create translated documents. These appearin XXX/target. One can compare the source and target files usinggvimdiff - the files are exactly the same except for the translations.10. When a subdirectory is translated (say, cui), zip it and upload topootle.

11. Enjoy the '0' in the "Need translation" column

OmegaT normally segments the text at the level of sentences. This meansthat a PO message, consisting of two and more sentences, would beoffered for translations in two or more parts. The suggestions based onauxiliary tmx file are however not split. One can change this behaviourby unchecking all options in Settings > Segmentation > Predefined (thelast but 3 iten). Subsequenly, a PO message will alway appear as oneitem in OT.


My observations after translating about 1000 words:

- work with OT is much faster than with Pootle - there one waits asecond or two after submitting a string, which after some time becomesannoying

- suggestions are both exact and fuzzy

- CTRL-U stops at untranslated strings and at translated strings withambiguous translation in the tmx files. In the second case the theoriginal translation is kept - one thus can correct it, based on thesuggestions- In suggestions one can sometimes see an incorrect translation - onecan then look the incorrect string up in Pootle and correct it. Or, onecan do that directly in the XXX/sources tree- one can see a lot of context - 16 lines in my case. For me it helps,for example, to choose correctly gender of adjectives, if I see that'thick' is related to 'line'- OT allways want to translate headers of the po files (these are itemswin many lines). Just hit CRTL-SHIFT-R to take the original

There seem to be also another option how to use OT. One can perhapsmerge the auxiliary tmx files into one and store it asXXX/omegat/project_save.tmx. This is the main translation memory. Inthis case, perhaps, OT will translate all untranslated messagesautomatically. I did not check it.

Except for OmegaT I've tried also Virtaal. In comparison to Pootle ithas the advantage, that it looks up translations on some servers. TheAgama server seems to have translation from OpenOffice, so it helps abit (would not be bad to ask then to load LO translations). The lookupis, however, too slow to be useful.


The po2tmx program is from the translate-toolkit package.

I hope that this will help
best
Milos



--
email & jabber: sramek.milos@gmail.com


--
To unsubscribe e-mail to: l10n+unsubscribe@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/l10n/
All messages sent to this list will be publicly archived and cannot be deleted

Context

[libreoffice-l10n] translating dialogs using OmegaT · Milos Sramek
- Re: [libreoffice-l10n] translating dialogs using OmegaT · Yury Tarasievich

Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.