User Tools

Site Tools


Sidebar

isaac_project:en:mtpe:mtpe_full_instructions:contents:1_t_guide

TRANSLATION GUIDELINES

INCOMPLETE/TRUNCATED STRINGS: if a string is incomplete in source, do not try to complete it in target; translate the part that appears in source with what would be the equivalent part in your language. Do not try to guess what the rest of the string would be.
INCIDENTAL WORDS: when incidental words of cultural/linguistic courtesy or habit are generally used in the source language but not in the target, or generally used in the target language but not in the source, translators may add or delete such words to best convey the same meaning and tone. For example, when translating a string that discusses a future event from Arabic into English, sometimes it may be needed to delete the word “inshallah” rather than literally translating it to more closely convey the meaning and tone.
IDIOMS: in case of idioms, proverbs, and expressions, always retain the meaning of the source string in your translation. A literal word-by-word translation is not acceptable if the meaning of the source string gets lost. Example: “Break a leg” → “In bocca al lupo”.
REPEATED WORDS: if the source text has repeated words that do not seem to serve any purpose in the intended meaning, please do not include the extra words in the translation.
Example: “I went I went to the park” → “I went to the park”
FILLER WORDS: do not include filler words like “uh”, “hmm”, etc. in your translations.
Example: “My weekend was um uh not too bad” → “My weekend was not too bad”.
INTERJECTIONS: words like “oups”, “yikes”, “yay”, “gosh”, etc. should be considered part of the text and should be kept in the target language/translated.
GRAMMAR AND SPELLING: we expect grammatically correct translations without spelling errors. Spell checker is integrated in Oneforma and available directly in the webapp. To launch the check, click on this button in each string you want to check:

GENDER: if the source sentence has a specific gender, it should be translated into the same gender in the target. If the source language doesn't support a specific gender and you need to choose one in the target language, for e.g. pronouns when translating from English into German, Spanish etc., choose a random gender. Do not always choose male or female.
STYLE AND FORMALITY: if the source sentence contains a level of formality, try to replicate the same level of formality/politeness in the target unless this would be perceived as unnatural by the target audience. If the source sentence does not provide a level of formality (e.g. English), translate with an informal style by default, unless this would be perceived as unnatural by the target audience (like in Japanese, where formal is the more accepted form).
POLITE WORDS: if using ‘polite’ words (like 'please') is not common or expected in the target, please omit them (this should not be considered an Omission error during review).
SWEAR WORDS/VULGAR CONTENT: do not omit any parts of the source content and do not change the original tone of the sentence. If source contains swear words, politically incorrect expressions, vulgar comments, etc. please keep it in target as well by conveying the same meaning and tone. If you do not feel comfortable with the content of the string and do not want to translate it, feel free to skip the hit.
NUMBERS: keep numbers as is (keep the “same style”). That means written-out numbers in the source should be translated into written-out numbers in the target and Arabic numerals (123) in the source should be transferred as Arabic numerals (123) in the target. Please stick to this rule even if it might be unusual/rare in the target language. Examples:
▪ 123 → 123
▪ two plus two equals four → zwei plus zwei ist gleich vier
THOUSAND/DECIMAL SEPARATOR: any formatting of the numbers, for e.g thousand and decimal separators needs to be modified to fit the target language. For example, given English to German translation, “123,000” should be translated to “123.000”.
DATES: dates need to be formatted based on the target language but remaining close to the source format as much as possible. For example, given English to French translation, “02/10/2015” should be translated to “10/02/2015”.
TIME: time indication/hours need to be formatted based on the target language but remaining close to the source format as much as possible. For example, given English to Italian translation, “5:30 PM” should be translated to “17:30”.
CURRENCIES: currency names should be translated (not converted). For example, given English to Spanish translation, “8000 crowns” would be “8000 coronas”. “18 dollars” would be “18 dólares”.
UNITS OF MEASUREMENT: Never convert units even if the target language uses different metric systems. Translation with converted units are not acceptable. Example: 75 Fahrenheit should not change into 25 Celsius (75 F should be translated to 75 F). 6 inches should not change into 15.24 cm.
SYMBOLS: symbols can be switched if one variant is more common in the target language. For example, given English to German translation, “$125 dinner” might be translated to “125 USD Abendessen”.
FORMATTING: The format of the source text should be maintained in the target languages unless target language has a pre-defined syntax. Example: “123” should be translated as “123” and not as “one hundred and twenty-three”. Example: “I am so happy!!!!!!!!!” should be translated by maintaining the same number of “!” in the target.
PUNCTUATION AND CAPITALIZATION: preserve source punctuation/capitalization as much as possible (especially if the content is informal, like in chats, social media posts, etc.), consistent with common usages in the target language. For example, if the source sentence includes several question marks, explanation marks, dots etc. please copy this punctuation as is into the target. If the source sentence starts lowercase, do not start the target sentence uppercase. Please note that if the source has enough intentional punctuation, correct any incorrect or missing ones.

However, some punctuation/capitalization adjustments may be needed in the target language for those cases where the source contains symbols that do not exist in the target or other grammar restraints.
Note: be sure to type symbols such as quotation marks (e.g. “ ”) using the target language keyboard set. Avoid typing using other keyboard set that is not the target language one because this can lead to inconsistencies.

EXCEPTION: when the source is clearly FORMAL, you should modify and adapt the punctuation to what is expected in the target language. Although most of the times we do not have the context of the strings, you can sometimes assume the nature of the source by the content, the style or the language used in it.

In case of doubts, if the source is ambiguous and you cannot clearly tell whether it’s formal/informal, please assume it is informal and preserve source punctuation/capitalization as much as possible.
MENTIONS AND HASHTAGS: keep @mentions and #hashtags as in source. Example: I don't like @PapaJohn pizza #cardboard #evil → no me gusta la pizza de @PapaJohn #cardboard #evil
CODE & TECHNICAL TEXT: links , html tags, development code, text usually in English wrapped with <>, {}, [] or other type of coding language should not be translated. The checkbox “garbled text” should be marked in these cases. Example: “<head>” “{top:- 30px;position:absolute;right }”, “xmlHttp.open(“GET”, true)”, etc. Additionally, there are other instances of text referring to a UI, a code, or an instruction where it does not need to be translated.
Example: text STOP to stop receiving message → translate everything except the word, “STOP” because this is a text message instruction where the word needs to be texted as is for the service to work correctly.

TAGS: You might encounter tags (words surrounded by brackets) in the source text: [redacted_name], [redacted_address], [redacted_email], [redacted_id], [redacted_number], [redacted_url], [redacted_words], etc. You should treat tags as substitutes for the described word or phrase, and translate them according to the context. The tag should be present in the translation as it is; you should NOT translate tag contents. Gendered articles: Please note that some languages might require gendered articles before some tags, like the address tag. Please do not include a gendered article before the tag, you should treat the tag as if the gendered articles were included inside the tag.

EMOJIS: copy emojis directly from source into the target. Do not change them, even if they are incomplete (like :-‘). If the source sentence uses an emoji to replace a word that is essential to the sentence meaning, then translate with the inferred word (without omitting the emoji).

BUSINESS AND PRODUCT NAMES: keep business names and product names as-is with proper capitalization. Examples: Facebook → Facebook, ebay → eBay. Exception: if a company/product has been officially marketed in another country under a different name, you should use the official name for the target country. Examples: “Diet Coke” would be “Coca-Cola Light” in Italian; “Algida” would be “Frigo”, “Miko”, or “Eskimo” depending on the target country.
PROPER NAMES: proper names of people should NOT be changed unless they are historical names widely recognized in each language. Example: Columbus > Colón.
Transliteration can be used as long as the names are the same as in source. Example for Serbian: “Shakespeare” > “Šekspir”. Other names (e.g. names of places and languages) should be translated when the name is rendered differently in the source and target languages. For example, Warsaw in Polish should be Warszawa, Cologne in German should be Köln and Spain in Spanish should be España.
APP NAMES: keep well known app names as in source if there is no official translation. Example: Skype → Skype
NAMES OF LAWS, ACTS, ORGANIZATIONS: keep as in source the names of laws, acts, organizations, etc. that belong to a specific country. Example: 'Defense of Marriage Act'. For international organizations or international documents, i.e., 'Universal Declaration of Human Rights', please use the standard and approved translations for the target.
MOVIE/BOOK/SONG/SHOW TITLES: if there is no official/culturally relevant translation for the title, keep the one in the source. If there is an official translation or adaptation that is more familiar to the target locale, please use that one. Example: “Harry Potter and the Philosopher’s Stone” should be translated into French as “Harry Potter à l'école des sorciers” because that was the title of the French version of the same book. “Twinkle, twinkle, little star” might be translated into Korean using the corresponding Korean-language song “반짝 반짝 작은 별”.
2 TRANSLATIONS REQUIRED: for MTPE, we typically require one translation per source string. However, some batches/files will require two translations for each source string. if 2 translations are required:

• They will be paid individually (if you provide both Translation1 and Translation2 for the same source string, the source string will be paid twice).
• Translation1 needs to be the most common.
• It will be mandatory to provide 2 valid different translations of the same source string. Both translations need to maintain the original intent using different words, structures, and expressions, which means that Translation1 cannot be the same as or very similar to Translation2. Make every attempt to provide a good diversity in Translation2 and not just change the articles or other minor changes. Example:
o T1: I’m Ralf.
o T2 (valid): My name is Ralf
o T2 (not valid): I am Ralf
• This diversity can come from using a different level of formality, translate into two sentences with different meanings (if the source text is ambiguous), use synonyms in the second translation or rephrase the sentence (syntactic difference). Please take all these possibilities for providing diversity into account and combine them. Do not always use the same criteria (for example do not always rephrase). Don't try to add a new translation from a different variant such as Swiss German for German (this would be a cheating behavior for wrong target language). Hints and tips on how to provide valid 2nd translations are posted in the below section.
• Centific will conduct checks to detect how similar Translation1 and Translation2 are. If there is an indication that Translation1 and Translation2 are the same or too similar (only minimum variations applied), it will be considered a cheating behavior.
• There are cases where it is not possible to provide a valid Translation2, and in these cases Translation2 should be omitted. For example when the source is made just by:
o a number or a price
o a URL
o an email address
o a proper name
o etc.
• If only Translation1 has been provided, Centific will conduct checks to detect if the lack of Translation2 is justified. If there is an indication that the lack of Translation2 is not justified, the user who missed to translate it will be penalized in the QA.

• If the task requires 2 translations but it is not possible to provide a valid Translation2, click the SECOND TRANSLATION NOT POSSIBLE button. This will indicate to the reviewers that there is only 1 possible translation for the provided source.

TIPS ABOUT 2ND TRANSLATIONS: The 2nd translations should be considerably different. It’s not just a matter of changing the articles or the punctuation. You should really find a different way to say the same (without of course doing mistranslations, omissions, etc.). Please find here some tips about how to reach this diversity in the 2nd translations.
Different levels of FORMALITY: Due to their nature, some strings should have either an informal or a formal tone. For example, business reports will usually be formal, tweets will usually be informal. Vulgar and offensive contents will more likely have an informal tone too. This perception may change from language to language and the gold rule is to always avoid using a level of formality that would be unnatural in the context of the sentence for your language. However, if the source context is ambiguous enough to let you play with different levels of formality, you can use this to vary Translation1 from Translation2. For example:
o Translation1: Votre enfant va impatiemment se rendre sur le site Internet www.loremipsum.com
o Translation2: Ton gamin s’éclatera sur www.loremipsum.com
Different meaning for AMBIGUOUS sources: Sometimes the source text is ambiguous and can be interpreted in different ways. You can use Translation1 and Translation2 to provide 2 different possible meanings of the same ambiguous source string. For example:
o Source: Fred previously checked on this with the gas desk because Aquila indicated interest in buying out this contract.
Here it is not clear whether “Fred” and “Aquila” are two people or two companies, so you can:
o T1: translate as if they were both people names
o T2: translate as if they were both company names
• Different SINTAXES: Use different syntactic structures in Translation1 and Translation2. This doesn’t only mean change the word order, but literally using different grammar/syntactic structures. For example:
o T1: I wish you all the best on your birthday
o T2: Wishing you a happy birthday
o T1: Quería saber si te gustaría ir a ver “American Beauty” con nosotros un día de estos
o T2: ¿Te apetecería ir a ver con nosotros “American Beauty” algún día?
Different TERMINOLOGY: Terminology between Translation1 and Translation2 can be changed, but please make sure you don’t change just one word, otherwise they would be too similar. For example:
o Source: Manual coffee crafting competition (Atlanta)
o T1: Gara di preparazione manuale del caffè (Atlanta)
o T2: Competizione di Atlanta su come fare il caffè a mano
Different GENDERS: If the source language doesn't support a specific gender and it’s ambiguous, please use different genders for Translation1 and Translation2. For example:
o Source: You are really funny.
o T1: Sei molto simpatica. → This is feminine
o T2: Ti trovo davvero simpatico. → This is masculine
UNACCEPTABLE 2ND TRANSLATIONS: Examples of unacceptable 2nd translations, that will be considered cheating behaviors, include (but are not limited to):

AMBIGUOUS/UNCLEAR SOURCE: if the context is clear, keep the same meaning as source. But if the meaning of the source is ambiguous and the context is not clear, try searching the exact snippets in the Web to find a possible context or meaning. If there is no way to clarify the meaning, provide a translation based on your best judgment, even word by word if needed. If the project allows 2 translations for the same source, you can use them to put 2 different interpretations of the source. Do not mark the source as nonsensical if you can apply the phrase into a context where the words make sense. For example, “Rat” can be translated referring to the animal in Translation1 and with the meaning of “traitor” in Translation2.

QUERIES: If you have linguistic queries or doubts about the instructions, raise a ticket to our Query Management System Global Query (C837-Isaac projects) by selecting “MTPE” under the Isaac scope field. Do not raise tickets in Global Query for non-linguistic topics. If you have queries about workflow, rates, payments, or want to report a technical bug (not related to linguistic issues), reach out by email to the PMs. If you do not have an account or you cannot remember your username, reach out to the PMs by email. If you don’t remember the password, you can reset it from the login page. All answers provided outside this tool and by third parties/companies are neither official nor reliable for this project.

NEXT TOPIC ⇒ MOST FREQUENT ERRORS IN MT OUTPUT

BACK TO ⇒ MACHINE TRANSLATION POSTEDITING (MTPE) TRANSLATION v.3.1

isaac_project/en/mtpe/mtpe_full_instructions/contents/1_t_guide.txt · Last modified: 2023/12/18 12:56 by sergio