CrossWord Puzzle 2
Сайт: | Открытые курсы ИРНИТУ |
Курс: | Digital Humanities |
Книга: | CrossWord Puzzle 2 |
Напечатано:: | Гость |
Дата: | Суббота, 11 Октябрь 2025, 03:01 |
1. Across
Across
- The process of automatically translating text or speech from one language to another using computer algorithms.
- An MT approach that uses linguistic rules and bilingual dictionaries to translate text, focusing on syntax, morphology, and grammar.
- An MT approach that uses statistical models based on bilingual text corpora to predict the probability of a translation.
- An advanced MT approach that uses deep learning models, specifically neural networks, to translate text by analyzing large datasets and capturing context.
- An MT approach that relies on a database of previously translated examples, finding the closest matches to translate new sentences.
- A combination of different MT approaches, often integrating RBMT and SMT/NMT to leverage the strengths of each method.
- Large collections of text in two languages, used to train and evaluate MT systems by providing parallel examples of translations.
- A type of bilingual corpus where texts in two languages are aligned at the sentence level, facilitating the training of SMT and NMT systems.
- A specific type of SMT that breaks down text into phrases rather than individual words, improving the fluency of translations.
- A deep learning model used in NMT that processes sequences of text to generate translations, maintaining the order and context of words.
- In SMT, a model that predicts the most likely translation of a word or phrase based on bilingual text data.
- A model that assesses the fluency of the translated text by predicting the likelihood of word sequences in the target language.
- In MT, the process that selects the best translation hypothesis based on the probabilities generated by the translation and language models.
- A component in SMT that predicts the correct word order in the target language, addressing differences in syntax between languages.
- Computational models inspired by the human brain, used in NMT to learn patterns and relationships in language data.
- A technique in NMT that allows the model to focus on specific parts of the input sentence, improving translation accuracy, especially for long sentences.
- A framework used in NMT where the encoder processes the input text and the decoder generates the translation, often using an attention mechanism.
- A metric for evaluating the quality of machine-generated translations by comparing them to one or more reference translations.
- In NMT, models that have been trained on large datasets and can be fine-tuned for specific tasks or languages, speeding up the development process.
- The practice of applying knowledge gained from one task (e.g., translating English to French) to another related task (e.g., translating English to Spanish), commonly used in NMT.
1.1. Check Across
Across
- MACHINE TRANSLATION
- RULE-BASED MACHINE TRANSLATION
- STATISTICAL MACHINE TRANSLATION
- NEURAL MACHINE TRANSLATION
- EXAMPLE-BASED MACHINE TRANSLATION
- HYBRID MACHINE TRANSLATION
- BILINGUAL TEXT CORPORA
- PARALLEL CORPORA
- PHRASE-BASED MACHINE TRANSLATION
- SEQUENCE-TO-SEQUENCE MODEL
- TRANSLATION MODEL
- LANGUAGE MODEL
- DECODING ALGORITHM
- REORDERING MODEL
- NEURAL NETWORKS
- ATTENTION MECHANISM
- ENCODER-DECODER ARCHITECTURE
- BLEU SCORE
- PRE-TRAINED MODELS
- TRANSFER LEARNING
2. Down
Down
- A method in NMT training where target language data is translated back into the source language to create additional training data, improving translation quality.
- Smaller language components, such as prefixes or suffixes, used in NMT to handle rare or compound words more effectively.
- The process of breaking down text into smaller units, such as words or subwords, to facilitate processing in MT systems.
- The process of matching corresponding words or phrases between the source and target languages in a parallel corpus, crucial for training SMT and NMT systems.
- Dense vector representations of words used in NMT to capture semantic meanings and relationships between words in different languages.
- The process of fine-tuning an MT system to perform better in a specific domain, such as legal or medical translation.
- The ability of an MT system to apply knowledge from one language pair to another, enhancing translation quality across multiple languages.
- An NMT approach that handles multiple languages simultaneously, using a shared model that can translate between any pair of supported languages.
- Languages that have limited digital resources, such as corpora or dictionaries, posing challenges for MT development.
- Words that are not present in the training data of an MT system, often leading to translation errors.
- The process of manually correcting errors in machine-generated translations to improve accuracy and fluency.
- The process of analyzing the grammatical structure of sentences, used in RBMT to generate accurate translations.
- The study of the structure of words and their components, such as roots and affixes, used in RBMT to handle inflected languages.
- The process of determining the correct meaning of a word that has multiple possible interpretations, crucial in MT for accurate translations.
- Identifying the roles played by words in a sentence, such as agent or object, to improve the accuracy of MT systems.
- The combination of a source language and a target language in MT, such as English to Spanish.
- An intermediate language used in MT when direct translation between two languages is difficult due to lack of resources.
- Word embeddings that take into account the context in which a word appears, improving translation quality in NMT.
- The process of artificially increasing the size of a training dataset by creating variations of existing data, used to improve MT performance.
- A decoding algorithm used in NMT that considers multiple translation hypotheses simultaneously to find the most probable translation.
- A regularization technique in NMT that prevents overfitting by randomly dropping units in the neural network during training.
- The process of automatically finding and extracting parallel sentences from large bilingual corpora, used to improve the training of MT systems.
- A term referring to the distinct linguistic patterns that emerge in machine-generated translations, often detectable by statistical analysis.
- An MT approach where human translators interact with the MT system during the translation process, refining the output in real-time.
- An approach that relies heavily on large text corpora for training MT systems, typical in SMT and NMT.
- In SMT, a table that lists possible translations for phrases in the source language along with their probabilities.
- A loss function used in NMT training to measure the difference between the predicted translation and the actual translation.
- A type of language model used in NMT that predicts the next word in a sentence based on the context of previous words.
- A technique in NMT where a smaller, simpler model is trained to replicate the behavior of a larger, more complex model, improving efficiency.
- An MT approach that transfers linguistic structures from the source language to the target language, relying on syntactic and semantic transfer rules.
2.1. Check Down
Down
- BACK-TRANSLATION
- SUBWORD UNITS
- TOKENIZATION
- ALIGNMENT
- WORD EMBEDDINGS
- DOMAIN ADAPTATION
- CROSS-LINGUAL TRANSFER
- MULTILINGUAL TRANSLATION
- LOW-RESOURCE LANGUAGES
- OUT-OF-VOCABULARY WORDS
- POST-EDITING
- SYNTACTIC PARSING
- MORPHOLOGICAL ANALYSIS
- LEXICAL DISAMBIGUATION
- SEMANTIC ROLE LABELING
- LANGUAGE PAIR
- PIVOT LANGUAGE
- CONTEXTUAL EMBEDDINGS
- DATA AUGMENTATION
- BEAM SEARCH
- DROPOUT
- PARALLEL SENTENCE MINING
- TRANSLATIONESE
- INTERACTIVE MT
- CORPUS-BASED MT
- PHRASE TABLE
- CROSS-ENTROPY LOSS
- NEURAL LANGUAGE MODEL
- KNOWLEDGE DISTILLATION
- TRANSFER-BASED MT