Translation and Translation Data (2/1/2022)

Lecture: (by Graham Neubig)

  • The Practice of Translation
  • Machine Translation
  • Translation Evaluation Metrics
  • Translation Data Sources
  • Bi-text Extraction/Filtering

Language in 10: Cantonese

Slides: Multilingual Slides

Discussion:
Use Google translate to back-translate the text via a pivot language, e.g., "English → Spanish → English" or "English → L1 → L2 → English", where L1 and L2 are typologically different from English and from each other.
Compare the original text and its English back-translation, and share your observations. For example, (1) what information got lost in the process of translation? (2) are there translation errors associated with linguistic properties of pivot languages and with linguistic divergences across languages?
Try different pivot languages: can you provide insights about the quality of MT for those language pairs?

References:

<-- Back To Schedule