Facebook M2M-100

DT #51 — October 25, 2020

Kyle Wiggers wrote a feature for Venture Beat on Facebook’s new M2M-100 model — a machine translation model that, unlike e.g. Google Translate does for many language pairs, does not use English as go-between. Instead of translating from A to English and then from English to B, it translates form A directly to B — which for 100 languages means there are nearly 10,000 combinations. The model was trained on 2200 of these combinations, and is a new state of the art (in terms of BLEU) for many non-English language pairs. The model has 15 billion parameters, continuing the trend that strength really is in numbers for NLP and MT models. FAIR has open-sourced M2M-100 at pytorch/fairseq.

ML Research

This section of Dynamically Typed covers recent models, datasets, and tools for machine learning research.

Join 325+ others and subscribe to get DT in your inbox every second Sunday — 76 issues and counting!

Or check out recent DT issues first:

DT #76: Dynamically Typed Hiatus

DT #75: OpenAI's book summaries for the alignment problem, Translatotron 2, and AI-generated movie posters

DT #74: Apple's privacy-focused facial recognition, DeepMind's multimodal Perceiver IO, and sea ice forecasting with IceNet