CycleTRANS: Can an AI learn language translation without direct translation data?

I created this AI architecture based on a scientific paper. The main idea is simple: give the computer two datasets of different languages (for example, Italian and English) and let it learn how to translate between them.

So, what’s different about this from regular translation models like Marian or seq2seq?

The key difference is that it doesn’t need direct translations in the dataset. This is a big deal for languages that don’t have a lot of available translation data.

How does it work?

The model takes sentences from one language—let’s say Italian—and tries to translate them into another language, like English. It uses a metric called the BLEU score to check if the translated sentence makes sense. Over time, this encourages the model to improve its translations. Then, the English translation is translated back into Italian, and the model gets rewarded if the back translation matches the original sentence.

For example:

Il gatto è sulla sedia → The cat is on the chair → Il gatto è sulla sedia

While this method currently gives lower results than traditional models, it could be improved. It might open up new possibilities for translation, especially for languages with limited resources.

Welcome to this forum

Posting Guidelines for News Updates


Here are a few simple guidelines to follow when posting news:

  • Posts should be over 100 characters - more detail is better!
  • Include a link to the original article or source
  • Mention your connection to the blog or news source
  • Add a brief description of the article to help people understand quickly
  • AI-related news can be overwhelming; make your post interesting to stand out!
Reach out to the forum moderators if you need help with anything

This message was automatically posted by a bot. Please contact the moderators if you have any questions.

GITHUB REPO

This is pretty interesting!

AI is definitely going to have a huge impact globally, for sure.