This text has been automatically translated, it may contain errors or inaccuracies.
UPV/EHU
Favorite
Remove from my list

HiTZ has found an innovative way to create chattbots in Basque and other small languages

Until now, the ability of chattbots to have good conversations depended on the number of documents written in a particular language openly on the Internet, but the new method will greatly facilitate the creation of intelligible chattbots.
HiTZ zentroak hizkuntza txikiagoentzat txatbotak egiteko modu berria deskubritu du
New method created by the HITZ center. Image: UPV

The Centre HiTZof the University of the Basque Country/Euskal Herriko Unibertsitatea has created a new way of doingthe\u00A0 chat in small languages such as the Euskera, based on the multilingual open language model built by the Meta Research Center, Llama.

The usual way would be to feed Llama with texts and examples in Basque, but that manual work is very expensive. "Only big companies have been able to do it so far," explainsEneko Agirre, director of the research center HiTZ.


In order to avoid this work, HiTZ members have found an "innovative and efficient way" to adapt the chat to the Euskera. With the new method, it is enough to continue training with the Euskera text mass Llama , but the key to this is to be able to apply techniques to deal with the problem known as "catastrophic oblivion".

The work done opens up new avenues. On the one hand, the method itself can be applied to open models stronger than the Llama, and on the other hand, it can be done in other languages with a similar volume of text.

In fact, with regard to the number of documents on the open Internet, there are 1000 times more documents in English than in Basque, and 100 times more in Spanish. So far, the question has been whether conversations with chattbots in small languages can achieve the same good results as those in English or Spanish.

More news from technology

18:00 - 20:00
LIVE
From  min.

Trump has extended TikToki's separation period from China by 90 days

Trump has given him 90 more days to separate TikToki from the Chinese matrix for use in the US. A law under Biden forced him to cut the app with China, considering it an "enemy" of the US. But President Trump has accepted extensions for TikTok to find an American investor.

Load more