HiTZ has found an innovative way to create chattbots in Basque and other small languages
The Centre HiTZof the University of the Basque Country/Euskal Herriko Unibertsitatea has created a new way of doingthe\u00A0 chat in small languages such as the Euskera, based on the multilingual open language model built by the Meta Research Center, Llama.
The usual way would be to feed Llama with texts and examples in Basque, but that manual work is very expensive. "Only big companies have been able to do it so far," explainsEneko Agirre, director of the research center HiTZ.In order to avoid this work, HiTZ members have found an "innovative and efficient way" to adapt the chat to the Euskera. With the new method, it is enough to continue training with the Euskera text mass Llama , but the key to this is to be able to apply techniques to deal with the problem known as "catastrophic oblivion".
The work done opens up new avenues. On the one hand, the method itself can be applied to open models stronger than the Llama, and on the other hand, it can be done in other languages with a similar volume of text.
In fact, with regard to the number of documents on the open Internet, there are 1000 times more documents in English than in Basque, and 100 times more in Spanish. So far, the question has been whether conversations with chattbots in small languages can achieve the same good results as those in English or Spanish.
You might like
X will block Grok's creation of fair, sexualized images for all users
The social network has imposed further restrictions on artificial intelligence, following controversy over the creation of images of real people in intimate clothing, including minors. The measure also applies to paid subscribers.
Mundu osoko gobernuak Musken Grok estutzen ari dira, horrek sortu ditzakeen “deepfake” sexualengatik
Europan, hainbat gobernuk ikerketak abiatu dituzte horrelako edukiak legez kanpokoak diren ebaluatzeko.
ChatGPT and Copilot will stop operating on WhatsApp and only Meta AI will remain available
As a result of the new messaging platform policies, third party chatbots will be shut down and, as a result, OpenAI and Microsoft will be withdrawn from tomorrow.
Wikipedia turns 25 and the Basque edition becomes a reference for minority languages
The world's largest free and collaborative encyclopedia is celebrating its 25th anniversary with more than 60 million articles, while Wikipedia in Euskera stands out for its sustained growth, active community and support for the care and dissemination of the Basque language on the Internet.
They warn of the criminal dangers of publishing images of other people, whether true or created by Artificial Intelligence
The Spanish Data Protection Agency warns that the thoughtless use of these instruments may violate fundamental rights and give rise to legal liability, especially in cases of intimate, sexualized content or affecting minors.
"Your Year in ChatGPT", a new feature that summarizes interactions in 2025
The tool will be available to all users of both free and paid versions, provided that their storage memory and chat history are activated.
What's the point and how is the new + button from Google's search bar used?
At the left end of Google's search bar, the + icon has replaced the magnifying glass, allowing you to attach both an image and a file, and, aided by Artificial Intelligence, Google conducts an advanced search using these images and files as context.
Netflix to Warner Bros. He'll get Discovery for $82.7 billion
The two companies reached an agreement this Friday that includes film and television studios and HBO.
Cloudflare has fallen again worldwide in less than a month: numerous affected websites
This new mistake comes after another major global shutdown that took place a few weeks ago, affecting services like the X social network, the ChatGPT chatbot and the League of Legends video game.
Meta has been sentenced to pay $479 million to the digital press for unfair competition
Madrid's 15th Commercial Court has sentenced Meta to compensate 87 AMI publishers for gaining a competitive advantage through behavioral advertising on Facebook and Instagram.