Poro 34B chat – a chat tuned version of Poro 34B model now released.

Poro 34B chat is here

Generative AI

Large language models

SiloGen

last updated

24.5.2024

‍Earlier this year, Silo AI in collaboration with TurkuNLP and HPLT released Poro 34B, a best-in-class open foundation model for Finnish with strong capabilities in English and code. Poro 34B is evidence of a unique approach to building performant large language models (LLMs) for low-resource languages, and an important milestone on the journey towards providing models for all official EU languages. While base models are versatile in nature, they require fine-tuning or special prompting when deployed into production. In line with this, another common challenge for models designed for low-resource languages is that most of the instruction datasets that are publicly available are written in English. To chat tune a model to follow instructions in a low-resource language, we put forward an approach that uses the base model’s translation capability to generate instruction-tuning data. This results in a model that produces high-quality responses in the same language as in which the prompts are written.

Foundation models are a cornerstone of the AI infrastructure needed to build AI-powered products, services and businesses. In line with this, Poro 34B is a best-in-class open foundation model for Finnish and is also on par with other English LLMs like Llama 33B and outperforms Falcon 40B and MBT 30B. In English-Finnish translation, it outperforms not only dedicated open-source translation models but also Google Translate and scores on par with GPT-4. Poro 34B and other similar base models are versatile, but require fine-tuning or special prompting when deployed into production.

Poro 34B chat can answer questions, follow instructions written in plain language, and write code. As such, the model is an optimal choice for chat-based use cases, such as customer support, assistants, co-pilots and data processing tools. Poro 34B chat is bilingual and has exceptional translation capabilities, even outperforming state-of-the-art models dedicated to translation.

Instruction tuning models for low-resource languages

In contrast to other available open source models, Poro 34B chat has been instruction tuned in a low-resource language, allowing it to follow instructions in Finnish.

Models are only as good as their training data. A common challenge for models designed for low-resource languages is that most of the instruction datasets that are publicly available are written in English. With datasets in English only, a model learns to provide responses in English, even if prompts are written in a different language.

For a performant chat tuned model, datasets that show instruction following are needed. These instructions can be commands such as “summarize this document”, mathematical problems or anything else that a model is required to do. A model is then trained with these datasets in order to learn how to generalize about a variety of tasks, including many which are not explicitly represented in the training dataset.

To chat tune a model to follow instructions in a low-resource language, we put forward an approach that uses the base model’s translation capability to generate instruction-tuning data. Hence, the low-resource base model functions as a translator to turn English instructions into instructions in the low-resource language. For chat capability in both the high- and low-resource language, instruction tuning with a combined set of instructions in both languages results in a model that produces high-quality responses in the same language as in which the prompts are written.

In line with this approach, we use the Poro 34B base model to generate instructions in Finnish for Poro 34B chat to follow instructions in Finnish. For capability in both English and Finnish, a combined set of instructions in both languages are then used to finetune the Poro 34B chat model. This approach ensures users get responses from the model in the same language as in which the prompts are written.

Download on HuggingFace and share your results

Now that the Poro 34B chat model is publicly available for download on HuggingFace, we would love to hear your feedback and see how you put the model into use. We invite you to download the model and tag us on social media to share your results, or contact us at community@silo.ai.

For more on how to build specialized sovereign LLMs, we encourage you to have a look at our SiloGen Platform.

Help develop chat models for a low-resource language

To create the best chat models, fine-tuning on examples that reflect the target language's culture and customs is essential. Translations alone often retain the original language's nuances. If you know Finnish, you can help improve Finnish chat models by joining the Avoin Avustaja crowdsourcing effort, a Finnish version of Open Assistant managed by TurkuNLP, https://avoin-avustaja.fi/

About

Silo AI

Silo AI is a leading AI lab on a joint mission with AMD to shape the future of AI computing. We’re a trusted AI partner that brings competitive advantage to leadership AI solutions. We build AI to enable smart devices, autonomous vehicles, industry 4.0, and smart cities. Silo AI trains state-of-the-art open source AI models, and offers customers unique access to world-class AI capabilities and the SiloGen platform. With advanced compute, a full-stack AI platform and world-leading AI scientists, our approach empowers organizations to develop AI that they own and control.

www.silo.ai

Want to discuss how Silo AI could help your organization?

Get in touch with our AI experts.

Peter Sarlin, PhD

Co-founder

peter.sarlin@silo.ai

Author

Authors

Share on Social

Subscribe to our newsletter

Join the 5000+ subscribers who read the Silo AI monthly newsletter to be among the first to hear about the latest insights, articles, podcast episodes, webinars, and more.

Country of residence

By submitting this form you agree to the processing of your personal data by Silo AI as described in the Privacy Policy.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Ready to level up your AI capabilities?

Succeeding in AI requires a commitment to long-term product development. Let’s start today.

Talk to an expert Join our team

Poro 34B chat is here

Instruction tuning models for low-resource languages

Download on HuggingFace and share your results

Help develop chat models for a low-resource language

About

Silo AI

Want to discuss how Silo AI could help your organization?

What to read next

AMD Silo AI and appliedAI expand their partnership with a program to accelerate AI adoption in life sciences, robotics and automotive

Europe's leading AI companies and research institutions combine their forces to develop next-generation open LLMs

AMD Silo AI and Combient unlock enterprise AI value on leadership compute platforms

Ready to level up your AI capabilities?