10-billion parameter open-source large language model available for use following announcement earlier this year
Abu Dhabi, United Arab Emirates, October 30, 2024: Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) – the world’s first graduate-level artificial intelligence (AI) university dedicated to research – has released Nanda, the world’s most advanced open-source Hindi large language model (LLM).
The model, released as open source, was developed by the University’s Institute of Foundation Models (IFM) in partnership with Inception (a G42 company) and Cerebras Systems, and was announced earlier this year. The release marks a significant milestone in the ongoing development of India’s AI ecosystem and its journey to equitable AI, with more than half-a-billion Hindi speakers now able to harness the potential of generative AI in their mother tongue.
Llama-3-Nanda-10B-Chat, or Nanda for short, is a 10-billion parameter model, which demonstrates better knowledge and reasoning capabilities in Hindi than any existing open Hindi and multilingual models of similar size by a sizable margin, based on extensive evaluation. It is also very competitive in English. The model was trained on the Condor Galaxy supercomputer, built by G42 and Cerebras Systems. Named after one of India’s highest peaks, Nanda is available on https://huggingface.co/MBZUAI/Llama-3-Nanda-10B-Chat
MBZUAI President and University Professor Eric Xing said: “An accurate and efficient LLM for the Hindi language is vital for India’s ambitions for inclusive and accessible AI. With the release of Nanda, we are reinforcing our commitment to open-source LLMs and to making new technology affordable, safe, ethical and standardizable. This is aligned with our mission as an academic institution to lead the generative AI development for public good and contributing to the UAE’s knowledge-led economy.”
“Nanda is an important advancement for generative AI for Hindi, which is one of the most widely spoken languages in the world,” said the project’s lead, Preslav Nakov, Department Chair and Professor of Natural Language Processing at MBZUAI.
“We are releasing Nanda as an open model, so people can download it from HuggingFace and run it locally. It is of reasonable size, and thus has modest hardware requirements.”
The project’s co-lead, Monojit Choudhury, Professor of Natural Language processing at MBZUAI, added: “The current state of LLMs in Hindi is not up to the mark. It’s nowhere close to English or several European languages. Building LLMs especially for a language like Hindi, spoken by hundreds of millions of people, to a reasonable level is important. India is one of the world’s largest economies; any LLM that can serve Hindi will benefit communities as it opens new commercial opportunities.”
The launch of Nanda builds on the success of Jais, the world’s leading Arabic LLM and joins MBZUAI’s zoo of advanced foundation models. Jais transformed Arabic Natural Language Processing (NLP), unlocking access to native-language generative AI capabilities for over 400 million Arabic speakers globally.
Link to paper with author names: https://github.com/mbzuai-nlp/Llama-3-Nanda-10B-Chat/blob/main/Llama-3-Nanda-10B-Chat-Paper.pdf
About Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)
MBZUAI is a graduate research university focused on artificial intelligence, computer science, and digital technologies across industrial sectors. The university aims to empower students, businesses, and governments to advance artificial intelligence as a global force for positive progress. MBZUAI offers various graduate programs designed to pursue advanced, specialized knowledge and skills in artificial intelligence, including computer science, computer vision, machine learning, natural language processing, and robotics. For more information, please visit www.mbzuai.ac.ae
To apply for admission, visit mbzuai.ac.ae or contact [email protected].