* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Saturday, July 12, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Immersive sports and entertainment venue Cosm set to build its 5th location in Cleveland – WKYC

    Cosm Reveals Exciting Vision for Its 5th Immersive Sports and Entertainment Venue in Cleveland

    Monumental Sports & Entertainment’s Samantha Brady on the Power of the RSN’s Direct-to-Consumer Streaming Service Monumental+ – Sports Video Group

    Samantha Brady Reveals How Monumental+ is Transforming Sports Streaming with Direct-to-Consumer Access

    Moses Singer Welcomes Entertainment and Intellectual Property Partner Frederick Bimbler – Yahoo Finance

    Moses Singer Expands Team with New Entertainment and Intellectual Property Partner Frederick Bimbler

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

    Magicians and Battlebots light up Las Vegas entertainment scene – KSNV

    Magicians and Battlebots Take Las Vegas Entertainment by Storm

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Stallion Uranium Provides Update on Technology Data Acquisition Agreement – GlobeNewswire

    Stallion Uranium Announces Exciting Progress in Technology Data Acquisition Agreement

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    SMPTE Opens Early Bird Registration for Media Technology Summit – TVTechnology

    SMPTE Launches Early Bird Registration for Exciting Media Technology Summit

    Google Fiber puts Nokia network slicing technology to the test – Fierce Network

    Google Fiber Puts Nokia’s Network Slicing Technology to the Ultimate Test

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Immersive sports and entertainment venue Cosm set to build its 5th location in Cleveland – WKYC

    Cosm Reveals Exciting Vision for Its 5th Immersive Sports and Entertainment Venue in Cleveland

    Monumental Sports & Entertainment’s Samantha Brady on the Power of the RSN’s Direct-to-Consumer Streaming Service Monumental+ – Sports Video Group

    Samantha Brady Reveals How Monumental+ is Transforming Sports Streaming with Direct-to-Consumer Access

    Moses Singer Welcomes Entertainment and Intellectual Property Partner Frederick Bimbler – Yahoo Finance

    Moses Singer Expands Team with New Entertainment and Intellectual Property Partner Frederick Bimbler

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

    Magicians and Battlebots light up Las Vegas entertainment scene – KSNV

    Magicians and Battlebots Take Las Vegas Entertainment by Storm

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Stallion Uranium Provides Update on Technology Data Acquisition Agreement – GlobeNewswire

    Stallion Uranium Announces Exciting Progress in Technology Data Acquisition Agreement

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    SMPTE Opens Early Bird Registration for Media Technology Summit – TVTechnology

    SMPTE Launches Early Bird Registration for Exciting Media Technology Summit

    Google Fiber puts Nokia network slicing technology to the test – Fierce Network

    Google Fiber Puts Nokia’s Network Slicing Technology to the Ultimate Test

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

The world’s most advanced Arabic LLM is now available on open source

November 1, 2023
in Technology
The world’s most advanced Arabic LLM is now available on open source
Share on FacebookShare on Twitter

ShpilbergStudios – stock.adobe.c

On 30 August 2023, Abu Dhabi-based Inception, a subsidiary of G42, announced the release of an Arabic large language model to open source

Pat Brans

By

Pat Brans,
Pat Brans Associates/Grenoble Ecole de Management

Published: 31 Oct 2023

Inception, an Abu Dhabi-based subsidiary of G42, has released an Arabic large language model (LLM) to open source. The new model, called Jais, uses 13 billion parameters, which is a measure of its sophistication and degree of precision. Parameters can be thought of as coefficients to a series of algebraic equations. 

During the learning phase, the values of the parameters are derived from the training data and saved as part of the neural network, which is then used for the inference phase. The inference phase is when the model is deployed – taking questions and commands from users and producing answers. 

On a worldwide scale, Jais is a respectably large model, fitting between GPT-2, which has 1.5 billion parameters, and GPT-3, which has 175 billion. GPT-4 is far ahead of the rest, with 1.7 trillion parameters.  

How Jais was developed 

Named after UAE’s highest mountain Jebel Jais, the LLM was developed by Cerebras Systems, Inception, and Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) – the world’s first graduate research university dedicated to artificial intelligence (AI). Jais was trained on Condor Galaxy, the multi-exaFLOP AI supercomputer recently announced by G42 and Cerebras. 

One of the challenges in training an LLM is getting enough text for input. That’s relatively easy for English, by far the most prevalent language on the internet. According to statista, as of January 2023, 58.8% of web content was in English, with Russian running a distant second at 5.3%. Arabic language text accounts for only 0.9% of the content on the worldwide web. 

“Once we began lifting our heads up beyond English, we saw that not having enough data is also a problem for other languages,” says Andrew Feldman, CEO and co-founder of Cerebras Systems. “Even when the number of speakers of a language is very large, the amount of text on the internet may be small. This is true for Spanish, for example. There is a continent of Spanish speakers, but the amount of text on the internet is relatively small. 

“It’s also true for Hindi and Mandarin, each with hundreds of millions of speakers. Even though the Chinese government spent a huge amount of time and money to remedy this problem, there still isn’t necessarily enough Mandarin text to feed a data-hungry AI algorithm.”

“There are other challenges with Arabic. The text that is available is often a poor translation from English or it may be too formal. In Arabic, some of the writing on the internet is religious writings or poetry, which is important, but not particularly useful if you want to build a chatbot. You have to find modern versions of the language in a conversational style.” 

To bridge the gap, a 398 billion-word Arabic and English dataset was developed specifically to train Jais and other AI models. Some aspects of an LLM can be trained using data from other languages – in this case, English. For example, the model can learn to summarise by examining content and summaries of that same content, independently of the language. 

Another challenge with Arabic is the number of dialects. “No two people in the Arab world outside of the media speak to each other in formal Arabic,” says Andrew Jackson, CEO of Inception. “They use one of the dialects. We have been gathering as many conversational datasets as possible and using them to introduce the tokens to our model. Once you have a broad set of different dialects, you tweak the model on the output side so it can decide that when this chat bot is used in Lebanon, the response is given in the Lebanese dialect.” 

The significance of Jais to the Arabic speaking people

“At G42, we’ve always had bold ambitions and the drive to pursue them,” says Jackson. “We’re trying to contribute as much as possible to the global development of AI by providing meaningful input.

“We’re very firm believers that within the next decade, AGI [artificial general intelligence] will become real, and we want to contribute to that and make sure it’s done in a safe way. We want to make sure AI works for the industries that are important to the region, including the government, healthcare, energy, and financial sectors.”

The new LLM responds to one of the important needs in the region, which is sovereign control. Nobody wants to depend on outside help for such a critical technology as AI. Jais encourages a fully in-house approach, where developers download the model and integrate it into their applications. 

This inherent sovereignty reduces dependency on external resources, allowing organisations across the Middle East to run the model within their own infrastructures, maintaining complete control over usage and fine-tuning the model for their own purposes. 

Jais gives the more than 400 million Arabic-speaking people in the world more direct access to the powers of AI, and the LLM is a step forward for Abu Dhabi in its ambitions to become a world-leading hub for AI. 

Inception chose to release Jais as open source to promote the budding ecosystem around Arabic language AI and to specifically target the scientific, academic, and developer communities. The company also hopes to serve as an example for native speakers of other languages that are currently underrepresented in mainstream AI. 

Several organisations have already began using Jais. This includes the UAE Ministry of Foreign Affairs, the UAE Ministry of Industry and Advanced Technology, the Department of Health – Abu Dhabi, the Abu Dhabi National Oil Company (ADNOC), Etihad Airways, and e&. Independent software developers have also taken an interest. Within a day of its release, Jais had already been downloaded from Hugging Face thousands of times.  

“This is not the be all end all for us,” says Jackson. “We want to fine tune our foundational model for proprietary data sets so companies in different industries can take use it for their specific needs.”

Read more on Artificial intelligence, automation and robotics


UAE makes significant contribution to AI computing power

PatBrans

By: Pat Brans


UAE improves healthcare through information technology

PatBrans

By: Pat Brans


Top 10 Middle East IT stories of 2022

KarlFlinders

By: Karl Flinders


Dubai’s fledgling drone programme gets another nudge

PatBrans

By: Pat Brans

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Computer Weekly – https://www.computerweekly.com/feature/The-worlds-most-advanced-Arabic-LLM-is-now-available-on-open-source

Tags: advancedtechnologyWorld's
Previous Post

British Library falls victim to cyber attack

Next Post

The Google Pixel 7a is cheaper than ever at $374 in this early Black Friday deal

State Department is firing more than 1,300 staff on Friday – CNN

Over 1,300 State Department Employees to Be Laid Off This Friday

July 12, 2025
Southland Conference and Spiideo Partner to Bring Cloud-Based Replay and Video Technology to Seven Sports – Sports Video Group

Southland Conference Partners with Spiideo to Transform Replay and Video Technology Across Seven Sports

July 11, 2025
Mariners’ Julio Rodriguez Breaks Silence After Unexpected Decision – Yahoo Sports

Mariners’ Julio Rodriguez Breaks Silence After Unexpected Decision – Yahoo Sports

July 11, 2025
Rice Museum: Architecture Rooted in Rural Memory and Ecology – ArchDaily

Rice Museum: Architecture Rooted in Rural Memory and Ecology – ArchDaily

July 11, 2025
Japan Shifts Space Policy from Science to Security – JAPAN Forward

Japan Shifts Space Policy from Science to Security – JAPAN Forward

July 11, 2025
Scientists Develop Glowing Tool To Reveal Cancer Cells – Newsweek

Scientists Develop Glowing Tool To Reveal Cancer Cells – Newsweek

July 11, 2025
The Real Lifestyle And WILDGO Partner To Transform Tokenized Real Estate – BlockchainReporter

The Real Lifestyle And WILDGO Partner To Transform Tokenized Real Estate – BlockchainReporter

July 11, 2025
A Lean World Health Organization for the Global Good – Center for Global Development

Transforming the World Health Organization for Greater Global Impact

July 11, 2025
I upgraded to premium economy for a 13-hour flight on a budget airline. It lacked some perks, but it was still worth the price. – Business Insider

I Upgraded to Premium Economy on a 13-Hour Budget Airline Flight – Here’s What It Was Really Like

July 11, 2025

Givēon’s Soul-Stirring Old-School R&B Heartbreak

July 11, 2025

Categories

Archives

July 2025
MTWTFSS
 123456
78910111213
14151617181920
21222324252627
28293031 
« Jun    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (716)
  • Economy (739)
  • Entertainment (21,626)
  • General (15,839)
  • Health (9,776)
  • Lifestyle (746)
  • News (22,149)
  • People (741)
  • Politics (750)
  • Science (15,957)
  • Sports (21,238)
  • Technology (15,724)
  • World (722)

Recent News

State Department is firing more than 1,300 staff on Friday – CNN

Over 1,300 State Department Employees to Be Laid Off This Friday

July 12, 2025
Southland Conference and Spiideo Partner to Bring Cloud-Based Replay and Video Technology to Seven Sports – Sports Video Group

Southland Conference Partners with Spiideo to Transform Replay and Video Technology Across Seven Sports

July 11, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version