* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Wednesday, May 14, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    HG Vora Files Definitive Proxy Materials and Sends Letter to PENN Entertainment, Inc. Shareholders – Business Wire

    HG Vora Takes Action: A Bold Move to Engage PENN Entertainment Shareholders

    Downtown Frederick Partnership announces Alive@Five season lineup – The Frederick News-Post

    Get Ready for Fun: Downtown Frederick’s Exciting Alive@Five Season Lineup Revealed!

    ‘American Idol’ Top 3 revealed as 2 contestants eliminated: Who advanced to the Season 23 finale? – Yahoo

    ‘American Idol’ Top 3 revealed as 2 contestants eliminated: Who advanced to the Season 23 finale? – Yahoo

    60,000 Fans Caused a Small Earthquake Because of One Famous Rock Song – Yahoo

    How 60,000 Fans Rocked the Ground with One Iconic Song!

    Dan Spilo Out at Industry Entertainment After Incident on Set of Alan Ritchson Movie (Exclusive) – The Hollywood Reporter

    Dan Spilo Exits Industry Entertainment Following Controversial Incident on Set of Alan Ritchson Film

    John Legend Says He’s Shocked by Ye’s ‘Descent’ Into ‘Antisemitism’ and ‘Anti-Blackness’ – Yahoo

    John Legend Expresses Shock Over Ye’s Troubling Descent into Antisemitism and Anti-Blackness

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Bridger Photonics Appoints Ryan Sullivan as Chief Technology Officer to Accelerate New Era of Data Insights – Business Wire

    Bridger Photonics Welcomes Ryan Sullivan as CTO to Propel Data Insights into a New Era!

    Michigan Public Policy Survey suggests uncertainty among local officials on AI police surveillance technology – The Michigan Daily

    Local Officials Grapple with Uncertainty Over AI Surveillance Technology in Policing

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    GenTech offers coding, AI lessons for elementary students – KTAR.com

    GenTech offers coding, AI lessons for elementary students – KTAR.com

    Arkansas Tech Univeristy-Ozark collision repair technology program re-accredited – Northwest Arkansas Democrat-Gazette

    Arkansas Tech University-Ozark’s Collision Repair Technology Program Earns Re-Accreditation!

    Top Chief Technology Officers to Watch in 2025: SMX’s Anthony Vultaggio – WashingtonExec

    Top Chief Technology Officers to Watch in 2025: SMX’s Anthony Vultaggio – WashingtonExec

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    HG Vora Files Definitive Proxy Materials and Sends Letter to PENN Entertainment, Inc. Shareholders – Business Wire

    HG Vora Takes Action: A Bold Move to Engage PENN Entertainment Shareholders

    Downtown Frederick Partnership announces Alive@Five season lineup – The Frederick News-Post

    Get Ready for Fun: Downtown Frederick’s Exciting Alive@Five Season Lineup Revealed!

    ‘American Idol’ Top 3 revealed as 2 contestants eliminated: Who advanced to the Season 23 finale? – Yahoo

    ‘American Idol’ Top 3 revealed as 2 contestants eliminated: Who advanced to the Season 23 finale? – Yahoo

    60,000 Fans Caused a Small Earthquake Because of One Famous Rock Song – Yahoo

    How 60,000 Fans Rocked the Ground with One Iconic Song!

    Dan Spilo Out at Industry Entertainment After Incident on Set of Alan Ritchson Movie (Exclusive) – The Hollywood Reporter

    Dan Spilo Exits Industry Entertainment Following Controversial Incident on Set of Alan Ritchson Film

    John Legend Says He’s Shocked by Ye’s ‘Descent’ Into ‘Antisemitism’ and ‘Anti-Blackness’ – Yahoo

    John Legend Expresses Shock Over Ye’s Troubling Descent into Antisemitism and Anti-Blackness

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Bridger Photonics Appoints Ryan Sullivan as Chief Technology Officer to Accelerate New Era of Data Insights – Business Wire

    Bridger Photonics Welcomes Ryan Sullivan as CTO to Propel Data Insights into a New Era!

    Michigan Public Policy Survey suggests uncertainty among local officials on AI police surveillance technology – The Michigan Daily

    Local Officials Grapple with Uncertainty Over AI Surveillance Technology in Policing

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    GenTech offers coding, AI lessons for elementary students – KTAR.com

    GenTech offers coding, AI lessons for elementary students – KTAR.com

    Arkansas Tech Univeristy-Ozark collision repair technology program re-accredited – Northwest Arkansas Democrat-Gazette

    Arkansas Tech University-Ozark’s Collision Repair Technology Program Earns Re-Accreditation!

    Top Chief Technology Officers to Watch in 2025: SMX’s Anthony Vultaggio – WashingtonExec

    Top Chief Technology Officers to Watch in 2025: SMX’s Anthony Vultaggio – WashingtonExec

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

The world’s most advanced Arabic LLM is now available on open source

November 1, 2023
in Technology
The world’s most advanced Arabic LLM is now available on open source
Share on FacebookShare on Twitter

ShpilbergStudios – stock.adobe.c

On 30 August 2023, Abu Dhabi-based Inception, a subsidiary of G42, announced the release of an Arabic large language model to open source

Pat Brans

By

Pat Brans,
Pat Brans Associates/Grenoble Ecole de Management

Published: 31 Oct 2023

Inception, an Abu Dhabi-based subsidiary of G42, has released an Arabic large language model (LLM) to open source. The new model, called Jais, uses 13 billion parameters, which is a measure of its sophistication and degree of precision. Parameters can be thought of as coefficients to a series of algebraic equations. 

During the learning phase, the values of the parameters are derived from the training data and saved as part of the neural network, which is then used for the inference phase. The inference phase is when the model is deployed – taking questions and commands from users and producing answers. 

On a worldwide scale, Jais is a respectably large model, fitting between GPT-2, which has 1.5 billion parameters, and GPT-3, which has 175 billion. GPT-4 is far ahead of the rest, with 1.7 trillion parameters.  

How Jais was developed 

Named after UAE’s highest mountain Jebel Jais, the LLM was developed by Cerebras Systems, Inception, and Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) – the world’s first graduate research university dedicated to artificial intelligence (AI). Jais was trained on Condor Galaxy, the multi-exaFLOP AI supercomputer recently announced by G42 and Cerebras. 

One of the challenges in training an LLM is getting enough text for input. That’s relatively easy for English, by far the most prevalent language on the internet. According to statista, as of January 2023, 58.8% of web content was in English, with Russian running a distant second at 5.3%. Arabic language text accounts for only 0.9% of the content on the worldwide web. 

“Once we began lifting our heads up beyond English, we saw that not having enough data is also a problem for other languages,” says Andrew Feldman, CEO and co-founder of Cerebras Systems. “Even when the number of speakers of a language is very large, the amount of text on the internet may be small. This is true for Spanish, for example. There is a continent of Spanish speakers, but the amount of text on the internet is relatively small. 

“It’s also true for Hindi and Mandarin, each with hundreds of millions of speakers. Even though the Chinese government spent a huge amount of time and money to remedy this problem, there still isn’t necessarily enough Mandarin text to feed a data-hungry AI algorithm.”

“There are other challenges with Arabic. The text that is available is often a poor translation from English or it may be too formal. In Arabic, some of the writing on the internet is religious writings or poetry, which is important, but not particularly useful if you want to build a chatbot. You have to find modern versions of the language in a conversational style.” 

To bridge the gap, a 398 billion-word Arabic and English dataset was developed specifically to train Jais and other AI models. Some aspects of an LLM can be trained using data from other languages – in this case, English. For example, the model can learn to summarise by examining content and summaries of that same content, independently of the language. 

Another challenge with Arabic is the number of dialects. “No two people in the Arab world outside of the media speak to each other in formal Arabic,” says Andrew Jackson, CEO of Inception. “They use one of the dialects. We have been gathering as many conversational datasets as possible and using them to introduce the tokens to our model. Once you have a broad set of different dialects, you tweak the model on the output side so it can decide that when this chat bot is used in Lebanon, the response is given in the Lebanese dialect.” 

The significance of Jais to the Arabic speaking people

“At G42, we’ve always had bold ambitions and the drive to pursue them,” says Jackson. “We’re trying to contribute as much as possible to the global development of AI by providing meaningful input.

“We’re very firm believers that within the next decade, AGI [artificial general intelligence] will become real, and we want to contribute to that and make sure it’s done in a safe way. We want to make sure AI works for the industries that are important to the region, including the government, healthcare, energy, and financial sectors.”

The new LLM responds to one of the important needs in the region, which is sovereign control. Nobody wants to depend on outside help for such a critical technology as AI. Jais encourages a fully in-house approach, where developers download the model and integrate it into their applications. 

This inherent sovereignty reduces dependency on external resources, allowing organisations across the Middle East to run the model within their own infrastructures, maintaining complete control over usage and fine-tuning the model for their own purposes. 

Jais gives the more than 400 million Arabic-speaking people in the world more direct access to the powers of AI, and the LLM is a step forward for Abu Dhabi in its ambitions to become a world-leading hub for AI. 

Inception chose to release Jais as open source to promote the budding ecosystem around Arabic language AI and to specifically target the scientific, academic, and developer communities. The company also hopes to serve as an example for native speakers of other languages that are currently underrepresented in mainstream AI. 

Several organisations have already began using Jais. This includes the UAE Ministry of Foreign Affairs, the UAE Ministry of Industry and Advanced Technology, the Department of Health – Abu Dhabi, the Abu Dhabi National Oil Company (ADNOC), Etihad Airways, and e&. Independent software developers have also taken an interest. Within a day of its release, Jais had already been downloaded from Hugging Face thousands of times.  

“This is not the be all end all for us,” says Jackson. “We want to fine tune our foundational model for proprietary data sets so companies in different industries can take use it for their specific needs.”

Read more on Artificial intelligence, automation and robotics


UAE makes significant contribution to AI computing power

PatBrans

By: Pat Brans


UAE improves healthcare through information technology

PatBrans

By: Pat Brans


Top 10 Middle East IT stories of 2022

KarlFlinders

By: Karl Flinders


Dubai’s fledgling drone programme gets another nudge

PatBrans

By: Pat Brans

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Computer Weekly – https://www.computerweekly.com/feature/The-worlds-most-advanced-Arabic-LLM-is-now-available-on-open-source

Tags: advancedtechnologyWorld's
Previous Post

British Library falls victim to cyber attack

Next Post

The Google Pixel 7a is cheaper than ever at $374 in this early Black Friday deal

Center for Ecology-Based Economy to host climate solution event – Lewiston Sun Journal

Join Us for an Inspiring Climate Solutions Event!

May 14, 2025
Executive order jeopardizes School of Information and Library Science research funding – – The Daily Tar Heel

Executive order jeopardizes School of Information and Library Science research funding – – The Daily Tar Heel

May 14, 2025
What’s hiding under Antarctica’s ice? – Live Science

What’s hiding under Antarctica’s ice? – Live Science

May 14, 2025
“Stand Up Paddleboard” Demonstration and Kayaks Available – swiowanewssource.com

Experience the Thrill: Join Us for a Stand Up Paddleboard and Kayak Adventure!

May 14, 2025
China, Brazil agree to defend multipolar world order amid Trump tariff turmoil – South China Morning Post

China and Brazil Unite to Champion a Multipolar World Amid Trump’s Tariff Turmoil

May 14, 2025
Trump tariffs have little impact on prices so far, defying grim forecasts – Politico

Trump Tariffs: Surprisingly Minimal Impact on Prices Defies Expectations

May 14, 2025
HG Vora Files Definitive Proxy Materials and Sends Letter to PENN Entertainment, Inc. Shareholders – Business Wire

HG Vora Takes Action: A Bold Move to Engage PENN Entertainment Shareholders

May 14, 2025
Summit County health department braces for federal cuts, amount uncertain – KPCW

Summit County health department braces for federal cuts, amount uncertain – KPCW

May 14, 2025
Trump’s Middle East trip: President plans to lift Syria sanctions as he touts Saudi Arabia deals – CNN

Trump’s Middle East trip: President plans to lift Syria sanctions as he touts Saudi Arabia deals – CNN

May 13, 2025
Bridger Photonics Appoints Ryan Sullivan as Chief Technology Officer to Accelerate New Era of Data Insights – Business Wire

Bridger Photonics Welcomes Ryan Sullivan as CTO to Propel Data Insights into a New Era!

May 13, 2025

Categories

Archives

May 2025
MTWTFSS
 1234
567891011
12131415161718
19202122232425
262728293031 
« Apr    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (607)
  • Economy (618)
  • Entertainment (21,531)
  • General (15,214)
  • Health (9,661)
  • Lifestyle (624)
  • News (22,149)
  • People (621)
  • Politics (625)
  • Science (15,841)
  • Sports (21,128)
  • Technology (15,609)
  • World (609)

Recent News

Center for Ecology-Based Economy to host climate solution event – Lewiston Sun Journal

Join Us for an Inspiring Climate Solutions Event!

May 14, 2025
Executive order jeopardizes School of Information and Library Science research funding – – The Daily Tar Heel

Executive order jeopardizes School of Information and Library Science research funding – – The Daily Tar Heel

May 14, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version