* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Sunday, December 21, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    WildBrain Sells Stake in Peanuts Holdings to Sony Pictures Entertainment – Licensing International

    WildBrain Sells Stake in Peanuts Holdings to Sony Pictures Entertainment – Licensing International

    Country music star, wife are getting divorced: ‘We are no longer suited to be married’ – PennLive.com

    Country Music Star and Spouse Reveal They Are No Longer Suited for Marriage

    Nate Bargatze is leaving his podcast — and Utah recently saw why – Deseret News

    Nate Bargatze Is Leaving His Podcast – What Utah Fans Recently Went Through

    State Farm Arena Ranks In The Top 5 Live Entertainment Venues In The U.S. & Top 7 In The World, According To Billboard – Secret Atlanta

    State Farm Arena Ranks In The Top 5 Live Entertainment Venues In The U.S. & Top 7 In The World, According To Billboard – Secret Atlanta

    Walk on White features Conchettes and Santa – keysnews.com

    Uncover the Enchantment of Conchettes and Santa in Walk on White

    Blizzard Entertainment President on BlizzCon 2026, 35th Anniversary Plans – Variety

    Blizzard Entertainment President Reveals Thrilling BlizzCon 2026 and 35th Anniversary Celebrations

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Technology is powerful but unforgiving when misused – Supreme Court judge warns – GhanaWeb

    Supreme Court Judge Issues Stark Warning: Technology’s Power Can Be Dangerous When Misused

    The 8 worst technology flops of 2025 – MIT Technology Review

    The 8 worst technology flops of 2025 – MIT Technology Review

    Bangor School District receives new CNC router technology from First National Bank – news8000.com

    Bangor School District Unveils Cutting-Edge CNC Router Technology Thanks to Local Support

    6G discussions: How things have changed – 5gtechnologyworld.com

    The Evolution of 6G: How the Conversation Has Transformed

    Retail supply chains brace for a redefined 2026 as tariffs, technology gaps, and nearshoring upend old models – Raleigh News & Observer

    Retail Supply Chains Revolutionize in 2026: How Tariffs, Technology Gaps, and Nearshoring Are Shaping the Future

    China exploits US-funded research on nuclear technology, a congressional report says – ABC News

    Congressional Report Uncovers China’s Exploitation of US-Funded Nuclear Technology Research

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    WildBrain Sells Stake in Peanuts Holdings to Sony Pictures Entertainment – Licensing International

    WildBrain Sells Stake in Peanuts Holdings to Sony Pictures Entertainment – Licensing International

    Country music star, wife are getting divorced: ‘We are no longer suited to be married’ – PennLive.com

    Country Music Star and Spouse Reveal They Are No Longer Suited for Marriage

    Nate Bargatze is leaving his podcast — and Utah recently saw why – Deseret News

    Nate Bargatze Is Leaving His Podcast – What Utah Fans Recently Went Through

    State Farm Arena Ranks In The Top 5 Live Entertainment Venues In The U.S. & Top 7 In The World, According To Billboard – Secret Atlanta

    State Farm Arena Ranks In The Top 5 Live Entertainment Venues In The U.S. & Top 7 In The World, According To Billboard – Secret Atlanta

    Walk on White features Conchettes and Santa – keysnews.com

    Uncover the Enchantment of Conchettes and Santa in Walk on White

    Blizzard Entertainment President on BlizzCon 2026, 35th Anniversary Plans – Variety

    Blizzard Entertainment President Reveals Thrilling BlizzCon 2026 and 35th Anniversary Celebrations

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Technology is powerful but unforgiving when misused – Supreme Court judge warns – GhanaWeb

    Supreme Court Judge Issues Stark Warning: Technology’s Power Can Be Dangerous When Misused

    The 8 worst technology flops of 2025 – MIT Technology Review

    The 8 worst technology flops of 2025 – MIT Technology Review

    Bangor School District receives new CNC router technology from First National Bank – news8000.com

    Bangor School District Unveils Cutting-Edge CNC Router Technology Thanks to Local Support

    6G discussions: How things have changed – 5gtechnologyworld.com

    The Evolution of 6G: How the Conversation Has Transformed

    Retail supply chains brace for a redefined 2026 as tariffs, technology gaps, and nearshoring upend old models – Raleigh News & Observer

    Retail Supply Chains Revolutionize in 2026: How Tariffs, Technology Gaps, and Nearshoring Are Shaping the Future

    China exploits US-funded research on nuclear technology, a congressional report says – ABC News

    Congressional Report Uncovers China’s Exploitation of US-Funded Nuclear Technology Research

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

Move over Gemini, open-source AI has video tricks of its own

March 4, 2024
in Technology
Move over Gemini, open-source AI has video tricks of its own
Share on FacebookShare on Twitter

Google Gemini website on laptop reads, welcome to the Gemini era Maria Diaz/ZDNET

Google dazzled the world with its demo this month of its most cutting-edge generative artificial intelligence (AI) model, Gemini 1.5, a follow-up to the first Gemini model, which was released last December. Among other feats, Gemini 1.5 excels at things such as the “needle-in-a-haystack” challenge, where the model must identify a frame of video matching a text description. 

However, Google’s model — like most AI models from the biggest commercial entities — contains little technical detail about how the software works. The 58-page technical report that Google released about Gemini 1.5 just contains general descriptions of the model and the approach used, without detailing the architecture from which Gemini 1.5 is composed. And, of course, the code is not available. 

Also: Meet Gemini 1.5, Google’s newest AI model with major upgrades from its predecessor

In that sense, Gemini 1.5 continues a recent trend from Google and OpenAI and other commercial enterprises — obfuscating the technical details of AI. 

That kind of secrecy presents an opportunity for open-source software that can match some of Gemini’s abilities while opening up access to its code. 

In work published this month by Hao Liu, Wilson Yan, Matei Zaharia, and Pieter Abbeel of University of California at Berkeley, and described on the project’s GitHub site, the scientists adapt Meta’s open-source Llama 2 large language model to create a multi-modal model that, like Gemini 1.5, can process not just text but also video and imagery, although not audio (unlike Gemini 1.5). 

Also: GPT-4 is getting significantly dumber over time, according to a study

Using the mainstream version of Llama 2, a not particularly large 7-billion-parameter neural net, the authors were able to handle input of up to one million “tokens”, which is the text, image, or video fed into the model. This number represents a dramatic increase from the 128,000 handled by the Gemini 1.0 version and OpenAI’s GPT-4 Turbo.

Their creation, known as Large World Model (LWM), performs tasks similarly to Gemini 1.5. It can solve a needle-in-a-haystack type of problem, such as answering the request, “What color jacket was the girl on the trampoline wearing?”, when fed a one-hour YouTube video:

lwm-video-needle-in-haystack-test

U.C. Berkeley’s Large World Model can answer a “needle-in-the-haystack” question about a particular moment in video better than Google’s Gemini 1.0 or OpenAI’s GPT-4 Turbo.

UC Berkeley

Liu and team haven’t yet shown how their results compare to Gemini 1.5. Instead, the team show comparisons with GPT-4 and Gemini 1.0. 

As shown in the illustration above, LWM answers the needle-in-haystack question correctly, while the other two fail.

LWM can hold chats about what’s going on in a video clip, and give lengthy discussions about the contents of images, which is a process the researchers call “image chat”. LWM can also generate images and videos when supplied with text descriptions in the prompt (see both examples, below):

berkeley-2024-lwm-video-chat UC Berkeley berkeley-2024-lwm-image-chat UC Berkeley

Strikingly, it appears possible that Liu and team were able to achieve results equivalent to Gemini 1.0 with less computing power. The LWM was trained on one slice of a TPU Version 4 “POD”, consisting of 256 TPU chips, with two cores apiece, for 58 hours. In the case of Gemini 1.0, the technical report, just like the technical report for 1.5, contains few technical details about the infrastructure for training. All we know is that Google used some amount of TPU Version 4 and Version 5 PODs for a certain amount of time. It is quite possible they used a much larger amount of computing than Liu and team did for training LWM.  

So, how is LWM — which is based only on a relatively small, open-source model, running on less computing power — able to achieve similar results to Gemini 1.0? Well, LWM is the product of a different kind of approach to the problem of how to develop a neural network. 

Both models start from using a similar kind of neural net, a Transformer. Google added “innovations in training algorithms, dataset, and infrastructure” to the Transformer.

Also: How Google and OpenAI prompted GPT-4 to deliver more timely answers

In the case of LWM, Liu and team trained the model in multiple successive rounds, with increasingly large “context windows”, which is the amount of data samples the model works on at each pass. The team started with 32,768 tokens in the context windows, which you can think of as multiple pieces of data. They then worked up to one million tokens.

That approach is called “Ring Attention”, and was developed last year by Liu and team. The insight in Ring Attention is that you can train a neural network on samples of data concurrently, rather than sequentially, to parallelize the training, which means getting more done in less time, and utilizing the chips more efficiently.

berkeley-2024-lwm-architecture

The architecture of LWM.

UC Berkeley

“We adopt a training approach […] where our model is trained on progressively longer sequence lengths, starting from 32K tokens and ending at 1M tokens in increasing powers of two,” write Liu and team.

“Intuitively, this allows the model to save compute by first learning shorter-range dependencies before moving onto longer sequences. By doing this, we are able to train on orders of magnitude more tokens compared to directly training on the maximum target sequence length.”

berkeley-2024-lwm-training-data-sequences

LWM is trained on sequences of data of increasing length. 

UC Berkeley

The data used to train LWM includes some of the most prominent data sets that have been put into the wild, including Books3, which is at the heart of controversy over copyright infringement. The researchers also used Video Instruct-100K, a “video conversation dataset” hosted on GitHub. 

Google didn’t disclose Gemini 1.0’s training data, but merely describes it as such: “Gemini models are trained on a dataset that is both multimodal and multilingual. Our pretraining dataset uses data from web documents, books, and code, and includes image, audio, and video data.”

Also: AI will unleash the next level of human potential. Here’s how

While Google has already moved forward with Gemini 1.5, which can handle as many as 10 million tokens in its input, Liu and team believe Ring Attention can “theoretically extend to an infinite context, bounded only by the number of devices available.”

They continue: “We believe that our released model will provide a foundation for future work on developing longer context models, as well as encourage more challenging benchmarks that contain difficult long-range tasks that require higher levels of synthesis, rather than pure fact retrieval.”

The code of LWM is posted on the research team’s GitHub site.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : ZDNet – https://www.zdnet.com/article/move-over-gemini-open-source-ai-has-video-tricks-of-its-own/#ftag=RSSbaffb68

Tags: Geminiopen-sourcetechnology
Previous Post

I asked Gemini and GPT-4 to explain deep learning AI, and Gemini won hands down

Next Post

New CAD renders reveal iPhone SE 4 redesign that finally ditches the Home button

Opinion — Eric Sorenson, Brett Engstrom, and Liz Thompson: We need more wild forests and ecological forestry. – VTDigger

Why We Must Protect and Expand Wild Forests Through Ecological Forestry

December 21, 2025
Scientists at the American Museum of Natural History discovered more than 70 new species in 2025 – Phys.org

Over 70 Exciting New Species Discovered in 2025 by Leading Scientists

December 21, 2025
The science of snowflakes – W&M News

The science of snowflakes – W&M News

December 21, 2025
Vietnam: Creating a green lifestyle with remote growing, vegetable boxes – Hortidaily

Vietnam Embraces Green Living with Remote Gardening and Fresh Vegetable Boxes

December 21, 2025
Technology is powerful but unforgiving when misused – Supreme Court judge warns – GhanaWeb

Supreme Court Judge Issues Stark Warning: Technology’s Power Can Be Dangerous When Misused

December 21, 2025
Georgia vs. Ole Miss set for Sugar Bowl: Preview and odds for CFP quarterfinal – CBS Sports

Georgia vs. Ole Miss Sugar Bowl Showdown: Exciting Preview and CFP Quarterfinal Odds

December 21, 2025
Consciousness breaks from the physical world by keeping the past alive – IAI TV

Consciousness breaks from the physical world by keeping the past alive – IAI TV

December 21, 2025
Charting the Global Economy: ECB, UK, BOJ Diverge on Rate Moves – Bloomberg.com

Global Economy in Flux: How the ECB, UK, and BOJ Are Diverging on Interest Rates

December 21, 2025
WildBrain Sells Stake in Peanuts Holdings to Sony Pictures Entertainment – Licensing International

WildBrain Sells Stake in Peanuts Holdings to Sony Pictures Entertainment – Licensing International

December 21, 2025
HHS Announces Request for Information to Harness Artificial Intelligence to Deflate Health Care Costs and Make America Healthy Again – U.S. Department of Health and Human Services (HHS) (.gov)

HHS Announces Request for Information to Harness Artificial Intelligence to Deflate Health Care Costs and Make America Healthy Again – U.S. Department of Health and Human Services (HHS) (.gov)

December 21, 2025

Categories

Archives

December 2025
M T W T F S S
1234567
891011121314
15161718192021
22232425262728
293031  
« Nov    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (980)
  • Economy (998)
  • Entertainment (21,875)
  • General (18,865)
  • Health (10,038)
  • Lifestyle (1,011)
  • News (22,149)
  • People (1,005)
  • Politics (1,012)
  • Science (16,214)
  • Sports (21,499)
  • Technology (15,981)
  • World (987)

Recent News

Opinion — Eric Sorenson, Brett Engstrom, and Liz Thompson: We need more wild forests and ecological forestry. – VTDigger

Why We Must Protect and Expand Wild Forests Through Ecological Forestry

December 21, 2025
Scientists at the American Museum of Natural History discovered more than 70 new species in 2025 – Phys.org

Over 70 Exciting New Species Discovered in 2025 by Leading Scientists

December 21, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version