* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Thursday, October 30, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    The Best Horror Movies Are In The Most Unlikely Place – Yahoo

    Discover the Best Horror Movies Hiding in the Most Unexpected Places

    Scene Calendar: ‘Rocky Horror’ at The Hipp, Pride Fest, laughs at the Matheson – Gainesville Sun

    Get Ready for a Thrilling Lineup: ‘Rocky Horror’ at The Hipp, Pride Fest Celebrations, and Hilarious Comedy Nights at the Matheson!

    Rock Hall Ceremony Adds Chappell, Donald Glover – Yahoo

    Chappell and Donald Glover Shine Bright in a Star-Studded Rock Hall Celebration

    Caesars Entertainment (CZR) Reports Q3 Loss, Lags Revenue Estimates – Yahoo Finance

    Caesars Entertainment Stumbles in Q3, Falls Short of Revenue Goals

    Free Live Entertainment – Fremont Street Experience

    Enjoy Free Live Entertainment on Fremont Street Tonight!

    What to Know About ‘Good Morning America’s 50th Anniversary Episode – Wyoming News Now

    Celebrate the Milestone: Everything You Need to Know About Good Morning America’s 50th Anniversary Episode

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Scientists Discover Breakthrough Method to Halt Diabetes Complications

    Chipmaker Nvidia hits $5 trillion valuation – Al Jazeera

    Nvidia Rockets to an Astonishing $5 Trillion Valuation

    How digital technologies can support a circular economy – Tech Xplore

    Unlocking the Power of Digital Technologies to Fuel a Thriving Circular Economy

    Nigeria’s government is using digital technology to repress citizens. A researcher explains how – The Conversation

    Nigeria’s government is using digital technology to repress citizens. A researcher explains how – The Conversation

    CPE Technology Berhad (KLSE:CPETECH) Has Affirmed Its Dividend Of MYR0.015 – Yahoo Finance

    CPE Technology Berhad (KLSE:CPETECH) Has Affirmed Its Dividend Of MYR0.015 – Yahoo Finance

    Researchers Discover New Bacterium That Turns Food Waste Into Energy – Technology Networks

    Scientists Unveil Breakthrough Bacterium That Transforms Food Waste Into Clean Energy

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    The Best Horror Movies Are In The Most Unlikely Place – Yahoo

    Discover the Best Horror Movies Hiding in the Most Unexpected Places

    Scene Calendar: ‘Rocky Horror’ at The Hipp, Pride Fest, laughs at the Matheson – Gainesville Sun

    Get Ready for a Thrilling Lineup: ‘Rocky Horror’ at The Hipp, Pride Fest Celebrations, and Hilarious Comedy Nights at the Matheson!

    Rock Hall Ceremony Adds Chappell, Donald Glover – Yahoo

    Chappell and Donald Glover Shine Bright in a Star-Studded Rock Hall Celebration

    Caesars Entertainment (CZR) Reports Q3 Loss, Lags Revenue Estimates – Yahoo Finance

    Caesars Entertainment Stumbles in Q3, Falls Short of Revenue Goals

    Free Live Entertainment – Fremont Street Experience

    Enjoy Free Live Entertainment on Fremont Street Tonight!

    What to Know About ‘Good Morning America’s 50th Anniversary Episode – Wyoming News Now

    Celebrate the Milestone: Everything You Need to Know About Good Morning America’s 50th Anniversary Episode

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Scientists Discover Breakthrough Method to Halt Diabetes Complications

    Chipmaker Nvidia hits $5 trillion valuation – Al Jazeera

    Nvidia Rockets to an Astonishing $5 Trillion Valuation

    How digital technologies can support a circular economy – Tech Xplore

    Unlocking the Power of Digital Technologies to Fuel a Thriving Circular Economy

    Nigeria’s government is using digital technology to repress citizens. A researcher explains how – The Conversation

    Nigeria’s government is using digital technology to repress citizens. A researcher explains how – The Conversation

    CPE Technology Berhad (KLSE:CPETECH) Has Affirmed Its Dividend Of MYR0.015 – Yahoo Finance

    CPE Technology Berhad (KLSE:CPETECH) Has Affirmed Its Dividend Of MYR0.015 – Yahoo Finance

    Researchers Discover New Bacterium That Turns Food Waste Into Energy – Technology Networks

    Scientists Unveil Breakthrough Bacterium That Transforms Food Waste Into Clean Energy

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home News

Baichuan says its new API can greatly reduce the cost of customizing large language models

January 23, 2024
in News
Baichuan says its new API can greatly reduce the cost of customizing large language models
Share on FacebookShare on Twitter

ChatGPT’s emergence not only renewed interest in artificial intelligence, but arguably sparked an unprecedented wave of advancements in AI technology. One such advancement takes the form of universal large language models (LLMs), which were previously difficult for the AI community to create. That problem is now passe. Instead, the next hurdle to overcome is determining how to effectively implement such models in practical applications.

Chinese AI firm Baichuan Intelligent Technology has seemingly made a significant leap in this regard. In October last year, Baichuan unveiled Baichuan2-192K, a large model capable of processing around 350,000 Chinese characters. That’s roughly 14 times the size of OpenAI’s GPT-4 and approximately 4.4 times that of Anthropic’s Claude 2, which has drawn plaudits for its excellence in processing long-form text.

On December 19, Baichuan launched the search-focused Baichuan2-Turbo series API, which includes Baichuan2-Turbo-192K and Baichuan2-Turbo.

Baichuan has also upgraded its official web-based models. Enterprise users can now upload various text formats such as PDFs, word documents, and URLs into the API to experience the enhanced capabilities of the Baichuan2 large model.

A large model “plug-in” to build knowledge bases instantly

Baichuan views large models as the computers of the new era, in some ways akin to central processors. The context window is likened to the computer’s memory, storing the text to be processed. The real-time information of the internet and the knowledge base of enterprises collectively form the equivalent of a computer’s hard drive.

The company’s newly introduced API enables large models to “attach” external knowledge bases, according to CEO Wang Xiaochuan.

While LLMs have become the infrastructural foundation of the AI era, the technical exploration of these models is still in its infancy. Despite the increase in model parameters, challenges persist—such as the hallucination problem and the issue of queries being “forgotten.” These limitations significantly impede the efficiency of large models.

However, the usability of large models can be augmented by combining them with search-related enhancements, Wang said. This enables even models with fewer parameters to handle much larger volumes of text in a single query, and at faster speeds.

To demonstrate the effectiveness of this approach, Baichuan tested the Baichuan-192K API using the classic “needle in a haystack” test:

Place a random fact or statement (the “needle”) in the middle of a context window (the “haystack”).
Ask the model to retrieve this fact or statement.
Iterate this process over various document depths (where the “needle” is placed) and context lengths to determine the model’s performance.

Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across various context lengths. This is commonly known as the “needle in a haystack test.”Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across various context lengths. This is commonly known as the “needle in a haystack test.” Graphic source: 36Kr. Header photo source: Baichuan via Weibo.

For requests that fall within the 192K’s token limit, 100% answer accuracy can be achieved. With the latest enhancement, Baichuan2 can handle a new maximum of 50 million tokens, which is equivalent to 350,000 Chinese characters—two orders of magnitude larger than before.

Baichuan also evaluated the effectiveness of pure vector retrievals as well as a combination of vector and sparse retrievals. The results indicate that the combined approach can achieve 95% answer accuracy. With a roughly 250-fold increase in the total volume of handleable text, the recall accuracy has also improved to 95%.

Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across long context lengths.Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across long context lengths. Graphic source: 36Kr.

Specifically, Baichuan conducted the test by using the following configuration (in Chinese):

Haystack: 80 long-form financial documents from a dataset used for the 2023 Bojin Large Model Challenge.
Needle: On December 16, 2023, during the GeekPark Innovation Conference 2024, Wang Xiaochuan shared new insights into large models. In his view, with the advent of the large model era, the starting point for product managers should shift from considering product-market fit (PMF) to considering technology-product fit (TPF).
Query: According to Wang Xiaochuan, what is the starting point for product managers in the era of large models?

This release marks a further improvement in the operational speed and accuracy of large models. Even with extensive context, Baichuan’s test results demonstrate that LLMs can now operate effectively with updated data, faster, more accurately, and at a significantly lower cost than building industry-specific models.

Customization does not equate to verticalization

In addition to the new API, Baichuan has introduced a search enhancement knowledge base. Its utility is straightforward: companies upload privately deployed data and information to the cloud, generating a customized system that integrates with Baichuan2 in a plug-and-play fashion.

The current Baichuan2 can be deployed in various B2B scenarios, including customer service, knowledge Q&A, compliance risk control, and marketing and consulting in industries such as finance, government, legal, and education.

During the launch event, Baichuan presented a sample scenario in the finance industry. In this example, a bank’s knowledge base was cited to comprise 6 terabytes of data, with 12,905 documents. The presentation describes Baichuan2’s ability to efficiently retrieve information from this extensive base—by inputting a document with 360,000 words into the model through the API, precise answers can be obtained.

The method of combining LLMs with search enhancement technology provides a practical path for the future implementation of large models in various industries.

Enterprise knowledge bases are currently the mainstream use case for LLMs. Previously, building such bases required large models to be pre-trained—a process that typically required highly skilled AI professionals. Any updates to the underlying data would also require retraining or fine-tuning, which can be costly and affect controllability and stability..

Another challenge lies in vector retrieval, as the overall cost of utilizing vector databases is relatively high. Their effectiveness depends on the scale of training data, with a noticeable discount in general capability for areas not covered by training data. The difference between user prompts and document lengths in the knowledge base also poses significant challenges to vector retrieval.

In this regard, Baichuan’s combination of LLMs with search enhancement technology has solved some technical challenges. It pioneered the self-critique large model technique using general retrieval-augmented general (RAG) technology as a foundation, allowing LLMs to evaluate their own answers before outputting them to users, selecting answers with the highest quality in the process.

This approach could replace the majority of custom fine-tuning techniques currently adopted by enterprises, while addressing 99% of the customization needs of enterprise knowledge bases.

While Wan admitted that customization is unavoidable in the industrial implementation of LLMs, the delivery capability can be continuously improved through technical iterations.

With the latest release, Baichuan highlights its rapid advance toward commercial implementation. The company has also revealed that it has entered into partnerships with leading enterprises in various industries for further development, but did not go into detail on the specifics of these collaborations.

KrASIA Connection features translated and adapted content that was originally published by 36Kr. This article was written by Yong Yi for 36Kr.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : KrAsia – https://kr-asia.com/baichuan-says-its-new-api-can-greatly-reduce-the-cost-of-customizing-large-language-models

Tags: Baichuangreatlynews
Previous Post

In charts: How Asia’s tourism recovery is held back by stay-home Chinese

Next Post

Deals in brief: AC Ventures raises USD 210 million at final close of fifth fund, Sony scraps merger with India’s Zee, nine China deals, and more

World News – The New York Times

Breaking Global Headlines: What You Need to Know Today

October 30, 2025
There’s an economic explanation for why everything feels so tense right now – CNN

Why Everything Feels So Tense Right Now: Unpacking the Economic Forces Driving It

October 30, 2025
The Best Horror Movies Are In The Most Unlikely Place – Yahoo

Discover the Best Horror Movies Hiding in the Most Unexpected Places

October 30, 2025
UM Health-Sparrow Opens Downtown Lansing Location and Pharmacy – UM Health-Sparrow

UM Health-Sparrow Launches New Downtown Lansing Location with On-Site Pharmacy

October 30, 2025
‘My emergency fund is gone’: Federal workers struggling as shutdown drags on – CNN

Federal Workers Face Financial Crisis as Shutdown Drains Emergency Funds

October 30, 2025
Restoring Europe’s sponge landscapes: The “SpongeBooster of the year 2026” award opens for applications – EurekAlert!

Apply Now for the 2026 “SpongeBooster of the Year” Award: Lead the Charge in Restoring Europe’s Vital Sponge Landscapes!

October 30, 2025
From Food Science to Forbes: NC State Alum Connor Balfany’s Entrepreneurial Path – NC State University

From Food Science to Fortune: How NC State Alum Connor Balfany Built a Thriving Entrepreneurial Empire

October 30, 2025
Being mean to ChatGPT increases its accuracy — but you may end up regretting it, scientists warn – Live Science

Being mean to ChatGPT increases its accuracy — but you may end up regretting it, scientists warn – Live Science

October 30, 2025
I love my boomer parents, but these 7 lessons they taught me are completely useless in 2025 – VegOut

I Love My Boomer Parents, But These 7 Lessons Just Don’t Work in 2025

October 30, 2025

Scientists Discover Breakthrough Method to Halt Diabetes Complications

October 30, 2025

Categories

Archives

October 2025
M T W T F S S
 12345
6789101112
13141516171819
20212223242526
2728293031  
« Sep    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (893)
  • Economy (916)
  • Entertainment (21,787)
  • General (17,894)
  • Health (9,957)
  • Lifestyle (928)
  • News (22,149)
  • People (916)
  • Politics (926)
  • Science (16,126)
  • Sports (21,415)
  • Technology (15,895)
  • World (899)

Recent News

World News – The New York Times

Breaking Global Headlines: What You Need to Know Today

October 30, 2025
There’s an economic explanation for why everything feels so tense right now – CNN

Why Everything Feels So Tense Right Now: Unpacking the Economic Forces Driving It

October 30, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version