* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Thursday, May 14, 2026
Earth-News
  • Home
  • Business
  • Entertainment

    OU and City Officials Celebrate Groundbreaking of Exciting New Rock Creek Entertainment District

    Paranovus Entertainment Technology Ltd. Unveils Exciting New Foreign Issuer Report

    TribeVibe Entertainment Triumphs at WOW Awards 2026 with Five Major Wins, Cementing Its Status as a Leader in India’s Live Entertainment Scene

    Sigourney Weaver Honored with Prestigious Award

    Dan Bucatinsky Opens Up About the Powerful, Emotional Final Scene with Lisa Kudrow in ‘The Comeback

    Lorraine Kelly Reveals Why Becoming a Grandmother Is the Greatest Joy of Her Life

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Revolutionizing Otologic Surgery: The Rise of Exoscope Technology at UHealth

    How Cutting-Edge AI Technologies Are Transforming the Future of Finance

    Lower Merion School District proposes new technology policy – PHL17.com

    WM Technology, Inc. Delivers Impressive First Quarter 2026 Results

    Medical Care Technologies Inc. (OTC Pink:MDCE) Expands AI Commercialization Strategy with Enterprise Vision Solutions – Yahoo Finance

    Has Silicon Motion Technology’s Stock Soared Too Far After a Stunning 368% Rally?

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment

    OU and City Officials Celebrate Groundbreaking of Exciting New Rock Creek Entertainment District

    Paranovus Entertainment Technology Ltd. Unveils Exciting New Foreign Issuer Report

    TribeVibe Entertainment Triumphs at WOW Awards 2026 with Five Major Wins, Cementing Its Status as a Leader in India’s Live Entertainment Scene

    Sigourney Weaver Honored with Prestigious Award

    Dan Bucatinsky Opens Up About the Powerful, Emotional Final Scene with Lisa Kudrow in ‘The Comeback

    Lorraine Kelly Reveals Why Becoming a Grandmother Is the Greatest Joy of Her Life

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Revolutionizing Otologic Surgery: The Rise of Exoscope Technology at UHealth

    How Cutting-Edge AI Technologies Are Transforming the Future of Finance

    Lower Merion School District proposes new technology policy – PHL17.com

    WM Technology, Inc. Delivers Impressive First Quarter 2026 Results

    Medical Care Technologies Inc. (OTC Pink:MDCE) Expands AI Commercialization Strategy with Enterprise Vision Solutions – Yahoo Finance

    Has Silicon Motion Technology’s Stock Soared Too Far After a Stunning 368% Rally?

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home News

Baichuan says its new API can greatly reduce the cost of customizing large language models

January 23, 2024
in News
Baichuan says its new API can greatly reduce the cost of customizing large language models
Share on FacebookShare on Twitter

ChatGPT’s emergence not only renewed interest in artificial intelligence, but arguably sparked an unprecedented wave of advancements in AI technology. One such advancement takes the form of universal large language models (LLMs), which were previously difficult for the AI community to create. That problem is now passe. Instead, the next hurdle to overcome is determining how to effectively implement such models in practical applications.

Chinese AI firm Baichuan Intelligent Technology has seemingly made a significant leap in this regard. In October last year, Baichuan unveiled Baichuan2-192K, a large model capable of processing around 350,000 Chinese characters. That’s roughly 14 times the size of OpenAI’s GPT-4 and approximately 4.4 times that of Anthropic’s Claude 2, which has drawn plaudits for its excellence in processing long-form text.

On December 19, Baichuan launched the search-focused Baichuan2-Turbo series API, which includes Baichuan2-Turbo-192K and Baichuan2-Turbo.

Baichuan has also upgraded its official web-based models. Enterprise users can now upload various text formats such as PDFs, word documents, and URLs into the API to experience the enhanced capabilities of the Baichuan2 large model.

A large model “plug-in” to build knowledge bases instantly

Baichuan views large models as the computers of the new era, in some ways akin to central processors. The context window is likened to the computer’s memory, storing the text to be processed. The real-time information of the internet and the knowledge base of enterprises collectively form the equivalent of a computer’s hard drive.

The company’s newly introduced API enables large models to “attach” external knowledge bases, according to CEO Wang Xiaochuan.

While LLMs have become the infrastructural foundation of the AI era, the technical exploration of these models is still in its infancy. Despite the increase in model parameters, challenges persist—such as the hallucination problem and the issue of queries being “forgotten.” These limitations significantly impede the efficiency of large models.

However, the usability of large models can be augmented by combining them with search-related enhancements, Wang said. This enables even models with fewer parameters to handle much larger volumes of text in a single query, and at faster speeds.

To demonstrate the effectiveness of this approach, Baichuan tested the Baichuan-192K API using the classic “needle in a haystack” test:

Place a random fact or statement (the “needle”) in the middle of a context window (the “haystack”).
Ask the model to retrieve this fact or statement.
Iterate this process over various document depths (where the “needle” is placed) and context lengths to determine the model’s performance.

Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across various context lengths. This is commonly known as the “needle in a haystack test.”Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across various context lengths. This is commonly known as the “needle in a haystack test.” Graphic source: 36Kr. Header photo source: Baichuan via Weibo.

For requests that fall within the 192K’s token limit, 100% answer accuracy can be achieved. With the latest enhancement, Baichuan2 can handle a new maximum of 50 million tokens, which is equivalent to 350,000 Chinese characters—two orders of magnitude larger than before.

Baichuan also evaluated the effectiveness of pure vector retrievals as well as a combination of vector and sparse retrievals. The results indicate that the combined approach can achieve 95% answer accuracy. With a roughly 250-fold increase in the total volume of handleable text, the recall accuracy has also improved to 95%.

Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across long context lengths.Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across long context lengths. Graphic source: 36Kr.

Specifically, Baichuan conducted the test by using the following configuration (in Chinese):

Haystack: 80 long-form financial documents from a dataset used for the 2023 Bojin Large Model Challenge.
Needle: On December 16, 2023, during the GeekPark Innovation Conference 2024, Wang Xiaochuan shared new insights into large models. In his view, with the advent of the large model era, the starting point for product managers should shift from considering product-market fit (PMF) to considering technology-product fit (TPF).
Query: According to Wang Xiaochuan, what is the starting point for product managers in the era of large models?

This release marks a further improvement in the operational speed and accuracy of large models. Even with extensive context, Baichuan’s test results demonstrate that LLMs can now operate effectively with updated data, faster, more accurately, and at a significantly lower cost than building industry-specific models.

Customization does not equate to verticalization

In addition to the new API, Baichuan has introduced a search enhancement knowledge base. Its utility is straightforward: companies upload privately deployed data and information to the cloud, generating a customized system that integrates with Baichuan2 in a plug-and-play fashion.

The current Baichuan2 can be deployed in various B2B scenarios, including customer service, knowledge Q&A, compliance risk control, and marketing and consulting in industries such as finance, government, legal, and education.

During the launch event, Baichuan presented a sample scenario in the finance industry. In this example, a bank’s knowledge base was cited to comprise 6 terabytes of data, with 12,905 documents. The presentation describes Baichuan2’s ability to efficiently retrieve information from this extensive base—by inputting a document with 360,000 words into the model through the API, precise answers can be obtained.

The method of combining LLMs with search enhancement technology provides a practical path for the future implementation of large models in various industries.

Enterprise knowledge bases are currently the mainstream use case for LLMs. Previously, building such bases required large models to be pre-trained—a process that typically required highly skilled AI professionals. Any updates to the underlying data would also require retraining or fine-tuning, which can be costly and affect controllability and stability..

Another challenge lies in vector retrieval, as the overall cost of utilizing vector databases is relatively high. Their effectiveness depends on the scale of training data, with a noticeable discount in general capability for areas not covered by training data. The difference between user prompts and document lengths in the knowledge base also poses significant challenges to vector retrieval.

In this regard, Baichuan’s combination of LLMs with search enhancement technology has solved some technical challenges. It pioneered the self-critique large model technique using general retrieval-augmented general (RAG) technology as a foundation, allowing LLMs to evaluate their own answers before outputting them to users, selecting answers with the highest quality in the process.

This approach could replace the majority of custom fine-tuning techniques currently adopted by enterprises, while addressing 99% of the customization needs of enterprise knowledge bases.

While Wan admitted that customization is unavoidable in the industrial implementation of LLMs, the delivery capability can be continuously improved through technical iterations.

With the latest release, Baichuan highlights its rapid advance toward commercial implementation. The company has also revealed that it has entered into partnerships with leading enterprises in various industries for further development, but did not go into detail on the specifics of these collaborations.

KrASIA Connection features translated and adapted content that was originally published by 36Kr. This article was written by Yong Yi for 36Kr.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : KrAsia – https://kr-asia.com/baichuan-says-its-new-api-can-greatly-reduce-the-cost-of-customizing-large-language-models

Tags: Baichuangreatlynews
Previous Post

In charts: How Asia’s tourism recovery is held back by stay-home Chinese

Next Post

Deals in brief: AC Ventures raises USD 210 million at final close of fifth fund, Sony scraps merger with India’s Zee, nine China deals, and more

School of Human Ecology Achieves Prestigious Re-Accreditation Milestone

May 14, 2026

Aesthetics, built from science – Ipsen

May 14, 2026

Don’t Miss Out: Final Opportunity to Register for WT Computational Science Summer Camp!

May 14, 2026

David Haye delivers honest verdict on Fury vs Joshua: “Has his lifestyle caught up with him?” – boxingnewsonline.net

May 14, 2026

ICE May Make an Appearance at World Cup Matches in the U.S

May 14, 2026

Bipartisan Commission Tackles Urgent Rural Issues in Hazard Hearing

May 14, 2026

Honoring Delaware’s Dedicated School Behavioral Health Professionals

May 14, 2026

OU and City Officials Celebrate Groundbreaking of Exciting New Rock Creek Entertainment District

May 14, 2026

What’s in the billion-dollar paragraph behind the White House ballroom debate – PBS

May 13, 2026

Revolutionizing Otologic Surgery: The Rise of Exoscope Technology at UHealth

May 13, 2026

Categories

Archives

May 2026
M T W T F S S
 123
45678910
11121314151617
18192021222324
25262728293031
« Apr    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (1,214)
  • Economy (1,235)
  • Entertainment (22,112)
  • General (21,503)
  • Health (10,268)
  • Lifestyle (1,247)
  • News (22,149)
  • People (1,236)
  • Politics (1,255)
  • Science (16,450)
  • Sports (21,732)
  • Technology (16,219)
  • World (1,226)

Recent News

School of Human Ecology Achieves Prestigious Re-Accreditation Milestone

May 14, 2026

Aesthetics, built from science – Ipsen

May 14, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version