* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Wednesday, September 17, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    PENN Entertainment stock rating reiterated at Market Outperform by JMP – Investing.com

    PENN Entertainment Stock Rated a Market Outperformer by Experts

    Here’s how NJ’s once-vibrant nightclub scene was born and why it died – Bergen Record

    The Rise and Fall of New Jersey’s Once-Vibrant Nightclub Scene: What Happened?

    The Emmys are back: Viewership soars to highest numbers in 4 years – yahoo.com

    The Emmys Return with a Bang: Viewership Hits a 4-Year High

    From Spinal Tap II to Ed Sheeran : your complete entertainment guide to the week ahead – The Guardian

    Fall’s Hottest Ski Films Are Going on Tour—Here’s the List – yahoo.com

    Experience the Thrill: Fall’s Hottest Ski Films Hit the Road-Don’t Miss the Tour!

    Ryan Reynolds reveals he called a journalist who said mean things about John Candy – yahoo.com

    Ryan Reynolds Reveals the Moment He Stood Up to a Journalist Who Insulted John Candy

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    City IT presented Best of North Carolina Technology Award – RaleighNC.gov

    City IT Honored with Best of North Carolina Technology Award

    LELO Releases 2025 Futurist Report: Intergenerational Views on Relationships, Sex, and Technology – PR Newswire

    Exploring the Future: How Different Generations View Relationships, Sex, and Technology in 2025

    Will New Big Technology Engagements Reshape Innodata’s Growth Path? – Yahoo Finance

    Could New Major Tech Partnerships Propel Innodata to Unprecedented Growth?

    Unlocking AI Success: How People, Process, and Technology Form the Ultimate Triangle

    Billion-dollar coffins? New technology could make oceans transparent and Aukus submarines vulnerable – The Guardian

    Billion-Dollar Coffins? How New Technology Could Make Oceans Transparent and Expose Submarines

    What if artificial intelligence is just a “normal” technology? – The Economist

    What if artificial intelligence is just a “normal” technology? – The Economist

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    PENN Entertainment stock rating reiterated at Market Outperform by JMP – Investing.com

    PENN Entertainment Stock Rated a Market Outperformer by Experts

    Here’s how NJ’s once-vibrant nightclub scene was born and why it died – Bergen Record

    The Rise and Fall of New Jersey’s Once-Vibrant Nightclub Scene: What Happened?

    The Emmys are back: Viewership soars to highest numbers in 4 years – yahoo.com

    The Emmys Return with a Bang: Viewership Hits a 4-Year High

    From Spinal Tap II to Ed Sheeran : your complete entertainment guide to the week ahead – The Guardian

    Fall’s Hottest Ski Films Are Going on Tour—Here’s the List – yahoo.com

    Experience the Thrill: Fall’s Hottest Ski Films Hit the Road-Don’t Miss the Tour!

    Ryan Reynolds reveals he called a journalist who said mean things about John Candy – yahoo.com

    Ryan Reynolds Reveals the Moment He Stood Up to a Journalist Who Insulted John Candy

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    City IT presented Best of North Carolina Technology Award – RaleighNC.gov

    City IT Honored with Best of North Carolina Technology Award

    LELO Releases 2025 Futurist Report: Intergenerational Views on Relationships, Sex, and Technology – PR Newswire

    Exploring the Future: How Different Generations View Relationships, Sex, and Technology in 2025

    Will New Big Technology Engagements Reshape Innodata’s Growth Path? – Yahoo Finance

    Could New Major Tech Partnerships Propel Innodata to Unprecedented Growth?

    Unlocking AI Success: How People, Process, and Technology Form the Ultimate Triangle

    Billion-dollar coffins? New technology could make oceans transparent and Aukus submarines vulnerable – The Guardian

    Billion-Dollar Coffins? How New Technology Could Make Oceans Transparent and Expose Submarines

    What if artificial intelligence is just a “normal” technology? – The Economist

    What if artificial intelligence is just a “normal” technology? – The Economist

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home News

Baichuan says its new API can greatly reduce the cost of customizing large language models

January 23, 2024
in News
Baichuan says its new API can greatly reduce the cost of customizing large language models
Share on FacebookShare on Twitter

ChatGPT’s emergence not only renewed interest in artificial intelligence, but arguably sparked an unprecedented wave of advancements in AI technology. One such advancement takes the form of universal large language models (LLMs), which were previously difficult for the AI community to create. That problem is now passe. Instead, the next hurdle to overcome is determining how to effectively implement such models in practical applications.

Chinese AI firm Baichuan Intelligent Technology has seemingly made a significant leap in this regard. In October last year, Baichuan unveiled Baichuan2-192K, a large model capable of processing around 350,000 Chinese characters. That’s roughly 14 times the size of OpenAI’s GPT-4 and approximately 4.4 times that of Anthropic’s Claude 2, which has drawn plaudits for its excellence in processing long-form text.

On December 19, Baichuan launched the search-focused Baichuan2-Turbo series API, which includes Baichuan2-Turbo-192K and Baichuan2-Turbo.

Baichuan has also upgraded its official web-based models. Enterprise users can now upload various text formats such as PDFs, word documents, and URLs into the API to experience the enhanced capabilities of the Baichuan2 large model.

A large model “plug-in” to build knowledge bases instantly

Baichuan views large models as the computers of the new era, in some ways akin to central processors. The context window is likened to the computer’s memory, storing the text to be processed. The real-time information of the internet and the knowledge base of enterprises collectively form the equivalent of a computer’s hard drive.

The company’s newly introduced API enables large models to “attach” external knowledge bases, according to CEO Wang Xiaochuan.

While LLMs have become the infrastructural foundation of the AI era, the technical exploration of these models is still in its infancy. Despite the increase in model parameters, challenges persist—such as the hallucination problem and the issue of queries being “forgotten.” These limitations significantly impede the efficiency of large models.

However, the usability of large models can be augmented by combining them with search-related enhancements, Wang said. This enables even models with fewer parameters to handle much larger volumes of text in a single query, and at faster speeds.

To demonstrate the effectiveness of this approach, Baichuan tested the Baichuan-192K API using the classic “needle in a haystack” test:

Place a random fact or statement (the “needle”) in the middle of a context window (the “haystack”).
Ask the model to retrieve this fact or statement.
Iterate this process over various document depths (where the “needle” is placed) and context lengths to determine the model’s performance.

Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across various context lengths. This is commonly known as the “needle in a haystack test.”Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across various context lengths. This is commonly known as the “needle in a haystack test.” Graphic source: 36Kr. Header photo source: Baichuan via Weibo.

For requests that fall within the 192K’s token limit, 100% answer accuracy can be achieved. With the latest enhancement, Baichuan2 can handle a new maximum of 50 million tokens, which is equivalent to 350,000 Chinese characters—two orders of magnitude larger than before.

Baichuan also evaluated the effectiveness of pure vector retrievals as well as a combination of vector and sparse retrievals. The results indicate that the combined approach can achieve 95% answer accuracy. With a roughly 250-fold increase in the total volume of handleable text, the recall accuracy has also improved to 95%.

Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across long context lengths.Diagram illustrating the performance of Baichuan2-192K-Turbo in a pressure test for fact retrieval across long context lengths. Graphic source: 36Kr.

Specifically, Baichuan conducted the test by using the following configuration (in Chinese):

Haystack: 80 long-form financial documents from a dataset used for the 2023 Bojin Large Model Challenge.
Needle: On December 16, 2023, during the GeekPark Innovation Conference 2024, Wang Xiaochuan shared new insights into large models. In his view, with the advent of the large model era, the starting point for product managers should shift from considering product-market fit (PMF) to considering technology-product fit (TPF).
Query: According to Wang Xiaochuan, what is the starting point for product managers in the era of large models?

This release marks a further improvement in the operational speed and accuracy of large models. Even with extensive context, Baichuan’s test results demonstrate that LLMs can now operate effectively with updated data, faster, more accurately, and at a significantly lower cost than building industry-specific models.

Customization does not equate to verticalization

In addition to the new API, Baichuan has introduced a search enhancement knowledge base. Its utility is straightforward: companies upload privately deployed data and information to the cloud, generating a customized system that integrates with Baichuan2 in a plug-and-play fashion.

The current Baichuan2 can be deployed in various B2B scenarios, including customer service, knowledge Q&A, compliance risk control, and marketing and consulting in industries such as finance, government, legal, and education.

During the launch event, Baichuan presented a sample scenario in the finance industry. In this example, a bank’s knowledge base was cited to comprise 6 terabytes of data, with 12,905 documents. The presentation describes Baichuan2’s ability to efficiently retrieve information from this extensive base—by inputting a document with 360,000 words into the model through the API, precise answers can be obtained.

The method of combining LLMs with search enhancement technology provides a practical path for the future implementation of large models in various industries.

Enterprise knowledge bases are currently the mainstream use case for LLMs. Previously, building such bases required large models to be pre-trained—a process that typically required highly skilled AI professionals. Any updates to the underlying data would also require retraining or fine-tuning, which can be costly and affect controllability and stability..

Another challenge lies in vector retrieval, as the overall cost of utilizing vector databases is relatively high. Their effectiveness depends on the scale of training data, with a noticeable discount in general capability for areas not covered by training data. The difference between user prompts and document lengths in the knowledge base also poses significant challenges to vector retrieval.

In this regard, Baichuan’s combination of LLMs with search enhancement technology has solved some technical challenges. It pioneered the self-critique large model technique using general retrieval-augmented general (RAG) technology as a foundation, allowing LLMs to evaluate their own answers before outputting them to users, selecting answers with the highest quality in the process.

This approach could replace the majority of custom fine-tuning techniques currently adopted by enterprises, while addressing 99% of the customization needs of enterprise knowledge bases.

While Wan admitted that customization is unavoidable in the industrial implementation of LLMs, the delivery capability can be continuously improved through technical iterations.

With the latest release, Baichuan highlights its rapid advance toward commercial implementation. The company has also revealed that it has entered into partnerships with leading enterprises in various industries for further development, but did not go into detail on the specifics of these collaborations.

KrASIA Connection features translated and adapted content that was originally published by 36Kr. This article was written by Yong Yi for 36Kr.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : KrAsia – https://kr-asia.com/baichuan-says-its-new-api-can-greatly-reduce-the-cost-of-customizing-large-language-models

Tags: Baichuangreatlynews
Previous Post

In charts: How Asia’s tourism recovery is held back by stay-home Chinese

Next Post

Deals in brief: AC Ventures raises USD 210 million at final close of fifth fund, Sony scraps merger with India’s Zee, nine China deals, and more

Torch Lake faces ecological challenges from invasive Quagga Mussels – WPBN

Torch Lake Fights Back Against the Growing Threat of Invasive Quagga Mussels

September 17, 2025
New Data Science Minor Prepares Students for Data-Driven World – CSUF News

Unlock Your Potential with the Exciting New Data Science Minor Designed for a Data-Driven Future

September 17, 2025
Miami University’s Cleveland Clinic Health Sciences and Wellness facility. – Cleveland Clinic Newsroom

Explore Miami University’s Exciting New Cleveland Clinic Health Sciences and Wellness Center

September 17, 2025
This Is The Most Popular Soup In The U.S. Right Now—And It’s A Fall Classic – yahoo.com

This Fall’s Most Beloved Soup Is Taking the U.S. by Storm

September 17, 2025
City IT presented Best of North Carolina Technology Award – RaleighNC.gov

City IT Honored with Best of North Carolina Technology Award

September 17, 2025

Lillie C. Ainscough – altoonamirror.com

September 17, 2025
World Athletics Championships: Faith Kipyegon storms to historic fourth 1500m title in Tokyo – BBC

Faith Kipyegon Makes History with Stunning Fourth 1500m Title at World Athletics Championships in Tokyo

September 17, 2025
What Jamie Dimon and other bank CEOs are saying about the economy – Quartz

What Jamie Dimon and Leading Bank CEOs Are Unveiling About the Future of the Economy

September 17, 2025
PENN Entertainment stock rating reiterated at Market Outperform by JMP – Investing.com

PENN Entertainment Stock Rated a Market Outperformer by Experts

September 16, 2025
While Michigan sees improvements in childhood health, education outcomes raise concerns – Michigan Advance

Michigan Makes Strides in Childhood Health, But Education Outcomes Spark Concern

September 16, 2025

Categories

Archives

September 2025
MTWTFSS
1234567
891011121314
15161718192021
22232425262728
2930 
« Aug    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (823)
  • Economy (842)
  • Entertainment (21,721)
  • General (17,081)
  • Health (9,887)
  • Lifestyle (857)
  • News (22,149)
  • People (847)
  • Politics (852)
  • Science (16,054)
  • Sports (21,343)
  • Technology (15,825)
  • World (826)

Recent News

Torch Lake faces ecological challenges from invasive Quagga Mussels – WPBN

Torch Lake Fights Back Against the Growing Threat of Invasive Quagga Mussels

September 17, 2025
New Data Science Minor Prepares Students for Data-Driven World – CSUF News

Unlock Your Potential with the Exciting New Data Science Minor Designed for a Data-Driven Future

September 17, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version