* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Thursday, May 15, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Ashwaubenon Bowling Alley upgrades with new Neoverse entertainment system – WFRV Local 5

    Revamped Ashwaubenon Bowling Alley Unveils Exciting New Neoverse Entertainment System!

    Entertainment Calendar for May 15-21 – York Dispatch

    Entertainment Calendar for May 15-21 – York Dispatch

    Reznor, Ross Celebrate Film/TV Score Favs With Future Ruins Fest – Yahoo

    Reznor, Ross Celebrate Film/TV Score Favs With Future Ruins Fest – Yahoo

    ‘Lilo & Stitch’ director unpacks key animation-to-live-action changes (exclusive) – ew.com

    Behind the Scenes: Key Changes in the Animation-to-Live-Action Transformation of ‘Lilo & Stitch

    HG Vora Files Definitive Proxy Materials and Sends Letter to PENN Entertainment, Inc. Shareholders – Business Wire

    HG Vora Takes Action: A Bold Move to Engage PENN Entertainment Shareholders

    Downtown Frederick Partnership announces Alive@Five season lineup – The Frederick News-Post

    Get Ready for Fun: Downtown Frederick’s Exciting Alive@Five Season Lineup Revealed!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    New technology driving on-air experience – WSFA

    Revolutionizing the On-Air Experience: The Impact of Cutting-Edge Technology

    Revolutionary Technology Unlocks Hydrogen from Seawater!

    Administration issues RFI on health technology – American Hospital Association

    Unlocking Innovation: Administration Seeks Insights on Health Technology

    Bridger Photonics Appoints Ryan Sullivan as Chief Technology Officer to Accelerate New Era of Data Insights – Business Wire

    Bridger Photonics Welcomes Ryan Sullivan as CTO to Propel Data Insights into a New Era!

    Michigan Public Policy Survey suggests uncertainty among local officials on AI police surveillance technology – The Michigan Daily

    Local Officials Grapple with Uncertainty Over AI Surveillance Technology in Policing

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Ashwaubenon Bowling Alley upgrades with new Neoverse entertainment system – WFRV Local 5

    Revamped Ashwaubenon Bowling Alley Unveils Exciting New Neoverse Entertainment System!

    Entertainment Calendar for May 15-21 – York Dispatch

    Entertainment Calendar for May 15-21 – York Dispatch

    Reznor, Ross Celebrate Film/TV Score Favs With Future Ruins Fest – Yahoo

    Reznor, Ross Celebrate Film/TV Score Favs With Future Ruins Fest – Yahoo

    ‘Lilo & Stitch’ director unpacks key animation-to-live-action changes (exclusive) – ew.com

    Behind the Scenes: Key Changes in the Animation-to-Live-Action Transformation of ‘Lilo & Stitch

    HG Vora Files Definitive Proxy Materials and Sends Letter to PENN Entertainment, Inc. Shareholders – Business Wire

    HG Vora Takes Action: A Bold Move to Engage PENN Entertainment Shareholders

    Downtown Frederick Partnership announces Alive@Five season lineup – The Frederick News-Post

    Get Ready for Fun: Downtown Frederick’s Exciting Alive@Five Season Lineup Revealed!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    New technology driving on-air experience – WSFA

    Revolutionizing the On-Air Experience: The Impact of Cutting-Edge Technology

    Revolutionary Technology Unlocks Hydrogen from Seawater!

    Administration issues RFI on health technology – American Hospital Association

    Unlocking Innovation: Administration Seeks Insights on Health Technology

    Bridger Photonics Appoints Ryan Sullivan as Chief Technology Officer to Accelerate New Era of Data Insights – Business Wire

    Bridger Photonics Welcomes Ryan Sullivan as CTO to Propel Data Insights into a New Era!

    Michigan Public Policy Survey suggests uncertainty among local officials on AI police surveillance technology – The Michigan Daily

    Local Officials Grapple with Uncertainty Over AI Surveillance Technology in Policing

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    Trump Media & Technology Group: When Politics Gets A Ticker Symbol (NASDAQ:DJT) – Seeking Alpha

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

The Cloud wins the AI infrastructure debate by default

June 16, 2024
in Technology
The Cloud wins the AI infrastructure debate by default
Share on FacebookShare on Twitter

It’s time to celebrate the incredible women leading the way in AI! Nominate your inspiring leaders for VentureBeat’s Women in AI Awards today before June 18. Learn More

As artificial intelligence (AI) takes the world by storm, an old debate is reigniting: should businesses self-host AI tools or rely on the cloud? For example, Sid Premkumar, founder of AI startup Lytix, recently shared his analysis self-hosting an open source AI model, suggesting it could be cheaper than using Amazon Web Services (AWS). 

Premkumar’s blog post, detailing a cost comparison between running the Llama-3 8B model on AWS and self-hosting the hardware, has sparked a lively discussion reminiscent of the early days of cloud computing, when businesses weighed the pros and cons of on-premises infrastructure versus the emerging cloud model.

Premkumar’s analysis suggested that while AWS could offer a price of $1 per million tokens, self-hosting could potentially reduce this cost to just $0.01 per million tokens, albeit with a longer break-even period of around 5.5 years. However, this cost comparison overlooks a crucial factor: the total cost of ownership (TCO). It’s a debate we’ve seen before during “The Great Cloud Wars,” where the cloud computing model emerged victorious despite initial skepticism.

The question remains: will on-premises AI infrastructure make a comeback, or will the cloud dominate once again?

VB Transform 2024 Registration is Open

Join enterprise leaders in San Francisco from July 9 to 11 for our flagship AI event. Connect with peers, explore the opportunities and challenges of Generative AI, and learn how to integrate AI applications into your industry. Register Now

A closer look at Premkumar’s analysis 

Premkumar’s blog post provides a detailed breakdown of the costs associated with self-hosting the Llama-3 8B model. He compares the cost of running the model on AWS’s g4dn.16xlarge instance, which features 4 Nvidia Tesla T4 GPUs, 192GB of memory, and 48 vCPUs, to the cost of self-hosting a similar hardware configuration.

According to Premkumar’s calculations, running the model on AWS would cost approximately $2,816.64 per month, assuming full utilization. With the model able to process around 157 million tokens per month, this translates to a cost of $17.93 per million tokens.

In contrast, Premkumar estimates that self-hosting the hardware would require an upfront investment of around $3,800 for 4 Nvidia Tesla T4 GPUs and an additional $1,000 for the rest of the system. Factoring in energy costs of approximately $100 per month, the self-hosted solution could process the same 157 million tokens at a cost of just $0.000000636637738 per token, or $0.01 per million tokens.

While this may seem like a compelling argument for self-hosting, it’s important to note that Premkumar’s analysis assumes 100% utilization of the hardware, which is rarely the case in real-world scenarios. Additionally, the self-hosted approach would require a break-even period of around 5.5 years to recoup the initial hardware investment, during which time newer, more powerful hardware may have already emerged.

A familiar debate 

In the early days of cloud computing, proponents of on-premises infrastructure made many passionate and compelling arguments. They cited the security and control of keeping data in-house, the potential cost savings of investing in their own hardware, better performance for latency-sensitive tasks, the flexibility of customization, and the desire to avoid vendor lock-in.

Today, advocates of on-premises AI infrastructure are singing a similar tune. They argue that for highly regulated industries like healthcare and finance, the compliance and control of on-premises is preferable. They believe investing in new, specialized AI hardware can be more cost-effective in the long run than ongoing cloud fees, especially for data-heavy workloads. They cite the performance benefits for latency-sensitive AI tasks, the flexibility to customize infrastructure to their exact needs, and the need to keep data in-house for residency requirements.

The cloud’s winning hand Despite these arguments, on-premises AI infrastructure simply cannot match the cloud’s advantages. 

Here’s why the cloud is still poised to win

Unbeatable cost efficiency: Cloud providers like AWS, Microsoft Azure, and Google Cloud offer unmatched economies of scale. When considering the TCO – including hardware costs, maintenance, upgrades, and staffing – the cloud’s pay-as-you-go model is undeniably more cost-effective, especially for businesses with variable or unpredictable AI workloads. The upfront capital expenditure and ongoing operational costs of on-premises infrastructure simply can’t compete with the cloud’s cost advantages.

Access to specialized skills: Building and maintaining AI infrastructure requires niche expertise that is costly and time-consuming to develop in-house. Data scientists, AI engineers, and infrastructure specialists are in high demand and command premium salaries. Cloud providers have these resources readily available, giving businesses immediate access to the skills they need without the burden of recruiting, training, and retaining an in-house team.

Agility in a fast-paced field: AI is evolving at a breakneck pace, with new models, frameworks, and techniques emerging constantly. Enterprises need to focus on creating business value, not on the cumbersome task of procuring hardware and building physical infrastructure. The cloud’s agility and flexibility allow businesses to quickly spin up resources, experiment with new approaches, and scale successful initiatives without being bogged down by infrastructure concerns.

Robust security and stability: Cloud providers have invested heavily in security and operational stability, employing teams of experts to ensure the integrity and reliability of their platforms. They offer features like data encryption, access controls, and real-time monitoring that most organizations would struggle to replicate on-premises. For businesses serious about AI, the cloud’s enterprise-grade security and stability are a necessity.

The financial reality of AI infrastructure 

Beyond these advantages, there’s a stark financial reality that further tips the scales in favor of the cloud. AI infrastructure is significantly more expensive than traditional cloud computing resources. The specialized hardware required for AI workloads, such as high-performance GPUs from Nvidia and TPUs from Google, comes with a hefty price tag.

Only the largest cloud providers have the financial resources, unit economics, and risk tolerance to purchase and deploy this infrastructure at scale. They can spread the costs across a vast customer base, making it economically viable. For most enterprises, the upfront capital expenditure and ongoing costs of building and maintaining a comparable on-premises AI infrastructure would be prohibitively expensive.

Also, the pace of innovation in AI hardware is relentless. Nvidia, for example, releases new generations of GPUs every few years, each offering significant performance improvements over the previous generation. Enterprises that invest in on-premises AI infrastructure risk immediate obsolescence as newer, more powerful hardware hits the market. They would face a brutal cycle of upgrading and discarding expensive infrastructure, sinking costs into depreciating assets. Few enterprises have the appetite for such a risky and costly approach.

Data privacy and the rise of privacy-preserving AI 

As businesses grapple with the decision between cloud and on-premises AI infrastructure, another critical factor to consider is data privacy. With AI systems relying on vast amounts of sensitive user data, ensuring the privacy and security of this information is paramount.

Traditional cloud AI services have faced criticism for their opaque privacy practices, lack of real-time visibility into data usage, and potential vulnerabilities to insider threats and privileged access abuse. These concerns have led to a growing demand for privacy-preserving AI solutions that can deliver the benefits of cloud-based AI without compromising user privacy.

Apple’s recently announced Private Compute Cloud (PCC) is a prime example of this new breed of privacy-focused AI services. PCC extends Apple’s industry-leading on-device privacy protections to the cloud, allowing businesses to leverage powerful cloud AI while maintaining the privacy and security users expect from Apple devices.

PCC achieves this through a combination of custom hardware, a hardened operating system, and unprecedented transparency measures. By using personal data exclusively to fulfill user requests and never retaining it, enforcing privacy guarantees at a technical level, eliminating privileged runtime access, and providing verifiable transparency into its operations, PCC sets a new standard for protecting user data in cloud AI services.

As privacy-preserving AI solutions like PCC gain traction, businesses will have to weigh the benefits of these services against the potential cost savings and control offered by self-hosting. While self-hosting may provide greater flexibility and potentially lower costs in some scenarios, the robust privacy guarantees and ease of use offered by services like PCC may prove more valuable in the long run, particularly for businesses operating in highly regulated industries or those with strict data privacy requirements.

The edge case

The only potential dent in the cloud’s armor is edge computing. For latency-sensitive applications like autonomous vehicles, industrial IoT, and real-time video processing, edge deployments can be critical. However, even here, public clouds are making significant inroads.

As edge computing evolves, it’s likely that we will see more utility cloud computing models emerge. Public cloud providers like AWS with Outposts, Azure with Stack Edge, and Google Cloud with Anthos are already deploying their infrastructure to the edge, bringing the power and flexibility of the cloud closer to where data is generated and consumed. This forward deployment of cloud resources will enable businesses to leverage the benefits of edge computing without the complexity of managing on-premises infrastructure.

The verdict 

While the debate over on-premises versus cloud AI infrastructure will no doubt rage on, the cloud’s advantages are still compelling. The combination of cost efficiency, access to specialized skills, agility in a fast-moving field, robust security, and the rise of privacy-preserving AI services like Apple’s PCC make the cloud the clear choice for most enterprises looking to harness the power of AI.

Just as in “The Great Cloud Wars,” the cloud is already poised to emerge victorious in the battle for AI infrastructure dominance. It’s just a matter of time. While self-hosting AI models may appear cost-effective on the surface, as Premkumar’s analysis suggests, the true costs and risks of on-premises AI infrastructure are far greater than meets the eye. The cloud’s unparalleled advantages, combined with the emergence of privacy-preserving AI services, make it the clear winner in the AI infrastructure debate. As businesses navigate the exciting but uncertain waters of the AI revolution, betting on the cloud is still the surest path to success.

VB Daily

Stay in the know! Get the latest news in your inbox daily

By subscribing, you agree to VentureBeat’s Terms of Service.

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : VentureBeat – https://venturebeat.com/data-infrastructure/the-cloud-wins-the-ai-infrastructure-debate-by-default/

Tags: cloudInfrastructuretechnology
Previous Post

The 10,800 game layoffs in 2024 already exceeded all cuts in 2023 — here’s why | The DeanBeat

Next Post

AGI isn’t here (yet): How to make informed, strategic decisions in the meantime

All Ecology Is Queer – Orion Magazine

All Ecology Is Queer – Orion Magazine

May 15, 2025
Hubble Pinpoints Young Stars in Spiral Galaxy – NASA Science (.gov)

Discovering the Birth of Stars: Hubble’s Stunning View of a Spiral Galaxy

May 15, 2025
New Master’s of Science degree in data science and AI – The Cor Chronicle

New Master’s of Science degree in data science and AI – The Cor Chronicle

May 15, 2025
Rome: New racquet, sober lifestyle kickstarts latest Bianca Andreescu comeback – Tennis.com

Bianca Andreescu’s Inspiring Comeback: A New Racquet and a Sober Lifestyle Fuel Her Return to Tennis

May 15, 2025
WA lawmakers approve funding for 2026 World Cup matches in Seattle – Cascade PBS News

Seattle Secures Funding to Host Exciting 2026 World Cup Matches!

May 15, 2025
CFR President Froman on Trade Deals, Tariffs, US Economy – Bloomberg

Unlocking Economic Growth: CFR President Froman Discusses Trade Deals and Tariffs

May 15, 2025
Ashwaubenon Bowling Alley upgrades with new Neoverse entertainment system – WFRV Local 5

Revamped Ashwaubenon Bowling Alley Unveils Exciting New Neoverse Entertainment System!

May 15, 2025
Novartis Canada extends Health Equity Initiative effort, fueling innovation and impact for second year – BioSpace

Novartis Canada Amplifies Health Equity Initiative, Driving Innovation and Impact into Year Two!

May 15, 2025
‘Good luck getting my vote’: GOP lawmaker warns his own party about a key provision in Trump’s major bill – CNN

Watch Out, GOP: Lawmaker Issues Stark Warning Over Controversial Provision in Trump’s Major Bill!

May 15, 2025
New technology driving on-air experience – WSFA

Revolutionizing the On-Air Experience: The Impact of Cutting-Edge Technology

May 15, 2025

Categories

Archives

May 2025
MTWTFSS
 1234
567891011
12131415161718
19202122232425
262728293031 
« Apr    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (611)
  • Economy (622)
  • Entertainment (21,535)
  • General (15,221)
  • Health (9,663)
  • Lifestyle (626)
  • News (22,149)
  • People (625)
  • Politics (629)
  • Science (15,845)
  • Sports (21,131)
  • Technology (15,612)
  • World (612)

Recent News

All Ecology Is Queer – Orion Magazine

All Ecology Is Queer – Orion Magazine

May 15, 2025
Hubble Pinpoints Young Stars in Spiral Galaxy – NASA Science (.gov)

Discovering the Birth of Stars: Hubble’s Stunning View of a Spiral Galaxy

May 15, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version