Top related persons:
Top related locs:
Top related orgs:

Search resuls for: "H100s"


25 mentions found


Mark Zuckerberg says Meta's Llama 4 AI models are training on the biggest GPU cluster in the industry. During Meta's earnings call, he said the cluster is "bigger than 100,000 H100s." A lot of computing power is going into training Meta's forthcoming Llama 4 AI models — more than anything currently offered by the competition, according to Zuckerberg. Zuckerberg added in the earnings call Wednesday that Meta's Llama 4 models will have "new modalities, capabilities, stronger reasoning" and be "much faster." Meanwhile, Musk tweeted earlier this week that xAI will soon double its cluster size in the coming months to 200,000 H100 and H200 chips.
Persons: Mark Zuckerberg, Elon Musk, , Mark Zuckerberg's, Zuckerberg, I've, Hopper, Aravind Srinivas, Srinivas, didn't, Musk, xAI Organizations: Service, Nvidia, Meta Locations: Meta
Read previewElon Musk just hinted at how much it cost to make his AI chatbot Grok. Knowing how many H100 GPUs Musk is getting allows us to do some napkin math to figure out a rough estimate of the cost. Mark Zuckerberg said in January that Meta will have purchased about 350,000 Nvidia H100 GPUs by the end of 2024. Aravind Srinivas, founder and CEO of AI startup Perplexity, talked about getting turned down by a Meta AI researcher he was trying to poach in part because of Zuckerberg's huge collection of AI chips. 'Come back to me when you have 10,000 H100 GPUs,'" Srinivas said.
Persons: , Elon, chatbot, Hopper, Musk, Nicolai Tangen, xAI, Mark Zuckerberg, Meta, Aravind Srinivas, Srinivas Organizations: Service, Business, Nvidia, Twitter, Meta Locations: Silicon Valley
Cerebras Systems, a startup trying to challenge Nvidia on the AI chip battlefield, is reportedly headed for an IPO toward the end of 2024. This is one of the primary elements holding back even some AWS customers from using Amazon's AI chips. But, most AI developers must go through gnarly work to change their software if they want to change their hardware. His startup Vimaan is building tech for a future "dark warehouse" that requires no humans to operate. Advertisement"Am I gonna take a chance to switch out a whole infrastructure that we've built on the Nvidia platform?"
Persons: , Lior Susan, Susan, CUDA, Thomvest, they're, He's, Padval, Jensen Huang, Thomas Sohmers, Ganapathi, Vimaan, Pavdal Organizations: Service, Cerebras Systems, Nvidia, Eclipse Ventures, Business, CUDA, NVIDIA CUDA, AMD, Qualcomm, Google, Intel, UXL Foundation, Amazon's Industrial Innovation Fund Locations: CUDA
CNBC Daily Open: Roaring Kitty's wealth, Modi's victory
  + stars: | 2024-06-05 | by ( Abid Ali | ) www.cnbc.com   time to read: +3 min
The S&P 500 inched up 0.15% and the Nasdaq Composite did marginally better, up 0.17%. Bath & Body Works was the worst-performing stock on the S&P 500, plunging almost 13% on disappointing guidance. With his 5 million shares of GameStop, if he were to exercise his 120,000 call options at $20 apiece, that would give him an additional 12 million shares — making him the fourth-largest shareholder in the games retailer. [PRO] June highThe S&P 500 will rally to fresh all-time high of 5,500 by the end this month, according to Fundstrat Global Advisors' Tom Lee. With the S&P 500 finishing Monday's trading session at 5,283.40 the forecast calls for upside of 4%.
Persons: Modi, Narendra Modi, Modi's, Keith Gill, Gill, Elon Musk, Tesla, Musk, Tom Lee, CNBC's Pia Singh, what's Organizations: CNBC, Dow Jones, Nasdaq, Body, Treasury, Bharatiya Janata Party, BJP, National Democratic Alliance, GameStop, Nvidia, Federal Reserve, Fundstrat Global Locations: Tesla
Correspondence from Nvidia staffers also indicates that Musk diverted a sizable shipment of AI processors that had been reserved for Tesla to his social media company X, formerly known as Twitter. "Elon prioritizing X H100 GPU cluster deployment at X versus Tesla by redirecting 12k of shipped H100 GPUs originally slated for Tesla to X instead," an Nvidia memo from December said. In a post on X in November, Musk wrote, "X Corp investors will own 25% of xAI." At Tesla, Musk has promised to build a $500 million "Dojo" supercomputer in Buffalo, New York, and a "super dense, water-cooled supercomputer cluster" at the company's factory in Austin, Texas. WATCH: Musk ordered Nvidia to ship thousands of AI chips to X
Persons: Elon Musk, David Swanson, Reuters Elon Musk, he's, Tesla's, Musk, Tesla, Elon, Critics, OpenAI's ChatGPT, Axios Harris, Jensen Huang, Huang, David Paul Morris, xAI's Grok, xAI, he'd, He's, Leo Koguan, Gerber Kawasaki's Ross Gerber, Joel Fleming, Fleming, hasn't, Ethan Knight Organizations: SpaceX, Tesla, Reuters, Nvidia, Tesla's Texas, CNBC, X Corp, EV, Google, Meta, Microsoft, Blackwell, Nvidia Corp, Technology, Bloomberg, Getty, Twitter, Equity Litigation Locations: Beverly Hills , California, Tesla's, U.S, San Jose , California, Buffalo , New York, Austin , Texas, North Dakota, Delaware, Tesla, xAI, SolarCity, Texas, New York
No one is laughing about Jayshree's company, either, as her white box solutions are beating Cisco Systems when it comes to the internet plumbing that connect Nvidia chips to the Titans. Last year at this time, Nvidia snuck into the U.S. trillion-dollar market cap club, with that May quarter. What does Nvidia really have to do then for an encore? Now we are getting into what Jensen does to beat expectations. Tall order, but Nvidia is a company that's beaten tall orders routinely we just didn't really know it until last May.
Persons: Jensen Huang, Jensen, Claude, let's, Blackwell, Andy Grove, Intel's, doesn't, — Jensen, Collette Kress, Mills, Eli Lilly, Jim Cramer's, Jim Cramer, Jim, Josh Edelson Organizations: Nvidia, Titans, Arista Networks, Microsoft, Meta, Cisco Systems, Google, Apple, Devices, Intel, Merck, Keytruda, Vision, AMD, Grove, Union Pacific, Club, GE, Jim Cramer's Charitable, CNBC, SAP Center, Afp, Getty Locations: San Jose , California
Nvidia is dominating earnings season, and it hasn't even reported results yet. Other mega-cap tech giants have been mentioning on earnings calls that they're boosting investment in AI infrastructure. AdvertisementThe company is gearing up for the release of its next-generation AI chip, named Blackwell, later this year. Nvidia has competition, but it still dominatesRecent earnings results from Nvidia's rival, AMD, suggest that most of this business is going to Nvidia and not its competitors. AdvertisementInvestors will have to wait until after the market close on May 22 to hear what Nvidia's earnings results actually are.
Persons: Blackwell, Elon Musk, We've, Musk, Tesla, Meta, Yann LeCun, John Werner, Brian Olsavsky Organizations: Nvidia, Microsoft, Tesla, Meta, UBS, Blackwell, AMD, Intel, Gaudi Locations: Meta
Google Cloud, one of the fund's cloud providers, developed new technologies that drew a full room of spectators at Google Cloud Next. When Google Cloud clients request GPUs through DWS, the platform requires clients to specify the region, the machine type and count of machines, and runtime duration. Knowing how many resources a given client needs allows Google Cloud to provision capacity more granularly, which "unlocks additional capacity," Mateo said. As a cloud provider for many financial firms, Google Cloud benefits from helping its clients run these models because many research platforms are hosted on Google's public cloud. In addition to Two Sigma, Citadel Securities has its research platform on Google Cloud.
Persons: Alex Hays, Hays, Sigma's Hays, Mateo, Dax, They'd, it's, Cook Organizations: Sigma, Google, Wall Street, Nvidia, prioritizes, Citadel Securities Locations: Las Vegas, Cook
Nvidia CEO Jensen Huang delivers a keynote address during the Nvidia GTC Artificial Intelligence Conference at SAP Center on March 18, 2024 in San Jose, California. Nvidia on Monday announced a new generation of artificial intelligence chips and software for running artificial intelligence models. The announcement, made during Nvidia's developer's conference in San Jose, comes as the chipmaker seeks to solidify its position as the go-to supplier for AI companies. “Hopper is fantastic, but we need bigger GPUs,” Nvidia CEO Jensen Huang said on Monday at the company's developer conference in California. Das said Nvidia's new software will make it easier to run programs on any of Nvidia's GPUs, even older ones that might be better suited for deploying but not building AI.
Persons: Jensen Huang, Nvidia's, ChatGPT, Blackwell, Hopper, Huang, Manuvir Das, what's, Das, you've, we'll Organizations: Nvidia, Intelligence, SAP Center, Monday, Microsoft, Meta, Companies, Apple, Manuvir Locations: San Jose , California, San Jose, California
The CEO of an AI startup said he couldn't poach a Meta employee because it didn't have enough GPUs. "Amazing incentives" are needed to attract AI talent, he said on the podcast "Invest Like The Best." AdvertisementRecruiting AI talent appears to be a tough feat for some companies. AdvertisementThat could make it even harder to secure AI talent in the future. AdvertisementLeaning into that skillset, the CEO said, will help AI companies like Perplexity stand out in a sector dominated by Big Tech.
Persons: It's, , Aravind Srinivas, Srinivas, Nvidia's, I'm, Meta didn't, OpenAI, skillset Organizations: Service, Meta, Nvidia, Netflix, Big Tech
There are "major companies and verticals that have not even considered the H100" yet, he added, referring to Nvidia's AI must-have GPU. A few weeks ago, we combed through the latest earnings reports from Google-parent Alphabet, Microsoft, Meta Platforms and Amazon to see how their AI spending plans are benefitting Nvidia. After Wednesday evening's killer quarterly release, Nvidia talked about how it is helping the businesses of fellow Club names Google, Microsoft, Meta and Amazon. Alphabet continues to heavily invest in AI applications to improve the performance of its services including Google DeepMind, Google Services, Gemini, and Google Cloud. "Nvidia DGX Cloud will expand its list of partners to include Amazon's AWS, joining Microsoft Azure, Google Cloud and Oracle Cloud.
Persons: Jim Cramer, Jim, Jensen Huang, Colette Kress, Kress, Mark Zuckerberg, we've, Gemma, Satya Nadella, Copilot, Jim Cramer's, Omar Marques Organizations: Nvidia, Google, Microsoft, Meta, Companies, Nvidia H100s, Google Services, Gemini, Management, Amazon's AWS, Oracle, NVIDIA, Amazon Web Services, CNBC, Getty
The dominant global designer and supplier of AI chips aims to capture a portion of an exploding market for custom AI chips and to protect itself from the growing number of companies interested in finding alternatives to its products. Nvidia officials have met with representatives from Amazon.com , Meta, Microsoft, Google and OpenAI to discuss making custom chips for them, according to two sources familiar with the meetings. $30 billion marketAccording to estimates from research firm 650 Group's Alan Weckel, the data center custom chip market will grow to as much as $10 billion this year, and double that in 2025. The broader custom chip market was worth roughly $30 billion in 2023, which amounts to roughly 5% of annual global chip sales, according to Needham analyst Charles Shi. "With Broadcom's custom silicon business touching $10 billion, and Marvell's around $2 billion, this is a real threat," said Dylan Patel, founder of the silicon research group SemiAnalysis.
Persons: OpenAI, Greg Reichow, Meta, Dina McKinney, Alan Weckel, Charles Shi, Dylan Patel Organizations: Nvidia, Microsoft, Broadcom, Marvell Technology, Eclipse Ventures, Amazon.com, Meta, Google, Reuters, Devices, Marvell, Taiwan Semiconductor Manufacturing Locations: Krakow, Poland, Santa Clara , California
Some of Amazon's spending is likely to go toward its custom AI chips, known as Trainium and Inferentia. But the Seattle-based tech giant also buys Nvidia chips. On Thursday's post-earnings call, CEO Andy Jassy said AWS offers the "most expansive collection of compute instances with Nvidia chips." Alphabet's spending on AI chips has typically been split between its custom chips designed in partnership with Broadcom, known as Tensor Processing Units (TPUs), and Nvidia's offerings. Including networking products that stitch together parts of the data center, about 15% of Broadcom's semiconductor revenue was tied to generative AI spending in fiscal 2023.
Persons: Eaton, Brian Olsavsky, FactSet, Amazon's capex, Andy Jassy, Meta's capex, Susan Li, Li, Mark Zuckerberg, Zuckerberg, Meta, it's, , Amy Hood, It's, chatbot ChatGPT, AMD's, Ruth Porat, Porat, Jim Cramer's, Jim Cramer, Jim, Florian Gaertner Organizations: Big Tech, Nvidia, Broadcom, Eaton, Microsoft, Meta, Google, Web Services, Facebook, Devices, OpenAI, supercomputing, TPUs, CNBC, YouTube, Photothek, Getty Locations: Eaton, Seattle
That's according to a report from Bloomberg , which stated that Altman had been busy pitching heavyweight investors to back a new AI chip venture that would give his company a lot more control over its chip supply. AdvertisementBut a bunch of tech companies have started designing their own. In November, for instance, Microsoft unveiled its new Azure Maia AI chip , designed with large language model (LLM) training in mind. Despite this, Altman's plan has already won fans. "Building the best AI assistants, AIs for creators, AIs for businesses and more – that needs advances in every area of AI."
Persons: , Sam Altman doesn't, Altman, Maia, Jensen Huang, Altman's, It's, Adam Niewinski, Mark Zuckerberg Organizations: Service, Business, Microsoft, Nvidia, Bloomberg, Intel, NVIDIA, Getty, Altman, OTB Ventures, AIs Locations: AFP
Meta Platforms is planning to pay Nvidia billions of dollars this year for its cutting-edge AI technology. Nvidia's stock has been off to a blistering start this year, up more than 20%, including a 3% gain Friday. Estimates on Nvidia's share of the AI training market vary, but is generally thought to be well above 80%. Still, Nvidia's stock more than tripled in 2023, leading the S & P 500, and has so far been a big winner in 2024. As Meta secures more AI chips, they will be placed in data centers where their computational capabilities will be utilized.
Persons: Mark Zuckerberg, Zuckerberg, Meta, they've, Jim Cramer, Wells, Wells Fargo, Aaron Rakers, Jim, Eaton, We've, Jim Cramer's, Facebook Mark Zuckerberg, Kenzo Tribouillard Organizations: Nvidia, Taiwan Semiconductor Manufacturing Company, Taiwan Semi, Meta, Apple, Facebook, Reality Labs, Eaton Corp, CNBC, European Commission, AFP, Getty Locations: Taiwan, Wells Fargo, Brussels
Nvidia's revenue triples as AI chip boom continues
  + stars: | 2023-11-21 | by ( Jordan Novet | ) www.cnbc.com   time to read: +4 min
The company's data center revenue totaled $14.51 billion, up 279% and more than the StreetAccount consensus of $12.97 billion. With respect to guidance, Nvidia called for $20 billion in revenue for the fiscal fourth quarter. During the quarter, Nvidia announced the GH200 GPU, which has more memory than the current H100 and an additional Arm processor onboard. As recently as two years ago, sales of GPUs for playing video games on PCs were the largest source of Nvidia's revenue. Nvidia faces obstacles, including competition from AMD and lower revenue because of export restrictions that can limit sales of its GPUs in China.
Persons: Colette Kress, Kress, Raymond James, Srini Pajjuri, Jacob Silverman, Degas Wright Organizations: Nvidia, LSEG, Energy, Microsoft, AMD Locations: China, East, Australia
Wall Street Journal report says Meta is developing an AI model designed to compete with GPT-4. It's expected to be much more powerful than Llama 2, the open-source AI that Meta recently released. Get the inside scoop on today’s biggest stories in business, from Wall Street to Silicon Valley — delivered daily. The Journal reported that Meta's lawyers had raised concerns about potential misuses of the company's AI model. Meta has bet on open-sourcing its AI models to cut the lead built up by its rivals.
Persons: It's, Google's Bard, Mark Zuckerberg, Zuckerberg, Meta Organizations: Meta, Service, Street Journal, Big Tech, Microsoft, Reuters, Facebook Locations: Wall, Silicon
DGX Cloud — Nvidia's supercomputer and related software accessible via a web browser — is coming to Google Cloud, too. The company also announced plans to integrate AI into its Google Workspace and Google Cloud offerings through a program called Duet AI. The firm expects "Google Cloud to remain a key driver of [the company's] top-line growth and margin improvement." As a subscriber to the CNBC Investing Club with Jim Cramer, you will receive a trade alert before Jim makes a trade. Thomas Kurian, CEO of Google Cloud, speaks at a cloud computing conference held by the company in 2019.
Persons: Jim Cramer, Jim, Grace Hopper Superchip, Thomas Kurian, Jim Cramer's, Michael Short Organizations: Nvidia, Google, Club, Broadcom, Microsoft, Oracle, Wall Street, JPMorgan, CNBC, Bloomberg, Getty
About half of Nvidia's data center revenue comes from cloud providers, followed by big internet companies. The growth in Nvidia's data center business was in "compute," or AI chips, which grew 195% during the quarter, more than the overall business's growth of 171%. Some startups have even gone into debt to buy Nvidia GPUs in hopes of renting them out for a profit in the coming months. On an earnings call with analysts, Nvidia officials gave some perspective about why its data center chips are so profitable. Nvidia's AI software, called Cuda, is cited by analysts as the primary reason why customers can't easily switch to competitors like AMD .
Persons: Jen, Hsun Huang, Huang, Chaim Siegel, Elazar, OpenAI, Meta, Colette Kress, Raymond James, H100s, Jensen Huang Organizations: Consumer, Audi, Nvidia, Elazar Advisors, Microsoft, AMD, Center Locations: Las Vegas, USA
In this article METAGOOGLMSFTAMZN Follow your favorite stocks CREATE FREE ACCOUNTChips as 'true differentiation'In the long run, Dekate said, Amazon's custom silicon could give it an edge in generative AI. Microsoft has yet to announce the Athena AI chip it's been working on, reportedly in partnership with AMD. So you train the machine learning models and then you run inference against those trained models," Wood said. Amazon's custom chips, from left to right, Inferentia, Trainium and Graviton are shown at Amazon's Seattle headquarters on July 13, 2023. An Amazon employee works on custom AI chips, in a jacket branded with AWS' chip Inferentia, at the AWS chip lab in Austin, Texas, on July 25, 2023.
Persons: Dekate, It's, Nitro, Stacy Rasgon, Matt Wood, Wood, Trainium, Nvidia H100s, Rasgon, Joseph Huerta, they're, Mai, Lan Tomsen Bukovec, Gartner, Swami Sivasubramanian, Sivasubramanian, Katie Tarasov Organizations: Microsoft, AWS, Amazon, CNBC, AMD, Intel, Bernstein Research, Google, Unit, Nvidia, Seattle, AI21 Labs Locations: Austin , Texas
Elon Musk has loaded up on Nvidia GPUs for X, xAI, and Tesla. Meanwhile, Chinese tech titans are reportedly scrambling to buy $5 billion worth of the chips. But there are signs emerging that there may not be enough of Nvidia's chips to go around, with multiple top executives warning that demand is massively outpacing supply. Soaring demandThe massive increase in interest in artificial intelligence has been a key factor driving demand for Nvidia's semiconductors. Perhaps the strongest sign that demand for Nvidia's chips is soaring came in May, when it released stellar second-quarter revenue forecasts that smashed Wall Street's expectations by 50%.
Persons: Elon Musk, Biden, Tesla, Adam Selipsky, Matthew Prince, there's, Barron's Organizations: Nvidia, titans, Service, Soaring, New, Research, Financial Times, Elon, Twitter, Web Locations: Wall, Silicon
July 25 (Reuters) - Artificial intelligence is expected to pay off big for tech giants including Microsoft (MSFT.O) and Alphabet (GOOGL.O) someday. Microsoft is bearing AI costs in two ways, analysts said: to power its own products such as its forthcoming $30-a-month Copilot AI assistant, and to serve companies wanting to use its Azure cloud computing services to create AI products. "They're buying a bunch of H100s," said Ben Bajarin, chief executive and principal analyst of Creative Strategies, referring to Nvidia's flagship chips for AI. Microsoft may be "aggressively buying Nvidia chips, given Microsoft does not have its own silicon as an alternative," said Atlantic Equities analyst James Cordwell. "The message on inflection point was the same," from Microsoft and Google, said Gene Munster, managing partner at Deepwater Asset Management, "but the difference was Microsoft investors wanted to see more."
Persons: Ben Bajarin, Ruth Porat, Scott Kessler, James Cordwell, Porat, Gene Munster, Stephen Nellis, Akash Sriram, Anna Tong, Max Cherney, Yuvraj Malik, Greg Bensinger, Sayantani Ghosh, Richard Chang Organizations: Microsoft, Nvidia Corp, Creative, Google, Deepwater Asset Management, Thomson Locations: Atlantic, San Francisco, Bengaluru, New York
REUTERS/Dado Ruvic/Illustration/File PhotoJune 29 (Reuters) - Inflection AI, a startup backed by several Silicon Valley heavyweights, said on Thursday it had raised $1.3 billion from investors including Microsoft and Nvidia, amid a boom in the artificial intelligence (AI) sector. Inflection released its chatbot Pi last month. Pi uses generative AI technology, similar to ChatGPT, to interact with users through dialogues, allowing people to ask questions and share feedback. Palo Alto, California-based Inflection AI has about 35 employees. Nvidia (NVDA.O), which has stepped up its AI investments recently, Hoffman, Bill Gates and former Google CEO Eric Schmidt also participated in the latest round, Inflection said.
Persons: Dado Ruvic, Google DeepMind, Mustafa Suleyman, Reid Hoffman, Pi, Suleyman, OpenAI, Hoffman, Bill Gates, Eric Schmidt, Niket, Krystal Hu, Vinay Dwivedi, Conor Humphries Organizations: REUTERS, Microsoft, Nvidia, Google, LinkedIn, Collision, Thomson Locations: Alto , California, Greylock, Bengaluru, Toronto
Where can a Chinese buyer purchase top-end Nvidia (NVDA.O) AI chips in the wake of U.S. sanctions? A model similar to OpenAI's GPT would require more than 30,000 Nvidia A100 cards, according to research firm TrendForce. Nvidia's more advanced H100 chips, only on the market since March, appear much harder to come by. He added the premiums currently commanded by Chinese vendors for A100 and H100 chips could collapse in the future as many of the Chinese AI startups that were driving purchases would eventually withdraw from the market. ($1 = 7.8307 Hong Kong dollars)Reporting by Josh Ye in Hong Kong, David Kirton in Shenzhen and Chen Lin in Singapore; Additional reporting by Fanny Potkin in Singapore; Editing by Brenda Goh and Edwina GibbsOur Standards: The Thomson Reuters Trust Principles.
Persons: Joe Biden's, OpenAI's, Ivan Lau, Hong, ByteDance, Vinci Chow, Charlie Chai, Josh Ye, David Kirton, Chen Lin, Fanny Potkin, Brenda Goh, Edwina Gibbs Organizations: Nvidia, SEG, Reuters, supercomputing, HK, U.S . Department of Commerce, China's, Information, Tencent Holdings, Taobao, Chinese University of Hong, Thomson Locations: HONG KONG, SHENZHEN, China, U.S, Shenzhen, Hong Kong, India, Taiwan, Singapore, Chinese University of Hong Kong, Shanghai
Nvidia's most-advanced graphics cards are selling for more than $40,000 on eBay , as demand soars for chips needed to train and deploy artificial intelligence software. The prices for Nvidia's H100 processors were noted by 3D gaming pioneer and former Meta consulting technology chief John Carmack on Twitter. The H100, announced last year, is Nvidia's latest flagship AI chip, succeeding the A100, a roughly $10,000 chip that's been called the "workhorse" for AI applications. Microsoft spent hundreds of millions of dollars on tens of thousands of Nvidia A100 chips to help build ChatGPT. Nvidia controls the vast majority of the market for AI chips.
Total: 25