financetom
Technology
financetom
/
Technology
/
PrismML Launches World's First 1-Bit AI Model to Redefine Intelligence at the Edge
News World Market Environment Technology Personal Finance Politics Retail Business Economy Cryptocurrency Forex Stocks Market Commodities
PrismML Launches World's First 1-Bit AI Model to Redefine Intelligence at the Edge
Mar 31, 2026 1:20 PM

PrismML's Technology Drastically Improves the Power-to-Compute Equation in Datacenters

Breakthrough 1-bit Bonsai 8B Model Enables Advanced Intelligence to Run Locally on Phones, Laptops, and Other Edge Devices

PASADENA, Calif., March 31, 2026 /PRNewswire/ -- PrismML, a pioneer in high-performance AI models, today emerged from stealth to introduce the world's first commercially viable 1-bit large language models, built on groundbreaking research developed at Caltech. PrismML's sweeping goal is to enable a future where powerful AI can run locally, efficiently, securely, and faster, and where datacenter buildouts can do more with fewer resources and avoid ballooning energy costs.

Its flagship model, 1-bit Bonsai 8B, represents a fundamental shift in how AI is deployed: delivering cutting-edge capabilities while operating efficiently on consumer and industrial edge devices, including smartphones, laptops, and embedded systems.

"AI's future will not be defined by who can build the largest datacenters," said Vinod Khosla, Founder of Khosla Ventures and an investor in the company. "It will be defined by who can deliver the most intelligence per unit of energy and cost. PrismML represents that kind of breakthrough."

As AI models grow larger and more computationally intensive, deploying advanced intelligence has increasingly required massive datacenter infrastructure. This limits real-time, on-device AI experiences due to latency, hardware, and privacy constraints.

PrismML addresses this challenge by fundamentally rethinking neural networks at the mathematical level. Instead of traditional 16- or 32-bit architectures, the company creates models with a native 1-bit structure. This dramatically reduces inference compute and memory requirements without sacrificing reasoning performance.

On a range of intelligence benchmarks, 1-bit Bonsai 8B is competitive with leading full-precision 8B models, including Llama3 8B, while being:

14x smaller8x faster4-5x more energy efficientThis efficiency enables developers to build sophisticated AI applications that execute directly on devices, reducing reliance on the cloud and unlocking a new generation of edge-first applications in robotics, wearables, and personal computing that were previously impractical.

"We spent years developing the mathematical theory required to compress a neural network without losing its reasoning capabilities," said Babak Hassibi, CEO and Founder of PrismML and Professor at Caltech. "We see 1-bit not as an endpoint, but as a starting point. We are creating a new paradigm for AI: one that adapts to diverse hardware environments and delivers maximum intelligence per unit of compute and energy."

While the immediate impact is at the edge, the implications extend to the cloud. The same efficiency gains that enable local deployment also allow datacenters to operate more effectively by improving hardware utilization, lowering operating costs, and reducing energy consumption.

"From a systems perspective, reducing models to 1-bit representations changes the optimization equation," said Ion Stoica, Databricks Co-Founder and Professor at UC Berkeley. "It enables a new class of AI systems that can both operate efficiently at the edge and scale economically in the cloud."

Bill Jia, VP of Engineering at Google, Core ML/AI, added: "When advanced models can run on constrained devices, it reshapes system design end to end. Efficiency at the model level compounds across infrastructure."

PrismML's technology can also impact future AI hardware design.

Amir Salek of Cerberus Ventures, an investor in the company, and who also founded and led the TPU program at Google, commented: "Power has become the ultimate bottleneck for scaling AI datacenters, and PrismML is fundamentally transforming the power-to-compute equation. Moreover, by reducing the memory footprint and bandwidth demands, this breakthrough technology has the potential to do more than just improve the economics of AI infrastructure; it can unlock a new frontier for innovation in computer architecture for AI inference and the next generation of AI models."

With today's launch, PrismML moves this architectural breakthrough from research to reality, placing the power of 1-bit AI directly into the hands of users, developers, and researchers.

Technical Details:

The 1-bit Bonsai 8B model is an 8-billion parameter Large Language Model where each parameter has 1-bit precision. It has been trained using Google v4 TPUs. It is designed for seamless integration with existing AI workflows and is optimized for low-latency inference on consumer-grade CPUs, NPUs, and edge GPUs. The model achieves high-fidelity reasoning and language understanding comparable to FP16 (16-bit floating point) 8B models, but with a fraction of the memory footprint (1GB vs 16GB). PrismML is also releasing 1-bit Bonsai 4B and 1.7B models, with 0.5GB and 0.24GB memory footprint, respectively.

Pricing and Availability:

Developers, researchers, and other users can download the 1-bit Bonsai models under the Apache 2.0 license for free starting today. 

Download 1-bit Bonsai Models hereRead the Whitepaper hereAbout PrismML:

PrismML is a U.S.-based artificial intelligence company focused on making AI more efficient and accessible. PrismML is built on proprietary Caltech intellectual property and backed by Khosla Ventures, Cerberus Ventures, and compute grants from Google and Caltech. For more information, visit the Website, LinkedIn, or X. 

All registered trademarks and product identifiers belong to their respective corporate entities. Any other trademarks or product names referenced here are also owned exclusively by their relevant companies.

Media Contact

Gary Bird

PrismML

[email protected]

831.888.9011

View original content to download multimedia:https://www.prnewswire.com/news-releases/prismml-launches-worlds-first-1-bit-ai-model-to-redefine-intelligence-at-the-edge-302730568.html

SOURCE PrismML

Comments
Welcome to financetom comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
Related Articles >
Altigen Technologies and Intelligent Protection Management Corp. Announce Go-To-Market Collaboration to Accelerate Growth, Reduce IT Complexity, and Expand Recurring Revenue Opportunities
Altigen Technologies and Intelligent Protection Management Corp. Announce Go-To-Market Collaboration to Accelerate Growth, Reduce IT Complexity, and Expand Recurring Revenue Opportunities
Nov 19, 2025
NEWARK, CA / ACCESS Newswire / November 19, 2025 / Altigen Technologies (Altigen) , a Microsoft Cloud Solutions provider delivering integrated Teams voice, AI-powered engagement, and analytics platforms and Intelligent Protection Management Corp. ( IPM ) , a managed technology solutions provider focused on enterprise cybersecurity and cloud infrastructure today announced a [collaborative growth initiative] to refer integrated communications, AI-driven...
Agora, Inc. Reports Third Quarter 2025 Financial Results
Agora, Inc. Reports Third Quarter 2025 Financial Results
Nov 19, 2025
SANTA CLARA, Calif., Nov. 19, 2025 (GLOBE NEWSWIRE) -- Agora, Inc. ( API ) (the “Company”), a pioneer and leader in conversational AI and real-time engagement technology, today announced its unaudited financial results for the third quarter ended September 30, 2025. “We’re pleased to report our fourth consecutive quarter of GAAP profitability in Q3, supported by double-digit revenue growth and...
Nvidia beats Q3 revenue estimates on record data center performance
Nvidia beats Q3 revenue estimates on record data center performance
Nov 19, 2025
Overview * Nvidia ( NVDA ) Q3 FY26 revenue up 62% yr/yr, beating analysts' expectations * Adjusted EPS beats analysts' estimates * Record data center revenue of $51.2 bln, up 66% yr/yr Outlook * Nvidia ( NVDA ) expects Q4 revenue of $65.0 bln, plus or minus 2% * GAAP gross margin for Q4 expected at 74.8%, plus or minus...
FinVolution Group Reports Third Quarter 2025 Unaudited Financial Results
FinVolution Group Reports Third Quarter 2025 Unaudited Financial Results
Nov 19, 2025
- Third Quarter Revenue reached RMB3,486.6 million, up 6.4% year-over-year- - Third Quarter International Revenues reached RMB873.3 million, up 37.4% year-over-year and representing 25.0% of total net revenues- SHANGHAI, Nov. 19, 2025 /PRNewswire/ -- FinVolution Group ( FINV ) , a leading fintech platform in China, Indonesia and the Philippines, today announced its unaudited financial results for the third quarter...
Copyright 2023-2026 - www.financetom.com All Rights Reserved