Skip to content
AI Usage3 min read

Optical AI Startups Target 90% Energy Reduction for Inference

UK startup Lumai launches the first optical computing system to run billion-parameter AI models in real-time, targeting 90% energy savings over traditional silicon architectures.

AB

Author

AUG Bot

Published

Optical computing architecture and laser-based data processing

Optical AI Startups Target 90% Energy Reduction for Inference

Lumai details Iris server family using photons for machine learning workloads

The focus of AI infrastructure is rapidly shifting from model training to inference, driving a new wave of specialized hardware designed to bypass the efficiency limits of traditional silicon. UK-based startup Lumai has detailed its optical computing architecture, which uses light instead of electrons to perform core mathematical operations, promising to slash AI energy consumption by up to 90%.

Key details

Lumai recently launched its Iris family of inference servers—including the Nova, Aura, and Tetra systems—marking the first time billion-parameter large language models (LLMs) have been run in real-time on an optical computing system. Unlike conventional GPUs that rely on digital silicon, Lumai’s architecture uses a hybrid electro-optical approach. While digital processing handles system control, an optical tensor core performs the massive matrix multiplications required for LLM inference.

The startup claims its next-generation Iris Tetra systems are targeting an exaOPS of AI performance within a 10kW power budget by 2029. Current evaluations with hyperscalers and "neoclouds" demonstrate that the technology can handle models like Llama 3.1 8B and 70B while dramatically reducing the "energy wall" that currently constrains data center expansion.

Why this matters

As AI adoption scales, the energy demand for inference is expected to surpass training, putting immense strain on global power grids. Traditional silicon-based architectures are hitting thermal and physical limits, where each incremental performance gain requires disproportionately more power. By moving from electrons to photons, optical compute offers a potential 10x increase in performance-per-watt, enabling AI scaling that is otherwise environmentally and economically unsustainable.

Context

The emergence of optical compute comes as the industry moves toward "disaggregated inference." Companies like NVIDIA, AWS, and Intel are increasingly pairing different types of hardware for "prefill" (compute-heavy) and "decode" (bandwidth-constrained) operations. Lumai is positioning its optical processors to excel in the prefill stage, where they can process tokens at massive scale with minimal heat waste compared to traditional high-end GPUs.

What happens next

Lumai has opened its Iris Nova servers for evaluation by hyperscalers and research institutions. The company plans to refine its 3D optical architecture to support increasingly larger models and tighter integration with existing data center cooling and power infrastructure. As utility companies and regulators begin to mandate stricter energy efficiency standards for data centers, the commercial adoption of post-silicon technologies like optical compute will be a critical trend to watch through the end of the decade.


Source: The Register Published on AI Usage Global, author: AUG Bot

Related

Read more

More posts that expand on the topics, companies, and AI trends covered in this story.

Digital representation of data center infrastructure and rising electricity cost metrics
AI Usage

AI Data Center Growth Could Raise U.S. Power Costs 29% by 2030

A new study from North Carolina State University warns that surging AI data center demand could drive a 29% increase in national electricity costs for households.

Digital representation of a data center and legal scale symbolizing DOJ intervention
AI Usage

DOJ Intervenes to Halt Air Pollution Lawsuit Against xAI Data Center

The US Department of Justice moves to dismiss a Clean Air Act lawsuit against Elon Musk's xAI, citing national security risks linked to AI military tools.

Digital representation of community opposition and massive AI data center infrastructure in Pennsylvania
AI Usage

Pennsylvania Community Rejection Halts 1.6 GW AI Data Center Project

Archbald residents successfully block the massive 1.6 GW Wildcat Ridge AI data center campus, citing concerns over grid strain and water usage.