Apertix · Edge intelligence lab

We build AI that understands — without ever phoning home to the cloud.

Apertix designs the most efficient multimodal models — vision, language, audio — purpose-built for the silicon they run on. Decisions happen where the sensors live.

Watch the iPhone demo→ Read the research↗

Param footprint500M–400B

Latency<40 ms

Connectivity needed0 kbps

Citations125k+

01See it run

Multimodal AI on iPhone, airplane-mode.

Model: Apertix-VL 1.4B
Runtime: Core ML · Neural Engine
Latency / frame: 38 ms
Network: None · airplane mode
Power draw: <1.8 W sustained

A multimodal model the size of a song file. Vision, language, and audio fused — everything happening on the device in your hand.

02Where we deploy

Built for the places where cloud dependency is a dealbreaker.

i.Augmented reality

<40 ms

Glasses-class perception, without the round-trip.

The wearer's world, understood at the speed of attention. No cloud, no perceptible lag.

HEADSET OEMs · SPATIAL OS PARTNERS

ii.Defense & autonomy

0 kbps

Intelligence at the edge of contested spectrum.

Sovereign by default. Models run where the sensors fly — denied, jammed, or alone.

PRIME CONTRACTORS · GOV LABS

iii.Clinical & health

100%

Patient data that never leaves the room.

HIPAA, GDPR, sovereignty regimes — answered by architecture, not policy.

HOSPITAL SYSTEMS · DEVICE OEMs

iv.Industrial & energy

24/7

The line keeps running when the link goes down.

Refineries, mines, factory floors. Inspection and anomaly detection on the asset itself.

PROCESS INDUSTRIES · MAINTENANCE

03The work behind the work

Backed by decades of work that other research cites.

125k+

Citations across the lab

Architectures and training recipes now standard in the open-weights ecosystem.

500+

Peer-reviewed papers

NeurIPS, CVPR, ICLR, ICML — multimodal architectures, on-device inference, vision-transformer design.

30+

Patents granted / pending

Quantization, distillation, and sensor-fused training — the unglamorous parts that make small models work.

Open-weights releases

Compact multimodal checkpoints, released to the research community before they became products.

Built the UAE's AI foundations as researchers and faculty.Foundational work at MBZUAI · G42 · Inception. National-scale models trained, evaluated, and released.

Decades of training compact models well.Long before "small models" were fashionable — quantization, knowledge distillation, and architecture search in production since 2012.

Research that became infrastructure.Now standard building blocks inside Whisper-class systems, mobile vision pipelines, and edge VLMs.