Research

The problems we're working on.

Seven interconnected threads, all aimed at the same gap: between AI that reasons about the world and AI that competently acts in it.

Index

01Manipulation
02Locomotion
03World Models
04Sim-to-Real
05Embodied Agents
06Perception for Action
07AI Safety

01 — Area

Manipulation

Dexterous policies for contact-rich tasks — grasping, assembly, tool use, and bimanual coordination.

Most useful work in the physical world involves contact: picking things up, putting them down, fitting one thing into another. The hard parts are the things that aren't in a typical RL benchmark — friction, deformation, occlusion, and the long tail of object geometries that haven't been seen before.

We work on policies that handle these conditions, on the representations that make them learnable, and on the data and simulation pipelines that make them transferable.

02 — Area

Locomotion

Robust whole-body control for legged and wheeled platforms across unstructured terrain.

Walking is solved on flat ground in a lab. It is not solved when the ground deforms, the lighting changes, the robot is carrying something asymmetric, or a sensor drops out. Locomotion in the real world is a controls problem, a perception problem, and a recovery problem all at once.

Our interest is in policies that degrade gracefully — that keep the agent upright and useful when conditions move outside the training distribution.

03 — Area

World Models

Predictive models of physical dynamics that agents can plan and reason against.

A useful world model lets an agent imagine what will happen before doing it. The hard problem is making models that are accurate enough to plan against, fast enough to roll out at the rate the agent needs, and general enough to be useful outside the conditions they were trained on.

We work on the representational and training questions behind that — what to predict, at what abstraction, and how to keep the model honest about its own uncertainty.

04 — Area

Sim-to-Real

Closing the gap between simulators and the real world that policies actually deploy in.

Simulation is the cheapest way to get data, and the most reliable way to get policies that don't generalize. The gap between sim and real is not one thing — it's sensor noise, contact dynamics, actuator delay, lighting, and a dozen smaller things that compound.

We build simulators, randomization schemes, and adaptation methods that close that gap deliberately rather than accidentally — and we measure transfer honestly.

05 — Area

Embodied Agents

Agents that combine perception, planning, and control over long horizons in physical environments.

A robot that picks up one object is a controller. A system that decides which object to pick up, when, in service of what goal, and how to recover when the plan fails — that's an agent. The interesting research questions sit at the seams between modules: how perception shapes planning, how plans constrain control, how failures get attributed back through the stack.

We work on the architectures that make those seams hold up under real-world conditions.

06 — Area

Perception for Action

Vision and multimodal representations grounded in what agents need to do, not what they see.

Most perception research optimises a proxy — classification accuracy, segmentation IoU, retrieval rank. Embodied agents care about something different: whether the representation supports the action that follows. A perfect segmentation that confuses a downstream grasp is worse than a noisy one that doesn't.

We study representations evaluated end-to-end on the tasks they enable, and the training regimes that produce them.

07 — Area

AI Safety

Evaluation, alignment, and guarantees for embodied systems that act in the physical world.

Safety in embodied AI is concrete in a way that text-only safety often isn't. A bad action has a force, a velocity, and a consequence. Safety here means evaluation methods that surface failure modes before deployment, control architectures with verifiable bounds, and an honest accounting of what the system does not know.

We treat this as a research problem in its own right — not a release-checklist item bolted onto someone else's policy.

Working on adjacent problems? We'd like to hear about it.

Get in touch Read the writing