Breaking the Scalability Limit of Multi-Projector Calibration with Embedded Cameras

The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (Oral)
Abstract
Conventional multi-projector calibration requires projecting and capturing structured light patterns for each projector sequentially, causing calibration time and effort to increase linearly with the number of projectors. This scalability bottleneck has long limited the deployment of large-scale projection mapping systems. We present a new calibration framework that breaks this limitation by embedding cameras into the surface of the calibration target. The embedded cameras directly capture the incoming projection light, enabling the separation of simultaneously projected structured light patterns from multiple projectors according to their incident directions. Our method establishes correspondences between the optical centers of the embedded cameras and the projector pixels, allowing the intrinsic and extrinsic parameters of all projectors to be simultaneously estimated. We further introduce a correction technique for small misalignments between the calibration board and camera optical centers. As a result, our system achieves calibration accuracy comparable to conventional methods while reducing the required number of projection-capture cycles from linear to nearly constant with respect to the number of projectors, dramatically improving scalability for dense multi-projector systems with overlapping projection regions, such as high-brightness stacking, super-resolution, light-field, and shadow-suppression displays.
📺 The teaser video is coming soon!
)](/en/projects/26-06-06-kawano-cvpr/kawano-cvpr26-poster_hu512b0ef0b563c93d5d610ae4bebe448a_656625_e4a5a09e9b8cbe98a2a78368f2c0e616.webp)
Principle
When projector 1 fires, its red light hits the surface and scatters diffusely in all directions. The camera records a red pixel. Add projector 2’s blue light, and the two scatters mix into purple at the same camera pixel. We can’t tell which color came from which projector — and that’s why simultaneous projection fails.
Our idea is to turn the camera into the target itself. We embed the camera into the board, with its optical center aligned with the board surface. The lens sits flush with the plane, and the image sensor goes behind it. So the camera is no longer observing the target — it is the target. Now both rays still pass through the same point on the board but they continue to different pixels on the image sensor, determined by their incident angles. This is the core idea of our method.
In practice, the optical center is never perfectly on the board surface. And critically, As the projector moves, crossing point slides along the surface. So the naive assumption that one camera equals one fixed point breaks down.
Our fix: we calibrate this offset once, offline. Place one projector at $K$ different positions, and for each position record the pair $c$ and $k$. Both clusters are related by a projective transformation — we fit it as a homography, $M_n$. This is done once per embedded camera, and only once.
When a new projector emits light during actual calibration, the embedded camera records a sensor pixel $c$. We push it through $M_n$ to recover the true board-surface coordinate $x$, and use this as world coordinate of a target.
Experimental setup

Results
Experiment with Gray-code Projection Using 25 Projectors

Experiment with Line-shift Projection Using 3 Projectors
Experiment evaluating robustness to ambient light with two projectors outdoors

Citation
Takumi Kawano, Kohei Miura, Daisuke Iwai, “Breaking the Scalability Limit of Multi-Projector Calibration with Embedded Cameras,” In Proceedings of The IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 21573-21582 (2026).