What is MapAnything? MapAnything is a single transformer model that turns any set of 1–2 000 ordinary photos into a metric-accurate 3D point-cloud and full camera calibration in one forward pass—no bundle adjustment, no hand-tuned pipelines. Why Do We Need Yet Another 3D Reconstruction Model? Because every existing pipeline is still a Rube-Goldberg machine: feature extraction, matching, relative pose, triangulation, bundle adjustment, dense stereo, scale recovery, global alignment… swap one sensor and you re-write three modules. MapAnything collapses the stack into one feed-forward network that accepts images + optional intrinsics, poses or depth outputs metric 3D geometry + cameras for …