Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving

Theodore Tsesmelis^1,*, Luca Palmieri^2,*, Marina Khoroshiltseva², Adeela Islam¹, Gur Elkin³, Ofir Itzhak Shahar³, Gianluca Scarpellini¹, Stefano Fiorini¹, Yaniv Ohayon³, Nadav Alali³, Sinem Aslan^2,7, Pietro Morerio¹, Sebastiano Vascon², Elena Gravina⁴, Maria Cristina Napolitano⁴, Giuseppe Scarpati⁴, Gabriel Zuchtriegel⁴, Alexandra Spühler⁵, Michel E. Fuchs⁵, Stuart James^1,6, Ohad Ben-Shahar³, Marcello Pelillo², Alessio Del Bue¹
^*Indicates Equal Contribution

¹Fondazione Istituto Italiano di Tecnologia ²Ca' Foscari University of Venice ³Ben-Gurion University of the Negev ⁴Parco Archeologico di Pompei ⁵University of Lausanne ⁶Durham University ⁷University of Milan
to be presented at NeurIPS 2024 Track Datasets and Benchmarks

Abstract

This paper proposes the RePAIR dataset that represents a challenging benchmark to test modern computational and data driven methods for puzzle-solving and reassembly tasks. Our dataset has unique properties that are uncommon to current benchmarks for 2D and 3D puzzle solving. The fragments and fractures are realistic, caused by a collapse of a fresco during a World War II bombing at the Pompeii archaeological park. The fragments are also eroded and have missing pieces with irregular shapes and different dimensions, challenging further the reassembly algorithms. The dataset is multi-modal providing hi-res images with characteristic pictorial elements, detailed 3D scans of the fragments and meta-data annotated by the archaeologists. Ground truth has been generated through several years of unceasing fieldwork, including the excavation and cleaning of each fragment, followed by manual puzzle solving by archaeologists of a subset of 1,000 pieces among the 16,000 available. After digitizing all the fragments in 3D, a benchmark was prepared to challenge current reassembly and puzzle-solving methods that often solve more simplistic synthetic scenarios. The tested baselines show that there clearly exists a gap to fill in solving this computationally complex problem.

Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving

Abstract

Resources

Paper

Dataset

Data Processing

Puzzle Solving (Reassembly)

A preview of the 3D fragments

A preview of the 2D fragments

Acknowledgment