We present an approach to decomposing animated graphics into sprites, a set of basic elements or layers. Our approach builds on the optimization of sprite parameters to fit the raster video. For efficiency, we assume static textures for sprites to reduce the search space while preventing artifacts using a texture prior model. To further speed up the optimization, we introduce the initialization of the sprite parameters utilizing a pre-trained video object segmentation model and user input of single frame annotations. For our study, we construct the Crello Animation dataset from an online design service and define quantitative metrics to measure the quality of the extracted sprites. Experiments show that our method significantly outperforms baselines for similar decomposition tasks in terms of the quality/efficiency tradeoff.
We applied our method to Crello Animation. The leftmost is the input, the next is the decomposed background, and the rest are the decomposed sprites for each sample. [Additional samples].
Our method can decompose sprites fast with high quality. The figure shows the trade-off between the optimization time and the decomposition errors. The right two errors are more important as quality metrics (see paper, for details). LNA and DS are Layered Neural Atlases [Kasten 2021] and Deformable Sprites [Ye 2022], respectively.
Once decomposed by our method, the sprites can be easily manipulated.
@inproceedings{suzuki2024fast,
author={Suzuki, Tomoyuki and Kikuchi, Kotaro and Yamaguchi, Kota},
title={Fast Sprite Decomposition from Animated Graphics},
booktitle={ECCV},
year={2024}
}
[Kasten 2021] Kasten, Y., Ofri, D., Wang, O., Dekel, "Layered Neural
Atlases." TOG 2021.
[Ye 2022] Ye, V., Li, Z., Tucker, R., Kanazawa, A., Snavely, N.,
"Deformable Sprites." CVPR 2022.