March 14, 2025
[2402.14327] Subobject-level Image Tokenization
[Submitted on 22 Feb 2024 (v1), last revised 12 Mar 2025 (this version, v3)] View a PDF of the paper titled Subobject-level Image Tokenization, by Delong Chen and 4 other authors View PDF HTML (experimental) Abstract:Patch-based image tokenization ignores the morphology of the visual world, limiting effective and efficient learning of image understanding. Inspired by subword tokenization, we introduce subobject-level adaptive token segmentation and explore several approaches, including superpixel, SAM, and a proposed Efficient and PanOptiC (EPOC) image tokenizer. Our EPOC combines boundary detection — a simple task that can be handled well by a compact model — with watershed