CVMay 16, 2023

Ray-Patch: An Efficient Querying for Light Field Transformers

arXiv:2305.09566v2
Originality Incremental advance
AI Analysis

This addresses computational inefficiency in light field transformers for tasks like rendering, though it appears incremental as it builds on existing transformer-based methods.

The paper tackles the problem of efficiently querying transformers to decode implicit representations into target views, achieving up to an order of magnitude increase in inference speed while maintaining task metrics.

In this paper we propose the Ray-Patch querying, a novel model to efficiently query transformers to decode implicit representations into target views. Our Ray-Patch decoding reduces the computational footprint and increases inference speed up to one order of magnitude compared to previous models, without losing global attention, and hence maintaining specific task metrics. The key idea of our novel querying is to split the target image into a set of patches, then querying the transformer for each patch to extract a set of feature vectors, which are finally decoded into the target image using convolutional layers. Our experimental results, implementing Ray-Patch in 3 different architectures and evaluating it in 2 different tasks and datasets, demonstrate and quantify the effectiveness of our method, specifically a notable boost in rendering speed for the same task metrics.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes