We introduce DCV-Net, a scalable transformer-based architecture for optical flow with dynamic cost volumes. Recently, FlowFormer [Huang et al., 2022], which applies transformers on the full 4D cost vol- umes instead of the visual feature maps, has shown significant improvements in the flow estimation accuracy. The major drawback of FlowFormer is its scalability for high-resolution input images, since the the com- plexity of the attention mechanism on the 4D cost volumes scales to O(N^4 ) , with N being the number of visual feature tokens. We propose a novel architecture where we obtain the FlowFormer type enrichment of matching cost representations, but using light-weight attention on the visual feature maps with quadratic ( O(N^2 ) ) compl...
This paper introduces a new algorithm for computing multi-resolution optical flow, and compares this...
State-of-the-art solutions to optical flow fail to jointly offer high density flow estimation, low p...
Abstract. The variational TV-L1 framework has become one of the most popular and successful approach...
We introduce Optical Flow TransFormer (FlowFormer), a transformer-based neural network architecture ...
Full-motion cost volumes play a central role in current state-of-the-art optical flow methods. Howev...
State-of-the-art neural network models estimate large displacement optical flow in multi-resolution ...
Transformers based on the attention mechanism have achieved impressive success in various areas. How...
Learning-based optical flow estimation has been dominated with the pipeline of cost volume with conv...
Optical flow is used to describe the variations between adjacent images of a sequence. Although the ...
Single-scale approaches to the determination of the optical flow field from the time-varying brightn...
Optical flow is a representation of projected real-world motion of the object between two consecutiv...
We propose a new neural network module, Deformable Cost Volume, for learning large displacement opti...
How important are training details and datasets to recent optical flow models like RAFT? And do they...
We attempt to reduce the computational costs in vision transformers (ViTs), which increase quadratic...
Vision transformers have shown excellent performance in computer vision tasks. As the computation co...
This paper introduces a new algorithm for computing multi-resolution optical flow, and compares this...
State-of-the-art solutions to optical flow fail to jointly offer high density flow estimation, low p...
Abstract. The variational TV-L1 framework has become one of the most popular and successful approach...
We introduce Optical Flow TransFormer (FlowFormer), a transformer-based neural network architecture ...
Full-motion cost volumes play a central role in current state-of-the-art optical flow methods. Howev...
State-of-the-art neural network models estimate large displacement optical flow in multi-resolution ...
Transformers based on the attention mechanism have achieved impressive success in various areas. How...
Learning-based optical flow estimation has been dominated with the pipeline of cost volume with conv...
Optical flow is used to describe the variations between adjacent images of a sequence. Although the ...
Single-scale approaches to the determination of the optical flow field from the time-varying brightn...
Optical flow is a representation of projected real-world motion of the object between two consecutiv...
We propose a new neural network module, Deformable Cost Volume, for learning large displacement opti...
How important are training details and datasets to recent optical flow models like RAFT? And do they...
We attempt to reduce the computational costs in vision transformers (ViTs), which increase quadratic...
Vision transformers have shown excellent performance in computer vision tasks. As the computation co...
This paper introduces a new algorithm for computing multi-resolution optical flow, and compares this...
State-of-the-art solutions to optical flow fail to jointly offer high density flow estimation, low p...
Abstract. The variational TV-L1 framework has become one of the most popular and successful approach...