AICurious Logo

What is: MoViNet?

SourceMoViNets: Mobile Video Networks for Efficient Video Recognition
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

Mobile Video Network, or MoViNet, is a type of computation and memory efficient video network that can operate on streaming video for online inference. Three techniques are used to improve efficiency while reducing the peak memory usage of 3D CNNs. First, a video network search space is designed and neural architecture search employed to generate efficient and diverse 3D CNN architectures. Second, a Stream Buffer technique is introduced that decouples memory from video clip duration, allowing 3D CNNs to embed arbitrary-length streaming video sequences for both training and inference with a small constant memory footprint. Third, a simple ensembling technique is used to improve accuracy further without sacrificing efficiency.