AICurious Logo

What is: Neural Network Compression Framework?

SourceNeural Network Compression Framework for fast model inference
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

Neural Network Compression Framework, or NNCF, is a Python-based framework for neural network compression with fine-tuning. It leverages recent advances of various network compression methods and implements some of them, namely quantization, sparsity, filter pruning and binarization. These methods allow producing more hardware-friendly models that can be efficiently run on general-purpose hardware computation units (CPU, GPU) or specialized deep learning accelerators.