AICurious Logo

What is: Pointer Sentinel-LSTM?

SourcePointer Sentinel Mixture Models
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

The Pointer Sentinel-LSTM mixture model is a type of recurrent neural network that combines the advantages of standard softmax classifiers with those of a pointer component for effective and efficient language modeling. Rather than relying on the RNN hidden state to decide when to use the pointer, the model allows the pointer component itself to decide when to use the softmax vocabulary through a sentinel.