Long Short-term memory (LSTM) Recurrent Neural Network (RNN) to classify movie reviews

A major characteristic of all neural networks I have used so far, such as densely connected networks and convnets (CNN) (see my previous post), is that they have no memory. Each input shown to them is processed independently, with no state kept in between inputs. In other words, they do not take into the context of the words (the words around the word). Imagine you’re reading a book, and you want to understand the story by keeping track of what’s happening in the plot.