Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling

Posted on 2017-09-29 | In Note

A new framework created by Massachusetts Institute of Technology(in Stanford University) and Richard Socher(in Salesforce Research)

Goal: A new framework
Advantage: greatly reducing the number of trainable variables.
Experiments: Their LSTM model lowers the state of the art word-level perplexity on the Penn Treebank to 68.5.

He Guoxiu

Some notes for interesting papers, tutorials for useful tools, and inspire for life.

Github