Word Mover's Distance for document similarity

Rishab Goel (~RishabGoel)




Word Mover's Distance is new metric to calculate document similarity. This beats LDA (Latent Dirichlet Allocation) and LSA (Latent Semantic Indexing) in terms of accuracy. This is based on state of the art google word2vec vectors and widely studied Earth Mover's Distance (in transportation). The use of word2vec gives it the power to detect the document similarity, even when 2 sentences have no word in common. This is implemented in Gensim (Topic Modelling Library) in Python.


Basics of Machine Learning(Neural Network), NLP and word2vec .

Content URLs:


Speaker Info:

Rishab Goel is a Master's in CS Student @ IIT Delhi with great interest in Deep Learning (RNNs specifically) and Data science.

Speaker Links:


Section: Others
Type: Open space
Target Audience: Intermediate
Last Updated:


Can you share your email id, so that we can send you further details.

Saurabh Sharma (~d3prof3t)

Hi Saurabh,

My email id is rgoel0112@gmail.com

Regards, Rishab

Rishab Goel (~RishabGoel)
The comment is marked as spam.


Login to add a new comment.