Document Clustering with Word2vec and Hierarchial Clusters

Karmanya Aggarwal (~CalmDownKarm)




Overall the talk is going to be about topic modelling, however, I'd like to talk about 2 things in particular

  1. Performing LDA onto a dataset, extracting most popular themes and then using word2vec and clustering to agglomerate the themes into clusters. Using Hierarchical Clustering to fit the themes into a fixed number of labels. Similar to what google's NLP classification API attempts to do.

  2. Visualizing Clusters of words/sentences/phrases using Dendrograms and t-SNE

Finally, if I get time I'd like to talk about StitchFix's LDA2vec approach, but I think the first 2 will last 30 minutes unless the audience is very familiar with how this sort of stuff works.


Some familiarity with clustering (Kmeans) is helpful, but not required.

Content URLs: (Blog Post)

Speaker Info:

Recently graduated from BML Munjal University, Developer at Gramener.

Speaker Links:

Section: Data science
Type: Talks
Target Audience: Intermediate
Last Updated:

Really like to visit here this amazing post this is the way for looking the all windows mobile ringtone settings and have to save the update to change the ring tone easily thanks.


At the present time I have a first or second gen iPad which I utilize generally to watch YouTube, Netflix, jerk, ect. I might want to utilize my tablet or workstation for media, and in addition school and possibly doodling. That is the reason I have been investigating the Samsung tablets, as the spen and console look extremely helpful and black friday ads 2018, while the entire thing is super thin and will fit in my sack. I would purchase an iPad genius I truly lean toward the appel pencil, however I favor android 100%. I'm likewise needing to get the s3/4 over the tab an or tab e, since I truly need it to have the capacity to run programs I require in school.

jelina den (~jelina)

Login to add a new comment.