NLP Applications Crash Course

Dipanjan Sarkar (~dipanjan)




Being specialized in domains like computer vision and natural language processing is no longer a luxury but a necessity which is expected of any data scientist in today’s fast-paced world! With a hands-on and interactive approach, we will understand essential concepts in NLP along with extensive case- studies and hands-on examples to master state-of-the-art tools, techniques and frameworks for actually applying NLP to solve real- world problems. We will leverage machine learning, deep learning and deep transfer learning to solve some popular tasks in NLP including the following:

  • Introduction to NLP

    • Basics
    • POS Tagging
    • NER
    • Text Wrangling
  • Text Representation

    • Traditional Statistical Models – BOW, TFIDF
    • Embedding Models
  • NLP Applications

    • Text Similarity Content Recommenders
    • Topic Modeling
    • Text Summarization (Extractive & Abstractive)
    • Text Classification (ML, DNN, CNN, LSTM, BERT, DistilBERT)
    • Multi-Task NLP (Transformers)


  • Knowledge of Python
  • Basics of ML \ DL help

Content URLs:

Slides and Code will be in my GitHub assuming this session pans out:

Speaker Info:

Dipanjan (DJ) Sarkar is a Data Science Lead at Applied Materials, leading advanced analytics efforts around computer vision, natural language processing and deep learning. He is also a Google Developer Expert in Machine Learning. He has consulted and worked with several startups as well as Fortune 500 companies like Intel and Open Source organizations like Red Hat \ IBM. He primarily works on leveraging data science, machine learning and deep learning to build large- scale intelligent systems. He holds a master of technology degree with specializations in Data Science and Software Engineering.

Dipanjan has been an analytics practitioner for several years now, specializing in machine learning, natural language processing, computer vision and deep learning. Having a passion for data science and education, he also acts as an AI Consultant and Mentor at various organizations like Springboard, where he helps people build their skills on areas like Data Science and Machine Learning. Dipanjan is also a published author, having authored several books on R, Python, Machine Learning, Social Media Analytics, Natural Language Processing, and Deep Learning. In his spare time he loves reading, gaming, watching popular sitcoms and football and writing interesting articles on and He is also a strong supporter of open-source and publishes his code and analyses from his books and articles on GitHub at

Speaker Links:

  • GitHub:
  • LinkedIn:
  • Past PyCon Talk:
  • Recent Conference Session:

Section: Data Science, Machine Learning and AI
Type: Workshop
Target Audience: Intermediate
Last Updated: