Spark DataFrames and Graph DB:- An Introduction

Piyush Gandhi (~PiyuGandhi)


34

Votes

Description:

Understanding the use of apache spark clusters in Industry to speed up computations (upto 10x) for heavy big data, use of graph database to make a vertex to vertex or node to node connection. This talk will be a hands on experience for intermediate pythoners and beginner data scientists and will include the following topics:-

  1. Introduction to spark and graph dataframes
  2. Spark DataFrames v/s SQL
  3. Basic analysis of a dataset using spark DataFrames
  4. Implementation on Databricks

Prerequisites:

Python 101 ( Basics of python)

Data Science 101 (Pandas, Numpy, data visualization)

Speaker Info:

B.Tech undergrad (senior year) at Guru Tegh Bahadur Institute Of Technology, New Delhi. I am currently interning as a Data Scientist (Financial Modelling and Fraud Detection) at RedCarpetUp. I've worked on Machine Learning projects specially in classification domain. I have also done some minor work on Computer Vision (Gender classification, Object Detection, Lane detection for Self Driving Cars in GTA Vice City), Natural Language Processing (Text similarity, Youtube comments relevancy) and currently working on my own Virtual Assistant.

Section: Data Analysis and Visualization
Type: Talks
Target Audience: Intermediate
Last Updated: