Synthetic Data Generation

S J (~s2)




In this talk, I will walk you through the basics of how to generate synthetic data from your production relational tables.

This is useful in cases where your customers do not want you to examine their production data but you want to build some models which simulates their usage of your product.

We will look at how you can mask values, capture variation within columns as well as dependence between the columns of a table and across tables.


prior exposure to statistical concepts like probability distributions would help but not required.

Video URL:

Speaker Info:

Software engineer with 18+ years of experience. Currently working at Kognitos

Speaker Links:

links to some previous talks given

Talk on Disambiguating users in a social media graph

Talk on Lamport's TLA+

Talk on Data Streaming Algorithms

Section: Artificial Intelligence and Machine Learning
Type: Talk
Target Audience: Intermediate
Last Updated: