Why AI Industry needs Revision Control Graph Database

Cheuk Ting Ho (~Cheukting)


Description:

Data is the driving force for ML&AI. As technology advances, we are dealing with bigger and more complicated data. Through use cases, Cheuk will show you how a graph database with revision control would revolutionize the AI industry by providing a logical way of storing data and providing data ops like branch, merge and rollback.

Cheuk will show you why graph database with revision control would revolutionize the AI industry by providing a logical way of storing data and powerful data ops. This talk mainly consists of 3 parts,

In the first part, Cheuk will explain what is a knowledge graph and the benefit of it over SQL databases. By storing data in a graph format, information can be implied in the data itself and knowledge of the information can be extracted by logical queries. Machine learning (ML) and natural language programming (NLP) can be benefit from representing data in a knowledge graph.

Then, Cheuk will demonstrate how a revision control database that is fully capable of git like operations - branch, merge, fork, clone and rollback, can benefit data ops and encourage data openness. Cheuk will demonstrate use-cases where such databases can be applied to provide a data ops pipeline for machine learning projects. Cheuk will also make some comparison of this “new method” with the “traditional way” of completing the same task.

In the last part, Cheuk will discuss why there are limited choices in graph databases that have revision control capabilities among the open-source software.

This talk is for those who are interested in Data Science and would like to look for a new solution for storing their data and have better data ops.

Prerequisites:

This talk will assume no prior knowledge from the audience.

Speaker Info:

After spending 5 years doing computational research in Physics, Cheuk has transferred her analytical and logical skills in natural science and built a career in data science. Cheuk has been a Data Scientist in various companies which demands high numerical and programmatical skills, especially in Python. To follow her passion for the tech community, now Cheuk is the Developer Relations Lead at TerminusDB - an open-source graph database. Cheuk maintains its Python client and engages with its user community daily.

Besides her work, Cheuk enjoys talking about Python in personal streaming platform and MidMeetPy podcast. Cheuk has also been a guest speaker at Universities and various conferences. On top of speaking at conferences, Cheuk also participates as organizers. Conferences that Cheuk has organized include EuroPython(which she is a board member of), PyData Global and Pyjamas Conf. Believing in gender equality, Cheuk constantly organizes workshops and mentored sprints to support Tech Diversity and Inclusion.

Speaker Links:

Website: https://cheuk.dev Twitter: https://twitter.com/cheukting_ho GitHub: https://github.com/Cheukting

Section: Data Science, Machine Learning and AI
Type: Talks
Target Audience: Beginner
Last Updated: