From scratch to ML - The machine learning library you really understand and explaining its predictions with LIME.

Mohit Rathore (~markroxor) | 24 Sep, 2018

4

Votes

Description:

The aim of this dev-sprint is to give a hands on coding experience for writing machine learning / deep learning algorithms from scratch without using external frameworks alongside visualising the model and explaining its predictions using LIME.

from-scratch-to-ml

The primary goals of this library is -
- This framework is intended to be an educational tool to learn deep Learning.
- To bridge the gap between the theoretical and coding aspects of machine learning algorithms.
- To write intuitive blogs as python notebooks so as to juxtapose theory and code. Explaining the fundamentals of the algorithm from the very basics.
- To minimise the use of external dependencies except the fundamental ones like numpy and matplotlib. - To make sure that the developed algorithms are coherent with already existing machine learning frameworks.

The library is still in a nascent stage but will take shape in a couple of months. Given that the commit frequency is huge. The audience is requested to be patient.

LIME (Local Interpretable Model-Agnostic Explanations) -

When you are writing a machine algorithm from scratch you want to make sure that your results are coherent and your model is learning the features it is meant to learn. LIME explains why your model behaved the way it did. <div style="width:100%;text-align:center;"> <a href="https://www.youtube.com/watch?v=hUnRCxnydCc" target="_blank"><img src="https://raw.githubusercontent.com/marcotcr/lime/master/doc/images/video_screenshot.png" width="450" alt="KDD promo video"/></a> </div> I will quote excerpts from their blog below -

Imagine we want to explain a classifier that predicts how likely it is for the image to contain a tree frog. We take the image on the left and divide it into interpretable components (contiguous superpixels).

As illustrated below, we then generate a data set of perturbed instances by turning some of the interpretable components “off” (in this case, making them gray). For each perturbed instance, we get the probability that a tree frog is in the image according to the model. We then learn a simple (linear) model on this data set, which is locally weighted—that is, we care more about making mistakes in perturbed instances that are more similar to the original image. In the end, we present the superpixels with highest positive weights as an explanation, graying out everything else.

Even from a human's perspective these explanations do make sense.

SOURCE

Prerequisites:

Basic python libraries like numpy and matplotlib.
High school level understanding of mathematics and calculus
and of course - python.

Content URLs:

The work in progress repository of all the associated code - fromscratchtoml.
The official website of fromscratchtoml.
The work in progress python notebooks.
The author's github profile.
Sample slides will be uploaded here.

Speaker Info:

I have graduated from IIT ISM Dhanbad in 2017.

Formerly I worked for a London based startup - ALIS labs, currently I am a research fellow at CVIT Lab IIIT Hyderabad alongside being the author of fromscratchtoml.

I am also RaRe's incubator program member - the same organization which looks after the reputed topic modelling library gensim.

I have given prep talks and mentored dev sprint on the same in Hyderabad Python Meetup group twice.

Speaker Links:

Author's open source contribution can be seen at his github profile where it all started.
Author's current blog where he discussed a 'bit' about the impact of AI.
Author's old blog archive where he talked about random developer stuff.
Author's another delusional repository which he has trouble explaining to people.
Author sometimes also blogs for RaRe technologies.
Author is omnipresent on the web by the handle markroxor.

Section:	Data science
Type:	Dev Sprint
Target Audience:	Advanced
Last Updated:	25 Sep, 2018

Comments