Wikipedia, Dead Authors, Naive Bayes and Python
|Topic||Data Analysis / Engineering with Python|
|Tags||Text Classification, Wikipedia Mining, Machine Learning|
This talk is about using one of the simplest to understand machine learning algorithm, Naive Bayes Classifier. We will use it for classifying Wikipedia pages. We will look at the available implementations in NLTK and Scikits.learn.
- Introduction of the classification task
- Brief introduction to Naive Bayes Classifier
- Using the NB classifier available in NLTK
- Using the NB classifier available in Scikits.learn
I discovered Python 2 years ago and have been digging it since. I have experience in using Python for web development, text processing, machine learning & as a handy tool for everyday automation. I consider Python to be an ideal first computer language and wish more places used it for their introduction to programming courses.