Web Scraping for Dummies

Ipsita Das (~ipsita)


41

Votes

Description:

From data sciences to app development, data extraction is the most necessary ingredient in today's world. Everybody in software, hardware and computer science research require data in some form or the other. The challenge is to extract and accumulate relevant and useful data. Hitherto, it is a must for everyone to have a basic working idea of data extraction from the web.

In this session we will discuss how web scraping is helpful in handling our data extraction issues and how easily this can be achieved using 'BeautifulSoup'. We will discuss web scraping from scratch and also run sample code for proper visualization of the techniques we will be demonstrating. People without prior knowledge of BeautifulSoup can easily grasp the demonstrated techniques. A working knowledge of HTML and Python will just be required and absolutely nothing else. We will wrap up our session with an overview of how Selenium can be used in web scraping, in order that the audience have an overall and complete overview of web scraping as a whole. For this basic knowledge of Selenium is good enough.

We plan to keep our session lucid, comprehensible and most importantly simple, so that the audience can take back a working knowledge of web scraping. We will be exhibiting sample code for better understanding at each step.

Prerequisites:

Basic knowledge of Selenium, Python and HTML

Content URLs:

This is a demo of few topics for my talk

https://github.com/dasipsita/proposals/blob/master/Web%20Scraping%20for%20Dummies.pdf

Section: Web Development
Type: Talks
Target Audience: Beginner
Last Updated:

Looking forward to this talk

Siddharth k (~siddharth2)

Looking forward to this talk.

Aardra Kannan Ambili (~aardra)

The prerequisites section could do with further detail. "Basic knowledge" could mean a variety of things and the audience should be clear about the expectations being set up by the speaker.

sankarshan mukhopadhyay (~sankarshan)

Thanks for showing interest. I have explained that basic knowledge of Selenium, Python and HTML is needed. Audience can expect a simple and helpful talk, which will help them to get data by themselves for their projects.

Ipsita Das (~ipsita)

Login to add a new comment.