PyCon Tutorial: Scraping the Web with Kimberly Fessel
By Metis • May 05, 2020
Last month's PyCon event, like so many others in 2020, was hosted virtually rather than in-person. It was filled with a tremendous amount of interesting and informative content, including a tutorial from our very own Kimberly Fessel, Metis Sr. Data Scientist.
About her chosen tutorial topic, which she titled It's Officially Legal so Let's Scrape the Web, Kimberly noted that "web scraping empowers you to write Python programs that collect data from websites automatically, and recent legal rulings support your right to do so."
During her allotted time, she covered the breadth and depth of web scraping, from HTML basics through pipeline methods to compile entire datasets. To get the most out of the tutorial, viewers should have a working knowledge of Python fundamentals but don't need to have any prior experience with scraping.
Kimberly Fessel is Sr. Data Scientist at Metis, where she teaches the bootcamp in NYC. Her professional interests include natural language processing, data visualization, and data storytelling, and her enthusiasm for teaching comes from her days as an academic. She holds a Ph.D. in applied mathematics from Rensselaer Polytechnic Institute and she completed a postdoctoral fellowship in math biology at the Ohio State University.
Next week in San Francisco (Oct. 30 - Nov. 1), five Metis team members will give talks or host workshops at the Open Data Science Conference (ODSC) West and Accelerate AI Business and Innovation Summit. In this post, read about each, and if you're attending the conferences, be sure to check them out!
On the second day of our upcoming Demystifying Data Science live online conference, hear 8 talks + 3 interactive workshops, all designed for Business Leaders, Managers, and Practitioners to learn best practices to successfully integrate data science into an organization. In this post, check out a preview of just some of what's to come on Day 2, and start planning so you can take it all in.