Getting Started with Python Web Scraping – Learning to collect information on websites with Python
Python is a high-level programming language used for general-purpose programming. It has a design philosophy which emphasizes code readability and a syntax that allows programmers to express concepts in fewer lines of code than possible in languages such as C ++ or Java.
This video course is a rich collection of recipes that will come in handy when you scrub a website using Python, addressing your usual and unusual problems while scraping websites by diving deep into the capabilities of Python’s web scraping tools such as Selenium, BeautifulSoup, and urllib2. The video will begin with showing how to use the selenium module for scraping by setting up a web driver, debugging with the console and downloading files and streamlining with a headless browser (PhantomJS). The video will then move on to show how to do parsing with Beautifulsoup, which would include introduction to the BeautifulSoupObjects, Nested Selectors and Regular Expressions Basics and how to do UTF-8 encoding. The video will end with showing how to do fetching with urlib2 using the developer tools Network tab,
By the end of this video, you will be able to successfully understand the in-depth capabilities of python web scraping tools.
Table of Contents:
Scraping with Selenium
– Parsing with BeautifulSoup
– Fetching the urlib2 and APIs
Manufacturer: Packt Publishing
Language of instruction: English
Teacher: Charles Clayton
Level of training: Preliminary
time of training: 1 hour + 36 minutes
File Size: 357 MB