Python program to get the Source Code of a Webpage


Posted on 24 January 2018

In this article we are going to create a simple web scraper python program which will fetch the source code a particular webpage, we are going to use pythons urllib module which is mainly used to fetch data across the world wide web.

So let's start by importing the urllib module the module is divided into three parts in python3 theurllib.request, urllib.parse and urllib.error in our code, we are going to use urllib.request

Also Read: Run Length Encoding (RLE) Program in Python

import urllib.request
import argparse

Now let's write a python function to fetch the webpage source code let's name the function as getCode()

def getCode(url):

    raw = urllib.request.urlopen(url).read()
    code = raw.decode()

If you are wondering "that's it" then you are totally correct we need this much lines of code to fetch the HTML source code of a webpage( Python's Swagcool).

In the above code we are simply opening a connection between our machine and host URL then we are reading the webpage data in form of bytestream and after that, we decoded the data into a readable format using the decode() function.  

Now let's see the complete program for fetching the source code of a webpage in python.

import urllib.request as ul
import argparse

def getCode(url):

    raw = ul.urlopen(url).read()
    code = raw.decode()

if __name__ == '__main__':

    parser = argparse.ArgumentParser(description="Hostname")
    args = parser.parse_args()
    url = args.url

to run the above program you need to pass an extra parameter '--url' as shown below

python --url=

Output :

Python Program to fetch the source code of a webpage

If You Love this article, You Should Consider:

  • Like us on Facebook
  • Follow us on Instagram
  • Follow us on Twitter
  • Subscribe to our Newsletter.
  • Let us know your suggestions and queries in the comments below.

Thank you for your Love and Support

Share your thoughts