Skip to content

TanmayThaker/Wikipedia-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wikipedia-Scraper

Table of Contents

About the project

This repository contains a Wikipedia-scraper that will scrape the entire wikipedia page and tell us the top 10 Most Frequently used words. It is made using Python as a Backend and HTML CSS in the frontend. I have also deployed it on heroku : https://wikipedia-scraper.herokuapp.com/

Enter a valid wikipedia URL and click on submit.

It will show the results i.e. top 10 frequently used words on that page.

Tech Stack used

Project is created with : HTML, CSS, Flask, Python.

Install and Run

  1. Clone the repo
$ git clone https://github.com/TanmayThaker/Wikipedia-Scraper.git
$ cd Wikipedia-Scraper
  1. Initialize and activate a virtualenv:
$ virtualenv --no-site-packages env
$ source env/bin/activate
  1. Install the dependencies:
$ pip install -r requirements.txt
  1. Run the development server:
$ python app.py
  1. Navigate to http://localhost:8000

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published