Setup

# create a virtual environment (to isolate your libraries)
python3 -m venv venv

# activate virtual environment "venv"
source venv/bin/activate

# install libraries
pip install --upgrade pip
pip install -r requirements.txt

# you can exi
587C
t venv using the command
deactivate

To run the scrapper, create index and test the search algorithms

After setup, you need to run the scripts in the following order to check the results

# will scrap data from wikipedia, reading links from a local txt file, create local html files and export data in csv and xlsx 
python scrapper.py

#create the inverse index file
python create_index.py

# show the outputs of the functions country_search(keyword) and fuzzy_search(keyword)  
python search.py toronto

#or

python search.py toronta

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
html		html
.gitignore		.gitignore
README.md		README.md
countries.csv		countries.csv
countries.xlsx		countries.xlsx
create_index.py		create_index.py
inverted_index.json		inverted_index.json
link.txt		link.txt
requirements.txt		requirements.txt
scrapper.py		scrapper.py
search.py		search.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Setup

To run the scrapper, create index and test the search algorithms

About

Uh oh!

Releases

Packages

Uh oh!

Languages

eduardomp/countries-web-scraping

Folders and files

Latest commit

History

Repository files navigation

Setup

To run the scrapper, create index and test the search algorithms

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages