Hikes_scraper Project

Scraping Komoot Website for Customized Hike Suggestions

This project utilizes Python's Scrapy, Selenium, Requests, BeautifulSoup, regex, and lambda functions to extract and manipulate hiking route details from Komoot's Poland page.

Project Introduction

Our project focuses on gathering information about hiking routes from Komoot, a leading platform for outdoor enthusiasts. We employed a combination of web scraping tools and libraries in Python, including:

Scrapy for management and automation of the scraping process.
Selenium for interacting with dynamic web content.
BeautifulSoup for parsing and extracting data from JavaScript-enabled pages.
Requests for handling HTTP requests efficiently.
Regex and lambda functions for refining and processing extracted data.

To ensure efficient extraction and processing, Scrapy Spiders were used to automate website navigation, while Selenium facilitated interaction with dynamic content. BeautifulSoup was then employed to parse and extract relevant information from JavaScript-rendered pages.

Key Features

Automated scraping of hiking routes.
Interaction with JavaScript-rendered content.
Data extraction, cleaning, and formatting using Python.
Structured dataset creation for further analysis.

Project Files

Project Description.ipynb: Detailed explanation of project workflow and implementation.
scrapy_spiders/: Contains Scrapy spider scripts for automating the scraping process.
data/: Folder containing extracted hiking route data.
requirements.txt: List of dependencies required for running the project.

Installation & Usage

Prerequisites

Python 3.x
Install dependencies:

pip install -r requirements.txt

Running the Scraper

scrapy crawl hikes_spider

Future Improvements

Expanding to other regions on Komoot.
Implementing NLP for enhanced route recommendations.
Improving data storage and visualization.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
hikes_scraper		hikes_scraper
recommend_web		recommend_web
.DS_Store		.DS_Store
Project Description.ipynb		Project Description.ipynb
README.md		README.md
banner.gif		banner.gif
komoot_page.png		komoot_page.png
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hikes_scraper Project

Scraping Komoot Website for Customized Hike Suggestions

Project Introduction

Key Features

Project Files

Installation & Usage

Prerequisites

Running the Scraper

Future Improvements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hikes_scraper Project

Scraping Komoot Website for Customized Hike Suggestions

Project Introduction

Key Features

Project Files

Installation & Usage

Prerequisites

Running the Scraper

Future Improvements

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages