Scrapy Exercises

This repository is for practicing Scrapy, a free and open-source web-crawling framework, and thanks to the people who gave me this valuable opportunity.

Requirements

Python 3.8+
scrapy 2.11

Getting Started (For Windows)

Create a Python virtual environment, which helps isolate the practice environment from the main environment and reduces the possibility of package conflicts.
```
python -m venv my_scrapy_env
```
Activate the virtual environment.
```
my_scrapy_env/Scripts/activate
```
Install dependencies.
```
pip install -r requirements.txt
```
This will install packages from requirements.txt:
- scrapy
- shub
- scrapy-crawlera
- google-cloud-storage
- scrapy-sessions
Please note that this project is initialized with Scrapy 2.11. Running scrapy startproject exercises with Scrapy 2.4 conflicts with other packages.

Usage

Please note the log level is set to INFO.

Tackle World

Inside the exercises folder, run:
```
scrapy crawl tackleworldadelaide -O tackleworldadelaide.json
```
This generates a json file containing products data from TackleWorld.
Surfboard Empire

Inside the exercises folder, run:
```
scrapy crawl surfboardempire -O surfboardempire.json
```
This generates a json file containing products data from Surfboard Empire.
Regular Expressions

Inside root folder, run:
```
python regex.py
```
Simply extract the numerical total number of products from an HTML elements.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
exercises		exercises
.gitignore		.gitignore
README.md		README.md
regex.py		regex.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapy Exercises

Requirements

Getting Started (For Windows)

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Scrapy Exercises

Requirements

Getting Started (For Windows)

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages