Crawler(scraper) & Laravel API

This project is a crawler designed to scrape the Magiran website and collect information about professors from Qom University. The project consists of two main components:

Crawler-Scraper (Python): It scrapes the author pages, extracts metadata such as article titles, keywords, language, and publication information, and stores the extracted data into a json.json file. The crawler leverages requests and BeautifulSoup to fetch and parse HTML content.

Laravel API: The Laravel backend manages the scraped data. The models for Author and Paper define relationships and handle the CRUD operations for the database. The crawler output can be integrated into the Laravel API to store the scraped information into a database.

Features

Scrapes article information (title, authors, keywords, language, etc.).
Stores the scraped data into a json.json file.
Laravel models (User, Paper, Author) handle data operations.

Requirements

Python (Crawler)

Python 3.x
Python packages:
- requests
- BeautifulSoup4

Laravel (API)

PHP 8.x or higher
Composer
MySQL or SQLite

How to Run

Python Crawler

Clone the repository and navigate to the project folder.
Install required Python packages: requests, BeautifulSoup4
Modify links.txt to add the target URLs you want to scrape.
Run the crawler: python crawler.py

Laravel API:

Install dependencies: composer install
Set up your .env file for database connection.
Run migrations: php artisan migrate
Run the Laravel development server: php artisan serve The API is now ready to receive data (from the crawler) via POST requests or other CRUD operations.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
api		api
crawler		crawler
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Crawler(scraper) & Laravel API

Features

Requirements

Python (Crawler)

Laravel (API)

How to Run

Python Crawler

Laravel API:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

zandmahsa/magiran-crawler

Folders and files

Latest commit

History

Repository files navigation

Crawler(scraper) & Laravel API

Features

Requirements

Python (Crawler)

Laravel (API)

How to Run

Python Crawler

Laravel API:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages