Webcrawler Markdown

A simple webcrawler application built with HTML, CSS, and JavaScript.

Overview

This project allows users to crawl a website, extract content, and save it as Markdown.

Preview

Features

Crawls a website to a specified depth.
Extracts headings, links, and paragraphs from crawled pages.
Converts extracted content to Markdown format.
Allows users to save the Markdown content to a file.
Displays status updates and logs during the crawling process.

Usage

Enter the URL of the website you want to crawl in the URL input field.
Specify the crawl depth in the depth input field.
Click the "Start Crawl" button to begin crawling.
View the extracted Markdown content in the output section.
Click the "Save Markdown" button to save the content to a file.

Implementation Details

HTML: Provides the structure and user interface of the application.
CSS: Styles the application for a better user experience.
JavaScript: Handles the crawling logic, content extraction, and Markdown conversion.

Files

index.html: The main HTML file containing the application's structure, styles, and JavaScript code.

Dependencies

No external libraries or frameworks are required.

Limitations

The crawler may not work correctly on websites with complex JavaScript or dynamic content.
The crawler may be blocked by websites with anti-scraping measures.
The extracted content may not be perfectly formatted.
The crawler does not support crawling behind login pages or forms.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Webcrawler Markdown

Overview

Preview

Features

Usage

Implementation Details

Files

Dependencies

Limitations

About

Uh oh!

Releases

Packages

Languages

NerdBaba/webcrawler-md

Folders and files

Latest commit

History

Repository files navigation

Webcrawler Markdown

Overview

Preview

Features

Usage

Implementation Details

Files

Dependencies

Limitations

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages