8000 GitHub - amafjarkasi/csv-link-extractor: CSV Link Extractor UI App - Export URLs
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

amafjarkasi/csv-link-extractor

Repository files navigation

CSV Link Extractor

A GUI application for extracting URLs from CSV files with master list management, deduplication, and configurable settings.

Features

  • Process multiple CSV files concurrently
  • Extract URLs from specified column
  • Maintain a master list of previously processed URLs
  • Exclude URLs using an exclude list file
  • Auto-deduplicate URLs against master list and current batch
  • Dark mode interface with three main sections:
    • Main: Primary processing controls
    • Statistics: Processing metrics and master list cleaning
    • Settings: Application configuration

Settings

  • Timestamp Output Files: Automatically add timestamps to output filenames (e.g., output_20240216_235959.txt)
  • Workers: Configure number of concurrent processing threads (1-16)
  • Skip Header: Skip first data row in CSV files
  • Continue on Error: Keep processing if individual files fail
  • Master List: Configure path to master list file for URL tracking
  • Sample CSV: Set a sample CSV to automatically detect URL column headers

Statistics

  • Track total files processed
  • Count total, unique, excluded and duplicate URLs
  • Display processing time and last run timestamp
  • Reset statistics as needed
  • Clean master list to remove any duplicates
  • Enhanced visualization features:
    • Interactive domain distribution chart
    • Top 10 domains bar chart with frequency analysis
    • Historical processing trends visualization
    • Detailed statistics report generation
    • Automatic www prefix removal for cleaner domain analysis
    • Charts and reports saved in 'statistics' directory:
      • domain_distribution.png: Visual breakdown of top domains
      • historical_trends.png: URL processing trends over time
      • statistics_report.md: Comprehensive statistics report

Screenshots

Main Interface (2024-02-20)

[Main Interface]

alt text

Dark theme interface with URL processing controls

Statistics Dashboard (2024-02-20)

[Statistics Dashboard]

alt text

Real-time processing statistics and history

Settings Interface (2024-02-20)

[Settings Interface]

alt text

Configure application settings

Usage

  1. Set your processing options in Settings
  2. Select input directory containing CSV files
  3. Choose output file location
  4. Select URL column from detected headers
  5. Optional: Configure exclude file path
  6. Click Process to begin extraction

All settings are automatically saved between sessions.

Building

cargo build --release

The compiled application will be available in target/release/export_csv_links.exe

Requirements

  • Windows operating system
  • CSV files with consistent column headers
  • URLs must be in standard HTTP/HTTPS format

About

CSV Link Extractor UI App - Export URLs

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

0