8000 EPIC Data Lab · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
@ucbepic

EPIC Data Lab

Effective Programming Interaction and Computation with Data

Popular repositories Loading

  1. docetl docetl Public

    A system for agentic LLM-powered data processing and ETL

    Python 2k 186

  2. TWIX TWIX Public

    TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents

    Python 179 7

  3. BARGAIN BARGAIN Public

    Low-Cost LLM-Powered Data Processing with Theoretical Guarantees

    Python 16 2

  4. pdf_parser pdf_parser Public

    Parse PDFs using computer vision, layout analysis, and other state-of-the-art document intelligence techniques. WebApp implemented in Flask/Jinja2 with infer and train pipelines managed by FlorDB

    JavaScript 7

  5. docetl-examples docetl-examples Public

    Examples of docetl pipelines

    Python 2

  6. ml_tutorial ml_tutorial Public

    Introduction to Flordb with PyTorch and TensorFlow

    Jupyter Notebook

Repositories

Showing 6 of 6 repositories
  • docetl Public

    A system for agentic LLM-powered data processing and ETL

    ucbepic/docetl’s past year of commit activity
    Python 1,954 MIT 186 31 (1 issue needs help) 8 Updated May 15, 2025
  • TWIX Public

    TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents

    ucbepic/TWIX’s past year of commit activity
    Python 179 7 3 0 Updated May 13, 2025
  • BARGAIN Public

    Low-Cost LLM-Powered Data Processing with Theoretical Guarantees

    ucbepic/BARGAIN’s past year of commit activity
    Python 16 MIT 2 0 0 Updated May 1, 2025
  • docetl-examples Public

    Examples of docetl pipelines

    ucbepic/docetl-examples’s past year of commit activity
    Python 2 0 0 0 Updated Apr 22, 2025
  • pdf_parser Public

    Parse PDFs using computer vision, layout analysis, and other state-of-the-art document intelligence techniques. WebApp implemented in Flask/Jinja2 with infer and train pipelines managed by FlorDB

    ucbepic/pdf_parser’s past year of commit activity
    JavaScript 7 Apache-2.0 0 0 0 Updated Jul 26, 2024
  • ml_tutorial Public

    Introduction to Flordb with PyTorch and TensorFlow

    ucbepic/ml_tutorial’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated Apr 9, 2024

Top languages

Loading…

Most used topics

Loading…

0