8000 benbogin (Ben Bogin) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View benbogin's full-sized avatar

Block or report benbogin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple viewer to see Yad2 listings on a graph.

Python 71 24 Updated Mar 12, 2025

Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature

Python 154 16 Updated Jul 31, 2024

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 302 25 Updated Dec 20, 2023

DSIR large-scale data selection framework for language model training

Python 249 19 Updated Apr 7, 2024

Author implementation of the paper "Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing"

Python 141 44 Updated Jul 25, 2024

Author implementation of Global Reasoning over Database Structures for Text-to-SQL Parsing

Python 68 14 Updated Jul 25, 2024

Efficient Scaling laws and collaborative pretraining.

Jupyter Notebook 16 1 Updated Jan 27, 2025

Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]

Python 70 8 Updated Nov 14, 2024

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 248 17 Updated Jun 5, 2025
Jupyter Notebook 42 2 Updated Apr 4, 2025

Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"

Python 56 5 Updated Dec 9, 2024

Diverse Demonstrations Improve In-context Compositional Generalization

Python 11 1 Updated Jul 7, 2023

Modeling, training, eval, and inference code for OLMo

Python 5,650 613 Updated Jun 5, 2025

Leveraging Code to Improve In-context Learning for Semantic Parsing

Python 8 2 Updated Mar 25, 2024

Code for the paper "A high-performance speech neuroprosthesis"

Roff 157 39 Updated Feb 4, 2025

Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"

Python 95 12 Updated Jan 21, 2024

COVR dataset for evaluation of compositional generalization

Python 4 1 Updated Mar 23, 2023

Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data

Jupyter Notebook 102 16 Updated Jun 25, 2021

Code to reproduce LREC Paper Simplifying Semantic Annotations of SMCalFlow

Python 24 6 Updated Mar 28, 2024

ML Collections is a library of Python Collections designed for ML use cases.

Python 956 44 Updated Apr 30, 2025

The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Python 69 9 Updated Jan 12, 2024

JSQLParser as a Service

Java 2 1 Updated Jun 6, 2021

Code for the paper: Finding needles in a haystack:Sampling Structurally-diverse Training Sets from Synthetic Data forCompositional Generalization

Python 4 Updated Dec 15, 2021
Python 3 Updated Aug 16, 2021

[TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs"

Python 114 23 Updated Mar 24, 2022

linguistics tree drawing to SVG in python, aimed at Jupyter

Python 64 9 Updated Aug 20, 2024

Hierarchical-Attention-Network

Python 46 9 Updated Dec 8, 2022
Next
0