8000 Kpetyxova (Kseniya Petukhova) / Starred ยท GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Kpetyxova's full-sized avatar

Block or report Kpetyxova

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tools for checking ACL paper submissions

Python 764 52 Updated May 16, 2025

Code repository for AIED25 paper: Can Large Language Models Match Tutoring System Adaptivity? A Benchmarking Study

Jupyter Notebook 3 Updated Feb 18, 2025

The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web

Python 2,289 120 Updated Jun 9, 2025

An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors

13 1 Updated Jun 9, 2025

Official code for --- InSTA: Towards Internet-Scale Training For Agents

JavaScript 8 Updated May 30, 2025

Switchboard Dialog Act Corpus with Penn Treebank links

Python 144 40 Updated Dec 30, 2020

An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues

Jupyter Notebook 3 Updated Dec 16, 2024
C 2 Updated Nov 8, 2024

[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,416 112 Updated Aug 19, 2024

๐Ÿงฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023

Python 55 3 Updated Mar 6, 2025

Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api

Python 1,142 169 Updated Jun 3, 2025

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Jupyter Notebook 203 21 Updated May 5, 2025

NAACL 2024. Code & Dataset for "๐ŸŒ Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"

Python 41 2 Updated Jul 21, 2024

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 1,072 77 Updated Feb 22, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 145,330 12,269 Updated Jul 2, 2025

2D Positional Embeddings for Webpage Structural Understanding ๐Ÿฆ™๐Ÿ‘€

Python 95 2 Updated Sep 6, 2024

A key remapping daemon for linux.

C 3,912 210 Updated Jun 15, 2025

Drive a browser with GPT-3

Python 1,924 274 Updated Jun 9, 2024
Jupyter Notebook 30 3 Updated May 7, 2024

A framework to enable multimodal models to operate a computer.

Python 9,737 1,340 Updated May 13, 2025

Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.

222 14 Updated May 28, 2025

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,035 162 Updated Feb 7, 2025
Rich Text Format 6,655 901 Updated Jun 26, 2025

Python coherence evaluation tool using Stanford's CoreNLP.

Python 10 5 Updated Feb 2, 2020

This repo contains implementation of different architectures for emotion recognition in conversations.

Python 1,447 337 Updated Mar 10, 2024

SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection

Python 76 30 Updated Apr 22, 2024

A library for advanced large language model reasoning

Python 2,159 191 Updated Jun 10, 2025

The prime repository for state-of-the-art Multilingual Question Answering research and development.

Python 734 57 Updated Jan 8, 2025

Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"

Python 278 23 Updated Aug 3, 2023
Next
0