Kpetyxova

Kseniya Petukhova Kpetyxova

5 followers · 5 following

Abu Dhabi, UAE

Achievements

x3 x2

Achievements

x3 x2

Stars

acl-org / aclpubcheck

Tools for checking ACL paper submissions

Python 764 52 Updated May 16, 2025

conradborchers / llm-instruction-benchmarking

Code repository for AIED25 paper: Can Large Language Models Match Tutoring System Adaptivity? A Benchmarking Study

Jupyter Notebook 3 Updated Feb 18, 2025

lmnr-ai / index

The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web

Python 2,289 120 Updated Jun 9, 2025

kaushal0494 / UnifyingAITutorEvaluation

An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors

13 1 Updated Jun 9, 2025

data-for-agents / data-for-agents.github.io

Official code for --- InSTA: Towards Internet-Scale Training For Agents

JavaScript 8 Updated May 30, 2025

cgpotts / swda

Switchboard Dialog Act Corpus with Penn Treebank links

Python 144 40 Updated Dec 30, 2020

kaushal0494 / aitutor_assessmentkit

An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues

Jupyter Notebook 3 Updated Dec 16, 2024

nazya / ErgoType

C 2 Updated Nov 8, 2024

microsoft / SoM

[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,416 112 Updated Aug 19, 2024

eth-nlped / mathdial

🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023

Python 55 3 Updated Mar 6, 2025

EmergenceAI / Agent-E

Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api

Python 1,142 169 Updated Jun 3, 2025

booydar / babilong

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Jupyter Notebook 203 21 Updated May 5, 2025

rosewang2008 / bridge

NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"

Python 41 2 Updated Jul 21, 2024

microsoft / ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 1,072 77 Updated Feb 22, 2024

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 145,330 12,269 Updated Jul 2, 2025

Llama2D / llama2d

2D Positional Embeddings for Webpage Structural Understanding 🦙👀

Python 95 2 Updated Sep 6, 2024

anuradha1992 / EmpatheticIntents

Python 48 7 Updated Feb 23, 2021

rvaiya / keyd

A key remapping daemon for linux.

C 3,912 210 Updated Jun 15, 2025

nat / natbot

Drive a browser with GPT-3

Python 1,924 274 Updated Jun 9, 2024

lukas / otto

Jupyter Notebook 30 3 Updated May 7, 2024

OthersideAI / self-operating-computer

A framework to enable multimodal models to operate a computer.

Python 9,737 1,340 Updated May 13, 2025

ICTMCG / Awesome-Machine-Generated-Text

Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.

222 14 Updated May 28, 2025

web-arena-x / webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,035 162 Updated Feb 7, 2025

langchain-ai / opengpts

Rich Text Format 6,655 901 Updated Jun 26, 2025

kigawas / coheoka

Python coherence evaluation tool using Stanford's CoreNLP.

Python 10 5 Updated Feb 2, 2020

declare-lab / conv-emotion

This repo contains implementation of different architectures for emotion recognition in conversations.

Python 1,447 337 Updated Mar 10, 2024

mbzuai-nlp / SemEval2024-task8

SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection

Python 76 30 Updated Apr 22, 2024

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,159 191 Updated Jun 10, 2025

primeqa / primeqa

The prime repository for state-of-the-art Multilingual Question Answering research and development.

Python 734 57 Updated Jan 8, 2025

CraftJarvis / MC-Planner

Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"

Python 278 23 Updated Aug 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kseniya Petukhova Kpetyxova

Achievements

Achievements

Block or report Kpetyxova

Stars

acl-org / aclpubcheck

conradborchers / llm-instruction-benchmarking

lmnr-ai / index

kaushal0494 / UnifyingAITutorEvaluation

data-for-agents / data-for-agents.github.io

cgpotts / swda

kaushal0494 / aitutor_assessmentkit

nazya / ErgoType

microsoft / SoM

eth-nlped / mathdial

EmergenceAI / Agent-E

booydar / babilong

rosewang2008 / bridge

microsoft / ToRA

ollama / ollama

Llama2D / llama2d

anuradha1992 / EmpatheticIntents

rvaiya / keyd

nat / natbot

lukas / otto

OthersideAI / self-operating-computer

ICTMCG / Awesome-Machine-Generated-Text

web-arena-x / webarena

langchain-ai / opengpts

kigawas / coheoka

declare-lab / conv-emotion

mbzuai-nlp / SemEval2024-task8

maitrix-org / llm-reasoners

primeqa / primeqa

CraftJarvis / MC-Planner