8000 nissymori (nissymori) / Repositories · GitHub

Address: [go: up one dir, main page]

Include Form Remove Scripts Accept Cookies Show Images Show Referer Rotate13 Base64 Strip Meta Strip Title Session Cookies

More Web Proxy on the site http://driver.im/

Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Sign up

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

nissymori Follow

Overview Repositories 11 Projects 0 Packages 0 Stars 102

More

Overview
Repositories
Projects
Packages
Stars

nissymori

Follow

Soichiro Nishimori nissymori

Follow

D1 student. Interested in Offline RL, Game AI, and JAX-based RL.

24 followers · 18 following

The University of Tokyo
Tokyo, Japan
14:47 (UTC -12:00)
https://nissymori.github.io/
@nissymori1

Achievements

Achievements

Block or report nissymori

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 11 Projects 0 Packages 0 Stars 102

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python HTML Ruby

Sort Last updated

Select order

Last updated Name Stars

JAX-CORL Public

Clean single-file implementation of offline RL algorithms in JAX

reinforcement-learning flax cql single-file jax awac iql

Python 143 2 MIT License Updated Dec 24, 2024
rejax Public
Forked from keraJLi/rejax

Python Apache License 2.0 Updated Oct 14, 2024
direct-preference-optimization Public
Forked from eric-mitchell/direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python Apache License 2.0 Updated Aug 11, 2024
nissymori.github.io Public

HTML 1 MIT License Updated Jul 11, 2024
SRPO Public
Forked from AIDefender/SRPO

[NeurIPS 2023] The official code for paper "State Regularized Policy Optimization on Data with Dynamics Shift"

Python GNU General Public License v3.0 Updated Nov 23, 2023
td-gammon Public
Forked from dellalibera/td-gammon

TD-Gammon implementation

Python MIT License Updated Sep 25, 2023
D4RL Public
Forked from Farama-Foundation/D4RL

A collection of reference environments for offline reinforcement learning

Python Apache License 2.0 Updated Aug 29, 2023
a2c-minatar Public
Forked from sotetsuk/a2c-minatar

Python GNU General Public License v3.0 Updated May 17, 2023
reinforce Public
Forked from sotetsuk/reinforce

A simple REINFORCE algorithm implementation in PyTorch

Python MIT License Updated Nov 10, 2022
CDA Public archive
Forked from XuhuiZhou/CDA

code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith

Python Updated Sep 14, 2022
mjai Public
Forked from gimite/mjai

Game server for Japanese Mahjong AI.

Ruby Updated Apr 23, 2021

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.

0