-
Preferred Networks, Inc./Keio University
- https://fun.bio.keio.ac.jp/~tokuoka/
- @_109man
Stars
CLI tool which enables you to login and retrieve AWS temporary credentials using a SAML IDP
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
The codes and models in 'Gaze Estimation using Transformer, ICPR2022'.
An Implementation of Takahashi, Nobuhara and Matsuyama "A New Mirror-based Camera Pose Estimation Using an Orthogonality Constraint" presented at CVPR 2012
Official implementation of ETH-XGaze dataset baseline
Official inference repo for FLUX.1 models
real time face swap and one-click video deepfake with only a single image
Gaze estimatin code. The Pytorch Implementation of "It’s written all over your face: Full-face appearance-based gaze estimation".
Optimization Modeling Using mip Solvers and large language models
Creating beautiful plots of data maps
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
The start page about my efforts around smart contract verification
[ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?
[MICCAI 2024] Easy diffusion models (optionally with segmentation guidance) for medical images and beyond.
[CVPR 2024] Generalizable Tumor Synthesis - Realistic Synthetic Tumors in Liver, Pancreas, and Kidney
Schedule-Free Optimization in PyTorch
👀 Eye Tracking library easily implementable to your projects
Implementation of Alpha Fold 3 from the paper: "Accurate structure prediction of biomolecular interactions with AlphaFold3" in PyTorch
Japanese Language Model Financial Evaluation Harness
PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
(ARXIV24) This is the official code repository for "VM-UNet: Vision Mamba UNet for Medical Image Segmentation".