zdy023

Danyang Zhang zdy023

http://zdy023.github.io

Achievements

Stars

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,870 843 Updated May 29, 2022

YifeiZhou02 / ArCHer

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Python 199 18 Updated Apr 17, 2025

OSU-NLP-Group / SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 811 104 Updated Feb 3, 2025

rail-berkeley / rlkit

Collection of reinforcement learning algorithms

Python 2,841 565 Updated Jun 17, 2024

michaelnny / deep_rl_zoo

A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

Python 121 12 Updated Feb 21, 2024

Timothyxxx / WorldModelPapers

Paper collections of the continuous effort start from World Models.

191 6 Updated Jul 6, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,713 1,248 Updated Dec 19, 2025

MobileLLM / AutoDroid

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

Python 431 61 Updated Mar 22, 2024

gydpku / PPTC

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Python 59 9 Updated Feb 29, 2024

xlang-ai / OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,649 513 Updated Nov 18, 2024

remote-android / redroid-doc

redroid (Remote-Android) is a multi-arch, GPU enabled, Android in Cloud solution. Track issues / docs here

Shell 5,817 409 Updated Jun 29, 2025

THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,471 106 Updated Oct 31, 2023

anchen1011 / FireAct

FireAct: Toward Language Agent Fine-tuning

Python 287 22 Updated Oct 22, 2023

allenai / ScienceWorld

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Scala 322 33 Updated Dec 3, 2025

xlang-ai / text2reward

[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Jupyter Notebook 192 12 Updated Dec 17, 2024

easymotion / vim-easymotion

Vim motions on speed!

Vim Script 7,707 366 Updated Feb 5, 2024

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,713 449 Updated May 29, 2024

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 21,065 1,130 Updated Dec 17, 2025

py499372727 / AgentSims

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Python 917 117 Updated Nov 18, 2023

HumanCompatibleAI / imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1,665 293 Updated Jan 7, 2025

web-arena-x / webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,264 205 Updated Nov 26, 2025

ConvLab / ConvLab-3

Python 121 36 Updated Jul 10, 2025

thu-coai / ConvLab-2

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

Python 464 138 Updated Jun 17, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,100 4,672 Updated Dec 24, 2025

zai-org / GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,681 606 Updated Jul 25, 2023

zai-org / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,225 5,220 Updated Jun 27, 2024

hua1995116 / awesome-ai-painting

AI绘画资料合集（包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等） Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo

11,708 950 Updated Aug 14, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,030 6,184 Updated Sep 18, 2024

rhythmcao / semantic-parsing-dual

Source code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".

Python 23 7 Updated Feb 21, 2021

rhythmcao / slu-dual-learning

Source code and data for the journal ``Dual learning for semi-supervised natural language understanding" in TASLP 2020.

Python 9 Updated Apr 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Danyang Zhang zdy023

Achievements

Achievements

Block or report zdy023

Stars

ikostrikov / pytorch-a2c-ppo-acktr-gail

YifeiZhou02 / ArCHer

OSU-NLP-Group / SeeAct

rail-berkeley / rlkit

michaelnny / deep_rl_zoo

Timothyxxx / WorldModelPapers

huggingface / text-generation-inference

MobileLLM / AutoDroid

gydpku / PPTC

xlang-ai / OpenAgents

remote-android / redroid-doc

THUDM / AgentTuning

anchen1011 / FireAct

allenai / ScienceWorld

xlang-ai / text2reward

easymotion / vim-easymotion

zai-org / CogVLM

guidance-ai / guidance

py499372727 / AgentSims

HumanCompatibleAI / imitation

web-arena-x / webarena

ConvLab / ConvLab-3

thu-coai / ConvLab-2

deepspeedai / DeepSpeed

zai-org / GLM-130B

zai-org / ChatGLM-6B

hua1995116 / awesome-ai-painting

facebookresearch / segment-anything

rhythmcao / semantic-parsing-dual

rhythmcao / slu-dual-learning