Stars
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Collection of reinforcement learning algorithms
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
Paper collections of the continuous effort start from World Models.
Large Language Model Text Generation Inference
Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
redroid (Remote-Android) is a multi-arch, GPU enabled, Android in Cloud solution. Track issues / docs here
AgentTuning: Enabling Generalized Agent Abilities for LLMs
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
a state-of-the-art-level open visual language model | 多模态预训练模型
A guidance language for controlling large language models.
AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.
Clean PyTorch implementations of imitation and reward learning algorithms
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Source code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".
Source code and data for the journal ``Dual learning for semi-supervised natural language understanding" in TASLP 2020.




