- 博客(70)
- 收藏
- 关注
原创 论文阅读笔记——Autoregressive Image Generation without Vector Quantization
Autoregressive Image Generation without Vector Quantization 论文阅读笔记
2025-07-03 22:14:12
925
1
原创 论文阅读笔记——Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation 论文阅读笔记
2025-07-03 17:17:19
244
1
原创 论文阅读笔记——VGGT: Visual Geometry Grounded Transformer
VGGT: Visual Geometry Grounded Transformer 论文阅读笔记
2025-07-02 14:41:12
916
1
原创 论文阅读笔记——NoPoSplat
论文阅读笔记 NO POSE, NO PROBLEM: SURPRISINGLY SIMPLE 3D GAUSSIAN SPLATS FROM SPARSE UNPOSED IMAGES
2025-07-01 16:04:54
885
1
原创 论文阅读笔记——ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback 论文阅读笔记
2025-06-09 10:58:24
707
1
原创 论文阅读笔记——Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing
Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing 论文阅读笔记
2025-06-09 10:55:29
726
1
原创 论文阅读笔记——D3: Differential Testing of Distributed Deep Learning With Model Generation
D3: Differential Testing of Distributed Deep Learning With Model Generation 论文阅读笔记
2025-06-07 22:37:43
944
1
原创 论文阅读笔记——Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries
Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries 论文阅读笔记
2025-06-07 22:35:00
606
1
原创 论文阅读笔记——Large Language Models Are Zero-Shot Fuzzers
Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models 论文阅读笔记
2025-06-04 15:48:37
1252
2
原创 论文阅读笔记——FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space 论文阅读笔记
2025-06-02 15:33:06
1801
1
原创 论文阅读笔记——Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 论文阅读笔记
2025-06-01 12:54:58
766
1
原创 论文阅读笔记——FLOW MATCHING FOR GENERATIVE MODELING
FLOW MATCHING FOR GENERATIVE MODELING 论文阅读笔记
2025-05-30 19:26:47
1919
1
原创 论文阅读笔记——MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning 论文阅读笔记
2025-05-30 19:23:22
1026
1
原创 论文阅读笔记——In-Context Edit
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer 论文阅读笔记
2025-05-28 15:44:52
1425
1
原创 论文阅读笔记——Step1X-Edit: A Practical Framework for General Image Editing
Step1X-Edit: A Practical Framework for General Image Editing 论文阅读笔记
2025-05-27 23:47:46
1387
1
原创 论文阅读笔记——Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing
Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing 论文阅读笔记
2025-05-27 22:42:36
1021
1
原创 论文阅读笔记——ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision 论文阅读笔记
2025-05-26 20:51:25
264
1
原创 论文阅读笔记——Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model 论文阅读笔记
2025-05-26 16:26:01
967
1
原创 论文阅读笔记——Emerging Properties in Unified Multimodal Pretraining
Emerging Properties in Unified Multimodal Pretraining 论文阅读笔记
2025-05-24 19:08:26
1251
1
原创 论文阅读笔记——双流网络
Two-Stream Convolutional Networks for Action Recognition in Videos 论文阅读笔记
2025-05-14 17:50:45
696
1
原创 论文阅读笔记——Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions
Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions 论文阅读笔记
2025-05-07 14:06:31
954
1
原创 论文阅读笔记——ROBOGROUND: Robotic Manipulation with Grounded Vision-Language Priors
ROBOGROUND: Robotic Manipulation with Grounded Vision-Language Priors 论文阅读笔记
2025-05-06 23:24:24
1363
1
原创 论文阅读笔记——STDArm
STDArm: Transferring Visuomotor Policies From Static Data Training to Dynamic Robot Manipulation 论文阅读笔记
2025-05-04 11:26:26
1594
1
原创 论文阅读笔记——TesserAct: Learning 4D Embodied World Models
TesserAct: Learning 4D Embodied World Models 论文阅读笔记
2025-05-02 13:08:02
1569
1
原创 论文阅读笔记——Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction
Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction 论文阅读笔记
2025-04-30 10:32:22
881
1
原创 论文阅读笔记——ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping 论文阅读笔记
2025-04-25 16:59:39
1161
1
原创 论文阅读笔记——π0.5: a Vision-Language-Action Model with Open-World Generalization
π0.5: a Vision-Language-Action Model with Open-World Generalization 论文阅读笔记
2025-04-24 10:04:09
1859
1
原创 论文阅读笔记——A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation 论文阅读笔记,其核心创新在于将任务分解为**高层空间可操作性推理**与**底层动作执行**,通过跨平台的**具身无关可操作性表示**(Embodiment-Agnostic Affordance Representation)预测物体中心的接触点与轨迹,实现多机器人系统的泛化能力。
2025-04-21 12:00:00
1393
1
原创 论文阅读笔记——Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsit
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity 论文阅读笔记
2025-04-20 12:00:00
1412
1
原创 论文阅读笔记——OPAL: Encoding Causal Understanding of Physical Systems for Robot Learning
OPAL: Encoding Causal Understanding of Physical Systems for Robot Learning 论文阅读笔记
2025-04-19 14:00:55
1072
1
原创 论文阅读笔记——RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete论文阅读笔记
2025-04-17 12:14:54
637
1
原创 论文阅读笔记——Generating Long Sequences with Sparse Transformers
Generating Long Sequences with Sparse Transformers 论文阅读笔记
2025-04-14 19:00:00
1356
1
原创 论文阅读笔记——Reactive Diffusion Policy
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation 论文阅读笔记
2025-04-13 14:57:04
1180
1
原创 论文阅读笔记——GPT-1,GPT-2,GPT-3,InstructGPT
GPT-1,GPT-2,GPT-3,InstructGPT 论文阅读笔记
2025-04-09 12:00:00
1111
1
原创 论文阅读笔记——Deformable Radial Kernel Splatting
Deformable Radial Kernel Splatting 论文阅读笔记
2025-04-06 11:45:13
1193
1
原创 论文阅读笔记——RDT-1B: A DIFFUSION FOUNDATION MODEL FOR BIMANUAL MANIPULATION
RDT-1B: A DIFFUSION FOUNDATION MODEL FOR BIMANUAL MANIPULATION 论文阅读笔记
2025-04-05 16:54:44
1355
1
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人