Eric Vyacheslav’s Post

View profile for Eric Vyacheslav

AI/ML Engineer | Ex-Google | Ex-MIT

Hundreds of billions of dollars later, the entire code needed to reproduce R1 is now available on Github. Code here: https://lnkd.in/gD8QrnGQ ↓ Are you an AI developer? Check out https://AlphaSignal.ai to get a daily summary of breakthrough models, repos and papers in AI. Read by 200,000+ devs.

  • No alternative text description for this image
Jeffrey Batista

Tech & Real Estate Entrepreneur

9mo

I had access to that code a week ago, and quite frankly, this is not the code used to train the R1 model. In this script, they are using reinforcement learning with rewards (GRPO) to fine-tune a language model on the GSM8K dataset.

Tyson Prier

Helping Hand | Founder | Link Layer

9mo

What do you think about Deep Seek Eric Vyacheslav? They aren't claiming to have developed all of the code themselves, but they are claiming they're results are on older model hardware with just changes to the architecture. People in the financial world are throwing flags on it. With your knowledge in AI, from a glance, is what they did feasible and doable on a shoestring budget? Let's be honest, AI development was supposed to be left Opensource from the beginning, that way new developments and things like this could happen.

Benedikt Backhaus

AI Educator, Consultant & Keynote Speaker | Practical Generative AI Strategies & Training for SMEs & Enterprises | 700+ Staff Trained | 35+ Video Courses | 20+ Keynotes | ChatGPT Workshops & 2 Day AI Strategy Sprints

9mo

Very interesting, Eric Vyacheslav. Just confirms that Gen AI models are becoming commodities. But the models are not the product, so let’s see how this affects the actual AI market.

Like
Reply
See more comments

To view or add a comment, sign in

Explore content categories