Demis Hassabis
Co-Founder & CEO, Google DeepMind
Greater London, England, United Kingdom
280K followers
500+ connections
View mutual connections with Demis
Demis can introduce you to 10+ people at Google DeepMind
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Demis
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
About
Co-Founder & CEO of Google DeepMind - working on AGI, responsible for AI breakthroughs such as AlphaGo, the first program to beat the world champion at the game of Go; and AlphaFold, which cracked the 50-year grand challenge of protein structure prediction and was recognised with the 2024 Nobel Prize in Chemistry. Revolutionising drug discovery at Isomorphic Labs. Ultimately trying to understand the fundamental nature of reality.
Activity
280K followers
-
Demis Hassabis shared thisI’ve always believed the No.1 application of AI should be to improve human health. That work started with AlphaFold, and continues at Isomorphic Labs with our mission to reimagine drug discovery and one day solve all disease. We are turbocharging our progress with $2.1B in new funding. Excited for what’s to come!
-
Demis Hassabis shared thisIt was wonderful to be back in Korea last week, 10 years after AlphaGo’s historic win against the legendary 9-dan champion Lee Sae Dol. That groundbreaking moment gave the world a first glimpse of what we could achieve with AI that can learn to solve hard problems. We’ve seen incredible progress in AI since then. Today, AI is capable of advanced reasoning and is beginning to have agentic capabilities that will enable it to plan and act in the world, whether in robotics or as useful assistants. We’re now at a major threshold with AGI likely to arrive in 3-4 years and bring profound change to industries and society. Korea is uniquely positioned for this transformation. It has one of the world’s fastest-growing AI adoption rates - many of Korea’s citizens rely on the Gemini app as an essential daily partner. It’s also a world leader in manufacturing memory chips and advanced semiconductors that are the bedrock of AI compute. At the same time, Korea’s leaders have been deeply thoughtful about the critical questions we as a society have to answer as we navigate this next transformative period of human history. It was a huge honour to meet President Lee Jae-myung and discuss AI safety and the importance of using AI to advance science. I’ve always believed that scientific discovery is the ultimate use case for AI, and Korea’s strengths in robotics, biotech, energy and education - along with its world-class talent - make it a natural partner for accelerating this work. We’re building on a strong foundation of collaborations with world-leading Korean companies and universities, and establishing a new partnership with The Ministry of Science and ICT of the Republic of Korea. We’ll help to accelerate the country’s K-Moonshot mission by leveraging our models in fields such as life sciences, energy, weather and climate. We’ll also be collaborating with the Korean AI Safety Institute on research and are supporting the next generation of talent by providing internship opportunities at Google DeepMind for Korean students. We have a long-standing connection with Korea as the home of the AlphaGo match that kickstarted the modern AI era. Returning to Seoul offered the chance to connect with Lee Sae Dol again and join Shin Jin-seo for a special Go match. It was incredibly interesting to hear how AlphaGo has changed the way players approach the game. I also got to visit Google Korea and spend some time with the amazing team there. The launch of our AI Campus within our Seoul office will help to drive collaborations between Korean institutions and our AI experts that are at the center of our work together. Thank you to everyone in Korea for such a warm welcome. Excited to see where this new chapter of collaboration leads us! Read more about our new partnership: https://lnkd.in/eaReHrVK
-
Demis Hassabis posted thisWe started DeepMind back in 2010 because even then we believed Artificial General Intelligence (AGI) would be the most transformative technology ever invented. It has the potential to be the ultimate tool to accelerate science and medicine, and improve productivity. The impact will be profound, but the challenges and complexities are also enormous. Thoughtfulness and foresight will be critical as we seek to steward this technology safely into the world to benefit everyone. As part of our contribution to that effort, I’m thrilled to welcome Jasjeet Sekhon to Google DeepMind as Chief Strategy Officer to partner with me on strategy cutting across research, commercialisation, policy and more. Jas is uniquely experienced for this role, having served as Chief Scientist and Head of AI at Bridgewater Associates, where he now joins the board. Super excited to be working with Jas to accelerate this important work at such a critical time for this technology.
-
Demis Hassabis shared thisTen years ago, AlphaGo’s legendary match in Seoul heralded the start of what is now recognised as the modern era in AI. In 2016, with over 200 million people watching, our AI system AlphaGo faced world champion Go player Lee Sae Dol. The match was defined by AlphaGo’s famous ‘Move 37’ in Game 2 - a play so unconventional it first appeared to be a mistake. But as the game unfolded, it became clear the play wasn’t just bold, it was decisive. One hundred or so moves later, Move 37 was in exactly the right place to decide the battle and allow AlphaGo to win the game. I knew at that moment that the AI techniques we developed with AlphaGo were ready to be applied to our real goal of using AI to accelerate scientific breakthroughs. The trajectory since then has been incredible: • 𝗔𝗹𝗽𝗵𝗮𝗭𝗲𝗿𝗼: Taught itself from scratch to master any 2-player perfect information game, including Go, chess and shogi. • 𝗔𝗹𝗽𝗵𝗮𝗙𝗼𝗹𝗱: Solved the 50-year grand challenge of protein structure prediction and is now a standard tool for millions of scientists around the world. • 𝗔𝗹𝗽𝗵𝗮𝗣𝗿𝗼𝗼𝗳 & 𝗔𝗹𝗽𝗵𝗮𝗘𝘃𝗼𝗹𝘃𝗲: Applying AlphaGo’s ‘reasoning as search’ to formal mathematics and algorithm discovery. • 𝗚𝗲𝗺𝗶𝗻𝗶: In Deep Think mode, our most capable model uses search and planning algorithms to explore lines of thought in parallel - an approach inspired by AlphaGo. Our goal is to build artificial general intelligence (AGI) that can help us make fundamental leaps in science and address some of the most pressing problems facing humanity, including energy and disease. The techniques we pioneered in AlphaGo are now paving the path towards AGI. Gemini uses some of the same search and planning approaches to reason across language, audio, video and images to build a model of how the world works. We think the combination of Gemini’s world model and AlphaGo’s techniques, as well as a system’s ability to call on specialised AI tools like AlphaFold, will prove to be critical for AGI. True creativity is a key capability that such an AGI system would need to exhibit. Move 37 was a glimpse of AI’s potential to think outside the box, but true original invention will require something more. It would need to not only come up with a novel Go strategy, as AlphaGo impressively did, but actually invent a game as deep and elegant, and as worthy of study as Go. AlphaGo has had an amazing impact over the past 10 years - look forward to seeing what it unlocks next!
-
Demis Hassabis shared thisIt was amazing to be in India this week for the AI Impact Summit. Seeing firsthand how the country is applying AI to solve real-world problems, it is clear that India is poised to become an AI powerhouse on the global stage. My thanks to Prime Minister Narendra Modi, Minister Ashwini Vaishnaw and the Indian government for convening such an impressive and productive meeting. Since the first summit in the UK at Bletchley Park in 2023, presciently initiated by Prime Minister Rishi Sunak, this gathering has become very important for continuing international dialogue and cooperation on the future of AI. Those discussions are especially urgent with AGI on the horizon, potentially within the next five years. In my view, AGI will be the most transformative technology ever invented and its impact will be unprecedented, maybe 10x that of the Industrial Revolution, unfolding 10x faster. I’ve always believed AI could be the ultimate tool to advance science, medicine, and productivity, and help tackle some of the biggest challenges facing humanity. To realise this massive potential, more scientists and entrepreneurs need to be able to use frontier AI capabilities. Building on our work with the US and UK, Google DeepMind announced new partnerships in India this week to broaden access to AI tools like AlphaGenome, WeatherNext and Gemini-powered learning assistants. India is already one of the top countries by users of the Gemini app. Our world-class team in Bengaluru is doing critical research on efficient models and multilingual capabilities that we are bringing to our products and technologies in order to broaden AI’s impact. It was incredibly impressive to see the energy and enthusiasm for AI in the country, especially among young people. While speaking at the Indian Institute of Science (IISc), I met with students and faculty who had inspiring ideas for seizing the economic and scientific opportunities AI unlocks. This is an extremely exciting time but we must approach it with humility and care, as we don’t have all the answers yet about how this technology will develop and be deployed into the world. To navigate this next period in human history, we need more forums like the international summits to bring together all parts of society - including technologists, scientists and governments, but also artists, social scientists, philosophers and citizens. These dialogues are vital to realising AI’s benefits and mitigating any potential risks. If we get these next steps right, I’m very optimistic we can usher in a new golden age of scientific discovery and progress, and improve the lives of everyone, everywhere.
-
Demis Hassabis shared thisIt’s amazing to see how the conversation around AI has evolved in the past year. In Davos last week, the discussions reflected the shift from the era of generative AI - models that write text and code - to agentic AI that can reason, plan, and take action. This shift brings immense potential to increase productivity and solve complex problems in the real world. As agent-based systems become more prevalent, the good news is I think we’ll see demand from enterprises and users that will drive the right behaviours regarding safety and security. Businesses will require guarantees that the systems they deploy are reliable and handle data securely. There will be a lot of commercial pressure on frontier AI providers to get this right, and it will be essential preparation for when bigger stakes come around with AGI. AGI will impact all of humanity. Currently, mechanisms for international coordination to realise its benefits and mitigate any potential risks are lagging behind the technology. We vitally need more dialogue between companies, governments, and civil society to ensure we get this transformative technology right. Ideally, as we approach AGI the best minds in the world would collaborate across disciplines - philosophers, social scientists and economists, as well as technologists - to figure out what we want from this technology and ensure all of humanity benefits from it. Today there is fear and reasonable concern around the impact of AI. It is incumbent on the industry to demonstrate the unequivocal good AI can do. Our work at Isomorphic Labs to design new drugs is an incredible example that builds on our pioneering breakthroughs with AlphaFold - but we need a lot more. AI has the potential to help us discover new materials, develop new clean energy sources and move us towards a post-scarcity world, all of which would dramatically improve the human condition. Our Google DeepMind Science team is leading the way on building AI tools to accelerate the pace of scientific discovery - like AlphaGenome, which was just released this week. I have spent my entire career on developing AI because I always believed it would usher in a new golden age of scientific discovery. I’ve been thinking about the technical risks for just as long, but I remain a big believer in human ingenuity and adaptability. If we approach building AI with the time and thoughtfulness it deserves, grounding our work rigorously in the scientific method, I am confident mitigating the technical risks is a tractable problem. There are profound questions to answer about the post-AGI world we want to build. It’s for us, as humanity, to write what happens next. It was great to discuss this and more when I was in Davos: https://lnkd.in/eGmag9Cs
-
Demis Hassabis shared thisAccelerating scientific discovery has always been my primary motivation for building AI - I think it could be an amazing tool to help scientists solve huge challenges like finding new sources of clean energy and curing disease. So I am excited to share that Google DeepMind is supporting the White House's Genesis Mission to use AI to power science and innovation. We are partnering with the U.S. Department of Energy (DOE) to give scientists at all 17 National Labs accelerated access to our frontier AI models, starting with AI co-scientist (to help researchers generate novel hypotheses) and expanding soon to AlphaEvolve, AlphaGenome and WeatherNext. Foundational work on the Protein Data Bank at the DOE's Brookhaven National Lab was crucial for AlphaFold, so it feels fitting now to build on this history. AI is ushering in a new golden era of discovery. Look forward to seeing the breakthroughs this partnership with the DOE unlocks! https://lnkd.in/eQ4sBtGHGoogle DeepMind & DOE Partner on Genesis: AI for ScienceGoogle DeepMind & DOE Partner on Genesis: AI for Science
-
Demis Hassabis shared thisThe UK has an incredible heritage in scientific innovation and computing, from Babbage to Turing. I’m passionate about the country’s potential to lead in the AI era and translate this opportunity into real benefit for people. To help realise this vision, we are deepening our partnership with the UK government and the Department for Science, Innovation and Technology We are giving UK scientists priority access to our frontier AI for science models and tools like AI co-scientist to accelerate discoveries. Next year, Google DeepMind will also establish our first automated laboratory in the UK. Focused on materials science and with Gemini integrated across the lab, it will help scientists analyse hundreds of potential materials every day. Discovering superconducting materials that can operate at ambient temperature and pressure is a lifelong dream of mine and I am hopeful AI systems will help us find them. We are also collaborating with the UK government to understand how AI tools like Gemini can support teachers and students, and to modernise public services and enhance national cyber resilience. The UK has real strengths in AI and this partnership will ensure it remains innovative and at the technical forefront. Learn more here: https://lnkd.in/e6_wCV7K
-
Demis Hassabis shared thisI’ve worked on AI my whole life because I’ve always believed it could unlock the ability to answer some of the biggest and most intractable problems in science. Our first big science breakthrough happened five years ago when we announced our solution to the protein structure prediction problem: AlphaFold 2. It has been incredible to see its impact since then. More than 3 million researchers across 190 countries have used this tool for disease understanding, drug discovery and more. And it was an honour of a lifetime for our work to be recognised last year with a Nobel Prize. One of our greatest ambitions is for AI to aid in accelerating drug design and help cure all diseases. This is what led me to found Isomorphic Labs, which is already making amazing progress. We’ve also expanded AlphaFold to predict the interactions of all of life’s molecules. But AlphaFold represents more than a solution to a biological puzzle. It demonstrated how AI can crack ‘root node’ problems - where a single breakthrough unlocks entire new avenues of research. It is a critical step towards a long-held dream of mine: building a virtual cell. Imagine running ‘in silico’ experiments orders of magnitude faster than in a wet lab. Scientists could rapidly test hypotheses, model complex pathways and see how a drug affects a cell. It would be an incredible boon not only for fundamental biology but also for medicine. Although for me, AlphaFold was never just about biology. It was the first major proof point for a much larger thesis: that AI could be the ultimate tool for advancing science. By processing data or helping us come up with new hypotheses, I think AI will help us tackle some of humanity’s greatest challenges and answer fundamental questions about the universe. From materials design to fusion energy to mathematics, I believe we’re on the cusp of a new golden age of discovery. We’re just getting started. Read more about AlphaFold’s impact: https://lnkd.in/eNeqxqQp
-
Demis Hassabis liked thisDemis Hassabis liked thisIt’s been a decade since DeepMind’s AlphaGo beat the world’s top Go player. Here’s my story on Google DeepMind CEO Demis Hassabis and his life in gaming and AI, all the way back to his days as a chess prodigy and the Othello AI he wrote for his Amiga as a kid. https://lnkd.in/gApA5UfpGoogle DeepMind's Demis Hassabis on the long game of AIGoogle DeepMind's Demis Hassabis on the long game of AI
-
Demis Hassabis liked thisDemis Hassabis liked thisOur AlphaProof paper is in this week’s issue of Nature Magazine! In 2024, Google DeepMind's proof agents AlphaProof & AlphaGeometry together made a substantial leap in AI by achieving the silver-medal standard in solving IMO problems. The Nature paper describes the technical innovations required—in particular, the RL loop bridging natural language & symbolic rigor—that made AlphaProof possible. Read our Nature paper: https://lnkd.in/epDQSn2i Read our IMO 2024 blog: https://lnkd.in/eBei2ZUb
Experience
Education
View Demis’ full profile
-
See who you know in common
-
Get introduced
-
Contact Demis directly
Other similar profiles
Explore more posts
-
Nishantha Ruwan
IWROBOTX Software Inc. • 2K followers
The authors present LERD, a novel Bayesian neural dynamical framework designed to improve diagnosis and monitoring of neurodegenerative diseases like Alzheimer’s using multichannel EEG data. Traditional machine learning approaches in this domain often treat classifiers as “black boxes” and fail to explicitly model the underlying neural dynamics generating the observed signals. In contrast, LERD infers latent neural events and their relational structure directly from EEG without requiring external event or interaction labels, combining a continuous-time event inference module with a stochastic event generation process. An electrophysiology-inspired dynamical prior guides the learning process, promoting physiologically meaningful representations. The paper also contributes a theoretical analysis that provides a tractable training bound and stability guarantees for the inferred latent relational dynamics. Extensive experiments on synthetic data and two real Alzheimer’s EEG cohorts show LERD consistently outperforms strong baseline methods, while yielding latent summaries that align with known physiological characteristics of the disease. These results suggest that modeling latent event interactions and dynamics offers a more interpretable and accurate approach for EEG-based neurodegenerative classification than traditional black-box classifiers. https://lnkd.in/g-nH6tyT
2
-
Nishantha Ruwan
IWROBOTX Software Inc. • 2K followers
Dynamic scene reconstruction from single-camera videos remains challenging for neural radiance field (NeRF) models, especially when generating novel views far outside the original camera’s viewpoint. Traditional dynamic NeRF approaches often fail under large viewpoint changes, producing unstable or unrealistic renderings that do not generalize well beyond the constrained input motion. To address this limitation, the authors introduce ExpanDyNeRF, a novel monocular NeRF framework that combines Gaussian splatting priors with a pseudo-ground-truth generation strategy to support reliable view synthesis under extreme rotations and large angular deviations. ExpanDyNeRF jointly optimizes density and color features to improve scene representation quality from difficult perspectives that would otherwise degrade reconstruction fidelity. In addition to the model, the paper presents SynDM, the Synthetic Dynamic Multiview dataset—the first synthetic multiview dataset for dynamic scenes with explicit side-view supervision, created using a customized rendering pipeline based on a videogame engine. Experiments on both SynDM and real-world data demonstrate that ExpanDyNeRF consistently outperforms existing dynamic NeRF baselines in rendering fidelity and stability under large viewpoint shifts, validating its effectiveness for broadening view synthesis in dynamic environments. The supplementary materials provide additional quantitative and qualitative evaluations that further support these findings. https://lnkd.in/gTe29awy
1
-
Nishantha Ruwan
IWROBOTX Software Inc. • 2K followers
This research focuses on advancing novel view synthesis (NVS) for complex, real-world scenes. Traditional NVS methods perform well when trained on curated, object-centric datasets but struggle with diverse, uncontrolled imagery common “in the wild.” The authors identify that appearance variation — such as differing illumination, occlusions, and backgrounds — significantly hampers multi-view learning. To address this, they introduce WildCAT3D, a framework that explicitly models global appearance conditions during training, enabling a diffusion-based model to learn from widely varied scene data. By disentangling appearance from geometry, WildCAT3D can synthesize consistent and realistic novel views even when appearance conditions differ across input images. WildCAT3D generalizes to novel scenes at inference time, providing state-of-the-art performance for single-view and multi-view NVS tasks while using less clean data than previous methods. A key contribution is the ability to control global appearance in the generated views, which unlocks new applications such as adjusting lighting or transient scene elements during synthesis. This work demonstrates that leveraging diverse, uncurated imagery can significantly improve real-world novel view generation by incorporating appearance awareness into multi-view diffusion models. https://lnkd.in/gXSu7WFb
-
Aditya Rajagopal PhD
nCompass Technologies Inc. • 3K followers
If you're running on Modal and would like to quickly try out profiling tools like Torch Profile and understand your AI system's performance - all within your VSCode / Cursor environment, check out the tutorial we've put together on using the nCompass VSCode/Cursor extension with remote GPUs. Comments have a link to a detailed tutorial on how to get started with our tooling on Modal. #MLSys #Profiling #AIPerformance #Modal #nCompass
24
1 Comment -
Swetabh Pathak
Elucidata Corporation • 13K followers
LLMs will not lead to generalized AGI. Yann Lecun said as much in a podcast with langchain and I couldn't agree more. LLMs are able to build an 'understanding' of a complex system whose rules are not clear. Like language. Based on the training data they can build complex functions. Functions that would be otherwise be too difficult to right down. ChatGPT has figured out rules of human language in a way that we can not explicitly right now. We just don't know what they are. But ChatGPT, somewhere in it's parameters, has figured it out. But the thing is that they have been built on a lot of data. A lot of data! Almost everything that has ever been published on the web. That is millions of documents conceivably covering all facets of writing in English. I mean, can you really say something that has not said somewhere on Reddit? The same approach doesn't extend to other problems just cause we do not have that size and diversity of data. We often hear that data in genomics is huge. Yes it is. But it's mostly cause the raw data that is captured is too noisy for it to be useful. That is why genomics data requires that amount of data processing before it can be used. It's a long winded say to that genomics data is just not as rich per unit of storage. Information content per mb is not that high. So the data might look large on the disk, but it does not have that much information. Generative models can only do so much without data from a similar space. The space of all possible molecules in all physical configrautions is just way too vast. The data available is minuscule compared to the possibilities. Secondly, just having a lot of data will not lead to generalized intelligence. That is a few fundamental breakthroughs away. No amount of compute or data thrown at DNNs are going to solve it. We need fundamental breakthroughs in our understanding of real intelligence - something that the field is still many years if not decades away from. What does this mean for leaders in life-sciences research? - Apply LLM like principles to spaces where enough data exists. - Don't bet on problems that need fundamental discoveries. Leave that to AI research labs in academia. - Maintain your data well and continue to invest in data management. Data will continue to be the moat. - Know which problems are 0 to 1 and which are 1 to n. This requires an understanding of the fundamentals of AI and the maths behind it. The road to successful AI adoption is long and not an event. It is a set of small steps that overtime will create value. There are no magic bullets out there. Link in comments to a review of new models by OpenAI for deep-research in comments. Nidhi and team have done an excellent job of summarizing we where are and what remains.
24
3 Comments -
Michael Tiller
LlamaBrain Labs • 305 followers
Building on LLMs? The hardest problem isn't getting "good" output. It's getting reproducible output every single time. LLMs are stochastic by nature. But production systems need guarantees. The solution is Stochastic Determinism: the engineering goal of constraining a probabilistic system until its variance is intentional, reproducible and acceptable. While building LlamaBrain (an open-source LLM governance framework), my architecture looked flawless on paper. I did some stress-testing on it with a real-world app and hit 96.7% consistency. Sounds high, right? It isn't. A 3.3% failure rate means your constraints break roughly 1 in 30 calls. At scale, that’s a systematic failure mode. Getting from 96.7% to 100% took ten days and required building a Black Box Audit Recorder to log, replay, and structurally compare every interaction. The lesson? Determinism isn't something you just design into an AI system. It's a property you must instrument, measure, and prove. Anything less is just a toy framework. #LLM #SoftwareEngineering #AI #OpenSource #LlamaBrain
5
2 Comments -
Nihal Kashinath
Deep Tech Stars • 10K followers
After a very public disagreement and split from Meta, Yann LeCun (who believes LLMs are a dead end in the search for AGI) has now set up a new company to build World Models. And he just raised over USD 1 Billion, one of the largest "seed" rounds in history! What are World Models and how will he build them? That's the AI concept explained in today's Deep Tech Stars Newsletter. Plus, Microsoft launches Copilot Cowork, which is basically Claude Code for all enterprise work. And some really awesome remote AI jobs. https://lnkd.in/ghnq5xez
17
-
Upendra Gariminti
leadkamp • 430 followers
ISTS Protocol: A Technical Deep Dive into Isotropic Spatiotemporal Tensor-Spline Compression Following my recent announcement, I am sharing an explanatory video that demonstrates the core mechanics of the ISTS Protocol in action. The ISTS algorithm introduces a dual-path architecture that fundamentally changes how we handle data density: Text-Based Atomization: The video highlights the real-time breakdown of raw text into structural components, which are then deduplicated via a Global Ledger. This ensures that redundant patterns occupy zero additional space beyond their initial coefficient. Vectorized Visual Storage: Watch the transition from raw image data to high-efficiency WebP Base64 strings, followed by secondary LZW bit-packing. Fibonacci Sphere Distribution: See how we use a 3D ASCII-based plotting system to visualize data clusters, providing a geometric representation of compressed information. By combining tensor-spline logic with bit-level compression, ISTS achieves significant storage optimization for local IndexedDB environments without sacrificing retrieval speed. Watch the full walkthrough below to see how these layers integrate. Technical Documentation: https://lnkd.in/ghPqEd4j
1
-
Eric Ho
Goodfire • 16K followers
We cut the hallucinations of an LLM in half using a new technique: RLFR (Reinforcement Learning from Feature Rewards). RLFR reduces hallucinations in Gemma 12B by 58% (when run with our probing harness), for ~90x cheaper per intervention than the LLM-as-judge alternative. The technique uses lightweight probes on a model's internal representations as reward signals for reinforcement learning. This is all while avoiding: - off-target effects (no degraded performance on standard benchmarks) - degrading/reward-hacking the probes (the probes still work as monitors at test time!) This is just our first public demonstration of our work building towards "intentional design": the ability to intelligently guide gradient descent, enabling a new paradigm of far more precise, robust, and effective training. Kudos to Aaditya Prasad, Connor Watts, Jack Merullo, Dhruvil Gala, Owen Lewis, Tom McGrath, and Ekdeep Singh for this groundbreaking new work!
236
9 Comments -
Michael Alexander Riegler
Simula Metropolitan Center… • 5K followers
Interesting perspectives on scaling, RL and what the current AI approaches struggle with. Back to the age of research (or AI winter 😅) https://lnkd.in/dZvZh2Ta #AI #scaling
13
4 Comments -
Ben Batman
Real Python • 815 followers
I've become interested in mechanistic interpretability recently and wanted to experiment with some of the latest innovations in the space. In a limited fashion, we can look inside a LLM and see what's going on. Using Sparse Autoencoders (SAEs) from Gemma Scope, this tool lets you peek under the hood of Google's Gemma 2 2B model and actually steer its behavior in real time. You can: - Browse and search 16K+ interpretable features by description - Visualize which features activate on any input text - Amplify or suppress specific features during generation and see how outputs change - Decompose predictions into per-feature logit contributions Built with TransformerLens, SAELens, and Gradio. Try it on Huggingface Spaces: https://lnkd.in/e88v8wym Code here: https://lnkd.in/e7dSS9WA #MechanisticInterpretability #AISafety #MachineLearning #NLP #OpenSource
6
-
Jason Milne, PhD
The University of Western… • 904 followers
Another good example of how LLMs are just pattern matching. They're great for brainstorming and 'established knowledge' topics, but when you get into more niche areas, they will happily present speculation as fact. In this case I'm preparing a design document for my ultrasonic transducer, for listening to bearing wear in conveyor belt idlers. An important design aspect is how the sound gets from a chunk of steel into my transducer, so I was going through the physics of some different options with Claude. I have my own pattern matching now- whenever an LLM says 'this is what X uses', I ask it to give examples. Often, you get this kind of response!
12
21 Comments
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Demis Hassabis
2 others named Demis Hassabis are on LinkedIn
See others named Demis Hassabis