Our research
We focus on rigorous evaluation, interpretability, robustness and model control: how AI behaves, where it fails and how it can be guided. We also explore model interoperability and AI decision-making.
Read the research. Use the code.
Browse the Labs’ projects to access the research behind each one, with links to papers, code and datasets where available. Access the methods, run the code and build on the results.
p-less sampling: A robust LLM decoding strategy
Published: December 15, 2025
The next frontiers in AI — according to industry leaders
Published: June 04, 2025
More research
Anti-slopping — An innovation for rectifying LLM writing clichés
Concept consistency score
p-less sampling: A robust hyperparameter-free approach for LLM decoding
p-less sampling: A robust LLM decoding strategy
Steering smarter
Evaluating LLM-generated summaries using the Lie algebra framework
Beyond I am sorry, I can’t: dissecting large language model refusal
Distribution-aware feature selection for SAEs
Towards transparent AI grading: Entropy as a signal for human-AI disagreement
The next frontiers in AI — according to industry leaders
Beyond linear steering: Unified multi-attribute control for language models
Calculating uncertainty in generative AI
TinySQL
Evaluating LLMs using semantic entropy
LLM benchmarks, evals and tests
Turning up the heat: Min-p samling for creative and coherent creative outputs
Decoding LLM uncertainties for better predictability
A surprisingly effective way to estimate token importance in LLM prompts
Probabilistic machine learning and weak supervision
A gentle introduction to machine teaching
Partners and collaborations
Thoughtworks AI labs sit within a wider network of organizations spanning public AI research, semiconductor innovation, cloud platforms, open source and AI engineering.
These relationships strengthen the lab’s ability to contribute to the methods, tools and technical standards shaping reliable AI.
For partnerships and collaboration inquiries
email ai-labs@thoughtworks.com