Talent Search and Recommendation
Systems at LinkedIn
Practical Challenges and Lessons Learned
Qi Guo, Sahin Cem Geyik, Bo Hu, Cagri Ozcaglar,
Ketan Thakkar, Xianren Wu, Krishnaram Kenthapadi
AI @ LinkedIn
+SIGIR 2018
+
The Team
Qi Guo Sahin Cem Geyik Bo Hu Cagri Ozcaglar
Ketan Thakkar Xianren Wu Krishnaram Kenthapadi
Contents
• Introduction
• Ranking Models for Talent Search
• Personalization
• Talent Search Architecture
• Summary
Introduction
LinkedIn Talent Solution:
~65% of LinkedIn’s Annual Revenue
A H I R I N G E C O S Y S T E M
LinkedIn Recruiter
MAJOR PRODUCT
A Talent Search and
Recommendation System
Recruiter Search
• Criteria-Based Search
• A recruiter has specific requisitions to fill
• Candidate Recommendation System
• A recruiter may want many qualified candidates, goes through pages
• Considers Both Sides of the Talent Marketplace
• Talents are limited resources
# of InMail Accepts
OPTIMIZATION OBJECTIVE:
3. Accept
2. Send
InMail
Recruiter
Candidate
1. Search
Ranking Models for Talent Search
Number of InMail Accepts Per Seat: 30% YoY
O V E R A L L I M P R O V E M E N T
Go Non-Linear with Tree Model
• Before: Linear Model optimized for NDCG with Coordinate Ascent
• After: XGBoost Tree Model
• Captures feature interactions
• XGBoost: gradient boosting tree models for richer model complexity
• Online Results:
METRIC PRECISION@5 PRECISION@25 OVERALL ACCEPT
Lift +7.5% +7.4% +5.1%
P-Value 2.1e-4 4.8e-4 0.01
Search for “Dentist”, a Software Engineer ranks high
P R O B L E M O B S E R V E D
Search for “Dentist”, a Software Engineer ranks high
P R O B L E M O B S E R V E D
• Focused too much on promoting active job-seeking candidate
• We want our ranking to be more context-aware
f( , , ) => Accept?
Reject?
Recruiter
Context
Query
Context
Context-Aware Ranking – Pairwise Training
f( , , )1
- f( , , )2Recruiter
Context
Query
Context
{
Shared Context
=>
• Pair up two candidates from the same search request:
Accept?
1
Accept?
2?>
Context-Aware Ranking
• Before: Pointwise XGBoost
• After: Pairwise XGBoost with Context-Aware Features
• Recruiter Context: Personalization features
• Query Context: Query-Candidate matching features
• Online Results:
METRIC PRECISION@5 PRECISION@25 OVERALL ACCEPT
Lift +18.2% +13.7% +8%
P-Value 1e-16 1.1e-11 9.6e-4
Search for “Machine Learning Engineer”,
desirable to include some Data Scientists
P R O B L E M O B S E R V E D
Representation Learning
• Fuzzy semantic match on title ids, skill ids, company ids etc.
• Unsupervised Graph Embedding
• Co-Occurrence Graph based on profile data
Representation Learning
• Before: XGBoost
• After: XGBoost with Title Similarity Feature
• Based on unsupervised graph embedding
• Online Results:
METRIC PRECISION@5 PRECISION@25 OVERALL ACCEPT
Lift +2% +1.8% +3%
P-Value 0.2 0.25 0.11
Deep Learning?
• Differentiable Programming with TensorFlow
• Flexible for model engineering
• Offline result does not justify the effort yet.
• Offline Results (Pairwise NN v.s. Pointwise XGBoost):
METRIC PRECISION@1 PRECISION@5 PRECISION@25
Lift +5.3% +2.8% +1.7%
Personalization for
Talent Search
Entity-Level Personalization with GLMix
• GLMix: Generalized Linear Mixed Models
• GLMix: global model + per-entity models
• We added per-recruiter model and per-contract/company model
Entity-Level Personalization with GLMix
• Model Ensemble
• Nonlinearity via tree interaction features
• Each leaf node is a feature
• Offline Results (GLMix vs. Pairwise XGBoost):
METRIC PRECISION@1 PRECISION@5 PRECISION@25
Lift +8.5% +4.7% +2.0%
Using Recruiter Search requires a lot of skills.
P R O B L E M O B S E R V E D
A Stream of
Recommended Candidates
Recommended Matches
SIMPLIFIED EXPERIENCE
In-Session Personalization
• Step 1: Segment the Space
• Query Intent Clustering
• Step 2: Evaluate each segment
• Multi-Armed Bandits
• Step 3: Modify each segment
• Term Weight Updates
In-Session Personalization: Results
Talent Search Architecture
Search and Retrieval Architecture
• LinkedIn’s Galene is built on top of Lucene.
• Three main components:
• Search index on searcher
• The fanout queries through broker, and
• Live updates to the index using live-updater.
• Query language is similar to Lucene with OR, AND, NOT.
• The search index contains two types of fields:
• Inverted Fields
• Forward Fields
Search and Retrieval Architecture
• Static Rank
• An auxiliary rank for members to help with retrieving at scale
• Based on member profile and activity
• Early termination
• Index partitioned into N-shards, each retrieves and scores candidates
• Not all members in a shard can be retrieved, so query is early terminated on the basis of
static rank.
• Galene Facet Counting:
• Galene supports facet counting (such as region, titles, etc) for any given query.
• Uses statistical counting approximation based on sample in each shard
Layered Ranking Architecture
• L1: Better to scoop into the talent pool and score/rank more candidates.
• L2: Refines the short-listed talent to apply more dynamic features using external cache.
Summary
Summary
• Talent Search
• Criteria Search, Recommendation System, Marketplace
• Talent Search Ranking
• Context-Aware Pairwise Training
• Representation Learning & Deep Learning
• GLMix Personalization
• In-Session Personalization
+
Thank You
Qi Guo Sahin Cem Geyik Bo Hu Cagri Ozcaglar
Ketan Thakkar Xianren Wu Krishnaram Kenthapadi

More Related Content

PPTX
Personalizing LinkedIn Feed
PDF
Deep Learning for Personalized Search and Recommender Systems
PDF
Déjà Vu: The Importance of Time and Causality in Recommender Systems
PDF
[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data
PPTX
Personalized Page Generation for Browsing Recommendations
PPTX
Activity Ranking in LinkedIn Feed
PDF
Recent Trends in Personalization: A Netflix Perspective
PDF
Deep Learning for Recommender Systems
Personalizing LinkedIn Feed
Deep Learning for Personalized Search and Recommender Systems
Déjà Vu: The Importance of Time and Causality in Recommender Systems
[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data
Personalized Page Generation for Browsing Recommendations
Activity Ranking in LinkedIn Feed
Recent Trends in Personalization: A Netflix Perspective
Deep Learning for Recommender Systems

What's hot (20)

PDF
Past, Present & Future of Recommender Systems: An Industry Perspective
PDF
Sequential Decision Making in Recommendations
PDF
Deep Learning for Recommender Systems
PDF
Approximate nearest neighbor methods and vector models – NYC ML meetup
PDF
Netflix Recommendations Feature Engineering with Time Travel
PPTX
Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...
PDF
Learning to rank
PDF
A Multi-Armed Bandit Framework For Recommendations at Netflix
PDF
Knowledge Graph Embeddings for Recommender Systems
PDF
Deep Natural Language Processing for Search and Recommender Systems
PDF
How Lazada ranks products to improve customer experience and conversion
PDF
How to build a recommender system?
PDF
Machine Learning for Q&A Sites: The Quora Example
PPTX
Learning to Rank Presentation (v2) at LexisNexis Search Guild
PDF
Personalizing the listening experience
PPTX
Netflix talk at ML Platform meetup Sep 2019
PPTX
Recommending What Video to Watch Next: A Multitask Ranking System
PDF
Marketplace in motion - AdKDD keynote - 2020
PDF
Making Netflix Machine Learning Algorithms Reliable
PDF
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Past, Present & Future of Recommender Systems: An Industry Perspective
Sequential Decision Making in Recommendations
Deep Learning for Recommender Systems
Approximate nearest neighbor methods and vector models – NYC ML meetup
Netflix Recommendations Feature Engineering with Time Travel
Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...
Learning to rank
A Multi-Armed Bandit Framework For Recommendations at Netflix
Knowledge Graph Embeddings for Recommender Systems
Deep Natural Language Processing for Search and Recommender Systems
How Lazada ranks products to improve customer experience and conversion
How to build a recommender system?
Machine Learning for Q&A Sites: The Quora Example
Learning to Rank Presentation (v2) at LexisNexis Search Guild
Personalizing the listening experience
Netflix talk at ML Platform meetup Sep 2019
Recommending What Video to Watch Next: A Multitask Ranking System
Marketplace in motion - AdKDD keynote - 2020
Making Netflix Machine Learning Algorithms Reliable
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Ad

Similar to Talent Search and Recommendation Systems at LinkedIn: Practical Challenges and Lessons Learned (20)

PDF
Gutmacher In-House Sourcing Model Offshore and Onshore Nov. 2016
PPTX
Staffing_Recruiter_Staffing and RPO Costs For us rpo and staffingMasterclass_...
PPTX
Building Search and Personalization at Nordstrom Rack | Hautelook
PDF
Alizeh: A Radiant Icon Among Pakistani Clothing Brands for Women’s Ethnic Fas...
PPTX
AI_Resume_Analyzer_Using_nlp_Presentation.pptx
PPTX
Timberhorn ppt
PPTX
How to Find Data Analyst Jobs in USA OPTnation
PDF
Hcm file-wp
PPTX
Recruitment_PPT_for bba college student.
PPTX
How is Salarite connecting startups with the best web developer jobs in Jaipu...
PPTX
How Google works
PPT
An analysis of employee performance evaluation and employee motivation
PDF
Accendo's TalentPulse Package
PDF
How to Leverage Marketing Analytics to Source Better Talent
PDF
SEO Training Course on Wecommerce.pk web
PDF
DTEK Talent Management System
PPTX
SEO Trends in 2024 What You Need to Know.pptx
PPTX
About Youth4work - Integrated Talent Solutions
PPTX
REsume Screening
PPTX
Times Jobs _Times Internet Limited
Gutmacher In-House Sourcing Model Offshore and Onshore Nov. 2016
Staffing_Recruiter_Staffing and RPO Costs For us rpo and staffingMasterclass_...
Building Search and Personalization at Nordstrom Rack | Hautelook
Alizeh: A Radiant Icon Among Pakistani Clothing Brands for Women’s Ethnic Fas...
AI_Resume_Analyzer_Using_nlp_Presentation.pptx
Timberhorn ppt
How to Find Data Analyst Jobs in USA OPTnation
Hcm file-wp
Recruitment_PPT_for bba college student.
How is Salarite connecting startups with the best web developer jobs in Jaipu...
How Google works
An analysis of employee performance evaluation and employee motivation
Accendo's TalentPulse Package
How to Leverage Marketing Analytics to Source Better Talent
SEO Training Course on Wecommerce.pk web
DTEK Talent Management System
SEO Trends in 2024 What You Need to Know.pptx
About Youth4work - Integrated Talent Solutions
REsume Screening
Times Jobs _Times Internet Limited
Ad

Recently uploaded (20)

PDF
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
PPTX
Build automations faster and more reliably with UiPath ScreenPlay
PDF
Ensemble model-based arrhythmia classification with local interpretable model...
PDF
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
PDF
Advancing precision in air quality forecasting through machine learning integ...
PDF
Introduction to MCP and A2A Protocols: Enabling Agent Communication
PDF
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
PDF
Build Real-Time ML Apps with Python, Feast & NoSQL
PDF
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
PDF
EIS-Webinar-Regulated-Industries-2025-08.pdf
PDF
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf
PDF
Planning-an-Audit-A-How-To-Guide-Checklist-WP.pdf
PDF
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PDF
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
PDF
The AI Revolution in Customer Service - 2025
PPTX
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
PPTX
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PDF
Co-training pseudo-labeling for text classification with support vector machi...
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
Build automations faster and more reliably with UiPath ScreenPlay
Ensemble model-based arrhythmia classification with local interpretable model...
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
Advancing precision in air quality forecasting through machine learning integ...
Introduction to MCP and A2A Protocols: Enabling Agent Communication
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
Build Real-Time ML Apps with Python, Feast & NoSQL
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
EIS-Webinar-Regulated-Industries-2025-08.pdf
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf
Planning-an-Audit-A-How-To-Guide-Checklist-WP.pdf
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
The AI Revolution in Customer Service - 2025
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
Co-training pseudo-labeling for text classification with support vector machi...

Talent Search and Recommendation Systems at LinkedIn: Practical Challenges and Lessons Learned

  • 1. Talent Search and Recommendation Systems at LinkedIn Practical Challenges and Lessons Learned Qi Guo, Sahin Cem Geyik, Bo Hu, Cagri Ozcaglar, Ketan Thakkar, Xianren Wu, Krishnaram Kenthapadi AI @ LinkedIn +SIGIR 2018
  • 2. + The Team Qi Guo Sahin Cem Geyik Bo Hu Cagri Ozcaglar Ketan Thakkar Xianren Wu Krishnaram Kenthapadi
  • 3. Contents • Introduction • Ranking Models for Talent Search • Personalization • Talent Search Architecture • Summary
  • 5. LinkedIn Talent Solution: ~65% of LinkedIn’s Annual Revenue A H I R I N G E C O S Y S T E M
  • 6. LinkedIn Recruiter MAJOR PRODUCT A Talent Search and Recommendation System
  • 7. Recruiter Search • Criteria-Based Search • A recruiter has specific requisitions to fill • Candidate Recommendation System • A recruiter may want many qualified candidates, goes through pages • Considers Both Sides of the Talent Marketplace • Talents are limited resources
  • 8. # of InMail Accepts OPTIMIZATION OBJECTIVE: 3. Accept 2. Send InMail Recruiter Candidate 1. Search
  • 9. Ranking Models for Talent Search
  • 10. Number of InMail Accepts Per Seat: 30% YoY O V E R A L L I M P R O V E M E N T
  • 11. Go Non-Linear with Tree Model • Before: Linear Model optimized for NDCG with Coordinate Ascent • After: XGBoost Tree Model • Captures feature interactions • XGBoost: gradient boosting tree models for richer model complexity • Online Results: METRIC PRECISION@5 PRECISION@25 OVERALL ACCEPT Lift +7.5% +7.4% +5.1% P-Value 2.1e-4 4.8e-4 0.01
  • 12. Search for “Dentist”, a Software Engineer ranks high P R O B L E M O B S E R V E D
  • 13. Search for “Dentist”, a Software Engineer ranks high P R O B L E M O B S E R V E D • Focused too much on promoting active job-seeking candidate • We want our ranking to be more context-aware f( , , ) => Accept? Reject? Recruiter Context Query Context
  • 14. Context-Aware Ranking – Pairwise Training f( , , )1 - f( , , )2Recruiter Context Query Context { Shared Context => • Pair up two candidates from the same search request: Accept? 1 Accept? 2?>
  • 15. Context-Aware Ranking • Before: Pointwise XGBoost • After: Pairwise XGBoost with Context-Aware Features • Recruiter Context: Personalization features • Query Context: Query-Candidate matching features • Online Results: METRIC PRECISION@5 PRECISION@25 OVERALL ACCEPT Lift +18.2% +13.7% +8% P-Value 1e-16 1.1e-11 9.6e-4
  • 16. Search for “Machine Learning Engineer”, desirable to include some Data Scientists P R O B L E M O B S E R V E D
  • 17. Representation Learning • Fuzzy semantic match on title ids, skill ids, company ids etc. • Unsupervised Graph Embedding • Co-Occurrence Graph based on profile data
  • 18. Representation Learning • Before: XGBoost • After: XGBoost with Title Similarity Feature • Based on unsupervised graph embedding • Online Results: METRIC PRECISION@5 PRECISION@25 OVERALL ACCEPT Lift +2% +1.8% +3% P-Value 0.2 0.25 0.11
  • 19. Deep Learning? • Differentiable Programming with TensorFlow • Flexible for model engineering • Offline result does not justify the effort yet. • Offline Results (Pairwise NN v.s. Pointwise XGBoost): METRIC PRECISION@1 PRECISION@5 PRECISION@25 Lift +5.3% +2.8% +1.7%
  • 21. Entity-Level Personalization with GLMix • GLMix: Generalized Linear Mixed Models • GLMix: global model + per-entity models • We added per-recruiter model and per-contract/company model
  • 22. Entity-Level Personalization with GLMix • Model Ensemble • Nonlinearity via tree interaction features • Each leaf node is a feature • Offline Results (GLMix vs. Pairwise XGBoost): METRIC PRECISION@1 PRECISION@5 PRECISION@25 Lift +8.5% +4.7% +2.0%
  • 23. Using Recruiter Search requires a lot of skills. P R O B L E M O B S E R V E D
  • 24. A Stream of Recommended Candidates Recommended Matches SIMPLIFIED EXPERIENCE
  • 25. In-Session Personalization • Step 1: Segment the Space • Query Intent Clustering • Step 2: Evaluate each segment • Multi-Armed Bandits • Step 3: Modify each segment • Term Weight Updates
  • 28. Search and Retrieval Architecture • LinkedIn’s Galene is built on top of Lucene. • Three main components: • Search index on searcher • The fanout queries through broker, and • Live updates to the index using live-updater. • Query language is similar to Lucene with OR, AND, NOT. • The search index contains two types of fields: • Inverted Fields • Forward Fields
  • 29. Search and Retrieval Architecture • Static Rank • An auxiliary rank for members to help with retrieving at scale • Based on member profile and activity • Early termination • Index partitioned into N-shards, each retrieves and scores candidates • Not all members in a shard can be retrieved, so query is early terminated on the basis of static rank. • Galene Facet Counting: • Galene supports facet counting (such as region, titles, etc) for any given query. • Uses statistical counting approximation based on sample in each shard
  • 30. Layered Ranking Architecture • L1: Better to scoop into the talent pool and score/rank more candidates. • L2: Refines the short-listed talent to apply more dynamic features using external cache.
  • 32. Summary • Talent Search • Criteria Search, Recommendation System, Marketplace • Talent Search Ranking • Context-Aware Pairwise Training • Representation Learning & Deep Learning • GLMix Personalization • In-Session Personalization
  • 33. + Thank You Qi Guo Sahin Cem Geyik Bo Hu Cagri Ozcaglar Ketan Thakkar Xianren Wu Krishnaram Kenthapadi