Prompt Injection in Automated Résumé Screening with Large Language Models: Single and Multi-Injection Settings
Summary
Large language models (LLMs) are increasingly employed for screening and ranking job applicants, creating incentives for candidates to manipulate these systems. A study investigated prompt injection in automated résumé screening, defined as subtle self-promotional text not adding new qualifications but designed to influence LLM evaluations. Controlled experiments revealed that prompt injection reliably improves applicant rankings when résumé quality is homogeneous and few candidates inject. However, its effectiveness rapidly diminishes as more candidates inject, collapsing when manipulation becomes widespread. When candidate quality is heterogeneous, prompt injection is less effective on average, but can occasionally allow lower-quality candidates to outrank higher-quality ones, raising fairness concerns. LLM-based screening is most vulnerable when manipulation is rare and candidate quality differences are small.
Key takeaway
For HR teams or ML Engineers deploying LLM-based résumé screeners, you must account for prompt injection vulnerabilities. While rare, subtle injections can significantly alter rankings, especially in homogeneous candidate pools, potentially allowing lower-quality applicants to advance. Implement robust detection mechanisms and consider diverse evaluation metrics to mitigate fairness risks as manipulation becomes more prevalent. Your systems are most vulnerable when manipulation is rare and candidate quality differences are small.
Key insights
Prompt injection can manipulate LLM-based résumé screening, but its effectiveness decreases with widespread use and varies with candidate quality.
Principles
- Injection works best with homogeneous quality.
- Widespread injection reduces individual efficacy.
- Fairness risks arise with heterogeneous quality.
Method
Controlled experiments were used to study prompt injection's impact on LLM-based résumé screening in single and multi-injection settings.
In practice
- Monitor for rare, subtle self-promotional text.
- Assess LLM vulnerability with homogeneous pools.
- Consider fairness implications for diverse candidates.
Topics
- Prompt Injection
- Large Language Models
- Résumé Screening
- Algorithmic Bias
- Hiring Systems
- AI Ethics
Code references
Best for: AI Architect, AI Engineer, NLP Engineer, AI Scientist, Machine Learning Engineer, AI Ethicist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.