On Reasoning Behind Next Occupation Recommendation
Summary
Researchers developed a novel reasoning-augmented framework to improve large language models' (LLMs) accuracy in predicting future occupations. This two-step approach involves a reason generator that uses a user's education and career history to derive a "reason" summarizing their preferences, which then serves as input for an occupation predictor. To overcome LLMs' lack of alignment with career paths, the team fine-tuned small LLMs using high-quality "oracle reasons" generated by a GPT-4.1 LLM-as-a-Judge, evaluated for factuality, coherence, and utility. Experiments demonstrated that this fine-tuning significantly enhances prediction accuracy, making LLMs comparable to fully supervised methods and superior to unsupervised approaches. A single LLM fine-tuned for both reason generation and occupation prediction (joint model) outperformed separate models, and prediction accuracy was directly linked to the quality of the generated reasons. The code for this approach is publicly available.
Key takeaway
For AI Engineers developing career guidance systems, integrating explicit reasoning into LLM-based prediction models is crucial. You should prioritize joint fine-tuning of a single LLM for both reason generation and occupation prediction, as this strategy consistently yields higher accuracy than separate models. Focus on generating high-quality, factual, and coherent reasons, potentially using an LLM-as-a-Judge for data curation, to ensure robust and interpretable career recommendations.
Key insights
Reasoning-augmented LLMs, fine-tuned with high-quality oracle reasons, significantly enhance next-occupation prediction accuracy.
Principles
- Reason quality directly impacts prediction accuracy.
- Joint training of reason generation and prediction is superior.
- LLM-as-a-Judge can create high-quality synthetic supervision.
Method
A two-step process: generate a user-specific reason from career history, then use that reason to predict the next occupation. Fine-tune LLMs using oracle reasons generated and filtered by an LLM-as-a-Judge for factuality, coherence, and utility.
In practice
- Use O*NET-SOC taxonomy for standardized occupation classification.
- Employ DPO to refine reason generation quality.
- Consider Qwen3-8B as a backbone for fine-tuning.
Topics
- Next Occupation Prediction
- Large Language Models
- Reasoning-Augmented AI
- Supervised Fine-Tuning
- Direct Preference Optimization
Code references
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.