Together AI expands fine-tuning service with tool calling, reasoning, and vision support
Summary
Together AI has expanded its fine-tuning service, Together Fine-Tuning, with native support for tool calling, reasoning, and vision-language model (VLM) fine-tuning, effective March 18, 2026. This update addresses common reliability issues in multi-turn AI workflows, such as inconsistent tool calls and degraded reasoning. The service now supports training models with up to 1T parameters and datasets up to 100GB, delivering up to 6x higher throughput for larger models like Kimi-K2 and at least a 2x increase across all models. Enhancements include end-to-end fine-tuning for OpenAI-compatible tool schemas, specialized support for "thinking" tokens in reasoning traces, and native vision training for domain-specific visual data. Additionally, Together AI now provides job cost estimations before training and ETAs during training for better experiment planning.
Key takeaway
For AI Engineers building advanced multi-turn applications, Together AI's expanded fine-tuning capabilities directly address common reliability challenges. You can now fine-tune models for robust tool calling, complex reasoning, and domain-specific vision interpretation, reducing downstream failures. This allows your team to iterate faster, potentially cutting costs by 2-3x and improving accuracy from 77% to 87%, as demonstrated by XY.AI Labs. Leverage the new cost and time estimations to better plan your experiments and accelerate deployment.
Key insights
Together AI's fine-tuning service now natively supports complex AI workflows like tool calling, reasoning, and VLM training.
Principles
- Agentic AI requires reliable tool call execution.
- Structured reasoning traces improve model logic.
- Domain-specific visual data needs VLM alignment.
Method
The service enables fine-tuning on OpenAI-compatible tool schemas, "thinking" tokens for reasoning, and base64 encoded images for VLMs. It supports hybrid datasets and offers optional joint training of vision encoders and language layers.
In practice
- Fine-tune Qwen, Moonshot AI, Z.AI for tool calls.
- Train Qwen, Z.AI models on reasoning traces.
- Use hybrid datasets for VLM fine-tuning.
Topics
- LLM Fine-tuning
- Tool Calling
- Reasoning Traces
- Vision-Language Models
- AI Agent Development
- Model Training Optimization
Code references
Best for: AI Architect, Computer Vision Engineer, CTO, Machine Learning Engineer, AI Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Together AI | The AI Native Cloud - Together.ai.