TeleMorpher: Toward Robust Simultaneous Motion-Location Editing
Summary
TeleMorpher is a novel one-shot framework designed for robust simultaneous motion-location editing in videos, addressing the previously underexplored challenge of transforming both subject motion and its spatial location. Published on 2026-06-18, this approach analyzes factors degrading editing quality and leverages motion priors, target motion-centric video guidance, and ground truth motion for precise control. The framework operates by first disentangling the protagonist and background using pre-trained segmentation and inpainting models. It then employs training-free pose warping to edit the protagonist's motion, guided by motion priors. The resulting warped motion video is injected into a baseline motion editor during inference, preserving source video appearance while mitigating motion differences. To enhance evaluation reliability, TeleMorpher introduces two new LPIPS-based metrics for background consistency and motion editing fidelity, validated through experiments on in-the-wild videos and the TaiChi dataset, demonstrating superior quantitative and qualitative performance.
Key takeaway
For Computer Vision Engineers developing advanced video editing tools, TeleMorpher offers a robust framework for simultaneous motion and location transformation. You should consider integrating its disentanglement and motion prior guidance techniques to achieve more controllable and precise editing outcomes. Furthermore, adopt its proposed LPIPS-based metrics to enhance the reliability and fidelity of your quantitative evaluations for motion editing performance.
Key insights
TeleMorpher enables robust simultaneous motion-location video editing by disentangling elements and leveraging motion priors for precise control.
Principles
- Disentangle protagonist and background for robust editing.
- Motion priors and target guidance improve editing control.
Method
TeleMorpher disentangles protagonist/background, applies training-free pose warping with motion priors, then injects the warped video into a baseline motion editor during inference to preserve appearance and mitigate motion differences.
In practice
- Edit character motion and position in existing videos.
- Generate new video sequences with controlled subject movement.
Topics
- Motion Editing
- Video Editing
- Diffusion Models
- Pose Warping
- LPIPS Metrics
- Computer Vision
Best for: Research Scientist, AI Scientist, Computer Vision Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.