

Maintain the exact identity from the image reference. Preserve facial structure, skin texture, eye shape, nose shape, lips, jawline, hairstyle, body proportions, clothing, accessories, and all visual details exactly. Animate the character naturally following the rhythm, timing, energy, and beat pattern of the provided audio. Use the audio only as motion guidance. Head movement synchronized to the beat. Natural shoulder movement. Natural body sway. Subtle hand gestures. Natural breathing motion. Realistic body mechanics. Realistic weight shift. Natural blinking. Natural facial expressions. Maintain perfect facial consistency throughout the entire video. No face distortion. No identity drift. No facial flickering. No anatomy distortion. No body deformation. OUTPUT SILENT VIDEO ONLY. Do not generate any voice. Do not generate any music. Do not generate any sound effects. Do not generate any ambient audio. Do not modify the provided audio. Do not recreate the provided audio. The audio reference is used only to drive motion and timing. Locked camera. Vertical 9:16. 85mm portrait lens. Natural depth of field. Photorealistic human. Real camera footage. Natural skin texture. Visible skin pores. Natural facial asymmetry. Realistic hair movement. Realistic fabric movement. Realistic lighting. Realistic shadows. Not cartoon. Not animation. Not CGI. Not 3D render. Not illustration. Not doll-like. Not plastic skin. Ultra realistic social media video. TikTok-quality realism. Professional cinematography.