Standout Feature
All-Round Multimodal Reference
Supports mixed input of text + image + video + audio
Maximum per time : 9 images + 3 videos + 3 audios = 12 reference files
Usage example : @Image1 @Video2 A girl travels through the world of famous paintings, cinematic texture, seamless transitions