Creating multimodal texts through images, audio, and video