We propose a multimodal dialogue description language by extending VoiceXML, which is a spoken dialogue description language
for voice user interface. We added the specification that can output a text, image, 3D image, life-like communication agent,
and multimedia clip.