In this paper, we describe a near-complete Pocket PC implementation of a Mobile Multi-Modal Interaction (M3I) platform for
pedestrian navigation. The platform is designed to easily support indoor and outdoor navigation tasks, and uses the combination
of several modalities for presentation output and user input. Whereas 2D/3D-graphics and synthesized speech are used to present
useful information on routes and places, fused input from embedded speech and gesture recognition engines allow for situated
user interaction.