This paper describes an approach to expose the salient visual information from raw sensory data stream. At first, a general
framework of media application is introduced. Then, based on the implementation of an approach proposed by MIT technical report,
several improved techniques with respect to the existing drawbacks in the original extraction method are applied to establish
our new framework. Finally, the result of our experiment system and its future work are given.