It is the cache of ${baseHref}. It is a snapshot of the page. The current page could have changed in the meantime.
Tip: To quickly find your search term on this page, press Ctrl+F or ⌘-F (Mac) and use the find bar.

Image Annotation Refinement Using Dynamic Weighted Voting Based on Mutual Information | Song | Journal of Software
Journal of Software, Vol 6, No 11 (2011), 2239-2246, Nov 2011
doi:10.4304/jsw.6.11.2239-2246

Image Annotation Refinement Using Dynamic Weighted Voting Based on Mutual Information

Haiyu Song, Xiongfei Li, Pengjie Wang

Abstract


Automatic image annotation is a promising solution to narrow the semantic gap between low-level content and high-level semantic concept, which has been an active research area in the fields of image retrieval, pattern recognition, and machine learning. However, even the most dedicated annotation algorithms are often unsatisfactory. Image annotation refinement has attracted much more attention recently. In this paper, a novel refinement algorithm using dynamic voting based on mutual information is proposed. Unlike the traditional refinement algorithm, the proposed algorithm adopts dynamic weighted voting to measure the dependence between the candidate annotations, which not only permits that the annotations with higher probabilities deny the annotations with lower probabilities, but also permits that the annotations with lower probabilities deny the annotations with higher probabilities. The proposed refinement algorithm adopts progressive method instead of iterative, which can significantly decrease the time cost of refining annotations. In order to further improve efficiency without sacrificing precision, we propose the block-based normalized cut algorithm to segment image. Experiments conducted on standard Washington Ground Truth Image Database demonstrate the effectiveness and efficiency of our proposed approach for refining image annotations.



Keywords


image annotation refinement; image retrieval; mutual information; normalized cut; relevance model

References


[1]Arnold W.M. Smeulders, Marcel Worring, Simone Santini, Amarnath Gupta, Ramesh Jain, “Content-Based Image Retrieval at the End of the Early Years”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.22(12), pp.1349 – 1380, December 2000.
http://dx.doi.org/10.1109/34.895972

[2]Jing Liu, Mingjing Li, Wei-Ying Ma, Qingshan Liu, Hanqing Lu, “An adaptive graph model for automatic image annotation”, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, pp.61-70, New York :ACM Press, 2006.

[3]Jia Li, James Z. Wang, “Automatic linguistic indexing of pictures by a statistical modeling approach”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.25(9), pp1075-1088, September 2003.

[4]Lim J-H, Tian Q, Mulhem P, “Home photo content modeling for personalized event-based retrieval”, IEEE Multimedia, vol.10(4), pp.28-37, 2003.
http://dx.doi.org/10.1109/MMUL.2003.1237548

[5]Park SB, Lee JW, Kim SK, “Content-based image classificatio using a neural network”, Pattern Recognition Letters, vol. 25, pp.287-300, 2004.
http://dx.doi.org/10.1016/j.patrec.2003.10.015

[6]Kim, S. Park, S., Kim, M., “Image classification into object/non-object classes” Proceedings of the International Conference on Image and Video Retrieval, Lecture Notes in Computer Science 3115 Springer 2004, Berlin; New York: Springer Press, pp.393-400, 2004.

[7]Chapelle O, Haffner P, Vapnik VN., “Support vector machines for histogram-based image classification”, IEEE Trans Neural Networks, vol.10(5), pp. 1055-1064, 1999
http://dx.doi.org/10.1109/72.788646
PMid:18252608

[8]Brank J. Using image segmentation as a basis for categorization. Informatica, 2002; 26: 4.

[9]Qbal Q, Aggarwal JK., “Retrieval by classification of images containing large manmade objects using perceptual grouping”, Pattern Recognition, vol.35(7), pp.1463-1479. 2002.
http://dx.doi.org/10.1016/S0031-3203(01)00139-X

[10]Cheng Y-C, Chen S-Y., “Image classification using color, texture and regions”, Image and Vision Computing, vol. 21(9), pp. 759-776, 2003.
http://dx.doi.org/10.1016/S0262-8856(03)00069-6

[11]Vailaya A, Figueiredo AT, Jain AK, Zhang H-J., “Image classification for content-based indexing”, IEEE Trans Image Proc, vol. 10(1), pp.117-130, 2001.
http://dx.doi.org/10.1109/83.892448
PMid:18249602

[12]Luo J., Savakis A., “Indoor vs outdoor classification of consumer photographs using low-level and semantic features”, Proceedings of the IEEE International Conference on Image Processing, IEEE Press, pp. 745-748, 2001.

[13]Jeon J., Manmatha R. “Using maximum entropy for automatic image annotation”, Proceedings of the International Conference on Image and Video Retrieval, Berlin; New York : Springer Press, pp.24-32, 2004.

[14]Wang JZ, Li J., “Learning-based linguistic indexing of pictures with 2-D MHMMs”, Proceedings of the ACM International Conference on Multimedia, New York :ACM Press, pp. 436-445, 2002.
http://dx.doi.org/10.1145/641007.641104

[15]Y. Mori, H. Takahashi, and R. Oka., “Image-to-word transformation based on dividing and vector quantizing images with words”, In First International Workshop on Multimedia Intelligent Storage and Retrieval Management, New York: ACM Press, 1999.

[16]P. Duygulu, K. Barnard, N. de Freitas, and D. Forsyth., “Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary”, In Seventh European Conference on Computer Vision, Berlin; New York : Springer Press, pp.97-112, 2002.

[17]Pan, J.Y., Yang, H.J. and Pinar, D., “Automatic multimedia cross-modal correlation discovery”, The Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York : ACM Press, pp.653-658, August 2004.

[18]J. Jeon, V. Lavrenko, and R. Manmatha, “Automatic image annotation and retrieval using cross-media relevance models”, In Proceedings of the 26th Annual international ACM SIGIR Conference on Research and Development in information Retrieval, New York : ACM Press, 2003.

[19]Y. Jin, L. Khan, L. Wang and M. Awad, “Image annotations by combining multiple evidence & wordnet”, In Proceedings of the 13th Annual ACM international Conference on Multimedia, New York: ACM Press, 2005.

[20]S. L. Feng, R. Manmatha, and V. Lavrenko, “Multiple Bernoulli relevance models for image and video annotation”, In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, vol.2, pp.1002-1009, IEEE CS Press, 2004.

[21]Yohan Jin, Latifur Khan, B.Prabhakaran, "To be Annotated or not?: the Randomized Approximation Graph Algorithm for Image Annotation Refinement Problem", Proceedings of the 24th International Conference on Data Engineering, IEEE Press, April 2008.

[22]C. Wang, F. Jing, L. Zhang, and H. J. Zhang, “Image Annotation Refinement using Random Walk with Restarts“, In Proceedings of the 14th Annual ACM international Conference on Multimedia, New York: ACM Press, 2006.

[23]Ricaro Baeza-Yates, Berthier Ribeiro-Neto, Modern Information Retrieval. New York: ACM Press,pp27-31, 1999.

[24] Changhu Wang, Feng Jing, Lei Zhang, Hong-Jiang Zhang. “Content-based Image Annotation Refinement”, IEEE Conference on In Computer Vision and Pattern Recognition, IEEE CS Press, pp.1-8, 2007.

[25]Haiyu Song, Xiongfei Li, Pengjie Wang, “Automatic Image Annotation based on Improved Relevance Model”, 2009 Asia-Pacific Conference on Information Processing, IEEE CS Press, pp.59-63, 2009.

[26]Jianbo Shi and Jitendra Malik, “Normalized Cuts and Image Segmentation”, IEEE Transactions on Pattern Analysis and Machine Intelligence(PAMI), vol22(8), 2000

[27]Haiy Song, Xiongfei Li, Pengjie Wang, Jingrun Chen. Block-based Normalized-cut Algorithm for Image Segmentation, Lecture Notes in Electrical Engineering, Springer, January 2010, in press

[28]www.cs.washington.edu/research/imagedatabase/groundtruth


Full Text: PDF


Journal of Software (JSW, ISSN 1796-217X)

Copyright @ 2006-2014 by ACADEMY PUBLISHER – All rights reserved.