 自动化学报  2018, Vol. 44 Issue (1): 99-105 PDF

1. 解放军信息工程大学信息系统工程学院 郑州 450001;
2. 75830部队 广州 510000;
3. 华侨大学计算机科学与技术学院 厦门 361021

Image Retrieval with Enhanced Visual Dictionary and Query Expansion
KE Sheng-Cai1,2, LI Bi-Cheng3, CHEN Gang1, ZHAO Yong-Wei1, WEI Han1
1. Institute of Information System Engineering, PLA Information Engineering University, Zhengzhou 450001;
2. Unit 75830, Guangzhou 510000;
3. College of Computer Science and Technology, Huaqiao University, Xiamen 361021
Manuscript received : January 29, 2016, accepted: August 15, 2016.
Foundation Item: Supported by National Natural Science Foundation of China (60872142) and Scientific Research Funds of Huaqiao University
Corresponding author. LI Bi-Cheng  Professor at the College of Computer Science and Technology, Huaqiao University. His research interest covers text analysis and understanding, speech/image/video processing and recognition, and information fusing. Corresponding author of this paper
Recommended by Associate Editor LIU Yue-Hu
Abstract: The most popular approach in image retrieval is based on the bag of visual-words (BoVW) model. However, there are several fundamental problems that restrict the performance of this method, such as low time efficiency, weak discrimination of visual words and less robustness. So, an image retrieval method with enhanced visual dictionary and query expansion is proposed. Firstly, clustering by fast search and finding density peaks are used to generate a group of visual words. Secondly, non-information words in the dictionary are eliminated by Chi-square model to improve the distinguishing ability of the visual dictionary. Finally, an efficient graph-based visual reranking method is introduced to refine the initial search results. Experimental results of Oxford5K and Paris6K datasets indicate that the expression ability of visual dictionary is effectively improved and the method is superior to the state-of-the-art image retrieval methods in performance.
Key words: Bag of visual words (BoVW)     clustering based on density     Chi-square model     query expansion

1 基于视觉词典优化和查询扩展的图像检索

 图 1 基于视觉词典优化和查询扩展的图像检索方法流程 Figure 1 The flow chart of image retrieval based on enhanced visual dictionary and query expansion
1.1 基于密度聚类的视觉词典组

 ${\rho _i} = \mathop \sum \limits_j \chi ({d_{ij}} - {d_c})$ (1)

 ${\delta _i} = \left\{ {\begin{array}{*{20}{c}} {\mathop {\min }\limits_{j:{\rho _j} > {\rho _i}} ({d_{ij}})}, &{{\rho _i} < {\rho _{\max }}}\\ {\mathop {\max }\limits_j ({d_{ij}})}, &{{\rho _i} = {\rho _{\max }}} \end{array}} \right.$ (2)

1.2 视觉单词过滤

 $x_i^2 = \sum\limits_{k = 1}^2 {\sum\limits_{j = 1}^m {\frac{{{{(N \cdot {n_{kj}} - {n_{k + }} \cdot {n_{ + j}})}^2}}}{{N \cdot {n_{k + }} \cdot {n_{ + j}}}}} }$ (3)

 $\tilde x_i^2 = \frac{{x_i^2}}{{tf{\rm{(}}{w_i}{\rm{)}}}}$ (4)

1.3 基于图结构的查询扩展

 图 2 基于图结构的查询扩展方法流程图 Figure 2 The flow chart of query expansion based on image structure

 ${R_k}(i, i') = \{ (i, i')|i \in {N_k}(i'), i' \in {N_k}(i)\}$ (5)

 $w(i, i') = \left\{ {\begin{array}{*{20}{l}} {\dfrac{{|{N_k}(i) \cap {N_k}(i')|}}{k}}, &{\mbox{若}~(i, i') \in {R_k}(i, i')}\\ 0, &\mbox{其他} \end{array}} \right.$ (6)

 ${s_i} = \min \left\{ {\beta ^n}\frac{{\left\| {{f_i} - {f_n}} \right\|_2^2}}{{\sigma _n^2}}|n = 1, 2, \cdots, {N_c}\right\}$ (7)

2 实验设置与性能评价 2.1 实验设置

2.1.1 实验性能分析

 图 3 距离阈值参数$d_c$对图像检索MAP值的影响 Figure 3 The effect of distance threshold on MAP

 图 4 视觉词典规模对图像检索MAP值的影响 Figure 4 The effect of vocabulary size on MAP

 图 5 去除停用词数目对图像检索MAP值的影响 Figure 5 The effect of parameter on MAP

 图 6 在Oxford5K和Oxford5K+Paris6K数据库上的图像检索MAP值 Figure 6 The MAP of different methods for Oxford5K and Oxford5K+Paris6K database

 图 7 EVD+GBQE方法在Oxford5K+Paris6K数据库上的检索结果 Figure 7 The image retrieval results of EVD+GBQE for Oxford5K+Paris6K database
3 结论

