Spatial-Frequency Redistribution-based Saliency Region Detection for Thai Text Localisation
Main Article Content
Abstract
Saliency region detection plays an important role in computer vision applications and related areas, such as human fixation, figure-ground separation, face detection, and image compression. Nevertheless, state-of-the-art methods can moderately detect the saliency regions of a psychological pattern dataset. Therefore, this paper proposes a spatial-frequency redistribution (SFR) method to improve the efficiency of the detection of the saliency regions. The proposed SFR method consists of two main procedures: (i) adaptive cosine image construction-and-implantation and (ii) saliency region detection using complete multiple-object-implanted images. The former procedure constructs an adaptive cosine image by using a cosine function based on an object structure and then individually implants it into each object detectable. This stage provides the first significant property, spatial redistribution, to the object implanted. The adaptive cosine image redistributes the objects for controllability. Then, the latter procedure transforms a complete multiple object-implanted image into the frequency domain. At this point, the second significant property, frequency redistribution, provides the simple technique for identifying and separating the target object from the unwanted objects and background. In this paper, these properties are referred to as spatial-frequency redistribution. This method was realized as a computer program, and then such program was tested with a psychological pattern dataset and a Thai text dataset. The experimental results, when compared with state-of-the-art methods, show that the proposed SFR method can achieve the clear detection of the attention region in the psychological pattern dataset. Moreover, the proposed method can achieve a F-value of 80.28% with a recall and precision of 76.32% and 85.23%, respectively, for Thai text localisation in the Thai text dataset.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
L.Zhang, M.H. Tong, T.K. Marks, H.Shan, and G.W. Cottrell, "SUN: A Bayesian framework for saliency using natural statistics," Journal of Vision, vol.8, no.7, pp.1-20, Dec. 2008.
X.Hou, J.Harel, and C.Koch, "Image signature: Highlighting sparse salient regions," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, no.1, pp.194-201, Jan. 2012.
D.Walther and C.Koch, "Modeling attention to salient proto-objects," Neural Networks, vol.19, no.9, pp.1395-1407, Nov. 2006.
M.Cerf, J.Harel, W.Einhaeuser, and C.Koch, "Predicting human gaze using low-level saliency combined with face detection," Advances in Neural Information Processing Systems 20, pp.241-248, 2007.
• C.Guo and L.Zhang, "A novel multiresolution spatiotemporal saliency detection model and its applications in image and video compression," IEEE Transactions on Image Processing, vol.19, no.1, pp.185-198, Jan. 2010.
W.Einhauser, M.Spain, and P.Perona, "Objects predict fixations better than early saliency," Journal of Vision, vol.8, no.14, p.1-26, 2008.
L.Elazary and L.Itti, "Interesting objects are visually salient," Journal of Vision, vol.8, no.3, p.1-15, 2008.
L.Itti and C.Koch, "A saliency-based search mechanism for overt and covert shifts of visual attention," Vision Research, vol.40, no.10-12, pp.1489-1506, Jun. 2000.
J.Harel, C.Koch, and P.Perona, "Graph-based visual saliency," Proceedings of Neural Information Processing Systems, pp.545-552, 2006.
J.K. Tsotsos and N.D.B. Bruce, "Saliency based on information maximization," Advances in Neural Information Processing Systems 18, MIT Press, pp.155-162, MIT Press, 2006.
• L.Itti, C.Koch, and E.Niebur, "A model of saliency-based visual attention for rapid scene analysis," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, no.11, pp.1254-1259, Nov. 1998.
X.Hou and L.Zhang, "Saliency detection: A spectral residual approach," 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, Jun. 2007.
J.Li, M.D. Levine, X.An, X.Xu, and H.He, "Visual saliency based on scale-space analysis in the frequency domain," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, no.4, pp.996-1010, Apr. 2013.
A.Kimura, R.Yonetani, and T.Hirayama, "Computational models of human visual attention and their implementations: A survey," IEICE Transactions on Information and Systems, vol.E96-D, no.3, pp.562-578, Mar. 2013.
Y.Zhang, Y.Li, and S.Safavi-Naeini, "A spectrum-based saliency detection algorithm for millimeter wave insar imaging with sparse sensing," IEICE Transactions on Information and Systems, vol.E100-D, no.2, pp.388-391, Feb. 2017.
J.Jaemsiri, T.Titijaroonroj, J.Rungrattanaubol, "Modified scale-space analysis in frequency domain based on adaptive multiscale Gaussian filter for saliency detection," 2019 International Joint Conference on Computer Science and Software Engineering, pp.212-217, Jul. 2019.
T. Titijaroonroj, J. Jaemsiri and J. Rungrattanaubol, "Improved adaptive spectrum scale-space in frequency domain for saliency detection," 2020 12th International Conference on Knowledge and Smart Technology (KST), pp.97-102, 2020.
D. Kakanopas and K. Woraratpanya, "An efficient Hola filter for saliency detection," 2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE), pp. 338-343, 2020.
D. Kakanopas and K. Woraratpanya, "A unified framework for saliency segmentation," 2021 17th International Conference on Computing and Information Technology (IC2IT), pp. 191-200, 2021.
N.Otsu, "A threshold selection method from gray-level histograms," IEEE Transactions on Systems, Man, and Cybernetics, vol.9, no.1, pp.62-66, Jan. 1979.
C.Guo, Q.Ma, and L.Zhang, "Spatio-temporal saliency detection using phase spectrum of quaternion Fourier transform," 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, Jun. 2008.
A.M. Treisman and G.Gelade, "A feature-integration theory of attention," Cognitive Psychology, vol.12, no.1, pp.97-136, Jan. 1980.
K.Woraratpanya, P.Boonchukusol, Y.Kuroki, and Y.Kato, "Improved Thai text detection from natural scenes," 2013 International Conference on Information Technology and Electrical Engineering, pp.137-142, Oct. 2013.
R.C. Chang, "Intelligent text detection and extraction from natural scene images," 2011 15th North-East Asia Symposium on Nano, Information Technology and Reliability, pp.23--28, Oct. 2011.
A.Schick, M.Fischer, and R.Stiefelhagen, "Measuring and evaluating the compactness of superpixels," Proceedings of the 21st International Conference on Pattern Recognition, pp.930-934, Nov. 2012.
K.Woraratpanya, K.Pasupa, U.Suttapakti, P.Boonchukusol, T.Titijaroonroj, R.Hokking, Y.Kuroki, and Y.Kato, "Text-background decomposition for Thai text localization and recognition in natural scenes," 2014 6th International Conference on Information Technology and Electrical Engineering, pp.1-6, Oct. 2014.