In image processing, a Gabor filter, named after Dennis Gabor, who first proposed it as a 1D filter.[1]
The Gabor filter was first generalized to 2D by Gösta Granlund,[2] by adding a reference direction.
The Gabor filter is a linear filter used for texture analysis, which essentially means that it analyzes whether there is any specific frequency content in the image in specific directions in a localized region around the point or region of analysis. Frequency and orientation representations of Gabor filters are claimed by many contemporary vision scientists to be similar to those of the human visual system.[3] They have been found to be particularly appropriate for texture representation and discrimination. In the spatial domain, a 2D Gabor filter is a Gaussiankernel function modulated by a sinusoidalplane wave (see Gabor transform).
In this equation, represents the wavelength of the sinusoidal factor, represents the orientation of the normal to the parallel stripes of a Gabor function, is the phase offset, is the sigma/standard deviation of the Gaussian envelope and is the spatial aspect ratio, and specifies the ellipticity of the support of the Gabor function.
Wavelet spaceedit
Gabor filters are directly related to Gabor wavelets, since they can be designed for a number of dilations and rotations. However, in general, expansion is not applied for Gabor wavelets, since this requires computation of bi-orthogonal wavelets, which may be very time-consuming. Therefore, usually, a filter bank consisting of Gabor filters with various scales and rotations is created. The filters are convolved with the signal, resulting in a so-called Gabor space. This process is closely related to processes in the primary visual cortex.[8]
Jones and Palmer showed that the real part of the complex Gabor function is a good fit to the receptive field weight functions found in simple cells in a cat's striate cortex.[9]
Time-causal analogue of the Gabor filteredit
When processing temporal signals, data from the future cannot be accessed, which leads to problems if attempting to use Gabor functions for processing real-time signals that depend on the temporal dimension. A time-causal analogue of the Gabor filter has been developed in [10] based on replacing the Gaussian kernel in the Gabor function with a time-causal and time-recursive kernel referred to as the time-causal limit kernel. In this way, time-frequency analysis based on the resulting complex-valued extension of the time-causal limit kernel makes it possible to capture essentially similar transformations of a temporal signal as the Gabor filter can, and as can be described by the Heisenberg group, see [10] for further details.
Extraction of features from imagesedit
A set of Gabor filters with different frequencies and orientations may be helpful for extracting useful features from an image.[11] In the discrete domain, two-dimensional Gabor filters are given by,
where B and C are normalizing factors to be determined.
2D Gabor filters have rich applications in image processing, especially in feature extraction for texture analysis and segmentation.[12] defines the frequency being looked for in the texture. By varying , we can look for texture oriented in a particular direction. By varying , we change the support of the basis or the size of the image region being analyzed.
Applications of 2D Gabor filters in image processingedit
In document image processing, Gabor features are ideal for identifying the script of a word in a multilingual document.[13] Gabor filters with different frequencies and with orientations in different directions have been used to localize and extract text-only regions from complex document images (both gray and colour), since text is rich in high frequency components, whereas pictures are relatively smooth in nature.[14][15][16] It has also been applied for facial expression recognition [17]
Gabor filters have also been widely used in pattern analysis applications. For example, it has been used to study the directionality distribution inside the porous spongy trabecularbone in the spine.[18] The Gabor space is very useful in image processing applications such as optical character recognition, iris recognition and fingerprint recognition. Relations between activations for a specific spatial location are very distinctive between objects in an image. Furthermore, important activations can be extracted from the Gabor space in order to create a sparse object representation.
importnumpyasnpdefgabor(sigma,theta,Lambda,psi,gamma):"""Gabor feature extraction."""sigma_x=sigmasigma_y=float(sigma)/gamma# Bounding boxnstds=3# Number of standard deviation sigmaxmax=max(abs(nstds*sigma_x*np.cos(theta)),abs(nstds*sigma_y*np.sin(theta)))xmax=np.ceil(max(1,xmax))ymax=max(abs(nstds*sigma_x*np.sin(theta)),abs(nstds*sigma_y*np.cos(theta)))ymax=np.ceil(max(1,ymax))xmin=-xmaxymin=-ymax(y,x)=np.meshgrid(np.arange(ymin,ymax+1),np.arange(xmin,xmax+1))# Rotationx_theta=x*np.cos(theta)+y*np.sin(theta)y_theta=-x*np.sin(theta)+y*np.cos(theta)gb=np.exp(-0.5*(x_theta**2/sigma_x**2+y_theta**2/sigma_y**2))*np.cos(2*np.pi/Lambda*x_theta+psi)returngb
For an implementation on images, see [1].
MATLABedit
This is an example implementation in MATLAB/Octave:
^
Gabor, D. (1946). "Theory of communication". J. Inst. Electr. Eng. 93.
^
Granlund G. H. (1978). "In Search of a General Picture Processing Operator". Computer Graphics and Image Processing. 8 (2): 155–173. doi:10.1016/0146-664X(78)90047-3. ISSN 0146-664X.
^Olshausen, B. A. & Field, D. J. (1996). "Emergence of simple-cell receptive-field properties by learning a sparse code for natural images". Nature. 381 (6583): 607–609. Bibcode:1996Natur.381..607O. doi:10.1038/381607a0. PMID 8637596. S2CID 4358477.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^Marčelja, S. (1980). "Mathematical description of the responses of simple cortical cells". Journal of the Optical Society of America. 70 (11): 1297–1300. Bibcode:1980JOSA...70.1297M. doi:10.1364/JOSA.70.001297. PMID 7463179.
^Daugman, John G. (1985-07-01). "Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters". Journal of the Optical Society of America A. 2 (7): 1160–9. Bibcode:1985JOSAA...2.1160D. CiteSeerX10.1.1.465.8506. doi:10.1364/JOSAA.2.001160. ISSN 1084-7529. PMID 4020513. S2CID 9271650.
^Jones, J.P.; Palmer, L.A. (1987). "An evaluation of the two-dimensional gabor filter model of simple receptive fields in cat striate cortex" (PDF). J. Neurophysiol. 58 (6): 1233–1258. doi:10.1152/jn.1987.58.6.1233. PMID 3437332. S2CID 16809045. Archived from the original (PDF) on 2020-02-28.
^ ab Lindeberg, T. (23 January 2023). "A time-causal and time-recursive scale-covariant scale-space representation of temporal signals and past time". Biological Cybernetics. 117 (1–2): 21–59. doi:10.1007/s00422-022-00953-6. PMC10160219. PMID 36689001.
^Haghighat, M.; Zonouz, S.; Abdel-Mottaleb, M. (2013). "Identification Using Encrypted Biometrics". Computer Analysis of Images and Patterns. Lecture Notes in Computer Science. Vol. 8048. pp. 440–448. doi:10.1007/978-3-642-40246-3_55. ISBN 978-3-642-40245-6.
^Ramakrishnan, A.G.; Kumar Raja, S.; Raghu Ram, H.V. (2002). "Neural network-based segmentation of textures using Gabor features" (PDF). Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing. Martigny, Switzerland: IEEE. pp. 365–374. doi:10.1109/NNSP.2002.1030048. ISBN 978-0-7803-7616-8. OCLC 812617471. S2CID 10994982.
^Raju S, S.; Pati, P.B.; Ramakrishnan, A.G. (2004). "Gabor filter based block energy analysis for text extraction from digital document images" (PDF). First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings. Palo Alto, CA, USA: IEEE. pp. 233–243. doi:10.1109/DIAL.2004.1263252. ISBN 978-0-7695-2088-9. LCCN 2003116308. OL 8067708M. S2CID 21856192.
^Raju, S. Sabari; Pati, P. B.; Ramakrishnan, A. G. (2005). "Text Localization and Extraction from Complex Color Images". Advances in Visual Computing. Lecture Notes in Computer Science. Vol. 3804. pp. 486–493. doi:10.1007/11595755_59. ISBN 978-3-540-30750-1. ISSN 0302-9743. LCCN 2005936803. OL 9056158M.
^S Sabari Raju, P B Pati and A G Ramakrishnan, “Text Localization and Extraction from Complex Color Images,” Proc. First International Conference on Advances in Visual Computing (ISVC05), Nevada, USA, LNCS 3804, Springer Verlag, Dec. 5-7, 2005, pp. 486-493.
^Lyons, M.; Akamatsu, S.; Kamachi, M.; Gyoba, J. (1998). "Coding facial expressions with Gabor wavelets". Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition. pp. 200–205. doi:10.1109/AFGR.1998.670949. ISBN 0-8186-8344-9. OL 11390549M. S2CID 1586662.
^Gdyczynski, C.M.; Manbachi, A.; et al. (2014). "On estimating the directionality distribution in pedicle trabecular bone from micro-CT images". Physiological Measurement. 35 (12): 2415–2428. Bibcode:2014PhyM...35.2415G. doi:10.1088/0967-3334/35/12/2415. PMID 25391037. S2CID 206078730.
External linksedit
MATLAB code for Gabor filters and Gabor feature extraction
3D Gabor demonstrated with Mathematica
python implementation of log-Gabors for still images
Gabor filter for image processing and computer vision (demonstration) Archived 2018-05-29 at the Wayback Machine
Further readingedit
Feichtinger, Hans G.; Strohmer, Thomas, eds. (1998). Gabor analysis and algorithms : theory and applications. Boston: Birkhäuser. ISBN 0-8176-3959-4. LCCN 97032252. OCLC 37761814. OL 685385M.
Gröchenig, Karlheinz (2001). Foundations of time-frequency analysis : with 15 figures. Applied and Numerical Harmonic Analysis. Boston: Birkhäuser. doi:10.1007/978-1-4612-0003-1. ISBN 0-8176-4022-3. LCCN 00044508. OCLC 44420790. OL 8074618M.
Daugman, J.G. (1988). "Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression" (PDF). IEEE Transactions on Acoustics, Speech, and Signal Processing. 36 (7): 1169–1179. CiteSeerX10.1.1.371.5847. doi:10.1109/29.1644. ISSN 0096-3518.
"Online Gabor filter demo". Archived from the original on 2009-06-15. Retrieved 2009-05-25.
Movellan, Javier R. "Tutorial on Gabor Filters" (PDF). Archived from the original (PDF) on 2009-04-19. Retrieved 2008-05-14.
Lagae, Ares; Lefebvre, Sylvain; Drettakis, George; Dutré, Philip (2009). "Procedural Noise using Sparse Gabor Convolution". ACM Transactions on Graphics. 28 (3): 1. CiteSeerX10.1.1.232.5566. doi:10.1145/1531326.1531360. Retrieved 2009-09-12.
Steerable Pyramids:
Eero Simoncelli's page on Steerable Pyramids
Manduchi, R.; Perona, P.; Shy, D. (April 1998). "Efficient deformable filter banks" (PDF). IEEE Transactions on Signal Processing. 46 (4): 1168–1173. Bibcode:1998ITSP...46.1168M. doi:10.1109/78.668570. ISSN 1053-587X. OCLC 926890247. (PDF Archived 2021-11-12 at the Wayback Machine) (Code Archived 2010-06-13 at the Wayback Machine)
Fischer, Sylvain; Šroubek, Filip; Perrinet, Laurent; Redondo, Rafael; Cristóbal, Gabriel (2007). "Self-Invertible 2D Log-Gabor Wavelets" (PDF). International Journal of Computer Vision. 75 (2): 231–246. CiteSeerX10.1.1.329.6283. doi:10.1007/s11263-006-0026-8. ISSN 0920-5691. S2CID 1452724.