Skip to main content

25.07.2019 | Special Issue Paper

Fréchet mean-based Grassmann discriminant analysis

verfasst von: Hongbin Yu, Kaijian Xia, Yizhang Jiang, Pengjiang Qian

Erschienen in: Multimedia Systems

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Representing image sets and videos with Grassmann manifold has become popular due to its powerful capability to extract discriminative information in machine learning research. However, existing techniques operations on Grassmann manifold are usually suffering from the problem of computational expensive, thus the application range of Grassmann manifold is limited. In this paper, we propose the Fréchet mean-based Grassmann discriminant analysis (FMGDA) algorithm to implement the videos (or image sets) data dimensionality reduction and clustering task. The data dimensionality reduction algorithm proposed by us can not only be used to reduce Grassmann data from high-dimensional data to a relative low-dimensional data, but also to maximize between-class distance and minimize within-class distance simultaneously. Fréchet mean is used to characterize the clustering center of Grassmann manifold space. We further show that the learning problem can be expressed as a trace ratio problem which can be efficiently solved. We designed a detailed experimental scheme to test the performance of our proposed algorithm, and the tests were assessed on several benchmark data sets. The experimental results indicate that our approach leads to a significant improvement over state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Absil, P.A., Mahony, R., Sepulchre, R.: Riemannian geometry of grassmann manifolds with a view on algorithmic computation. Acta Appl. Math. 80(2), 199–220 (2004)MathSciNetCrossRef Absil, P.A., Mahony, R., Sepulchre, R.: Riemannian geometry of grassmann manifolds with a view on algorithmic computation. Acta Appl. Math. 80(2), 199–220 (2004)MathSciNetCrossRef
2.
3.
Zurück zum Zitat Centingu, H.E., Vidal, R.: Intrinsic mean shift for clustering on Stiefel and Grassmann manifolds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1896–1902. IEEE (2009) Centingu, H.E., Vidal, R.: Intrinsic mean shift for clustering on Stiefel and Grassmann manifolds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1896–1902. IEEE (2009)
4.
Zurück zum Zitat Chan, A.B., Vasconcelos, N.: Modeling, clustering, and segmenting video with mixtures of dynamic textures. IEEE Trans. Pattern Anal. Mach. Intell. 30(5), 909–926 (2008)CrossRef Chan, A.B., Vasconcelos, N.: Modeling, clustering, and segmenting video with mixtures of dynamic textures. IEEE Trans. Pattern Anal. Mach. Intell. 30(5), 909–926 (2008)CrossRef
5.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 886–893. IEEE (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 886–893. IEEE (2005)
6.
Zurück zum Zitat Edelman, A., Arias, T.A., Smith, S.T.: The geometry of algorithms with orthogonality constraints. SIAM J. Matrix Anal. Appl. 20(2), 303–353 (1998)MathSciNetCrossRef Edelman, A., Arias, T.A., Smith, S.T.: The geometry of algorithms with orthogonality constraints. SIAM J. Matrix Anal. Appl. 20(2), 303–353 (1998)MathSciNetCrossRef
7.
Zurück zum Zitat Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Comput. Vis. Image Underst. 106(1), 59–70 (2007)CrossRef Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Comput. Vis. Image Underst. 106(1), 59–70 (2007)CrossRef
8.
Zurück zum Zitat Fletcher, P.T., Venkatasubramanian, S., Joshi, S.: The geometric median on riemannian manifolds with application to robust atlas estimation. NeuroImage 45(1), S143–S152 (2009)CrossRef Fletcher, P.T., Venkatasubramanian, S., Joshi, S.: The geometric median on riemannian manifolds with application to robust atlas estimation. NeuroImage 45(1), S143–S152 (2009)CrossRef
9.
Zurück zum Zitat Hamm, J., Lee, D.D.: Grassmann discriminant analysis: a unifying view on subspace-based learning. In: Proceedings of the 25th International Conference on Machine Learning (ICML), pp. 376–383. ACM (2008) Hamm, J., Lee, D.D.: Grassmann discriminant analysis: a unifying view on subspace-based learning. In: Proceedings of the 25th International Conference on Machine Learning (ICML), pp. 376–383. ACM (2008)
10.
Zurück zum Zitat Hamm, J., Lee, D.D.: Extended grassmann kernels for subspace-based learning. In: Advances in neural information processing systems (NIPS), pp. 601–608. Curran Associates (2009) Hamm, J., Lee, D.D.: Extended grassmann kernels for subspace-based learning. In: Advances in neural information processing systems (NIPS), pp. 601–608. Curran Associates (2009)
11.
Zurück zum Zitat Harandi, M., Sanderson, C., Shen, C., Lovell, B.C.: Dictionary learning and sparse coding on grassmann manifolds: an extrinsic solution. In: Proceedings of the IEEE International Conference on Computer Vision (CVPR), pp. 3120–3127. IEEE (2013) Harandi, M., Sanderson, C., Shen, C., Lovell, B.C.: Dictionary learning and sparse coding on grassmann manifolds: an extrinsic solution. In: Proceedings of the IEEE International Conference on Computer Vision (CVPR), pp. 3120–3127. IEEE (2013)
12.
Zurück zum Zitat Harandi, M.T., Sanderson, C., Shirazi, S., Lovell, B.C.: Graph embedding discriminant analysis on grassmannian manifolds for improved image set matching. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2705–2712. IEEE (2011) Harandi, M.T., Sanderson, C., Shirazi, S., Lovell, B.C.: Graph embedding discriminant analysis on grassmannian manifolds for improved image set matching. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2705–2712. IEEE (2011)
13.
Zurück zum Zitat Huang, Z., Wang, R., Shan, S., Chen, X.: Projection metric learning on grassmann manifold with application to video based face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 140–149. IEEE (2015) Huang, Z., Wang, R., Shan, S., Chen, X.: Projection metric learning on grassmann manifold with application to video based face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 140–149. IEEE (2015)
14.
Zurück zum Zitat Karcher, H.: Riemannian center of mass and mollifier smoothing. Commun. Pure Appl. Math. 30(5), 509–541 (1977)MathSciNetCrossRef Karcher, H.: Riemannian center of mass and mollifier smoothing. Commun. Pure Appl. Math. 30(5), 509–541 (1977)MathSciNetCrossRef
15.
Zurück zum Zitat Kim, T.K., Kittler, J., Cipolla, R.: Discriminative learning and recognition of image set classes using canonical correlations. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1005–1018 (2007)CrossRef Kim, T.K., Kittler, J., Cipolla, R.: Discriminative learning and recognition of image set classes using canonical correlations. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1005–1018 (2007)CrossRef
17.
Zurück zum Zitat Liu, G., Lin, Z., Yan, S., Sun, J., Yu, Y., Ma, Y.: Robust recovery of subspace structures by low-rank representation. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 171–184 (2013)CrossRef Liu, G., Lin, Z., Yan, S., Sun, J., Yu, Y., Ma, Y.: Robust recovery of subspace structures by low-rank representation. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 171–184 (2013)CrossRef
18.
Zurück zum Zitat Marrinan, T., Ross Beveridge, J., Draper, B., Kirby, M., Peterson, C.: Finding the subspace mean or median to fit your need. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1082–1089 (2014) Marrinan, T., Ross Beveridge, J., Draper, B., Kirby, M., Peterson, C.: Finding the subspace mean or median to fit your need. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1082–1089 (2014)
19.
Zurück zum Zitat Ngo, T.T., Bellalij, M., Saad, Y.: The trace ratio optimization problem for dimensionality reduction. SIAM J. Matrix Anal. Appl. 31(5), 2950–2971 (2010)MathSciNetCrossRef Ngo, T.T., Bellalij, M., Saad, Y.: The trace ratio optimization problem for dimensionality reduction. SIAM J. Matrix Anal. Appl. 31(5), 2950–2971 (2010)MathSciNetCrossRef
20.
Zurück zum Zitat Nie, F., Xiang, S., Jia, Y., Zhang, C., Yan, S.: Trace ratio criterion for feature selection. AAAI 2, 671–676 (2008) Nie, F., Xiang, S., Jia, Y., Zhang, C., Yan, S.: Trace ratio criterion for feature selection. AAAI 2, 671–676 (2008)
21.
Zurück zum Zitat Nishiyama, M., Yamaguchi, O., Fukui, K.: Face recognition with the multiple constrained mutual subspace method. In: International Conference on Audio-and Video-Based Biometric Person Authentication, pp. 71–80. Springer, New York (2005) Nishiyama, M., Yamaguchi, O., Fukui, K.: Face recognition with the multiple constrained mutual subspace method. In: International Conference on Audio-and Video-Based Biometric Person Authentication, pp. 71–80. Springer, New York (2005)
22.
Zurück zum Zitat Rodriguez, M.D., Ahmed, J., Shah, M.: Action mach a spatio-temporal maximum average correlation height filter for action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8. IEEE (2008) Rodriguez, M.D., Ahmed, J., Shah, M.: Action mach a spatio-temporal maximum average correlation height filter for action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8. IEEE (2008)
23.
Zurück zum Zitat Soomro, K., Zamir, A.R.: Action recognition in realistic sports videos. In: Computer Vision in Sports, pp. 181–208. Springer, New York (2014)CrossRef Soomro, K., Zamir, A.R.: Action recognition in realistic sports videos. In: Computer Vision in Sports, pp. 181–208. Springer, New York (2014)CrossRef
24.
Zurück zum Zitat Srivastava, A., Klassen, E.: Bayesian and geometric subspace tracking. Adv. Appl. Probab. 36(1), 43–56 (2004)MathSciNetCrossRef Srivastava, A., Klassen, E.: Bayesian and geometric subspace tracking. Adv. Appl. Probab. 36(1), 43–56 (2004)MathSciNetCrossRef
25.
Zurück zum Zitat Wang, B., Hu, Y., Gao, J., Sun, Y., Chen, H., Ali, M., Yin, B.: Locality preserving projections for grassmann manifold. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 2893–2900. AAAI Press, San Francisco (2017) Wang, B., Hu, Y., Gao, J., Sun, Y., Chen, H., Ali, M., Yin, B.: Locality preserving projections for grassmann manifold. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 2893–2900. AAAI Press, San Francisco (2017)
26.
Zurück zum Zitat Wang, B., Hu, Y., Gao, J., Sun, Y., Yin, B.: Low rank representation on grassmann manifolds. In: Asian Conference on Computer Vision, pp. 81–96. Springer, New York (2014)CrossRef Wang, B., Hu, Y., Gao, J., Sun, Y., Yin, B.: Low rank representation on grassmann manifolds. In: Asian Conference on Computer Vision, pp. 81–96. Springer, New York (2014)CrossRef
27.
Zurück zum Zitat Wang, B., Hu, Y., Gao, J., Sun, Y., Yin, B.: Product grassmann manifold representation and its LRR models. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pp. 2122–2129. AAAI Press (2016) Wang, B., Hu, Y., Gao, J., Sun, Y., Yin, B.: Product grassmann manifold representation and its LRR models. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pp. 2122–2129. AAAI Press (2016)
28.
Zurück zum Zitat Wang, B., Hu, Y., Gao, J., Sun, Y., Yin, B.: Laplacian LRR on product grassmann manifolds for human activity clustering in multicamera video surveillance. IEEE Trans. Circ. Syst. Video Technol. 27(3), 554–566 (2017)CrossRef Wang, B., Hu, Y., Gao, J., Sun, Y., Yin, B.: Laplacian LRR on product grassmann manifolds for human activity clustering in multicamera video surveillance. IEEE Trans. Circ. Syst. Video Technol. 27(3), 554–566 (2017)CrossRef
29.
Zurück zum Zitat Wang, H., Yan, S., Xu, D., Tang, X., Huang, T.: Trace ratio vs. ratio trace for dimensionality reduction. In: IEEE Conference on Computer Vision and Pattern Recognition, IEEE, pp. 1–8 (2007) Wang, H., Yan, S., Xu, D., Tang, X., Huang, T.: Trace ratio vs. ratio trace for dimensionality reduction. In: IEEE Conference on Computer Vision and Pattern Recognition, IEEE, pp. 1–8 (2007)
30.
Zurück zum Zitat Wang, Y., Mori, G.: Human action recognition by semilatent topic models. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1762–1774 (2009)CrossRef Wang, Y., Mori, G.: Human action recognition by semilatent topic models. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1762–1774 (2009)CrossRef
31.
Metadaten
Titel
Fréchet mean-based Grassmann discriminant analysis
verfasst von
Hongbin Yu
Kaijian Xia
Yizhang Jiang
Pengjiang Qian
Publikationsdatum
25.07.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-019-00629-5