Skip to main content

2023 | OriginalPaper | Buchkapitel

Comparative Study on Transfer Learning for Object Classification and Detection

verfasst von : Jungme Park, Wenchang Yu, Pawan Aryal, Viktor Ciroski

Erschienen in: AI-enabled Technologies for Autonomous and Connected Vehicles

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The recent development of deep neural learning achieved remarkable breakthroughs in object classification and detection. Deep learning has the capability of learning features automatically from data using general-purpose learning procedures. However, because deep neural networks require large amounts of data to train the parameters in the network, it is challenging to develop any object classification or detection system with a relatively small dataset. Transfer learning is an important machine learning technique that transfers the learned features in a pre-trained Convolution Neural Network (CNN) model into a new system. In this study, current state-of-the-art CNN models are reviewed in their architectures and characteristics. For the comparative study of transfer learning, the object classification and the detection systems are implemented using transfer learning with six state-of-the-art CNN models. The object classification model has achieved an accuracy of 97.01% for the three-class classification task using transfer learning. Furthermore, six different Faster R-CNN architectures are implemented for object detection. The performances of the different transferred models are compared in terms of the accuracy and the deploying speed of the new model. Experiments show that transfer learning saves training time and achieves accurate performance by fine-tuning the pre-existing deep learning model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Viola P, Jones M (2004) Robust real-time face detection. Int J Comput Vis 57–2:137–154CrossRef Viola P, Jones M (2004) Robust real-time face detection. Int J Comput Vis 57–2:137–154CrossRef
4.
Zurück zum Zitat Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer vision and pattern recognition. IEEE computer society conference, vol 1, pp 886–893 Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer vision and pattern recognition. IEEE computer society conference, vol 1, pp 886–893
9.
Zurück zum Zitat Dollar P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: a benchmark. In: IEEE conference on computer vision and pattern recognition, pp 304–311 Dollar P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: a benchmark. In: IEEE conference on computer vision and pattern recognition, pp 304–311
11.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th international conference on neural information processing systems, vol 1, 1097–1105 Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th international conference on neural information processing systems, vol 1, 1097–1105
13.
Zurück zum Zitat Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2014) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9 Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2014) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
15.
Zurück zum Zitat He K, Zhang S, Ren S, Sun J (2016) Deep residual learning for image recognition. Paper presented at the IEEE conference on computer vision and pattern recognition He K, Zhang S, Ren S, Sun J (2016) Deep residual learning for image recognition. Paper presented at the IEEE conference on computer vision and pattern recognition
16.
Zurück zum Zitat Girshick R, Donahue J, Darrel T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587 Girshick R, Donahue J, Darrel T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
17.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28:91–99 Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28:91–99
18.
Zurück zum Zitat Lin T, Dollar P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125 Lin T, Dollar P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
19.
Zurück zum Zitat Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788 Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
21.
Zurück zum Zitat Lin T, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988 Lin T, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
23.
Zurück zum Zitat Kulkarni R, Dhavalikar S, Bangar S (2018) Traffic light detection and recognition for self-driving cars using deep learning. In: Fourth international conference on computing communication control and automation, pp 1–4 Kulkarni R, Dhavalikar S, Bangar S (2018) Traffic light detection and recognition for self-driving cars using deep learning. In: Fourth international conference on computing communication control and automation, pp 1–4
24.
Zurück zum Zitat Thrun S, Pratt L (1998) Learning to learn. Kluwer Academic Publishers, NorwellCrossRef Thrun S, Pratt L (1998) Learning to learn. Kluwer Academic Publishers, NorwellCrossRef
25.
Zurück zum Zitat Pan S, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22:1345–1359 Pan S, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22:1345–1359
26.
Zurück zum Zitat Mengying S (2019) Deep learning for image classification on very small datasets using transfer learning. Creative Compon 345 Mengying S (2019) Deep learning for image classification on very small datasets using transfer learning. Creative Compon 345
27.
Zurück zum Zitat Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C (2018) a survey on deep transfer learning. In: 27th international conference on artificial neural networks Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C (2018) a survey on deep transfer learning. In: 27th international conference on artificial neural networks
29.
Zurück zum Zitat Kim J, Park C (2017) End-To-end ego lane estimation based on sequential transfer learning for self-driving cars. Paper presented at the IEEE conference on computer vision and pattern recognition workshops Kim J, Park C (2017) End-To-end ego lane estimation based on sequential transfer learning for self-driving cars. Paper presented at the IEEE conference on computer vision and pattern recognition workshops
30.
Zurück zum Zitat Augiar A, Santos F, Sousa A, Oliveira P, Santos L (2020) Visual trunk detection using transfer learning and a depp learning-based coprocessor. IEEE Access 8:77308–77320CrossRef Augiar A, Santos F, Sousa A, Oliveira P, Santos L (2020) Visual trunk detection using transfer learning and a depp learning-based coprocessor. IEEE Access 8:77308–77320CrossRef
31.
Zurück zum Zitat Girshick R (2015) Fast R-CNN. In: IEEE international conference on computer vision (ICCV), pp 1440–1448 Girshick R (2015) Fast R-CNN. In: IEEE international conference on computer vision (ICCV), pp 1440–1448
35.
Zurück zum Zitat Tian Z, Shen C, Chen H, He T (2019) FCOS: fully convolutional one-stage object detection. In: Proceedings of the IEEE ICCV, pp 9627–9636 Tian Z, Shen C, Chen H, He T (2019) FCOS: fully convolutional one-stage object detection. In: Proceedings of the IEEE ICCV, pp 9627–9636
37.
Zurück zum Zitat Law H, Deng J (2018) Cornernet: detecting objects as paired keypoints. In: Proceedings of the ECCV, pp 734–750 Law H, Deng J (2018) Cornernet: detecting objects as paired keypoints. In: Proceedings of the ECCV, pp 734–750
41.
Zurück zum Zitat Krause J, Stark M, Deng J, Li F (2013) 3D object representations for fine-grained categorization. In: 4th IEEE workshop on 3D representation and recognition. In: ICCV 2013, Sydney, Australia, 8 Dec Krause J, Stark M, Deng J, Li F (2013) 3D object representations for fine-grained categorization. In: 4th IEEE workshop on 3D representation and recognition. In: ICCV 2013, Sydney, Australia, 8 Dec
42.
Zurück zum Zitat Wang L, Shi J, Song G, Shen I (2007) Object detection combining recognition and segmentation. In: ACCV Wang L, Shi J, Song G, Shen I (2007) Object detection combining recognition and segmentation. In: ACCV
43.
Zurück zum Zitat Dalal N (2006) Finding people in images and videos. Dissertation, Institute National Polytechnique de Grenoble-INPG Dalal N (2006) Finding people in images and videos. Dissertation, Institute National Polytechnique de Grenoble-INPG
44.
Zurück zum Zitat Li X, Flohr F, Yang Y, Xiong H, Braun M, Pan S, Li K, Gavrila D (2016) A new benchmark for vision-based cyclist detection. Paper presented at the proceedings of the IEEE intelligent vehicles symposium (IV), Gothenburg, Sweden Li X, Flohr F, Yang Y, Xiong H, Braun M, Pan S, Li K, Gavrila D (2016) A new benchmark for vision-based cyclist detection. Paper presented at the proceedings of the IEEE intelligent vehicles symposium (IV), Gothenburg, Sweden
Metadaten
Titel
Comparative Study on Transfer Learning for Object Classification and Detection
verfasst von
Jungme Park
Wenchang Yu
Pawan Aryal
Viktor Ciroski
Copyright-Jahr
2023
DOI
https://doi.org/10.1007/978-3-031-06780-8_5

    Premium Partner