Detection of Bird Species Found in Bhutan Using Vision Transformer-based Transfer Learning

Karma Wangchuk; Tandin Wangchuk; Tsheten Dorji

doi:10.17102/zmv8.i1.008

Authors

Karma Wangchuk Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
Tandin Wangchuk Information Technology Department, College of Science and Technology, Royal University of Bhutan Author
Tsheten Dorji Information Technology Department, College of Science and Technology, Royal University of Bhutan Author

DOI:

https://doi.org/10.17102/zmv8.i1.008

Keywords:

Vision Transformer, Bhutanese bird recognition, Transfer learning, Fine-tuning, Deep learning

Abstract

Birdwatching is an emerging recreational activity in Bhutan, attracting both local enthusiasts and international tourists due to the country's rich avian biodiversity. This growing interest contributes to local tourism and economic development. However, accurate bird identification remains a challenge due to variations in size, shape, and coloration, compounded by inconsistencies in English and Dzongkha nomenclature. Traditional identification methods, which rely on field guides and expert observations, are often prone to errors and disagreements. To address this limitation, we developed a bird detection and recognition system utilizing image processing and machine learning techniques. Bird images were collected from birdwatchers in Paro, Thimphu, and Trongsa, as well as from the Kaggle dataset. These images underwent preprocessing and augmentation to construct a comprehensive dataset. The study considered 23 bird species, and the model was fine-tuned using Google’s pre-trained transformer encoder for image recognition, operating at a resolution of 244×244 with 16×16 patches. The model was trained on a dataset of 3,595 images, leading to a significant reduction in training and validation losses, from 2.8491 and 1.2231 to 0.0030 and 0.0529, respectively. The results indicate the effectiveness of the proposed approach in enhancing bird species identification, offering a valuable tool for birdwatchers and conservation efforts in Bhutan.

Author Biographies

Karma Wangchuk, Information Technology Department, College of Science and Technology, Royal University of Bhutan
Tandin Wangchuk, Information Technology Department, College of Science and Technology, Royal University of Bhutan
Tsheten Dorji, Information Technology Department, College of Science and Technology, Royal University of Bhutan

Detection of Bird Species Found in Bhutan Using Vision Transformer-based Transfer Learning

Authors

DOI:

Keywords:

Abstract

Author Biographies

Downloads

Published

Issue

Section

How to Cite

Similar Articles

Most read articles by the same author(s)

Make a Submission

Visiting Counter Flag

ISSN Number

Latest publications

Information