[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3677525.3678681acmconferencesArticle/Chapter ViewAbstractPublication PagesgooditConference Proceedingsconference-collections
research-article
Open access

Bird Action Recognition in Wetlands using Deep Learning

Published: 04 September 2024 Publication History

Abstract

The current decline in bird species and protected natural areas highlights the importance of providing solutions to improve the understanding of bird biodiversity and its interaction with its environment. This study focuses on the development and application of deep learning architectures for bird action recognition in protected natural areas, specifically in wetlands. Different state-of-the-art deep learning architectures for image and video analysis are explored and applied to compare the suitability of each of the architectures for the scope of the study. An experiment with frame-level and video-level methods was conducted to observe the impact of temporal aggregation on the bird action recognition task. From the experiments, the suitability of video-level approaches, where bird motion is considered for prediction, can be observed. Furthermore, a novel dataset dedicated to the bird action recognition task is presented to train the models of this study.

References

[1]
Mark Anderson, John Kennedy, and Naomi Harte. 2021. Low resource species agnostic bird activity detection. In 2021 IEEE Workshop on Signal Processing Systems (SiPS). IEEE, 34–39.
[2]
Reza Arablouei, Liang Wang, Lachlan Currie, Jodan Yates, Flavio A.P. Alvarenga, and Greg J. Bishop-Hurley. 2023. Animal behavior classification via deep learning on embedded systems. Computers and Electronics in Agriculture 207 (2023), 107707. https://doi.org/10.1016/j.compag.2023.107707
[3]
Reza Arablouei, Ziwei Wang, Greg J. Bishop-Hurley, and Jiajun Liu. 2023. Multimodal sensor data fusion for in-situ classification of animal behavior using accelerometry and GNSS data. Smart Agricultural Technology 4 (2023), 100163. https://doi.org/10.1016/j.atech.2022.100163
[4]
Adrian Berenguer-Agullo, Javier Rodriguez-Juan, David Ortiz-Perez, and Jose Garcia-Rodriguez. 2024. Lightweight CNNs for Advanced Bird Species Recognition on the Edge. In International Work-Conference on the Interplay Between Natural and Artificial Computation. Springer, 95–104.
[5]
Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. arxiv:2004.10934 [cs.CV] https://arxiv.org/abs/2004.10934
[6]
Chen Chen, Weixing Zhu, and Tomas Norton. 2021. Behaviour recognition of pigs and cattle: Journey from computer vision to deep learning. Computers and Electronics in Agriculture 187 (2021), 106255. https://doi.org/10.1016/j.compag.2021.106255
[7]
Jenna V. Congdon, Mina Hosseini, Ezekiel F. Gading, Mahdi Masousi, Maria Franke, and Suzanne E. MacDonald. 2022. The Future of Artificial Intelligence in Monitoring Animal Identification, Health, and Behaviour. Animals 12, 13 (2022). https://doi.org/10.3390/ani12131711
[8]
Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, and Mubarak Shah. 2023. Diffusion Models in Vision: A Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 9 (2023), 10850–10869. https://doi.org/10.1109/TPAMI.2023.3261988
[9]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. 248–255. https://doi.org/10.1109/CVPR.2009.5206848
[10]
Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, and Christoph Feichtenhofer. 2021. Multiscale Vision Transformers. arxiv:2104.11227
[11]
Sara Fraixedas, Andreas Lindén, Markus Piha, Mar Cabeza, Richard Gregory, and Aleksi Lehikoinen. 2020. A state-of-the-art review on birds as indicators of biodiversity: Advances, challenges, and future directions. Ecological Indicators 118 (2020), 106728. https://doi.org/10.1016/j.ecolind.2020.106728
[12]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13]
Runang He, Xiaomin Wang, Huazhen Chen, and Chang Liu. 2023. VHR-BirdPose: Vision Transformer-Based HRNet for Bird Pose Estimation with Attention Mechanism. Electronics 12, 17 (2023), 3643.
[14]
Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, Quoc V. Le, and Hartwig Adam. 2019. Searching for MobileNetV3. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).
[15]
Kevin Hyekang Joo, Shiyuan Duan, Shawna L Weimer, and Mohammad Nayeem Teli. 2022. Birds’ Eye View: Measuring Behavior and Posture of Chickens as a Metric for Their Well-Being. arXiv preprint arXiv:2205.00069 (2022).
[16]
Will Kay, Joao Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Green, Trevor Back, Paul Natsev, Mustafa Suleyman, and Andrew Zisserman. 2017. The Kinetics Human Action Video Dataset. arxiv:1705.06950https://arxiv.org/abs/1705.06950
[17]
Yu Kong and Yun Fu. 2022. Human action recognition and prediction: A survey. International Journal of Computer Vision 130, 5 (2022), 1366–1401.
[18]
Chih-Wei Lin, Zhongsheng Chen, and Mengxiang Lin. 2022. Video-based bird posture recognition using dual feature-rates deep fusion convolutional neural network. Ecological Indicators 141 (2022), 109141.
[19]
Ze Liu, Jia Ning, Yue Cao, Yixuan Wei, Zheng Zhang, Stephen Lin, and Han Hu. 2021. Video Swin Transformer. arxiv:2106.13230https://arxiv.org/abs/2106.13230
[20]
C Meijer, S Verhoeven, E Ranguelova, CJ Camphuysen, J Shamoun-Baranes, and W Bouten. 2016. Visually supported annotation of bird tracking data and subsequent classification of behaviour using machine learning.
[21]
Michael Perez and Corey Toler-Franklin. 2023. CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey. arxiv:2301.06187https://arxiv.org/abs/2301.06187
[22]
Frank Schindler and Volker Steinhage. 2021. Identification of animals and recognition of their actions in wildlife videos using deep learning techniques. Ecological Informatics 61 (2021), 101215. https://doi.org/10.1016/j.ecoinf.2021.101215
[23]
Mohammad Shahbazi, Kamyar Mohammadi, Sayed M. Derakhshani, and Peter W. G. Groot Koerkamp. 2023. Deep Learning for Laying Hen Activity Recognition Using Wearable Sensors. Agriculture 13, 3 (2023). https://doi.org/10.3390/agriculture13030738
[24]
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going Deeper With Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[25]
Mingxing Tan 2019. MnasNet: Platform-Aware Neural Architecture Search for Mobile. arXiv:1807.11626 (2019).
[26]
Mingxing Tan and Quoc Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 6105–6114. https://proceedings.mlr.press/v97/tan19a.html
[27]
Mingxing Tan and Quoc Le. 2021. EfficientNetV2: Smaller Models and Faster Training. In Proceedings of the 38th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 139), Marina Meila and Tong Zhang (Eds.). PMLR, 10096–10106. https://proceedings.mlr.press/v139/tan21a.html
[28]
Hansen Thambi Prem. 2020. What are the biggest threats to wildlife and why?https://www.worldanimalprotection.org.in/latest/blogs/what-are-biggest-threats-wildlife-and-why
[29]
Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann LeCun, and Manohar Paluri. 2018. A Closer Look at Spatiotemporal Convolutions for Action Recognition. arxiv:1711.11248https://arxiv.org/abs/1711.11248
[30]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2023. Attention Is All You Need. arxiv:1706.03762https://arxiv.org/abs/1706.03762
[31]
Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, and Bin Xiao. 2020. Deep High-Resolution Representation Learning for Visual Recognition. arxiv:1908.07919https://arxiv.org/abs/1908.07919
[32]
Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, and Xiangyang Xue. 2016. Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification. In Proceedings of the 24th ACM International Conference on Multimedia (Amsterdam, The Netherlands) (MM ’16). Association for Computing Machinery, New York, NY, USA, 791–800. https://doi.org/10.1145/2964284.2964328
[33]
Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, and Kevin Murphy. 2018. Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification. arxiv:1712.04851https://arxiv.org/abs/1712.04851

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GoodIT '24: Proceedings of the 2024 International Conference on Information Technology for Social Good
September 2024
481 pages
ISBN:9798400710940
DOI:10.1145/3677525
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 September 2024

Check for updates

Author Tags

  1. Action Recognition
  2. Birds Analysis
  3. Convolutional Neural Networks
  4. Ecological Conservation
  5. Transformers

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • MCIN/AEI/10.13039/- 501100011033
  • HORIZON MSCA- 2021-SE-0

Conference

GoodIT '24
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 220
    Total Downloads
  • Downloads (Last 12 months)220
  • Downloads (Last 6 weeks)47
Reflects downloads up to 09 Mar 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media