Feng Chen
Adapting Vision Foundation Models for Plant Phenotyping
Chen, Feng; Giuffrida, Mario Valerio; Tsaftaris, Sotirios A.
Authors
Dr VALERIO GIUFFRIDA VALERIO.GIUFFRIDA@NOTTINGHAM.AC.UK
Assistant Professor in Computer Vision
Sotirios A. Tsaftaris
Abstract
Foundation models are large models pre-trained on tremendous amount of data. They can be typically adapted to diverse downstream tasks with minimal effort. However, as foundation models are usually pre-trained on images or texts sourced from the Internet, their performance in specialized domains, such as plant phenotyping, comes into question. In addition, fully fine-tuning foundation models is time-consuming and requires high computational power. This paper investigates the efficient adaptation of foundation models for plant phenotyping settings and tasks. We perform extensive experiments on fine-tuning three foundation models, MAE, DINO, and DINOv2 on three essential plant phenotyping tasks: leaf counting, instance segmentation, and disease classification. In particular, the pre-trained backbones are kept frozen, while two distinct fine-tuning methods are evaluated, namely adapter tuning (using LoRA) and decoder tuning. The experimental results show that a foundation model can be efficiently adapted to multiple plant phenotyping tasks, yielding similar performance as the state-of-the-art (SoTA) models specifically designed or trained for each task. Despite exhibiting great transferability over different tasks, the fine-tuned foundation models perform slightly worse than the SoTA task-specific models in some scenarios, which requires further investigation.
Citation
Chen, F., Giuffrida, M. V., & Tsaftaris, S. A. (2023, October). Adapting Vision Foundation Models for Plant Phenotyping. Presented at International Conference on Computer Vision (ICCV) Workshops, Paris, France
Presentation Conference Type | Edited Proceedings |
---|---|
Conference Name | International Conference on Computer Vision (ICCV) Workshops |
Start Date | Oct 2, 2023 |
End Date | Oct 6, 2023 |
Acceptance Date | Aug 15, 2023 |
Online Publication Date | Dec 25, 2023 |
Publication Date | Oct 1, 2023 |
Deposit Date | Mar 10, 2025 |
Publicly Available Date | Mar 13, 2025 |
Publisher | Institute of Electrical and Electronics Engineers |
Peer Reviewed | Peer Reviewed |
Pages | 604-613 |
Series ISSN | 2473-9944 |
Book Title | 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) |
ISBN | 979-8-3503-0745-0 |
DOI | https://doi.org/10.1109/ICCVW60793.2023.00067 |
Public URL | https://nottingham-repository.worktribe.com/output/46460320 |
Publisher URL | https://ieeexplore.ieee.org/document/10350452 |
Files
Adapting Vision Foundation Models for Plant Phenotyping
(894 Kb)
PDF
You might also like
Cloud Detection Challenge - Methods and Results
(2025)
Journal Article
Evaluating Language Model Vulnerability to Poisoning Attacks in Low-Resource Settings
(2024)
Journal Article
Synchronization Is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs
(2024)
Presentation / Conference Contribution
TADM: Temporally-Aware Diffusion Model for Neurodegenerative Progression on Brain MRI
(2024)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search