BirdBox is a comprehensive system for detecting and evaluating bird calls in audio recordings using deep learning. It leverages YOLO (You Only Look Once) object detection on spectrogram images to identify and localize bird vocalizations in time and frequency.
Everything you need, including installation instructions, an interactive demo, model metrics, and the CLI Reference, can be found in the BirdBox Documentation.
- Installation - set up the environment
- Data Flow - pipeline description
- Models and Metrics - list of models with corresponding metrics
- CLI Reference - command line interface
- API Reference - application programming interface
Try out the Interactive Demo or browse the Demo Documentation. If everything is working as expected, the web interface will look like this:
Feel free to use BirdBox for your acoustic analyses and research. If you do, please cite as:
@software{Schlosser_BirdBox,
author = {Schlosser, Elias and Kahl, Stefan and Eibl, Maximilian},
license = {MIT},
title = {{BirdBox}},
url = {https://github.com/birdnet-team/BirdBox}
}The source code is licensed under the MIT License. See the License for details.
Our work in the K. Lisa Yang Center for Conservation Bioacoustics is made possible by the generosity of K. Lisa Yang to advance innovative conservation technologies to inspire and inform the conservation of wildlife and habitats.
The development of BirdNET is supported by the German Federal Ministry of Research, Technology and Space (FKZ 01|S22072), the German Federal Ministry for the Environment, Climate Action, Nature Conservation and Nuclear Safety (FKZ 67KI31040E), the German Federal Ministry of Economic Affairs and Energy (FKZ 16KN095550), the Deutsche Bundesstiftung Umwelt (project 39263/01) and the European Social Fund.
BirdNET is a joint effort of partners from academia and industry. Without these partnerships, this project would not have been possible. Thank you!


