|
|
---
|
|
|
license: gpl-3.0
|
|
|
tags:
|
|
|
- ultralytics
|
|
|
- yolo
|
|
|
- object-detection
|
|
|
- ui-detection
|
|
|
- computer-vision
|
|
|
- agent
|
|
|
---
|
|
|
|
|
|
# deki-yolo: Mobile UI Element Detection Model
|
|
|
|
|
|
This is a YOLO model trained to identify common UI elements in mobile
|
|
|
screenshots. It is the core detection model for the [deki huggingface space](https://huggingface.co/spaces/orasul/deki)
|
|
|
or [deki github](https://github.com/RasulOs/deki)
|
|
|
|
|
|
## Model Description
|
|
|
|
|
|
The model is trained to detect the following four classes of UI elements:
|
|
|
* `View`: General-purpose containers.
|
|
|
* `ImageView`: Icons and images.
|
|
|
* `Text`: Text elements.
|
|
|
* `Line`: Separators and lines.
|
|
|
|
|
|
This model can be used as a foundational component for applications that need
|
|
|
to understand screen layouts, such as AI agents for mobile automation,
|
|
|
accessibility tools, and UI code generation.
|
|
|
|
|
|
---
|
|
|
|
|
|
## YOLO examples
|
|
|
|
|
|
Bounding boxes with classes for bb_1:
|
|
|
|
|
|
<img src="res/bb_1_yolo.jpeg" alt="example1" width="60%">
|
|
|
|
|
|
Bounding boxes without classes but with IDs after NMS for bb_1:
|
|
|
|
|
|
<img src="res/bb_1_yolo_updated.jpeg" alt="example2" width="60%">
|
|
|
|
|
|
Bounding boxes with classes for bb_2:
|
|
|
|
|
|
<img src="res/bb_2_yolo.jpeg" alt="example3" width="60%">
|
|
|
|
|
|
Bounding boxes without classes but with IDs after NMS for bb_2:
|
|
|
|
|
|
<img src="res/bb_2_yolo_updated.jpeg" alt="example4" width="60%">
|
|
|
|
|
|
---
|
|
|
|
|
|
## YOLO model accuracy
|
|
|
|
|
|
The model was trained on 486 images and was tested on 60 images.
|
|
|
|
|
|
Current YOLO model accuracy:
|
|
|

|
|
|
|
|
|
|