orasul commited on
Commit
1c15ba3
·
1 Parent(s): 3efd1e6

Update README

Browse files
Files changed (5) hide show
  1. .gitattributes +2 -0
  2. README.md +47 -0
  3. res/YOLO_accuracy.png +3 -0
  4. res/bb_1.jpeg +3 -0
  5. res/bb_2.jpeg +3 -0
.gitattributes CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.png filter=lfs diff=lfs merge=lfs -text
37
+ *.jpeg filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,50 @@
1
  ---
2
  license: gpl-3.0
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: gpl-3.0
3
+ tags:
4
+ - ultralytics
5
+ - yolo
6
+ - object-detection
7
+ - ui-detection
8
+ - computer-vision
9
+ - agent
10
  ---
11
+
12
+ # deki-yolo: Mobile UI Element Detection Model
13
+
14
+ This is a YOLO model trained to identify common UI elements in mobile
15
+ screenshots. It is the core detection model for the [deki huggingface space](https://huggingface.co/spaces/orasul/deki)
16
+ or [deki github](https://github.com/RasulOs/deki)
17
+
18
+ ## Model Description
19
+
20
+ The model is trained to detect the following four classes of UI elements:
21
+ * `View`: General-purpose containers.
22
+ * `ImageView`: Icons and images.
23
+ * `Text`: Text elements.
24
+ * `Line`: Separators and lines.
25
+
26
+ This model can be used as a foundational component for applications that need
27
+ to understand screen layouts, such as AI agents for mobile automation,
28
+ accessibility tools, and UI code generation.
29
+
30
+ ---
31
+
32
+ ## YOLO examples
33
+
34
+ Bounding boxes with classes for bb_1:
35
+
36
+ <img src="./res/bb_1_yolo.jpeg" alt="example1" width="60%">
37
+
38
+ Bounding boxes with classes for bb_2:
39
+
40
+ <img src="./res/bb_2_yolo.jpeg" alt="example2" width="60%">
41
+
42
+ ---
43
+
44
+ ## YOLO model accuracy
45
+
46
+ The model was trained on 486 images and was tested on 60 images.
47
+
48
+ Current YOLO model accuracy:
49
+ ![example3](./res/YOLO_accuracy.png)
50
+
res/YOLO_accuracy.png ADDED

Git LFS Details

  • SHA256: 1070c8072dc4a92292a60527c18b65fee396d563ac297589dfab00374aabfdc3
  • Pointer size: 131 Bytes
  • Size of remote file: 223 kB
res/bb_1.jpeg ADDED

Git LFS Details

  • SHA256: 47f8170ef239b0c8b2d3f27aee8417db55236a411acb0100b8d7828946a2e045
  • Pointer size: 131 Bytes
  • Size of remote file: 131 kB
res/bb_2.jpeg ADDED

Git LFS Details

  • SHA256: d7f6604f321e1a21c4e779c5f18624146bdcdd2a669132f4144d6cd50b22648c
  • Pointer size: 131 Bytes
  • Size of remote file: 104 kB