Mahmud ElHuseyni 🇵🇸
MElHuseyni
AI & ML interests
Computer Vision
NLP
Machine Learning
Recent Activity
liked
a model
3 days ago
lightonai/LightOnOCR-1B-1025
upvoted
an
article
4 days ago
Supercharge your OCR Pipelines with Open Models
Organizations
SmolVLM 🚐
OCR Models 👀️📃
Visual Embedding Models 🖼️
-
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 79k • 390 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 48.2k • 85 -
nomic-ai/colnomic-embed-multimodal-7b
Visual Document Retrieval • Updated • 13.9k • 90 -
nvidia/llama-nemoretriever-colembed-3b-v1
Visual Document Retrieval • 4B • Updated • 1.78k • 52
Speech Models 🎧
Instance Segmentation
Image Segmentation Models 🍪
-
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 83.4k • • 34 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 356k • • 167 -
facebook/maskformer-swin-base-ade
Image Segmentation • Updated • 1.74k • • 13 -
facebook/maskformer-swin-base-coco
Image Segmentation • 0.1B • Updated • 1.52k • • 26
Object Detection Models 🍉
Vision Language Leader-boards 📈
-
Running3939
OCRBenchv2 Leaderboard
🏆Display OCRBench leaderboard for text recognition models
-
Running178178
Vidore Leaderboard
🥇Explore visual document retrieval benchmark results
-
Running on CPU Upgrade920920
Open VLM Leaderboard
🌎VLMEvalKit Evaluation Results Collection
-
Running557557
Vision Arena (Testing VLMs side-by-side)
🖼Display image analysis results
LLM Inference 🚀
-
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
Paper • 2401.08671 • Published • 15 -
NanoFlow: Towards Optimal Large Language Model Serving Throughput
Paper • 2408.12757 • Published • 19 -
richard-park/llama3-deepspeed-v1.0
Text Generation • 8B • Updated • • 1
Arabic Models (LLM, VLM, Multimodel)
Instance Segmentation
SmolVLM 🚐
Image Segmentation Models 🍪
-
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 83.4k • • 34 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 356k • • 167 -
facebook/maskformer-swin-base-ade
Image Segmentation • Updated • 1.74k • • 13 -
facebook/maskformer-swin-base-coco
Image Segmentation • 0.1B • Updated • 1.52k • • 26
OCR Models 👀️📃
Object Detection Models 🍉
Visual Embedding Models 🖼️
-
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 79k • 390 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 48.2k • 85 -
nomic-ai/colnomic-embed-multimodal-7b
Visual Document Retrieval • Updated • 13.9k • 90 -
nvidia/llama-nemoretriever-colembed-3b-v1
Visual Document Retrieval • 4B • Updated • 1.78k • 52
Vision Language Leader-boards 📈
-
Running3939
OCRBenchv2 Leaderboard
🏆Display OCRBench leaderboard for text recognition models
-
Running178178
Vidore Leaderboard
🥇Explore visual document retrieval benchmark results
-
Running on CPU Upgrade920920
Open VLM Leaderboard
🌎VLMEvalKit Evaluation Results Collection
-
Running557557
Vision Arena (Testing VLMs side-by-side)
🖼Display image analysis results
Speech Models 🎧
LLM Inference 🚀
-
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
Paper • 2401.08671 • Published • 15 -
NanoFlow: Towards Optimal Large Language Model Serving Throughput
Paper • 2408.12757 • Published • 19 -
richard-park/llama3-deepspeed-v1.0
Text Generation • 8B • Updated • • 1