Extract and visualize layout from PDFs or images
Generate Gradio app code based on user requests
MOSS-TTSD: Text to Spoken Dialogue Generation
nanonets ocr / smoldocling / monkey ocr / typhoon ocr