 
				allenai/Molmo-72B-0924
			Image-Text-to-Text
			• 
		
				73B
			• 
	
				Updated
					
				
				• 
					
					2.54k
				
	
				• 
					
					294
				
 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				A unified multimodal understanding and generation model.
 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				