multimodal
Coverage, reference pages, tools, and guides connected to this topic.
-
Google debuts Gemma 4, a free multimodal model tuned for agents
Google released Gemma 4, a 31B-parameter open model with multimodal and local deployment support, positioned as a strong backbone for agentic AI systems.
-
Microsoft launches MAI Transcribe, Voice, and Image for agent stacks
Microsoft’s MAI team released three in-house AI models—Transcribe, Voice, and Image—aimed at powering end-to-end multimodal agents across its ecosystem.