Topic

multimodal

Coverage, reference pages, tools, and guides connected to this topic.

  1. Google debuts Gemma 4, a free multimodal model tuned for agents

    Google released Gemma 4, a 31B-parameter open model with multimodal and local deployment support, positioned as a strong backbone for agentic AI systems.

  2. Microsoft launches MAI Transcribe, Voice, and Image for agent stacks

    Microsoft’s MAI team released three in-house AI models—Transcribe, Voice, and Image—aimed at powering end-to-end multimodal agents across its ecosystem.