← Back to PRs

#21499: fix #20721: add video and audio to models.input type union

by neipor open 2026-02-20 01:08 View on GitHub →
agents size: XS
## Summary Model input config (`models.providers[].models[].input`) rejects `"video"` and `"audio"` values at zod validation, preventing users from declaring Gemini native multimodal capabilities. ## Changes - `src/config/zod-schema.core.ts`: Added "video" and "audio" to input type union - `src/config/types.models.ts`: Added "video" and "audio" to ModelDefinitionConfig.input type ## Testing - [x] Code compiles and passes format check <!-- greptile_comment --> <h3>Greptile Summary</h3> Added `"video"` and `"audio"` to the model input type union, allowing users to declare Gemini native multimodal capabilities in model configurations. The changes update both the TypeScript type definition and the Zod validation schema to accept these new input types alongside existing `"text"` and `"image"` types. <h3>Confidence Score: 5/5</h3> - This PR is safe to merge with minimal risk - The changes are minimal, well-scoped, and follow the existing pattern. Both the TypeScript type and Zod schema are updated consistently. The new input types align with existing `MediaUnderstandingCapabilitiesSchema` that already supports video and audio, demonstrating consistency with the codebase's architecture. - No files require special attention <sub>Last reviewed commit: 3e12dca</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->

Most Similar PRs