Google Gemini Expands to Video Queries

Google's Gemini now allows users to ask questions using videos and screen content, enhancing its AI capabilities.

Mar 3, 2025

Google Gemini

Google has taken a significant step forward in AI innovation with its Gemini platform, now enabling users to ask questions using videos and what’s on their screen. This advancement marks a pivotal moment in how users interact with AI, moving beyond traditional text-based queries to more visual and interactive methods.

Gemini allows users to upload videos that demonstrate a problem they are trying to solve. The AI then searches the internet, including user forums and tutorials, to provide solutions. For instance, if a user is having trouble with a device, they can record a video of the issue and let Gemini find relevant fixes online.

Users can also highlight specific areas of concern in the video by adding text or drawing arrows. This feature ensures that Gemini understands the exact issue, providing more accurate and relevant solutions.

Gemini can also interpret content on the user's screen, allowing users to ask questions about what they are viewing. This feature is particularly useful for troubleshooting or seeking information about items displayed on the screen.

Google is continuously expanding Gemini's capabilities. Recent leaks suggest that Gemini may soon include AI video creation features, allowing users to generate realistic video clips from text prompts. While these features are still in development, they represent a significant leap in AI-powered content creation.

The integration of video and screen content into Gemini enhances the user experience by making it easier to communicate complex issues or questions. This approach saves time and effort, as users no longer need to describe their problems in detail. Instead, they can simply show the issue, and Gemini will provide relevant solutions.