AI Can Process Live Video Now (From Your Phone!)


Summary

Google and OpenAI have developed AI models with access to cameras, enabling them to interact with desktop environments. Examples include Gemini 2.0 and GPT-3 advanced voice mode. The speaker in a Santa hat engages the viewer by asking if they see a coffee setup with a kettle in front. This advancement showcases the potential for AI to perceive and interact with the world in a more human-like manner.


Introduction of AI models with access to cameras

Google and OpenAI have given their AI models access to cameras, allowing them to see and interact with the computer desktop. Gemini 2.0 and GPT-3 advanced voice mode are examples of AI models with this capability.

Visual Interaction with AI

The speaker is wearing a Santa hat and asks if the viewer sees a coffee set up with a kettle in front of them.

Logo

Get your own AI Agent Today

Thousands of businesses worldwide are using Chaindesk Generative AI platform.
Don't get left behind - start building your own custom AI chatbot now!