
Openai launched an advanced voice mode with visual features in Chatgpt on Thursday. The feature allows artificial intelligence (AI) chatbots to access the smartphone’s camera to capture visual information around the user, which is available to all Chatgpt Plus, Team, and Pro subscribers. This feature draws on the capabilities of the GPT-4O and provides real-time voice responses to content displayed in the camera. Chatgpt’s Vision debuted at the company’s spring update event.
Chatgpt obtains vision function
The new ChatGPT feature is launched on the sixth day of OpenAI’s 12-day feature release schedule. To date, AI has released the full version of the O1 model, a video-generating Sora model and a new Canvas tool. Now, with advanced voice modes with vision, users can let AI see their surroundings and ask questions based on them.
In a demonstration, members of the OpenAI team interacted with the chatbot and introduced several people. After that, even if it isn’t actively on the screen, AI can test these people. This highlights the visual pattern also comes with memory, although the company does not specify how long the memory lasts.
Users can use the ChatGpt Vision feature to show their refrigerator to AI and ask for recipes or by displaying a wardrobe and asking for clothing suggestions. They can also show a landmark to the AI outside and ask questions about it. This feature paired with the chatbot’s low latency and emotional advanced voice modes, making it easier for users to interact in natural language.
Once the feature is launched to users, they can go to ChatGpt’s mobile app and click on the advanced voice icon. In the new interface, they will now see a video option, and clicking will allow the AI to access the user’s camera feed. In addition, the screen processing function can be accessed by clicking on the three dot menus.
Screen processing will enable AI to view users’ devices and any apps or screens they use. In this way, the chatbot can also help users solve problems and queries related to their smartphones. It is worth noting that OpenAI said that all team subscribers will access the feature in the latest version of the Chatgpt mobile app next week.
Most of the advantages and professional users will also get the feature, however, users in the EU region, Switzerland, Iceland, Norway and Liechtenstein are not currently getting the feature. On the other hand, enterprise and EDU users will visit Chatgpt’s advanced voice in 2025.