ChatGPT is now an assistant that can see, hear and speak
September 25, 2023
0
ChatGPTDeveloped by OpenAI, it offers new capabilities that enable interaction through voice and images, offering an intuitive interface and more ways to integrate ChatGPT into daily life. In
ChatGPTDeveloped by OpenAI, it offers new capabilities that enable interaction through voice and images, offering an intuitive interface and more ways to integrate ChatGPT into daily life. In a recent announcement on its website, OpenAI decided to reveal these new functions in advance. All these modes have been highlighted in a useful and useful way to present and present in line with the best commercial products of AI.
Arguments of the article:
ChatGPT: voice interaction
Gli utenti possono avere with new vocal functionality Interactive conversations with ChatGPT. This increases the potential of the chatbot, allowing support to be used on the go. For example, a user can ask ChatGPT to tell a story for children while traveling, making it more enjoyable.
A creative story of chatbot
During one friendship, we had the chance to wrestle with a particular argument; In this case, users can use the bot to get accurate information and resolve the discussion constructively.
ChatGPT’s voice technology uses: Use of text-to-speech model. In collaboration with professional vocal actors, this model can generate humanoid voice from text and short vocal samples, making interaction with ChatGPT even more natural and intuitive. Additionally, words speak thanks to Whisper, an open-source voice recognition system developed by OpenAI. testo con grande sensitivee in transcriptIt allows the chatbot to understand all user requests and respond to them effectively.
ChatGPT: visual interaction
Come sopra, the model of artificial intelligence or può Analyze or immagine the batter first., assisting the problem solver, making good analysis and completing graphical analysis. For example, a user might post a photo of the contents of their refrigerator. Il chatbot andrebbe dunque advertisement Can analyze available foods and suggest recipes based on their ingredients, Provides step-by-step instructions for preparation.
Trivia: GPT-4: Gemini rivals Google. Here are the differences
Also if the user needs to focus on a specific element in the image, ChatGPT mobile app includes a design tool This allows you to highlight specific areas of the image, making communication and analysis more precise and personal.
Understanding images is supported by multimodal GPT-3.5 and GPT-4 models. Search for Avanzati models applied their language skills to a wide variety of visuals, such as photos, screenshots, and documents Containing text and images, allowing ChatGPT to understand and interpret the visual context accurately and in detail.
It’s worth remembering that OpenAI recently integrated not only Canva but also DALL-E 3 into ChatGPT beyond the generative display model.
When and where available
Nelle prossime two weeks OpenAI to implement audio and video in ChatGPT for gli Use with Plus and Enterprise.
Function allowing voice interaction will be available on iOS and Android Although it is not the new version of the web, the greatest part of the person has been taken advantage of.
Functions to facilitate available visual interaction keep the platformshence Android, iOS and web.
John Wilkes is a seasoned journalist and author at Div Bracket. He specializes in covering trending news across a wide range of topics, from politics to entertainment and everything in between.