© OpenAI
Artificial Intelligence

ChatGPT: voice conversations and image analysis for a better experience 🤖

Cliquez ici pour lire en français

ChatGPT just keeps getting better. Since its launch, this generative artificial intelligence from OpenAI has received regular updates aimed at making the user experience even better. The latest update promises to make interactions with users smoother than ever.

You can have real conversations with ChatGPT

Imagine being able to start a natural conversation with ChatGPT, just as you would with a friend, simply by using your voice, wherever you are. You could ask it to tell a story to your family when you’re out and about, or even resolve a heated debate at the dinner table. Well, now you can, thanks to ChatGPT.

If you’re familiar with using voice assistants, you already have an idea of how this type of interaction works. With the power of ChatGPT, the answers to your questions will be even more precise and natural.

This brand-new voice function is made possible by a brand-new text-to-speech model. It can take a simple text and a few seconds of sample speech to generate a human-like sound. OpenAI has worked in collaboration with professional actors to create each of the voices available. To transcribe your words into text, OpenAI’s open-source speech recognition system, Whisper, is used.

To start taking advantage of this feature, simply go to the mobile application’s settings. Look for the « New Features » option and select « Voice Conversations ». Then, on the home screen, click on the headset button in the top right-hand corner, and choose from five different voices the one that suits you best. It’s that easy!

ChatGPT can help you understand your environment

A few months ago, OpenAI announced that ChatGPT was now able to decrypt and understand images. This feature has been made even easier to use with the latest update to the virtual assistant.

Now you can present one or more images to ChatGPT. You can ask it to solve problems, such as why your barbecue won’t start, explore the contents of your fridge to plan a meal, or analyze complex work-related graphics. To emphasize a specific part of the image, you can use the drawing tool in our mobile application.

It probably reminds you of what Google offers with Google Lens. And once again, thanks to the power of ChatGPT, the experience is much improved.

To get started, press the photo button to capture or select an image. If you’re on iOS or Android, press the plus button first. You can also discuss multiple images or use our drawing tool to guide your assistant.

Gradual, but above all responsible deployment

These new ChatGPT features will be rolled out gradually and, above all, responsibly.

OpenAI’s goal is to build AGI that is safe and beneficial. We believe in making our tools available gradually, which allows us to make improvements and refine risk mitigations over time while also preparing everyone for more powerful systems in the future.

Aware of the potential of its technologies, particularly that of creating synthetic voices, OpenAI is proceeding cautiously. Indeed, the company is well aware of possible abuses, such as impersonation of public figures or fraud.

This is why text-to-speech technology is not open to all uses. It is reserved for the voice chat function. In addition, technical measures have been put in place to limit ChatGPT’s ability to analyze and make direct statements about individuals, while respecting their privacy.

Limitations are still present

While these new features significantly enhance the ChatGPT user experience, it’s important to note that some limitations remain.

Users might depend on ChatGPT for specialized topics, for example in fields like research. We are transparent about the model’s limitations and discourage higher risk use cases without proper verification.

So don’t put all your trust in ChatGPT – quite the opposite. Keep in mind that ChatGPT performs better in English than in other languages. It is effective for transcribing English texts, but performs poorly in other languages, especially those with non-Roman scripts. As a result, OpenAI strongly advises non-English-speaking users against using ChatGPT for this purpose.

The roll-out of voice and image functionality in ChatGPT for Plus and Enterprise users will take place over the next two weeks. The voice feature is available on Android (click here to download) and iOS (click here to download), while the image feature will be accessible on all platforms.

Source : OpenAI

Qu'en avez-vous pensé?

Excité
0
Joyeux
0
Je suis fan
0
Je me questionne
0
Bof
0

Vous pourriez aussi aimer

Laisser une réponse

Votre adresse e-mail ne sera pas publiée. Les champs obligatoires sont indiqués avec *