OpenAI:the company submittedis a new model of multimodal generative artificial intelligence (AI), GPT-4o, whose name "o" means omni. In the new model, text processing and image analysis functions have been improved, and the ability to recognize and process speech in real time has been added. The new model will be rolled out in solutions for developers and consumers in the coming weeks and will be available for free.
According to Open AI technical director Muri Murati, the new model's cognitive capabilities are much higher than GPT-4's, especially in the field of programming. In addition, the new model understands and owns50 languages, including Armenian. We will find out in the coming weeks how well it can understand a language as complex as Armenian and whether it has surpassed ChatGPT-3.5.
There is also a multilingual voice assistant that can translate speech on the fly and has human speech features; it can joke and "laugh".As he writes TechCrunch, one of the main innovations of GPT-4o is the ability to understand photos directly from the camera in real time, which makes interaction with AB more intuitive and natural. Previously, the chatbot could only see uploaded images.
The release of the new model will take place in stages. Some users have already received it, while others will in the coming weeks.
For free users, there will be some restrictions in terms of requests made to GPT-4o. That amount will depend on current usage and demand. If GPT-4o is unavailable, users of the free version will be redirected to GPT-3.5.
It's also worth noting that free users will also get limited access to advanced tools like data analysis, file downloads, and more.