08.12.2023
253

Google Introduced Gemini: A Revolutionary Multimodal AI Model

Sergej Ostrovskij
Editor in Chief at ApiX-Drive
Reading time: ~1 min

Google introduces Gemini, its newest AI breakthrough, capable of processing text, images, and videos. This innovative model is being progressively incorporated into Google's ecosystem, initially through the Bard chatbot. Gemini can be used in more than 170 regions, and the capabilities of the AI model will be available to developers through the Google Cloud API.

Future plans for Google's Gemini include its integration into Google's search engine, ads platform, and the Chrome browser. The most advanced version of the AI model is planned for launch in 2024 after a thorough safety assessment. Gemini is unique in its training across various data forms, such as texts, images, video, and audio, and is available in three distinct versions: Ultra, Nano, and Pro, each tailored for varying performance levels and efficiency.

Google is enhancing its Bard chatbot by integrating the Gemini Pro model, designed to improve its analytical and planning skills. Additionally, a special version of Gemini Pro is being incorporated into AlphaCode, a coding tool from Google DeepMind. In 2024, Bard will be equipped with the most sophisticated version, Gemini Ultra, which will also be available via a cloud API.