16.05.2024
69

Google I/O 2024: Key Highlights

Sergej Ostrovskij
Editor in Chief at ApiX-Drive
Reading time: ~2 min

The Google I/O 2024 conference showcased several exciting advancements and announcements. Here's a brief overview of the main updates:

Gemini 1.5 Pro

The generative AI model Gemini has received a significant upgrade, now capable of analyzing longer documents, codebases, videos, and audio recordings. The latest version, Gemini 1.5 Pro, revealed in a private preview, can process up to 2 million tokens, doubling the previous capacity. This makes it the largest input model available commercially.

Gemini Live

Gemini Live was introduced, allowing users to have "in-depth" voice chats with Gemini on their smartphones. Users can interrupt to ask clarifying questions, and Gemini adapts to their speech patterns in real-time. Additionally, Gemini can see and respond to surroundings through photos or videos captured by smartphone cameras.

Gemini on Android

Gemini will soon replace Google Assistant on Android, integrating deeply with the mobile OS and Google apps. Users can drag and drop AI-generated images into Gmail, Google Messages, and other apps. YouTube users can use the "Ask this video" feature to find specific information from videos.

Gemini Nano

Google is integrating Gemini Nano, its smallest AI model, directly into the Chrome desktop client, starting with Chrome 126. This will enable developers to utilize the on-device model to enhance their own AI features.

Veo

Veo is an AI model that creates 1080p video clips up to one minute long from a text prompt. It captures various visual and cinematic styles, including landscapes and time lapses, and can edit existing footage.

Firebase Genkit

The new Firebase Genkit aims to simplify the development of AI-powered applications in JavaScript/TypeScript, with Go support coming soon. This open-source framework under the Apache 2.0 license enables developers to quickly integrate AI into new and existing applications.

Ask Photos

Launching later this summer, the "Ask Photos" feature in Google Photos, powered by Gemini AI, will allow users to search their photo collections using natural language queries.

Stay updated with these groundbreaking developments from Google I/O 2024!