This video provides a comprehensive rundown of approximately 15 major Google AI updates released in a single month, spanning NotebookLM, Gemini, image generation, music generation, creative tools, and reasoning models. The presenter demonstrates each feature with hands-on examples, comparing results where relevant.
NotebookLM now offers an agentic video generation system that analyzes source materials and produces 5-minute cinematic video overviews. The system uses multiple specialized models: Gemini 3 Pro writes code for programmatic animations (maps, algorithms), Imagen generates consistent-style visuals, and VO3 handles standard video generation. An automated self-critique loop reviews and edits the output. Currently available only on the Ultra plan.
New visual style presets for infographics include professional styles (bento grid, instructional, scientific) and creative styles (sketchnote, editorial, clay, kawaii, bricks, anime). Slide decks can now be edited by describing changes in natural language. Infographics can also be generated directly from the chat panel, targeted to specific conversation topics rather than broad source material.
Google released Lyria 3 for music generation, available both in Gemini (limited to 30-second clips) and through Producer AI (formerly Riffusion, acquired by Google). Producer allows full song generation with post-generation editing through natural language prompts. It also offers interactive "spaces" like synths and drum machines.
A significant upgrade especially for free-plan users, who now get 20 images per day (up from 2-3 with the previous Pro model). Key improvements include faster generation (10-15 seconds vs. ~30 for Pro), better text rendering in complex infographics, crisper detail, and strong consistent character generation. Thinking mode is recommended for complex prompts. Paid users can toggle between Nano Banana 2 and Pro using "redo with Pro."
A Google Labs experiment that creates entire product marketing campaigns from a single image. It generates professional photo shoots with accurate product details, supports editing (e.g., removing objects), and builds branded campaigns with customizable colors, fonts, and text. It can also animate the final output and extract brand DNA from existing websites.
The Chrome sidebar now includes Imagen for in-context image generation. More significantly, Auto Browse allows Gemini to navigate websites autonomously — searching, filtering, filling forms, and finding information based on natural language instructions.
A new feature under Tools > Audio that instantly generates a spoken summary of any Google Doc.
Updates to Google's Flow platform add the ability to draw on images for targeted edits and use "ingredients" (separate image elements) to compose video scenes. The platform aims to be a full creative studio for image-to-video workflows.
Enables automation of routine tasks across Gmail, Sheets, and Drive using Gemini. Opal, the no-code visual workflow builder, received an agent step that autonomously selects and orchestrates the right tools (VO for video, web search for research, etc.).
Gemini 3 DeepThink is Google's most advanced reasoning model, competing with O3 Pro and Claude Opus 4.6 (Ultra plan only). Gemini 3.1 Pro is slightly less advanced but much faster, recommended for complex reasoning, website building, and interactive tool creation in Google AI Studio.