Yazılar

Google Unveils Veo 2 AI Video Generation Model for Gemini Advanced Users

Google has recently unveiled the Veo 2 artificial intelligence (AI) model, now available to paid subscribers of Gemini. This new AI tool allows users to create eight-second video clips by simply providing text prompts in natural language. The Veo 2 model, which was first introduced in December 2024 as a successor to the original Veo model, is also integrated into Google’s Vertex AI platform and plays a key role in powering YouTube’s Dream Screen feature. This launch marks another significant milestone in Google’s push to enhance its AI capabilities within the Gemini ecosystem.

Currently, the Veo 2 model is accessible exclusively to those using Gemini’s paid subscription, Gemini Advanced. Free-tier users will not be able to access this feature. The rollout is taking place globally and will be available in all languages supported by Gemini. However, users should note that while the feature is being introduced gradually, it may take some time before it reaches all eligible subscribers worldwide.

The Veo 2 model allows users to generate high-quality videos in 720p resolution, maintaining a 16:9 aspect ratio. The video clips are produced in response to detailed text prompts and can be downloaded in MP4 format. Users can also share these clips directly on popular social media platforms like TikTok and YouTube. Google has set a monthly limit on the number of videos each user can generate, and notifications will alert users when they are nearing their quota.

The Veo 2 AI model also brings significant advancements in terms of realism and cinematic detail. It can interpret technical film terms, such as camera lenses, movements, and cinematic effects, allowing users to be highly specific in their prompts. This enhanced understanding enables the AI to produce more tailored and professional-looking video content, making it a valuable tool for creators who want to experiment with video production in a more intuitive and accessible way.

Gemini 2.5 Pro Enters Public Preview as Google Boosts AI Studio Rate Limits

Google Expands Access to Gemini 2.5 Pro with Public Preview and New Pricing

Google has officially transitioned its Gemini 2.5 Pro AI model from experimental preview to public preview, allowing broader access for developers. Initially launched last month with limited rate caps, the advanced language model is now available with increased usage limits via the Gemini API and Google AI Studio. This shift opens the door for more robust experimentation and development, especially for those looking to integrate high-performance AI into their workflows.

According to Google, early interest in Gemini 2.5 Pro exceeded expectations, prompting the company to expand availability. While the model is now accessible through the Gemini API in AI Studio, it is still pending rollout on Vertex AI. Developers can take advantage of the new access tier immediately, giving them greater flexibility and speed in deploying AI-driven applications.

With expanded access comes clarified pricing. Google has introduced a two-tier pricing structure for Gemini 2.5 Pro. Under the standard tier, which includes up to 200,000 tokens, the model is priced at $1.25 per million input tokens and $10 per million output tokens. Input tokens cover all forms of content including text, images, and audio, while output tokens are calculated based on the model’s reasoning and response generation.

For developers who exceed the 200,000-token threshold, the higher tier pricing kicks in at $2.50 per million input tokens and $15 per million output tokens. Meanwhile, Google is continuing to offer the experimental version of Gemini with limited access at no cost. Emphasizing affordability, Google claims its rates are highly competitive — especially when compared to rivals like Anthropic’s Claude 3.7 Sonnet, which charges $3 and $15 for input and output tokens respectively.

Gemini to Receive Enhancements with New Audio Overview and Canvas Features

Google has announced the rollout of two exciting new artificial intelligence (AI) features for Gemini, enhancing the platform’s capabilities for both free and Gemini Advanced subscribers. The first new feature, called Canvas, offers an interactive space where users can collaborate directly with AI on a variety of tasks, including document creation and coding. This feature aims to bridge the gap between human creativity and AI efficiency, allowing users to generate drafts, make edits, and refine their work through AI assistance. The second new addition, Audio Overview, is a feature that was previously exclusive to Google’s NotebookLM but is now making its way to Gemini. This tool lets users transform documents, slides, and Deep Research reports into an engaging, podcast-style audio discussion, making it easier to digest complex content.

Both features are being introduced as part of Gemini’s ongoing evolution, following the introduction of Deep Research—a tool designed to generate detailed reports on complex topics—and exclusive lockscreen widgets for iOS users. The addition of Canvas and Audio Overview comes as part of a broader strategy to enrich user experience by offering new, intuitive ways to interact with AI. These new functionalities will be available across both the web and mobile versions of Gemini, allowing users to access them seamlessly across devices.

Canvas allows users to add documents or lines of code into a dedicated workspace within the Gemini interface. By clicking on the newly introduced Canvas button next to the Deep Research option, users can start working on a project where the AI generates a first draft based on the user’s prompt. From there, users can collaborate with the AI, editing the draft and refining the output to their liking. This feature is designed to facilitate a more hands-on, creative process where human expertise and AI capabilities complement each other, making it ideal for projects that require a mix of creativity and technical input.

On the other hand, Audio Overview offers an innovative way to engage with written content. This feature takes documents, presentations, and reports and transforms them into a podcast-like audio experience. Users can simply input a document or presentation, and Gemini will generate an engaging, narrated summary, making it easier for people to absorb the content in an auditory format. This feature is especially useful for users on the go who prefer listening to content instead of reading, offering a more flexible and interactive way to consume information. With these additions, Gemini is further positioning itself as a powerful AI tool for both personal and professional use.