Yazılar

Gemini to Receive Enhancements with New Audio Overview and Canvas Features

Google has announced the rollout of two exciting new artificial intelligence (AI) features for Gemini, enhancing the platform’s capabilities for both free and Gemini Advanced subscribers. The first new feature, called Canvas, offers an interactive space where users can collaborate directly with AI on a variety of tasks, including document creation and coding. This feature aims to bridge the gap between human creativity and AI efficiency, allowing users to generate drafts, make edits, and refine their work through AI assistance. The second new addition, Audio Overview, is a feature that was previously exclusive to Google’s NotebookLM but is now making its way to Gemini. This tool lets users transform documents, slides, and Deep Research reports into an engaging, podcast-style audio discussion, making it easier to digest complex content.

Both features are being introduced as part of Gemini’s ongoing evolution, following the introduction of Deep Research—a tool designed to generate detailed reports on complex topics—and exclusive lockscreen widgets for iOS users. The addition of Canvas and Audio Overview comes as part of a broader strategy to enrich user experience by offering new, intuitive ways to interact with AI. These new functionalities will be available across both the web and mobile versions of Gemini, allowing users to access them seamlessly across devices.

Canvas allows users to add documents or lines of code into a dedicated workspace within the Gemini interface. By clicking on the newly introduced Canvas button next to the Deep Research option, users can start working on a project where the AI generates a first draft based on the user’s prompt. From there, users can collaborate with the AI, editing the draft and refining the output to their liking. This feature is designed to facilitate a more hands-on, creative process where human expertise and AI capabilities complement each other, making it ideal for projects that require a mix of creativity and technical input.

On the other hand, Audio Overview offers an innovative way to engage with written content. This feature takes documents, presentations, and reports and transforms them into a podcast-like audio experience. Users can simply input a document or presentation, and Gemini will generate an engaging, narrated summary, making it easier for people to absorb the content in an auditory format. This feature is especially useful for users on the go who prefer listening to content instead of reading, offering a more flexible and interactive way to consume information. With these additions, Gemini is further positioning itself as a powerful AI tool for both personal and professional use.

Former Meta Executives Raise $15 Million for AI Assistant Startup Yutori

Two former Meta artificial intelligence executives, Devi Parikh and Dhruv Batra, have secured $15 million in funding for their AI assistant startup, Yutori. The funding round was led by Radical Ventures’ Rob Toews, with contributions from Felicis, AI pioneer Fei-Fei Li, and Google DeepMind’s Jeff Dean.

San Francisco-based Yutori is developing autonomous AI agents capable of executing tasks independently, a growing trend in AI innovation. Unlike current chatbots that primarily facilitate conversations, Yutori aims to create AI assistants that actively perform tasks, such as managing travel logistics and automating online transactions.

The company is leveraging post-training techniques to enhance AI models’ ability to navigate the web and execute complex reasoning tasks. Yutori’s team includes experts who played key roles in Meta’s AI advancements, including researchers behind Llama 3 and Llama 4, as well as embodied AI projects designed for robotics.

OpenAI Set to Test ChatGPT Integrations for Slack and Google Drive

OpenAI is reportedly preparing to launch a new feature for ChatGPT that will allow the AI to connect with external platforms like Google Drive and Slack. This feature, known as ChatGPT Connectors, will be available exclusively to Teams subscribers and is designed to improve enterprise users’ access to information stored within these platforms. By syncing with internal data from services like Google Drive for Workspace and Slack, ChatGPT will be able to answer queries based on the specific knowledge base of the connected platforms, making it a powerful tool for business users.

ChatGPT Connectors: A New Tool for Enterprises

According to a report from TechCrunch, OpenAI is set to begin beta testing for the ChatGPT Connectors feature. This feature will enable users to connect ChatGPT with third-party databases and communication tools, streamlining information retrieval across platforms. It’s expected that, during its initial phase, ChatGPT Connectors will focus on integrations with Google Drive and Slack, allowing the AI model to extract relevant data from files, presentations, spreadsheets, and conversations within these platforms. Later on, OpenAI may expand the feature to work with other platforms like Microsoft SharePoint and Box.

GPT-4o and Privacy Concerns

The new feature will be powered by a version of OpenAI’s GPT-4o AI model, which will be tailored to the specific internal knowledge of each connected platform. By integrating with platforms like Google Drive and Slack, the model will be able to search for and provide answers based on encrypted copies of files and conversations stored on OpenAI’s servers. This raises questions about privacy, particularly concerning how long the data will be stored and who will have access to it. While the data is encrypted, it remains unclear how OpenAI will manage these files and whether any third parties could potentially access them.

Enhancing User Experience with Contextual Responses

In addition to retrieving information from internal platforms, ChatGPT Connectors will also include a feature that displays sources for related information not directly used in the response. This feature will appear as a button at the bottom of each response, giving users further insight into the data used to generate answers. Furthermore, the AI will be able to access external information from the internet and its training data, ensuring that it can provide comprehensive and up-to-date answers.

As the beta testing phase for ChatGPT Connectors begins, it will be interesting to see how the feature performs in real-world business environments. If successful, this integration could transform how enterprises leverage AI tools to access and utilize data, streamlining workflows and improving productivity. However, OpenAI will need to address privacy concerns to ensure that businesses can trust the system with sensitive internal information.