CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
0
0
0
0
0
Subscribe
CW Pakistan
CW Pakistan CW Pakistan
  • Legacy
    • Legacy Editorial
    • Editor’s Note
  • Academy
  • Wired
  • Cellcos
  • PayTech
  • Business
  • Ignite
  • Digital Pakistan
  • PSEB
    • DFDI
    • Indus AI Week
  • PASHA
  • TechAdvisor
  • GamePro
  • Partnerships
  • Wired

Google Enhances Gemini AI with Audio Overview and Canvas for Smarter Collaboration

  • March 24, 2025
Total
0
Shares
0
0
0
Share
Tweet
Share
Share
Share
Share

Google has expanded the capabilities of its AI-powered Gemini platform with the introduction of two new features: Audio Overview and Canvas. These updates are designed to enhance document accessibility, collaboration, and content refinement, making Gemini an even more powerful AI assistant for a broad range of users. The features bring new ways for individuals and teams to interact with information, offering improved productivity tools for both content creation and software development.

One of the key additions to Gemini is Audio Overview, a feature that allows users to transform documents, presentations, and reports into AI-generated spoken discussions. This tool provides a dynamic approach to content summarization, turning complex information into engaging, podcast-style insights. The AI hosts within Gemini perform real-time analysis, extracting key points and presenting them in an accessible audio format. The feature is especially useful for individuals who prefer listening over reading, making it an ideal solution for reviewing research papers, summarizing notes, or organizing lengthy email threads.

Originally developed as part of Google’s NotebookLM, the Audio Overview function has now been integrated into Gemini and is available to all users at no cost. Currently, the feature supports only English, but Google has confirmed plans to introduce support for multiple languages in the near future. Users can access Audio Overview simply by uploading a document to Gemini, after which a suggestion chip appears above the prompt bar to guide them through the process. Within minutes, AI-generated discussions are available for listening, sharing, or downloading across both web and mobile platforms.

Alongside Audio Overview, Google has also launched Canvas, an interactive platform designed for content collaboration. Canvas provides users with an AI-powered space where they can draft, refine, and share their work in real-time. Writers, researchers, and professionals can use Canvas to fine-tune their documents, adjusting elements such as tone, length, and formatting with AI assistance. This feature is particularly beneficial for those working on essays, blog posts, and reports, as it allows them to receive instant feedback and implement changes seamlessly.

For software developers, Canvas offers an advanced environment where they can bring ideas to life by building interactive projects, Python scripts, and web application prototypes. The feature enables users to preview their code and make necessary refinements directly within the platform, streamlining the development process. By providing an integrated workspace, Canvas enhances efficiency, allowing developers to quickly iterate on their projects before transferring content to Google Docs for further collaboration.

These new capabilities mark a significant advancement for Gemini, transforming it into a more sophisticated AI assistant that bridges the gap between content development and information processing. By integrating AI-driven interactivity into workflows, Google is positioning Gemini as a direct competitor to leading AI tools such as OpenAI’s ChatGPT and Anthropic’s AI models.

Starting today, Canvas is available in all supported languages, while Audio Overview is rolling out in English, with plans to expand language support in the coming months. Users can explore these new features through the Gemini web and mobile apps, further enhancing their ability to work efficiently in an AI-assisted environment. With these updates, Google continues to push the boundaries of AI-driven productivity, solidifying its place as a leader in the evolving digital landscape.

Share
Tweet
Share
Share
Share
Previous Article
  • Ignite

Pakistani Startups Shine on Meet The Drapers, Compete for $1 Million

  • March 24, 2025
Read More
Next Article
  • Digital Pakistan

Sindh Launches Digital Attendance System for Teachers to Enhance Transparency

  • March 24, 2025
Read More
You May Also Like
Read More
  • Wired

Meta Deploys Advanced Artificial Intelligence And New Tools To Combat Scams Across Facebook, WhatsApp And Messenger

  • Press Desk
  • March 20, 2026
Read More
  • Wired

YouTube Introduces Reimagine AI Tool For Shorts Allowing Users To Transform Scenes With Google Veo Video Generation

  • Press Desk
  • March 20, 2026
Read More
  • Wired

Meta Launches Creator Fast Track Programme Offering Up To USD 3,000 Monthly Bonuses To Lure TikTok And YouTube Creators To Facebook

  • Press Desk
  • March 19, 2026
Read More
  • Wired

OpenAI Launches GPT-5.4 Mini For Free ChatGPT Users And GPT-5.4 Nano For Developers

  • Press Desk
  • March 19, 2026
Read More
  • Wired

Khyber Pakhtunkhwa Government Orders Full Work-From-Home On Fridays For Two Months Amid Fuel Crisis

  • Press Desk
  • March 18, 2026
Read More
  • Wired

Pakistan Inaugurates First Solar Panel Testing Laboratory Established With South Korean Support

  • Press Desk
  • March 18, 2026
Read More
  • Wired

Yango Ride Becomes First Ride-Hailing Platform To Receive Transport Network Company Operating License From Punjab Transport Authority

  • Press Desk
  • March 18, 2026
Read More
  • Wired

Instagram Tests Clickable Links In Post Captions For Meta Verified Creators With A Monthly Cap

  • Press Desk
  • March 16, 2026
Trending Posts
  • Government Promotes Secure Communication Platform Beep For Digital Governance
    • March 21, 2026
  • LUMS Secures Gates Foundation Grant To Establish Pakistan’s First National Artificial Intelligence Health Hub
    • March 21, 2026
  • Micron Technology Warns Of Capital Spending Exceeding USD 25 Billion This Fiscal Year Despite Strong Memory Chip Sales
    • March 21, 2026
  • Pakistan’s 5G Spectrum Auction: Ufone Enters 5G Era With Largest Share Of 3500 MHz Spectrum As MergeCo Eyes Biggest Portfolio In Pakistan
    • March 21, 2026
  • PITB Conducts Two-Day IT Training Programme For Balochistan Police On Smart Policing And AI Tools
    • March 21, 2026
about
CWPK Legacy
Launched in 1967 internationally, ComputerWorld is the oldest tech magazine/media property in the world. In Pakistan, ComputerWorld was launched in 1995. Initially providing news to IT executives only, once CIO Pakistan, its sister brand from the same family, was launched and took over the enterprise reporting domain in Pakistan, CWPK has emerged as a holistic technology media platform reporting everything tech in the country. It remains the oldest continuous IT publishing brand in the country and in 2025 is set to turn 30 years old, which will be its biggest benchmark and a legacy it hopes to continue for years to come. CWPK is part of the SPIN/IDG Wakhan media umbrella.
Read more
Explore Computerworld Sites Globally
  • computerworld.es
  • computerworld.com.pt
  • computerworld.com
  • cw.no
  • computerworldmexico.com.mx
  • computerwoche.de
  • computersweden.idg.se
  • computerworld.hu
Content from other IDG brands
  • PCWorld
  • Macworld
  • Infoworld
  • TechHive
  • TechAdvisor
CW Pakistan CW Pakistan
  • CWPK
  • CXO
  • DEMO
  • WALLET

CW Media & all its sub-brands are copyrighted to SPIN-IDG Wakhan Media Inc., the publishing arm of NCC-RP Group. This site is designed by Crunch Collective. ©️1995-2026. Read Privacy Policy.

Input your search keywords and press Enter.