In early January 2025, tech giant Google will roll out a significant update for its Google Drive mobile app users, introducing an automatic editing feature for the app’s built-in scanner. This update promises to streamline the process of capturing and enhancing digital copies of important documents, including bills, identification cards, and more.
At present, Google Drive users can scan documents using their mobile devices, but editing these scans requires manual adjustments, such as tweaking filters and adjusting image levels. With the new update, however, Google Drive will automatically optimize scanned images, eliminating the need for users to make manual enhancements. The new auto-filter feature will automatically improve scans, delivering sharper, brighter, and more readable document versions with minimal user input.
Using the feature is simple. Users just need to tap the “+ New” button located at the bottom-right of the screen, select “Scan,” and grant the app access to their camera. After scanning a document, a sparkle icon will appear in the preview mode, indicating that the auto-enhancer tool is ready. This tool will then adjust the white balance, eliminate shadows, boost contrast, sharpen details, and optimize lighting for a more polished final result.
Google has confirmed that this update will be available to all Google Drive users, including those with free personal accounts. The feature will first be available on Android devices starting January 6, 2025.
The goal of this update is to simplify document scanning while enhancing the overall user experience, making it easier to store clear, high-quality digital versions of important documents directly within Google Drive.
In other news, Google has also introduced a groundbreaking generative AI experiment called Whisk, designed to revolutionize creative workflows. Unlike traditional image generation tools that rely on text-based prompts, Whisk allows users to drag and drop images representing the subject, scene, and style they wish to create. By remixing these images, users can produce entirely original visuals.
Powered by Google’s Gemini model, Whisk automatically generates detailed captions based on the uploaded images. These captions are then processed by Google’s Imagen 3, the company’s latest image generation model. The focus of Whisk is on capturing the essence of the subject, rather than attempting to replicate it exactly, offering a fresh approach to creative expression.