Google has recently introduced a new artificial intelligence tool called Whisk, which allows users to upload photographs and obtain a merged image generated by AI without the need to type a single word. This innovative tool is designed to provide users with creative inspiration quickly and easily, without the complexity of traditional image editing software.
Users can provide photographs of subjects, settings, and styles before Whisk combines them to create a unique image. Unlike traditional image editing tools, Whisk is meant to be a fun and creative tool rather than a professional editing software. Google describes Whisk as a “creative tool” that allows users to explore visual concepts quickly and easily, rather than focusing on making pixel-perfect edits.
Companies like Google and OpenAI are racing to develop consumer products that showcase the latest AI technology. Despite the exciting possibilities that AI presents, there are concerns about the dangers of unchecked AI growth for humanity. The introduction of tools like Whisk and OpenAI’s Dall-E, a text-to-image generator, have led to an influx of AI-generated artwork across social media platforms and consumer products.
Whisk builds upon the success of text-to-image generators like Dall-E by allowing users to combine different categories and inputs to create unique images. Users can direct the details of the image using words, but an image input is not necessary. Whisk is aimed at offering users a platform for rapid visual exploration and creativity, enabling them to remix subjects, scenes, and styles in new and innovative ways.
Google acquired DeepMind in 2014 and has utilized its generative AI technology to develop Whisk. The tool uses Google’s primary AI service, Gemini, which was introduced in December 2023, and DeepMind’s latest text-to-image generator, Imagen 3. Imagen 3 receives captions from Gemini when users upload photographs, capturing the essence of the subject rather than an exact reproduction.
It is important to note that the final image created by Whisk may differ from the prompt photographs in certain aspects such as height, haircut, and skin tone. Google has faced criticism in the past for historically incorrect images generated by its AI technology. Whisk is currently available as a US-only website developed by Google Labs and is still in the early stages of development.
In a move to stay competitive in the consumer product market, OpenAI recently unveiled Sora, a text-to-video generator. This demonstrates the growing focus on AI technology in the industry. According to Dan Ives, managing director and senior equities analyst at Wedbush Securities, Whisk is a significant milestone for Google in the field of AI and technology. He views AI products as a key part of Google’s future product lineup, alongside a new Android operating system developed in collaboration with Samsung and Qualcomm.
In conclusion, Google Whisk is a cutting-edge AI tool that offers users a creative and innovative way to explore visual concepts and create unique images. By combining the latest AI technology with user-friendly interfaces, Google aims to provide a platform for rapid visual exploration and creativity. As AI continues to evolve, tools like Whisk are paving the way for new possibilities in the world of digital art and design.