As Google releases its new generative AI model called Gemini, Adarsh takes you through everything we know about it so far.
Ever since ChatGPT made its debut, Google has been in a race to catchup. It’s initial response, Bard, was no match in comparison so they had to pool resources to put together an upgrades generative AI model that could take on ChatGPT.
To make this happen and power their AI research, the company combined their AI divisions Google Brain and Deepmind. And that is how Gemini was born.
According to an article published by Google to announce the launch, Gemini can generalize and seamlessly understand, operate across and combine different types of information, including text, images, audio, video and code. The model is being promoted as a significant advancement in natural language processing, with Google calling it “our largest science and engineering project ever.”
Available in 3 Models
Google Gemini is available in 3 models:
1. Gemini Ultra
2. Gemini Pro
3. Gemini Nano
Ultra is the largest and most powerful of the three and has been designed for highly complex tasks while Pro is more equipped to handle a wide variety of tasks. Nano is for Android users who want to build Gemini-powered apps. For instance, with Gemini Nano, people can now summarize recordings made using the Recorder app on the Pixel 8 Pro phone (but only in English).
In a recent press conference, Sissie Hsiao, vice president of Google’s AI chatbot, Bard, said Gemini Pro outperformed GPT-3.5 in six out of the eight industry benchmarks. Google’s most advanced model, Gemini Ultra, beat the newer GPT-4 in seven of the eight benchmarks.
What the Future holds
Google has been really bigging up the launch and has repeatedly hailed Gemini as the most capable and general-purpose AI it has developed so far. They also plan to expand the advanced version of this large language model (LLM) next year. The LLM is multimodal, which means that it can understand different types of information, including text, audio, images, and video.
The LLM runs on Google-made tensor processing units, or TPUs, specialized hardware designed for training AI models. But in the future, Gemini will be trained on both TPUs and graphics processing units (GPUs), said Amin Vahdat, vice president of Google’s Cloud AI, in a briefing. Nvidia makes the H100 GPU, a popular chip for powering generative AI products.
Bard gets an Update
Bard has got a long impending makeover too. It has now been upgraded with Gemini Pro, which gives the chatbot more advanced reasoning and understanding, among other capabilities.
According to Google, Bard will also be integrated with Gemini Ultra in 2024. In the coming months, the company will add Gemini across its other apps, including search, Google Ads, and the Chrome browser.
Inspired by AlphaGo
Gemini owes a debt to AlphaGo, which was developed by Google’s DeepMind and became the first computer program to defeat a professional human Go player. AI history was made back in 2016 when AlphaGo beat Lee Sedol, one of the world’s greatest Go players, at his own game.
The techniques used in AlphaGo has been combined with the technology that powers ChatGPT to create Gemini. In an interview in mid-2023, Demis Hassabis, the boss of DeepMind had explained this: “At a high level, you can think of Gemini as combining some of the strengths of AlphaGo-type systems with the amazing language capabilities of the large models.”
The Last Word
Considering the effort that has gone into Gemini and the way Google has been hyping it up, this is their next big bet and now that it has been officially launched, you can expect several additions and upgrades in the months to come.
Have you tried Gemini yet? What about it has impressed you the most? Let us know in the comments section.
In case you missed:
- Presenting Gemini Live, Google’s Response to ChatGPT Voice
- TalkBack, Circle To Search & 3 More Google Features added on Android
- Presenting Claude, ChatGPT’s Competitor which is apparently Better!
- Talk to ChatGPT with new AI Glasses
- Meta AI launches on WhatsApp & Insta in India
- Introducing Universal AI Employee being built by Ema AI
- How Accurate are AI Web Searches?
- Are you ready for the Apple Flip Phone? Get Ready for the Foldable iPhone!
- iPhone 16: Price, Release Date, Specs & More
- OpenAI finally unveils its Advanced Voice Assistant!