Google Launches Gemma 2: Transforming AI Capabilities for Global Development

Google DeepMind introduces Gemma 2, a new generation of open AI models designed to enhance performance and efficiency across diverse hardware platforms. Available in 9 billion and 27 billion parameter sizes, Gemma 2 aims to revolutionise AI deployment with its cost-effective, high-performance capabilities.

Google Launches Gemma 2: Transforming AI Capabilities for Global Development

Google DeepMind has launched Gemma 2, a new generation of open AI models that provide improved performance metrics and a collection of lightweight. This is an advanced open model derived from the same research and technology used to create the Gemini models. The Gemma family has expanded to include CodeGemma, RecurrentGemma, and PaliGemma, each designed for specific AI tasks and accessible through integrations with partners like Hugging Face, NVIDIA, and Ollama.

Gemma 2 is available in 9 billion (9B) and 27 billion (27B) parameter sizes, surpassing the first generation in both performance and efficiency during inference, while also incorporating significant safety enhancements. The 27B model provides competitive alternatives to models more than twice its size. This advanced capability is now possible on a single NVIDIA H100 Tensor Core GPU or TPU host, greatly reducing deployment costs.

Google launches Gemma 2 for redefining AI performance and efficiency

Gemma 2 represents AI architecture, crafted for exceptional performance and efficiency in inference. At 27 billion parameters, Gemma 2, offers performance that competes with models more than twice its size. Even the 9 billion parameter version of Gemma 2 surpasses other open models in its category, including Llama 3 8B. For detailed performance metrics, refer to the technical report.

The 27B Gemma 2 model is designed to execute inference tasks efficiently at full precision on a single Google Cloud TPU host, NVIDIA A100 80GB Tensor Core GPU, or NVIDIA H100 Tensor Core GPU. This capability significantly reduces deployment costs while maintaining high performance, making AI deployments more accessible and budget-friendly.

Gemma 2 is optimised for rapid inference across diverse hardware environments, from advanced gaming laptops and high-performance desktops to cloud-based setups. Users can experience Gemma 2 in full precision on Google AI Studio, and unlock local performance using the quantised version with Gemma.cpp on their CPU, or deploy it on personal devices such as NVIDIA RTX or GeForce RTX via Hugging Face Transformers.

Gemma 2 by Google enhances accessibility: Tailored for developers and researchers

Gemma 2 isn’t just more powerful; it’s designed to seamlessly integrate into user workflows:

Open and Accessible: Gemma 2 is available under the commercially friendly Gemma license to empower developers and researchers to share and commercialise their innovations with ease.

Wide Framework Compatibility: Gemma 2 is compatibile with major AI frameworks such as Hugging Face Transformers, JAX, PyTorch, and TensorFlow by native Keras 3.0, vLLM, Gemma.cpp, Llama.cpp, and Ollama. This facilitates an effortless integration with the preferred tools and workflows of users.

Gemma is optimised with NVIDIA TensorRT-LLM for running on NVIDIA-accelerated infrastructure or as an NVIDIA NIM inference microservice, with plans for optimisation for NVIDIA’s NeMo in the pipeline.

Effortless Deployment: Beginning next month, Google Cloud customers can effortlessly deploy and manage Gemma 2 on Vertex AI, streamlining the deployment process.

Gemma 2 models can also be fine tuned for specific tasks, including practical examples and recipes in the new Gemma Cookbook, designed to guide users through building applications.

Advancing responsible AI with Gemma 2

The Responsible Generative AI Toolkit by Google now includes the newly open-sourced LLM Comparator. This tool assists developers and researchers in conducting thorough evaluations of language models.

Users can utilise the companion Python library to perform comparative assessments using their models and data, visualising results within the application. Preparations are also underway to open source SynthID, a text watermarking technology tailored for Gemma models.

During the development of Gemma 2, Google committed to internal safety protocols. This included filtering pre-training data and subjecting the model to extensive testing and evaluation against a comprehensive range of metrics to detect and mitigate potential biases and risks. Google’s findings are openly shared across a wide array of public benchmarks concerning safety and representational harms.

Gemma 2 by Google empowering future innovation

The original Gemma release sparked over 10 million downloads and inspired numerous groundbreaking projects. Navarasa used Gemma to create a model that celebrates India’s rich diversity.

Google continues to explore new architectures and develop specialised variants of Gemma to address an expanded spectrum of AI challenges. This includes the upcoming 2.6 billion parameter Gemma 2 model, designed to further bridge the gap between lightweight accessibility and high-performance capabilities.

Gemma 2 is accessible on Google AI Studio, enabling users to test its full capabilities at 27 billion parameters without specific hardware requirements. Model weights for Gemma 2 can be downloaded from platforms like Kaggle and Hugging Face Models, with availability soon in the Vertex AI Model Garden.

To facilitate research and development access, Gemma 2 is also offered free of charge through Kaggle and by a free tier for Colab notebooks. Additionally, first-time Google Cloud customers may qualify for $300 in credits. Academic researchers can apply for the Gemma 2 Academic Research Program, offering Google Cloud credits to accelerate research efforts.

Pallavi Singal

Pallavi Singal is the Vice President of Content at ztudium, where she leads innovative content strategies and oversees the development of high-impact editorial initiatives. With a strong background in digital media and a passion for storytelling, Pallavi plays a pivotal role in scaling the content operations for ztudium’s platforms, including Businessabc, Citiesabc, and IntelligentHQ, Wisdomia.ai, MStores, and many others. Her expertise spans content creation, SEO, and digital marketing, driving engagement and growth across multiple channels. Pallavi’s work is characterised by a keen insight into emerging trends in business, technologies like AI, blockchain, metaverse and others, and society, making her a trusted voice in the industry.

Solana’s Expanding Reach in Consumer Apps

How Nigeria’s Property Market Could Become the Most Advanced in the…

Blockchain, AI, And Web3 Shaping The Future Of Art And Creativity:…

6 Ways Blockchain Technology is Making Real Estate Investment More Accessible…

How Decentralized Technology Is Enhancing Data Protection

Best AI Design Tools for Efficient and Creative Work

Learn about the top career opportunities in coding in 2025

How AI-Powered Life Insurance Quoting Tools Are Transforming the Industry

8 Game-Changing Innovations Reshaping Translation and Localization

Mistral AI Introduces High-Performance OCR API For Accurate And Scalable Document…

What Are the Accreditation Standards for Pediatric Life Support Courses?

Best AI Design Tools for Efficient and Creative Work

How AI-Powered Life Insurance Quoting Tools Are Transforming the Industry

Leveraging Cutting-Edge IoT Solutions to Enhance Building Efficiency

Saving with FieldBee PowerGuide: how precision guidance reduces costs

What Are the Accreditation Standards for Pediatric Life Support Courses?

The Futuristic view of Sociology as a Social Science

Exploring the Benefits of Joining First Technology Federal Credit Union

Exploring Ohio Family Jobs and Services: Opportunities for Every Household

Transforming Patient Care: The Impact of Blockchain in Healthcare

Google Launches Gemma 2: Transforming AI Capabilities for Global Development

Google launches Gemma 2 for redefining AI performance and efficiency

Gemma 2 by Google enhances accessibility: Tailored for developers and researchers

Advancing responsible AI with Gemma 2

Gemma 2 by Google empowering future innovation

MOST POPULAR

How To Think Like A Master Content Strategist

How to play online casino games with Bitcoin?

Amrita Sen And Dinis Guarda Launch New Youtube Podcast Series “Music...

The Ultimate Guide to SBA Loans is Here

OUR NETWORK