Skip to main content

Google releases Gemma, a new AI model designed with AI researchers in mind

Google is building on the success of its Gemini launch with the release of a new family of lightweight AI models called Gemma. The Gemma models are open and are designed to be used by researchers and developers to innovate safely with AI. 

“We believe the responsible release of LLMs is critical for improving the safety of frontier models, for ensuring equitable access to this breakthrough technology, for enabling rigorous evaluation and analysis of current techniques, and for enabling the development of the next wave of innovations,” the researchers behind Gemma wrote in a technical report.  

Along with Gemma, Google is also releasing a new Responsible Generative AI Toolkit that includes capabilities for safety classification and debugging, as well as Google’s best practices for developing large language models.

Gemma comes in two model sizes: 2B and 7B. They share many of the same technical and infrastructure components as Gemini, which Google says enables Gemma models to “achieve best-in-class performance for their sizes compared to other open models.”

Gemma also provides integration with JAX, TensorFlow, and PyTorch, allowing developers to switch between frameworks as needed. 

The models can be run on a variety of device types, including laptops, desktops, IoT, mobile, and cloud. Google also partnered with NVIDIA to optimize Gemma for use on NVIDIA’s GPUs. 

It has also been optimized for use on Google Cloud, which allows for benefits like one-click deployment and built-in inference optimizations. It is accessible through Google Cloud’s Vertex AI Model Garden, which now contains over 130 AI models, and through Google Kubernetes Engine (GKE).

According to Google Cloud, through Vertex AI, Gemma could be used to support real-time generative AI tasks that require low latency or build apps that can complete lightweight AI tasks like text generation, summarization, and Q&A. 

“With Vertex AI, builders can reduce operational overhead and focus on creating bespoke versions of Gemma that are optimized for their use case,” Burak Gokturk, VP and GM of Cloud AI at Google Cloud, wrote in a blog post

On GKE, the potential use cases include deploying custom models in containers alongside applications, customizing model serving and infrastructure configuration without needing to provision nodes, and integrating AI infrastructure quickly and in a scalable way. 

Gemma was designed to align with Google’s Responsible AI Principles, and used automatic filtering techniques to remove personal data from training sets, reinforcement learning from human feedback (RLHF) to align models with responsible behaviors, and manual evaluations that included red teaming, adversarial testing, and assessments of model capabilities for potentially bad outcomes. 

Because the models were designed to promote AI research, Google is offering free credits to developers and researchers who are wanting to use Gemma. It can be accessed for free using Kaggle or Colab, or first-time Google Cloud users can get a $300 credit. Researchers can also apply for up to $500,000 for their projects. 

“Beyond state-of-the-art performance measures on benchmark tasks, we are excited to see what new use-cases arise from the community, and what new capabilities emerge as we advance the field together. We hope that researchers use Gemma to accelerate a broad array of research, and we hope that developers create beneficial new applications, user experiences, and other functionality,” the researchers wrote.

The post Google releases Gemma, a new AI model designed with AI researchers in mind appeared first on SD Times.



from SD Times https://ift.tt/UoTAyLI

Comments

Popular posts from this blog

Difference between Web Designer and Web Developer Neeraj Mishra The Crazy Programmer

Have you ever wondered about the distinctions between web developers’ and web designers’ duties and obligations? You’re not alone! Many people have trouble distinguishing between these two. Although they collaborate to publish new websites on the internet, web developers and web designers play very different roles. To put these job possibilities into perspective, consider the construction of a house. To create a vision for the house, including the visual components, the space planning and layout, the materials, and the overall appearance and sense of the space, you need an architect. That said, to translate an idea into a building, you need construction professionals to take those architectural drawings and put them into practice. Image Source In a similar vein, web development and design work together to create websites. Let’s examine the major responsibilities and distinctions between web developers and web designers. Let’s get going, shall we? What Does a Web Designer Do?...

A guide to data integration tools

CData Software is a leader in data access and connectivity solutions. It specializes in the development of data drivers and data access technologies for real-time access to online or on-premise applications, databases and web APIs. The company is focused on bringing data connectivity capabilities natively into tools organizations already use. It also features ETL/ELT solutions, enterprise connectors, and data visualization. Matillion ’s data transformation software empowers customers to extract data from a wide number of sources, load it into their chosen cloud data warehouse (CDW) and transform that data from its siloed source state, into analytics-ready insights – prepared for advanced analytics, machine learning, and artificial intelligence use cases. Only Matillion is purpose-built for Snowflake, Amazon Redshift, Google BigQuery, and Microsoft Azure, enabling businesses to achieve new levels of simplicity, speed, scale, and savings. Trusted by companies of all sizes to meet...

Olive and NTT DATA Join Forces to Accelerate the Global Development and Deployment of AI Solutions

U.S.A., March 14, 2021 — Olive , the automation company creating the Internet of Healthcare, today announced an alliance with NTT DATA , a global digital business and IT services leader. The collaboration will fast track the creation of new healthcare solutions to transform the health experience for humans — both in the traditional healthcare setting and at home. As a member of Olive’s Deploy, Develop and Distribute Partnership Programs , NTT DATA is leveraging Olive’s open platform to innovate, build and distribute solutions to Olive’s customers, which include some of the country’s largest health providers. Olive and NTT DATA will co-develop new Loops — applications that work on Olive’s platform to provide humans real-time intelligence — and new machine learning and robotic process automation (RPA) models. NTT DATA and Olive will devote an early focus to enabling efficiencies in supply chain and IT, with other disciplines to follow. “This is an exciting period of growth at Olive, so...