What should I do next in practice?

Pienin E2B malli voidaan nyt ajaa mobiililaitteilla jopa noin 1 Gt:n muistilla, mikä avaa oven tehokkaalle tekoälylle älypuhelimissa suoraan laitteella [5][12][13].

← Back to Trending

AnswersPublished4 days agoLast edited 2 days ago26 sources

Googlen uudet Gemma 4 QAT -mallit tuovat huipputekoälyn älypuhelimeesi – muistinkulutus romahti 72 %

Google julkaisi viralliset QAT tarkistuspisteet Gemma 4 malliperheelle, joka koostuu viidestä eri koosta: E2B, E4B, 12B, 26B A4B ja 31B [1][4][5]. QAT tekniikan ansiosta 4 bittiset mallit käyttävät noin 72 % vähemmän muistia säilyttäen silti lähes alkuperäisen suorituskyvyn [5].

Search & fact-check with Studio Global AI Browse more Trending pages

281K0

Google Gemma 4 QAT model compression unlocking mobile and consumer GPU deployment illustrated as a large neural network being compressed efficiently into a smartphone. — What are the key details of Google's June 4 release of Gemma 4 QAT models, including their quantization approach, supported model sizes andGoogle's QAT checkpoints compress Gemma 4 models by roughly 72%, enabling deployment on hardware from smartphones to consumer GPUs.
AI Prompt
Create a landscape editorial hero image for this Studio Global article: What are the key details of Google's June 4 release of Gemma 4 QAT models, including their quantization approach, supported model sizes and. Article summary: Google provides official Quantization-Aware Training (QAT) checkpoints for Gemma 4, and the Gemma 4 lineup includes E2B, E4B, 12B, 26B A4B, and 31B sizes [1][4][5]. Here are the key details.. Topic tags: general, documentation, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "# What Is Google Gemma 4? Google Gemma 4 is the most capable open model family from DeepMind yet, shipping four sizes under Apache 2.0 with multimodal input, native reasoning, and" source context "What Is Google Gemma 4? Architecture, Benchmarks, and Why It ..." Reference image 2: visual subject "# What Is Google Gemma 4? Google
openai.com

Google on ottanut merkittävän askeleen tekoälyn demokratisoinnissa julkaisemalla 5. kesäkuuta 2026 viralliset Quantization-Aware Training (QAT) -tarkistuspisteet koko Gemma 4 -malliperheelle . Käytännössä tämä tarkoittaa, että massiiviset, aiemmin tehokkaita palvelimia vaatineet kielimallit voidaan nyt kutistaa toimimaan sujuvasti omalla läppärilläsi tai jopa älypuhelimellasi – ja mikä parasta, laatu säilyy lähes ennallaan.

Mikä on QAT ja miksi se on pelin muuttaja?

Perinteinen kvantisointi (Post-Training Quantization eli PTQ) toimii vähän kuin pakkaisi valmiin valokuvan pienempään tiedostokokoon – tiedosto pienenee, mutta kuvanlaatu usein kärsii . QAT puolestaan on kuin opettaisi valokuvaajaa ottamaan kuvan suoraan optimaalisessa, pakatussa muodossa. Quantization-Aware Training simuloi kvantisointiprosessia jo mallin koulutusvaiheessa, jolloin malli oppii kompensoimaan tarkkuuden menetyksen .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Malli	Tyyppi	4-bittinen muistinkulutus	Säästö vs. BF16
E2B	Tiheä, 2.3B efektiivistä parametria	~3.2 Gt	~72 % QAT-tyylisellä 4-bittisellä
E4B	Tiheä, 4.5B efektiivistä parametria	~5 Gt	~72 % QAT-tyylisellä 4-bittisellä
12B	Tiheä, yhtenäinen teksti/kuva/ääni -malli	~7 Gt	~72 % QAT-tyylisellä 4-bittisellä
26B A4B	Asiantuntijasekoitus (MoE), ~3.8B aktiivista parametria	~15 Gt	~72 % QAT-tyylisellä 4-bittisellä
31B	Tiheä, 30.7B parametria	~17–20 Gt	~72 % QAT-tyylisellä 4-bittisellä

Googlen uudet Gemma 4 QAT -mallit tuovat huipputekoälyn älypuhelimeesi – muistinkulutus romahti 72 %

Mikä on QAT ja miksi se on pelin muuttaja?

Search, cite, and publish your own answer

People also ask

What is the short answer to "Googlen uudet Gemma 4 QAT -mallit tuovat huipputekoälyn älypuhelimeesi – muistinkulutus romahti 72 %"?

What are the key points to validate first?

What should I do next in practice?

Sources

Comments

Gemma 4 -mallisto ja muistivaatimukset

Saatavilla olevat formaatit: Valitse oikea työkalu

Mitä tämä tarkoittaa käytännössä eri laitteilla?

Tärkein varoituksen sana