उत्तरप्रकाशित2 माह पहलेLast edited पिछला माह19 स्रोत

Fractile कैसे हल करना चाहता है AI का बढ़ता हुआ Inference Bottleneck

यूके की AI चिप स्टार्टअप Fractile ने AI इन्फरेंस को तेज़ करने के लिए $220 मिलियन की Series B फंडिंग जुटाई है। कंपनी का डिज़ाइन कंप्यूटेशन को सीधे मेमोरी के भीतर करने पर आधारित है, जिससे डेटा मूवमेंट कम होकर latency और लागत घट सकती है। अगर यह तकनीक सफल होती है, तो बड़े reasoning मॉडल, रियल‑टाइम AI असिस्टेंट और agenti...

Studio Global AI के साथ खोजें और तथ्यों की जांच करें और ट्रेंडिंग पेज देखें

Concept illustration of AI inference hardware integrating memory and compute — How is UK AI chip startup Fractile addressing the growing AI inference bottleneck, what did its $220M Series B funding involve, why does theFractile is developing AI chips designed to perform computation directly within memory to reduce inference latency and cost.
AI संकेत
Create a landscape editorial hero image for this Studio Global article: How is UK AI chip startup Fractile addressing the growing AI inference bottleneck, what did its $220M Series B funding involve, why does the. Article summary: Fractile is attacking the inference bottleneck with specialized AI inference hardware that moves compute much closer to memory, rather than relying on conventional GPU designs that shuttle model data between separate com. Topic tags: general, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "# Fractile United Kingdom ## Why Fractile matters #### Summary Fractile has raised $220 million in a Series B funding round led by Accel, Factorial Funds, and Founders Fund, wi" source context "Fractile raised $200M | AI Chips | MapCo" Reference image 2: visual subject "Founded in 2022, Fractile aims to address t
openai.com

पिछले कुछ सालों में AI कंपनियाँ लगातार बड़े‑से‑बड़े मॉडल ट्रेन करने की दौड़ में लगी रही हैं। लेकिन अब उद्योग एक नए और अलग तरह के संकट का सामना कर रहा है—इन मॉडलों को वास्तविक उपयोग में तेज़, सस्ता और बड़े पैमाने पर चलाना।

लंदन स्थित स्टार्टअप Fractile इसी समस्या पर दांव लगा रही है। कंपनी ने हाल ही में $220 मिलियन की Series B फंडिंग जुटाई है ताकि ऐसे विशेष AI चिप्स विकसित किए जा सकें जो खास तौर पर इन्फरेंस (inference) के लिए बनाए गए हों—यानी वह चरण जब प्रशिक्षित AI मॉडल उपयोगकर्ताओं को जवाब देना शुरू करते हैं।

Fractile का मानना है कि आने वाले समय में AI की प्रगति केवल बेहतर मॉडल बनाने पर निर्भर नहीं होगी। असली चुनौती यह होगी कि वे मॉडल वास्तविक दुनिया में कितनी तेजी और कम लागत पर काम कर सकते हैं।

क्यों AI Inference बन रहा है असली बॉटलनेक

आज का अधिकतर AI हार्डवेयर—खासकर GPU—ट्रेनिंग के लिए अनुकूलित है। ट्रेनिंग में बड़े पैमाने पर गणितीय गणनाएँ होती हैं, जिन्हें GPU बेहद तेज़ी से कर सकते हैं। लेकिन जब मॉडल को उपयोग में लाया जाता है, तब वह इन्फरेंस मोड में काम करता है—जहाँ हर उपयोगकर्ता के प्रश्न पर लगातार नए टोकन (tokens) जनरेट होते हैं।

यह प्रक्रिया सिर्फ कंप्यूट पावर पर निर्भर नहीं करती। असली चुनौती बन जाती है:

मेमोरी बैंडविड्थ (डेटा कितनी तेज़ी से पढ़ा जा सकता है)
लेटेंसी (डेटा आने‑जाने में लगने वाला समय)

बड़े AI मॉडल हर टोकन बनाते समय अपने लाखों‑करोड़ों पैरामीटर और मध्यवर्ती डेटा को बार‑बार पढ़ते हैं। अगर हार्डवेयर मेमोरी से डेटा तेजी से नहीं ला पाता, तो तेज़ प्रोसेसर भी काम नहीं आते।

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AI के साथ खोजें और तथ्यों की जांच करें

लोग पूछते भी हैं