Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

News

Apple's Bold Expansion: A New Chapter in Bengaluru's Tech Odyssey

19-08-2025

News

Gaurav Bhalotia's Vision: Steering EY India into an AI-Driven Future

19-08-2025

artificial intelligence

Perplexity AI makes $34.5 billion offer to acquire Google’s chrome browser

14-08-2025

News

Fractal Analytics: Pioneering India's AI Revolution with a Historic IPO Leap

13-08-2025

News

OpenAI introduces powerful new features in ChatGPT 5

13-08-2025

artificial intelligence

Graas.ai secures $9M in Pre-Series B

12-08-2025

artificial intelligence

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI By Kaanchi Chawla - 10 July 2025

Microsoft

Microsoft has launched Phi-4-mini-flash-reasoning, a compact 3.8B-parameter AI model built for rapid, on-device logical reasoning. Optimized for low-latency use cases like mobile apps and edge deployments, it delivers up to 10x higher throughput and 2–3x lower latency than its predecessor.

Key to its performance is the new “SambaY” architecture—a hybrid of state-space models (Mamba), sliding window attention, and a Gated Memory Unit (GMU). This design maintains linear prefill times while blending lightweight memory operations with heavier attention layers, boosting efficiency on long-context tasks (up to 64k tokens).

Benchmarks show Phi-4-mini-flash-reasoning outperforming larger models on structured reasoning tasks like AIME24/25 and Math500, while ensuring faster response on frameworks like vLLM.

Also Read: The Big Questions We’re Tackling at World AI Summit 2025

Part of Microsoft’s commitment to responsible AI, it incorporates safety measures like supervised fine-tuning, direct preference optimization, and RLHF. The model is accessible via Azure AI Foundry, Hugging Face, and NVIDIA API Catalogue.

Meanwhile, Hugging Face released SmolLM3 (3B parameters) with 128k context length, multilingual support, and strong reasoning benchmarks, demonstrating growing momentum for high-performance, small-scale AI models suitable for on-device use.

Be a part of Elets Collaborative Initiatives. Join Us for Upcoming Events and explore business opportunities. Like us on Facebook , connect with us on LinkedIn and follow us on Twitter.

"Exciting news! Elets technomedia is now on WhatsApp Channels Subscribe today by clicking the link and stay updated with the latest insights!" Click here!

Tags: Artifical Intelligence microsoft Phi-4 World AI Summit Bengaluru

Related artificial intelligence

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Perplexity AI makes $34.5 billion offer to acquire Google’s chrome browser

AI startup Perplexity has submitted an unsolicited $34.5 billion all-cash proposal to purchase Google’s Chrome browser...

By Kaanchi Chawla 14-08-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Graas.ai secures $9M in Pre-Series B

Singapore-headquartered Graas.ai has raised approximately $9 million in a pre-Series B funding round to accelerate the I...

By Kaanchi Chawla 12-08-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

India launches ₹1 lakh crore plan to boost private research

The Union Cabinet chaired by Prime Minister Narendra Modi has greenlit the Research Development and Innovation (RDI) Sch...

By Kaanchi Chawla 02-07-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

HCLTech and OpenAI announce strategic AI partnership

HCLTech has launched a multi-year strategic collaboration with OpenAI aimed at enabling large-scale AI transformation fo...

By Kaanchi Chawla 30-06-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Altimetrik to acquire SLK Software

Altimetrik announced that it has entered into a definitive agreement to acquire SLK Software, a move designed to expand ...

By Kaanchi Chawla 30-06-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Character.AI appoints Karandeep Anand as CEO

Character.AI, the rapidly growing AI chatbot platform resonating strongly with Gen Z audiences, has named Karandeep Anan...

By Kaanchi Chawla 25-06-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Coforge taps Duke Fuqua for real-world GenAI solution

Coforge has announced a strategic collaboration with Duke University’s Fuqua School of Business to deepen the understa...

By Kaanchi Chawla 24-06-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Adobe expands GenStudio with new AI tools

Adobe has unveiled significant upgrades to its content creation suite, GenStudio, introducing cutting-edge AI features a...

By Kaanchi Chawla 24-06-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Apple eyes Perplexity AI for smarter search and voice tech

Apple is reportedly in early-stage talks with Perplexity AI as it looks to deepen its artificial intelligence strategy, ...

By Kaanchi Chawla 23-06-2025

Microsoft launches Phi-4-mini-flash-reasoning for fast, on-device AI

artificial intelligence

Mindsprint accelerates India-led AI growth with global expansion goals

AI-led tech firm Mindsprint is strengthening its India operations as the core of its global innovation and delivery mode...

By Kaanchi Chawla 12-06-2025