Microsoft makes Phi-3 generally available, previews its Phi-3-vision multimodal small language model

Join us in returning to NYC on June 5th to collaborate with executive leaders in exploring comprehensive methods for auditing AI models regarding bias, performance, and ethical compliance across diverse organizations. Find out how you can attend here.

Microsoft is making its Phi-3 lightweight model family available to developers nearly a month after first announcing its release. Phi-3-medium, Phi-3-small, and Phi-3-mini are available to developers with Phi-3-mini made a part of Azure AI. In addition, the company is also showing off a multimodal variant of the small model called Phi-3-vision featuring 4.2 billion parameters.

Phi-3 for all

Developed by Microsoft Research, Phi-3 is a powerful 3 billion parameter language model designed to pack as much punch in reasoning as larger models but at a significantly lower cost. It’s the fourth iteration in compact language models Microsoft has been working on—Phi-1 was developed a year ago before making way for Phi-1.5 and Phi-2.

Not every use case calls for a large language model. The push to enact AI locally or on-device is leading developers to seek out more capable and smaller options. And the number of offerings is growing from not only Phi-3, but also includes Google’s Gemma 2 and Hugging Face’s Zephyr. But Microsoft didn’t build one small model—Phi-3 comes in three options: Phi-3-mini has 3.8 billion parameters, Phi-3-small has 7 billion parameters, and Phi-3-medium has 14 billion parameters. The company has said it performs just as well as OpenAI’s GPT-3.5 but in a more lightweight form.

The timing of Phi-3’s public release is no coincidence, with the dawn of the AI PC coming soon. Developers can now use the different variants to bring their AI implementations to laptops, mobile devices, and wearables.

VB Event

The AI Impact Tour: The AI Audit

Join us as we return to NYC on June 5th to engage with top executive leaders, delving into strategies for auditing AI models to ensure fairness, optimal performance, and ethical compliance across diverse organizations. Secure your attendance for this exclusive invite-only event.

Request an invite

What we know about Phi-3-vision

Besides releasing Phi-3, Microsoft is introducing a new model variant that supports general visual reasoning tasks as well as chart, graph and table reasoning. Called Phi-3-vision, it has 4.2 billion parameters. When implemented, users can ask questions about a chart or use an open-ended question to inquire about a specific image.

Incidentally, Google also debuted its own lightweight multimodal model last week at its developer conference. PaliGemma offers similar capabilities but has 3 billion parameters, slightly smaller than Microsoft’s version.

Having AI that can interpret multiple forms of input is valuable to developers, and if there’s a way to provide a model with the performance of an LLM but at a fraction of the cost, it could grow adoption.

Though announced as a preview, Microsoft has not revealed when Phi-3-vision will be publicly available.

VB Daily

Stay in the know! Get the latest news in your inbox daily

By subscribing, you agree to VentureBeat’s Terms of Service.

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Tags: AI, Azure AI, Azure AI Platform, business, category-/Business & Industrial, category-/Computers & Electronics, category-/Computers & Electronics/Enterprise Technology, category-/Internet & Telecom/Web Services, category-/News, category-/Science, category-/Science/Computer Science, Microsoft Build, Microsoft Build 2024, Microsoft Phi 3, Phi 3, Phi 3 Vision, small language models, small language models (SLMs)

Microsoft makes Phi-3 generally available, previews its Phi-3-vision multimodal small language model

Phi-3 for all

VB Event

What we know about Phi-3-vision

6 smart gifts for holiday travelers

What to know about David Sacks, Trumps pick for AI and crypto czar

Get indie books for 99 cents (or less) during the Indie Author Winter Wonderland event

How To Secure AI With MLSecOps

The best early Cyber Monday deals are live at Amazon — check out our top picks

You may have missed

Arm lawsuit against Qualcomm ends in mistrial and favorable ruling for Qualcomm

Perplexity’s Carbon integration will make it easier for enterprises to connect their data to AI search

OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills

My favorite games of 2024 | The DeanBeat

I Used AI to Do All of My Holiday Shopping

Get to Know Us