Microsoft has introduced OpenAI’s gpt-oss open-weight language models, gpt-oss-120b and gpt-oss-20b, to its Azure AI Foundry and Windows AI Foundry platforms this week. This expansion aims to provide developers with greater flexibility and control over AI implementation.
The gpt-oss-120b model is designed for high-performance reasoning tasks, while the more compact gpt-oss-20b model functions on PCs equipped with GPUs featuring at least 16GB of memory. Developers are granted full access to the models’ weights, enabling fine-tuning for specific applications, offline use, or industry-specific assistants. This open-weight access also facilitates model review, partial retraining, and export for deployment on Microsoft’s Azure Kubernetes Service (AKS) or local machines.
Azure AI Foundry supports these models with tools for evaluation, fine-tuning, and deployment, boasting a catalog of over 11,000 models. Additionally, Foundry Local provides on-device support for local inference, catering to needs requiring enhanced security or offline capabilities.
The gpt-oss-20b model is currently available on Windows, with macOS compatibility planned for the near future. Both models will be integrated with the common responses API. Microsoft emphasizes that this initiative offers businesses and developers more transparency and options for managing AI across diverse environments, including cloud, on-device, and edge computing.








