MiniMax has released MMX-CLI, a Node.js-based command-line interface for its AI platform, enabling developers and AI agents to access generative capabilities directly.
MMX-CLI addresses challenges faced by large language model (LLM)-based agents, which struggle to generate media without additional integration layers. The new interface allows users to invoke various media generation functions directly as shell commands, eliminating the need for model context protocols.
The interface integrates MiniMax’s capabilities into seven command groups: mmx text, mmx image, mmx video, mmx speech, mmx music, mmx vision, and mmx search. Each command provides specific functionalities tailored for developers and AI applications.
The mmx text command supports multi-turn chat and targets specific MiniMax model variants. It defaults to using MiniMax-M2.7, allowing for advanced text generation tasks. The mmx image command enables image generation from text prompts, providing options for aspect ratio and subject consistency across multiple outputs. This feature aids workflows requiring visual continuity. The mmx video command utilizes MiniMax-Hailuo-2.3 for video generation, with options for synchronous polling or asynchronous task tracking. Users can specify an opening frame for video generation using the command’s flags.
MMX-CLI’s mmx speech command features over 30 text-to-speech voices. It allows users to adjust parameters like speed, volume, and pitch while supporting subtitle timing. The input is limited to 10,000 characters.
With the mmx music command, users can generate music from text inputs and define various compositional parameters like genre and instrumentation. Options for embedding watermarks are also included. The mmx vision command interprets images using a vision-language model, thus allowing users to inquire about the content of images through prompts.
The mmx search command facilitates web searches, returning results in either text or JSON format. Both the command structure and underlying technical architecture are optimized for usability across various environments. MMX-CLI is primarily developed in TypeScript and is compatible with Node.js 18+. Its deployment is straightforward, featuring structured error handling and support for dual-region API routing.
MiniMax stated that MMX-CLI significantly reduces integration barriers for AI development by providing a unified tool for diverse media generation capabilities. This, combined with easy setup for AI agents, streamlines the process of building intelligent applications.








