Apple reveals their Multimodal LLM: MM1

Technology giant Apple broke its silence on artificial intelligence and introduced its next-generation multimodal large language models (LLMs) called MM1.

MM1, which successfully performs complex tasks such as captioning images, answering visual questions, and natural language inference, is seen as an important development in the world of artificial intelligence.

What is MM1?

As I mentioned above, MM1 is a multimodal big language model designed to caption images, answer visual questions, and perform natural language inference. It aims to perform complex tasks by combining text and visual data. Apple researchers report that MM1 offers much-improved results compared to other preliminary training results.

Technical specifications of MM1

Supporting up to 30 billion parameters, MM1 stands out as a model family that can process image and text data together. Trained in different data types such as image-subheaders, interspersed image-text, and text-only, MM1 has a more comprehensive information processing capability.

On the other hand, the development of MM1 also indicates the importance Apple attaches to artificial intelligence. Apple, working on an LLM framework codenamed “Ajax” and including initiatives such as DarwinAI, sees artificial intelligence and machine learning as core technologies. The company plans to share the details of its work in this area in 2024 and make an AI-focused presentation at the WWDC developer conference in June.

Apple’s MM1 is considered an important step forward in the field of multimode LLMs. It shows that Apple is breaking its silence on AI, which could play an important role in the near future. MM1’s development will contribute to the further development of artificial intelligence in areas such as visual data processing and natural language understanding.

Featured image credit: Sumudu Mohottige / Unsplash

Apple reveals their Multimodal LLM: MM1

Barış Selman

Related Posts

Apple Vision Pro launches in South Korea and UAE on November 15

Fitbit might launch personalized sleep journal feature

Google launches Gemini for iPhone with advanced AI features

OpenAI’s ChatGPT for Mac adds third-party app integration

LATEST

Have account abstraction wallets improved enough to compete with Metamask?

Apple Vision Pro launches in South Korea and UAE on November 15

Fitbit might launch personalized sleep journal feature

Google launches Gemini for iPhone with advanced AI features

OpenAI’s ChatGPT for Mac adds third-party app integration

EU fines Meta nearly 800 million euros for anti-competitive practices

NVIDIA benchmarks STALKER 2: Impressive 4K gains with DLSS 3

Robinhood expands crypto offerings as Trump boosts market enthusiasm

Data breach affects 1.5 million, Set Forth responds

Fallout TV show brings new life to older games

© 2021 TechBriefly is a Linkmedya brand.

Apple reveals their Multimodal LLM: MM1

What is MM1?

Technical specifications of MM1

Related Posts

LATEST

© 2021 TechBriefly is a Linkmedya brand.

Follow Us