Meta Platforms Inc is planning to introduce enhanced voice features in its upcoming Llama 4 artificial intelligence model, aiming to pivot towards conversational AI-powered agents in the coming weeks, according to a Financial Times report.
Meta enhances voice features in Llama 4 AI model
The new features in Llama 4 reflect Meta’s strategy to create a more interactive dialogue system, allowing users to interrupt and engage in a natural conversation rather than adhering to a strict question-and-answer format. This initiative aligns with CEO Mark Zuckerberg’s vision to establish the company as a leader in artificial intelligence technology.
Meta has announced plans to trial premium subscriptions for its AI assistant, Meta AI, which will facilitate agentic tasks such as booking reservations and video creation. The company is also contemplating the introduction of paid advertising or sponsored posts within the search results of its AI assistant.
Meta plans to invest up to $65 billion by 2025, primarily focusing on data center infrastructure and enhancing its AI capabilities. This commitment follows Zuckerberg’s remarks during the fourth-quarter earnings call, where he highlighted the significant market potential of the AI engineering agent designed to emulate the coding and problem-solving skills of a mid-level engineer.
Industry competition and product features
Competing with other major players like OpenAI, Microsoft, and Google, Meta is intensifying its efforts in AI as it prepares for what Zuckerberg has termed a “make-or-break” year in 2025. The discussion surrounding Llama 4 also encompasses the implementation of guardrails for the model’s outputs, a topic ongoing amid concerns about political bias and ethical AI usage.
Chris Cox, Meta’s chief product officer, explained that Llama 4 would operate as an “omni model,” facilitating a process where speech is native rather than requiring conversion between voice and text formats. He emphasized the transformative nature of directly conversing with the internet.
OpenAI released its voice functionality last year, focusing on creating distinct personalities for their models. Meanwhile, xAI’s Grok 3 has introduced its voice features with fewer restrictions. Last year, Meta launched a less restricted version of its Llama model following criticism regarding the overly cautious responses from Llama 2.
Voice interaction plays a significant role in Meta’s Ray-Ban smart glasses, which have gained popularity among consumers. The company is advancing its plans to create lightweight headsets that could replace smartphones as the primary computing device for users.
Featured image credit: Meta








