Microsoft Introduces LAM, AI That Performs Complex Tasks Independently By Elets News Network - 06 January 2025

Microsoft AI research

Microsoft researchers have introduced a Large Action Model (LAM), an AI model designed to operate Windows programs independently. As observed, large language models (LLMs) are driving rapid developments in AI by enabling functions like chatbots, text generation, and code writing. While LLMs excel at understanding and generating text, they face limitations in performing tasks in real-world environments.

Large Action Models (LAMs) represent a step forward in AI by allowing systems to carry out complex tasks based on human instructions. They signify a transition from AI systems that process and generate text to those capable of executing real-world actions.


What are LAMs?

Traditional AI models mainly handle text-related tasks, but LAMs go beyond this by translating user requests into actionable steps. These tasks can include operating software or controlling devices. While the concept of AI performing actions is not new, LAM is the first model specifically trained to work with Microsoft Office applications. The concept gained attention in early 2024, particularly after Rabbit’s AI device demonstrated the ability to interact with mobile applications without user input.

LAMs can process inputs like text, voice, or images and convert them into detailed, actionable plans. They can also adapt their approach in real time based on feedback. In simple terms, LAMs are AI systems designed not just to interpret requests but to perform them.

According to the research paper Large Action Models: From Inception to Implementation, these models interact with both digital and physical environments. For instance, instead of asking AI for instructions to create a PowerPoint presentation, a user could direct the AI to open the application, create slides, and format them as needed. LAMs combine three key functions: interpreting user commands accurately, planning actionable steps, and adjusting actions dynamically based on environmental feedback.

Also Read :- Kevan Parekh Assumes Office as Apple’s CFO

How are LAMs built?

Creating a LAM involves a more complex process than LLMs, with five stages of development. These models require two types of data. The first is task-plan data, which includes high-level steps for tasks like opening a document or formatting text. The second is task-action data, which specifies detailed, executable actions. Training involves supervised fine-tuning, reinforcement learning, and imitation learning. Before deployment, LAMs are tested in controlled settings and integrated with systems like Windows GUI agents to ensure compatibility with different environments. Final testing in live scenarios evaluates their adaptability and performance.

LAMs represent a shift in AI, from text-based applications to action-driven systems. They have potential uses ranging from automating workflows to assisting individuals with disabilities. As the technology progresses, LAMs could become standard AI solutions across industries.

Be a part of Elets Collaborative Initiatives. Join Us for Upcoming Events and explore business opportunities. Like us on Facebook , connect with us on LinkedIn and follow us on Twitter.

"Exciting news! Elets technomedia is now on WhatsApp Channels Subscribe today by clicking the link and stay updated with the latest insights!" Click here!

Related News