Perform complex tasks with a simple text command. That is the promise of AI-driven GUI automation according to new research from Microsoft and academic partners. This technology enables AI to operate software as if it were a human, from filling out forms to navigating applications.
Major tech companies embrace AI automation
Companies like Microsoft, Google, and Anthropic are actively developing AI assistants that control software through text commands. Microsoft's Copilot and Power Automate are examples of systems that convert natural language into actions. Google is working on 'Project Jarvis', which performs web-based tasks such as online shopping and travel planning.
Analysts predict that the market for GUI automation will grow from $8.3 billion in 2022 to $68.9 billion in 2028. This presents opportunities for companies looking to automate repetitive tasks and make software more accessible to non-technical users.
Opportunities and obstacles for businesses
Despite the advantages, there are challenges. Privacy risks, performance, and reliability have not yet been fully addressed. The research highlights the need for stricter security measures, more efficient models, and standardized evaluation processes. This way, companies can implement AI safely and effectively.
A new way of working
The rise of AI-driven GUI automation is a milestone in software usage. By 2025, it is expected that 60% of large companies will be experimenting with this technology. Although there is still work to be done, this development marks an important step towards more efficient and user-friendly software interaction.

