Gemini 2.5 Computer Use
Introducing the Gemini 2.5 Computer Use model, a specialized agent model built on top of Gemini 2.5 Pro’s visual reasoning capabilities, designed to interact directly with user interfaces (UIs). It is exposed via a new computer-use tool in the Gemini API, with inputs that include the user’s request, a screenshot of the UI environment, and a history of recent actions. The model generates function calls corresponding to UI actions like clicking, typing, or selecting, and may request user confirmation for higher-risk tasks. After each action is executed, a new screenshot and URL are fed back into the model to continue the loop until the task completes or is halted. It is optimized primarily for web browser control and shows promise for mobile UI interaction, though it is not yet suited for desktop OS-level control. In benchmarks across web and mobile control tasks, Gemini 2.5 Computer Use outperforms leading alternatives, delivering high accuracy at lower latency.
Learn more
Cisco AI Canvas
The Agentic Era marks a transformative shift from traditional application-centric computing to a new frontier defined by agentic AI, autonomous, context-aware systems capable of acting, learning, and collaborating within complex, dynamic environments. These intelligent agents don’t just respond to commands; they perform complete tasks, retain memory and context via large language models tailored for specific domains, and can scale across industries into the tens of millions. This evolution brings the need for a new operational mindset, AgenticOps, and a reimagined management interface built around three guiding principles, keeping humans thoughtfully in the loop to provide creativity and judgment, enabling agents to operate across siloed systems with cross-domain context, and deploying purpose-built models fine-tuned for their distinct tasks. Cisco brings this to life through AI Canvas, the industry’s first generative, shared workspace driven by a multi-data, multi-agent architecture.
Learn more
Naptha
Naptha is a modular AI platform for autonomous agents that empowers developers and researchers to build, deploy, and scale cooperative multi‑agent systems on the agentic web. Its core innovations include Agent Diversity, which continuously upgrades performance by orchestrating diverse models, tools, and architectures; Horizontal Scaling, which supports collaborative networks of millions of AI agents; Self‑Evolved AI, where agents learn and optimize themselves beyond human‑designed capabilities; and AI Agent Economies, which enable autonomous agents to generate useful goods and services. Naptha integrates seamlessly with popular frameworks and infrastructure, LangChain, AgentOps, CrewAI, IPFS, NVIDIA stacks, and more, via a Python SDK that upgrades existing agent frameworks with next‑generation enhancements. Developers can extend or publish reusable components on the Naptha Hub, run full agent stacks anywhere a container can execute on Naptha Nodes.
Learn more
wave
wave is a next-generation AI agent designed to handle complex tasks with human-like understanding and reasoning. Our mission is to save you time and enhance your productivity. Built with advanced language models and specialized tools, wave can perform research, create content, and assist with a wide range of tasks. wave is a powerful modular AI agent system that brings tasks to life. Users report saving up to 87% of their research time by leveraging wave's autonomous research capabilities. Access a comprehensive ecosystem of over 30 specialized AI agents working together to solve complex problems. Get answers and actionable insights 5 times faster than using traditional research methods. wave's specialized modules work together seamlessly to tackle complex tasks that would overwhelm a single model approach. wave remembers your preferences and previous interactions, creating a personalized experience that gets better over time.
Learn more