Corporate Sovereignty in the Age of AI
Washington demanded Anthropic switch off its best model for non-US users. But the AI stack offers surprising ways to maintain sovereignty away from AI models
Written on
Berkeley's second Agentic AI Summit drew Google, OpenAI, NVIDIA, IBM and frontier researchers for talks on where agents are heading, from Chi Wang's MassGen to the Linux Foundation's 'Internet of Agents'. So what does the near future of agentic AI actually hold for enterprises?
UC Berkeley (San Francisco) held their second Agentic AI summit in August. The bay area's most connected players (Google, OpenAI etc) and frontier researchers (Princeton, Berkeley etc) gave two days of presentations on where they see the near future of agentic AI.
So what can we learn from them? What is the near future of agentic AI?
There are 18hrs of video to catch up with, below are my highlights. My favourite being Chi Wang's MassGen for agents to achieve state of the art intelligence in a democratic team, and the Linux Foundation's AGNTCY,standards for the 'Internet of Agents'.
OpenAI's priorities for agents via API...
You.com moving into the 'Context as a Service' space. Important that he mentions 'simulations', evaluation of agent performance can often be done well in simulation, which Google demonstrated earleir this year with their AI CoScientist.

Agents are increasingly able to complete tasks over long time horizons, think of how DeepResearch works over 10minutes as opposed to ChatGPT 3.5 struggling to retain a conversation. This trend will continue and agents will work over tasks requiring hours of focus, but at multiples of the speed that humans would execute them.
However, to ensure their success we must provide the right information for the long task, i.e. 'context management'

Businesses drowning in options for implementing AI, but a strategy for success is harder to come by. I see this everywhere, although it is not surprising for such a new technology.

Org charts become dynamic when we have people and agents collaborate on tasks.
Important when rethinking how organisations will function when they are collaborations between people and AI.
In many ways the orchestration graph is already a truth in companies, its how people talk with each other. Reporting lines in org charts never captured the real conversation, but agentic workflows must explicitly replicate that flexibility.

Talked about training agents for specific jobs by using a technique common in Machine Learning called 'reinforcement learning'. However, it requires lots of training examples and we rarely have such clean data to teach agents how to do an office job. We usually have only the final result, a document or product, not a record of the process that got us there. But we do have proxy data; conversations, emails, etc. Sergey then gives a mechanism for the agents to learn and outperform people by using such proxy data, despite it being relatively disorganised. The race is on to collect that data for professional jobs.

Proposed the NLIP schema for all multi modal LLM’s to use a single message standard. This would make it much easier for agents on different LLM's to collaborate.

Chi launched MassGen, a multi agent collaboration tool which takes inspiration from Google’s DeepThink, which recently won IMO Gold. “ MassGen is a cutting-edge multi-agent system that leverages the power of collaborative AI to solve complex tasks.”
Takes inspiration from Chi’s earlier article Myth of Reasoning, where he discusses that reasoning is a messy process which is only formalised and structured in hindsight.
Essentially its a framework for a mixture of LLM's to discuss their common objective and conduct a series of votes to decide the best solution. Getting opinions from multiple unrelated LLM's achieves state of the art intelligence.

Outshift and a sister organisation, AGNTCY (a Linux foundation project), are working towards standards for the ‘Internet of Agents’, encompassing existing agent collaboration standards such as MCP and A2A.
Those are only the beginning for ai agents to collaborate across the entire internet: “The AGNTCY project provides the complete infrastructure stack for agent collaboration—discovery, identity, messaging, and observability that works across any vendor or framework. It is the foundational layer that lets specialized agents find each other, verify capabilities, and work together on complex problems.”

Jay continued the 'internet of agents' / multi agent collaboration theme with tools for managing and observing teams of agents. This is especially tricky where each agent was created with a different agent framework, such as LangChain or LlamaIndex. Good to see Nvidia liberating agents from their framework, so they can easily collaborate with any other agent,

Much of the conference was about evaluations of agents. How do they perform when given ’Standard Operating Procedures’ (SOP) to execute? Afterall, this is how most agents in an enterprise environment which be required to work, yet they SOPs are written for humans.
A simple example is a shipping company classifying whether a product is ‘dangerous goods’ or not. Sounds like their are straight forward rules for such a task, but turns out that people are using fairly nuanced logic to accomplish this.
Nandi presented a method for companies to create Benchmarks and test their agents against real industrial SOPs

Xifeng argues it is too hard for non technical users to create their own agents. So he presented a simple standard called ADL (agent declarative language) as part of the MICA project.
It uses a declarative language, looking like a list of bullets, whereas software developers currently use a coding language, python, which is much less accessible.
We all like simple! Exactly the kind of standard that could get traction

A new a more reliable form of agentic deep research (like ChatGPT's research tool) which is based on a recent scientific paper called PaperQA, which was open sourced. Sam demonstrated how it was used to create a knowledge base on gene function by reviewing a million papers and was more accurate than people at this task.
Claimed there is money lying around to be picked up by such research agents simply because the vast research in many subjects has not been synthesised into digestible form. Only the recent invention of agents allows such volumes of knowledge to be organised, and we have only just begun the process.

Can LLMs become faster at reasoning through recurrent exposure to relevant tasks, and if so, how can it be achieved? This belongs to a wider theme at the summit, to develop agents that can learn, but do so simply.
Emory's Paper shows that agents which have their relatively simple mechanism for remembering learnings, ‘in context learning’, experience greater improvements in both efficiency and accuracy than supervised fine-tuning. They call their framework ‘SpeedUpLLM’.

"From Hype to Impact”, Aashutosh discussed Dell’s learnings scaling up agentic solutions. Good summary of common challenges to adoption:

Salesforce nicely summarised concerns that agentic projects which are intended to save time are tackling poorly defined business problems, where the process is unclear or data is difficult to find or trust. So where projects are failing to achieve returns it is common to find the real human effort to achieve the task is also not well understood.

Full videos of Berkeley's Agentic AI Summit - August 2nd 2025
Watch recordings: https://rdi.berkeley.edu/events/agentic-ai-summit
Washington demanded Anthropic switch off its best model for non-US users. But the AI stack offers surprising ways to maintain sovereignty away from AI models
Gartner says a third of enterprise software purchases will involve an AI agent by 2028, and machines are assumed immune to persuasion. We ran 8,000 trials across five frontier models. Which techniques work, which backfire, and what does that mean for selling to agents?
The gap between top models has collapsed to 5%, and GPT-3.5-level inference cost fell 280-fold in under two years. If AI is now a commodity, competitors can buy the same agent tomorrow. So where does durable advantage live, and why can't it be bought?
Agentico Assistant
Grounded in our knowledge base
This is an AI assistant for general information. Responses may be inaccurate, and it cannot act for Agentico or enter into any agreement — see our Terms & Conditions.
Conversations are recorded and stored — including your IP address — for quality, security, and record-keeping. Please don't share confidential or sensitive personal information. See our Privacy Policy.
By continuing you agree to our Terms & Conditions and acknowledge our Privacy Policy.
The assistant has reached today's capacity. Please try again tomorrow, or use the contact page — we'd be glad to help.