Aws’ New Policy Layer In Bedrock Agentcore Makes Sure Ai Agents Can’t Give Away The Store

Sedang Trending 1 bulan yang lalu

LAS VEGAS — At its re:Invent convention successful Las Vegas, AWS coming announced a number of updates to Amazon Bedrock AgentCore that will make it easier for businesses to build and deploy AI agents connected nan AWS platform.

Amazon Bedrock AgentCore is AWS’ developer-oriented level for building, deploying and managing AI agents. When it launched this summer, it featured galore of nan halfway devices needed to build production-ready agents, including nan expertise to usage immoderate celebrated supplier model and ample connection exemplary (LLM) to build those agents, authentication services, features to supply representation to these agents and more.

Now, astatine re:Invent, AWS is introducing 3 halfway caller features that reside immoderate of nan issues that still clasp immoderate companies backmost from adopting agents.

“In nan accustomed AWS style, we person a fistful of primitives,” David Richardson, nan VP of Amazon Bedrock AgentCore astatine AWS, told me. “The intent is that they tin activity good together. Actually, we’re trying moreover harder to person that beryllium existent than is ever nan lawsuit pinch AWS. I cognize sometimes we show our seams, but we’re really trying to person it beryllium a full group of things that are apt going to beryllium needed crossed a wide scope of agents.”

Policy

The first of these caller services is simply a information and argumentation enforcement capacity that addresses nan request for further guardrails that galore companies person recovered erstwhile putting agents into production. For galore companies, aft all, it’s nan nondeterministic quality of these models that makes them hesitant to spell beyond nan aviator shape — and that’s particularly existent erstwhile it comes to customer-facing products. Prompt injection attacks are, aft all, very difficult to protect against.

With this caller feature, called Policy, Richardson explained that what’s absorbing present is that it sits extracurricular of nan agentic loop and is rules-based. A institution tin usage this to create guardrails (in earthy language), to, for example, guarantee that if nan AI supplier wants to connection a personification a in installments that is worthy much than $100, a quality supplier has to measurement successful and validate this request.

“The measurement I deliberation astir it … is it controls what nan supplier is allowed to inquire nan instrumentality to do. At nan debased level, you’ve sewage [identity entree management], which says these are nan devices that tin beryllium used. With Policy, you’ve sewage what you tin inquire nan instrumentality to do — and past pinch our existing Bedrock Guardrails, you tin power what nan LLM will opportunity backmost to nan extremity user,” Richardson explained.

He besides noted that nan intent present is for Policy to beryllium portion of a layered information communicative that starts pinch nan supplier runtime sitting connected apical of micro virtual machines (VMs) that supply session-level isolation, connected apical of each of nan accustomed information features that AWS offers.

In nan end, Richardson argues, businesses request to beryllium capable to spot their agents if they want to get immoderate existent worth retired of them. And ideally, this further information nett will get them location and let them to trust connected nan agent’s reasoning capabilities much because it will beryllium location erstwhile thing doesn’t rather activity arsenic planned.

Evals

The 2nd caller characteristic summation is civilization Evaluations. Bedrock AgentCore already supported accepted observability devices — some those wrong of nan AWS ecosystem, for illustration CloudWatch and nan distributed X-Ray tracing system, aliases manufacture standards for illustration OpenTelemetry.

The caller information devices will travel pinch 13 prebuilt evaluations that screen a batch of nan basics (correctness, faithfulness, helpfulness, consequence relevance, conciseness, coherence, instruction following, refusal, extremity occurrence rate, instrumentality action accuracy, instrumentality parameter accuracy, discourse relevance, harmfulness, stereotyping).

Developers tin besides create their ain civilization evaluations, too. These will usage an LLM arsenic a judge.

“I’m benignant of envisioning location will upwind up — astatine slightest early connected — being 2 ways it gets used. One will astir apt beryllium by nan supplier developer astatine improvement and refinement time, while they’re going done and possibly testing it connected their ain aliases testing it pinch immoderate precanned traces, and looking astatine nan evals,” Richardson said. “And past nan different will beryllium much semipermanent by an Ops team, benignant of analogous to really they negociate a nonagentic application, where, successful that world, you mightiness person a latency- and correction metrics-related group of metrics, now you mightiness person a personification sentiment aliases an accuracy metric that is calculated by nan evaluators.”

Memory

The 3rd announcement isn’t truthful overmuch a caller characteristic arsenic an summation to Bedrock AgentCore’s existing representation tool. The representation instrumentality already provides short- and semipermanent representation features, but it now besides offers episodic memory.

“The thought location is to person it beryllium aligned pinch an individual user, truthful that you tin grounds things for illustration their preferences — for illustration they for illustration a model spot versus an aisle seat, aliases they for illustration hotels that are sub-$500 aliases things for illustration that,” Richardson said. “We deliberation different representation disciplines, representation regimes, are going to beryllium 1 of nan keys to creating effective agents. And truthful we want to commencement to connection a fewer different representation type capabilities that customers tin use.”

YOUTUBE.COM/THENEWSTACK

Tech moves fast, don't miss an episode. Subscribe to our YouTube channel to watercourse each our podcasts, interviews, demos, and more.

Group Created pinch Sketch.

Selengkapnya