The biggest expansion of models to date, new
inference optimization tools, and additional data capabilities give
customers even greater flexibility and control to build and deploy
production-ready generative AI faster
At AWS re:Invent, Amazon Web Services, Inc. (AWS), an
Amazon.com, Inc. company (NASDAQ: AMZN), today announced new
innovations for Amazon Bedrock, a fully managed service for
building and scaling generative artificial intelligence (AI)
applications with high-performing foundation models. Today’s
announcements reinforce AWS’s commitment to model choice, optimize
how inference is done at scale, and help customers get more from
their data.
This press release features multimedia. View
the full release here:
https://www.businesswire.com/news/home/20241204780931/en/
Discover Amazon Bedrock Marketplace
models and Amazon Bedrock fully managed models in the new model
catalog (Graphic: Business Wire)
- AWS will soon be the first cloud provider to offer models from
Luma AI and poolside. AWS will also add the latest Stability AI
model in Amazon Bedrock and, through the new Amazon Bedrock
Marketplace capability, is giving access to more than 100 popular,
emerging, and specialized models, so customers can find the right
set of models for their use case.
- New prompt caching and Amazon Bedrock Intelligent Prompt
Routing help customers more easily and cost effectively scale
inference.
- Support for structured data and GraphRAG in Amazon Bedrock
Knowledge Bases further expands how customers can leverage their
data to deliver customized generative AI experiences.
- Amazon Bedrock Data Automation automatically transforms
unstructured, multimodal data into structured data with no coding
required—helping customers use more of their data for generative AI
and analytics.
- Tens of thousands of customers trust Amazon Bedrock to run
their generative AI applications as the service has grown its
customer base by 4.7x in the last year. Adobe, Argo Labs, BMW
Group, Octus, Symbeo, Tenovos, and Zendesk are already adopting the
latest Amazon Bedrock advancements.
“Amazon Bedrock continues to see rapid growth as customers flock
to the service for its broad selection of leading models, tools to
easily customize with their data, built-in responsible AI features,
and capabilities for developing sophisticated agents,” said Dr.
Swami Sivasubramanian, vice president of AI and Data at AWS.
“Amazon Bedrock is helping to tackle the biggest roadblocks
developers face today, so customers can realize the full potential
of generative AI. With this new set of capabilities, we are
empowering customers to develop more intelligent AI applications
that will deliver greater value to their end users.”
The broadest selection of models from leading AI
companies
Amazon Bedrock provides customers with the broadest selection of
fully managed models from leading AI companies, including AI21
Labs, Anthropic, Cohere, Meta, Mistral AI, and Stability AI.
Additionally, Amazon Bedrock is the only place customers can access
the newly announced Amazon Nova models, a new generation of
foundation models that deliver state-of-the-art intelligence across
a wide range of tasks and industry-leading price performance. With
today’s announcements, AWS is further expanding model choice in
Amazon Bedrock with the addition of more industry-leading
models.
- Luma AI’s Ray 2: Luma AI’s multimodal models and
software products advance video content creation with generative
AI. AWS will be the first cloud provider to make Luma AI’s
state-of-the-art Luma Ray 2 model, the second generation of its
renowned video model, available to customers. Ray 2 marks a
significant advancement in generative AI-assisted video creation,
generating high-quality, realistic videos from text and images with
efficiency and cinematographic quality. Customers can rapidly
experiment with different camera angles and styles, create videos
with consistent characters and accurate physics, and deliver
creative outputs for architecture, fashion, film, graphic design,
and music.
- poolside’s malibu and point: poolside addresses the
challenges of modern software engineering for large enterprises.
AWS will be the first cloud provider to offer access to poolside’s
malibu and point models, which excel at code generation, testing,
documentation, and real-time code completion. This empowers
engineering teams to improve productivity, write better code
faster, and accelerate product development cycles. Both models can
also be fine-tuned securely and privately on customers’ codebases,
practices, and documentation, enabling them to adapt to specific
projects and helping customers tackle daily software engineering
tasks with increased accuracy and efficiency. Additionally, AWS
will be the first cloud provider to offer access to poolside’s
Assistant, which puts the power of poolside’s malibu and point
models inside of developers' preferred integrated development
environments (IDEs).
- Stability AI’s Stable Diffusion 3.5 Large: Stability AI
is a leading generative AI model developer in visual media, with
cutting-edge models in image, video, 3D, and audio. Amazon Bedrock
will soon add Stable Diffusion 3.5 Large, Stability AI’s most
advanced text-to-image model. This new model generates high-quality
images from text descriptions in a wide range of styles to
accelerate the creation of concept art, visual effects, and
detailed product imagery for customers in media, gaming,
advertising, and retail.
Access more than 100 popular, emerging, and specialized
models with Amazon Bedrock Marketplace
While the models in Amazon Bedrock can support a wide range of
tasks, many customers want to incorporate emerging and specialized
models into their applications to power unique use cases, like
analyzing a financial document or generating novel proteins. With
Amazon Bedrock Marketplace, customers can now easily find and
choose from more than 100 models that can be deployed on AWS and
accessed through a unified experience in Amazon Bedrock. This
includes popular models such as Mistral AI’s Mistral NeMo Instruct
2407, Technology Innovation Institute’s Falcon RW 1B, and NVIDIA
NIM microservices, along with a wide array of specialized models,
including Writer’s Palmyra-Fin for the financial industry,
Upstage’s Solar Pro for translation, Camb.ai’s text-to-audio MARS6,
and EvolutionaryScale’s ESM3 generative model for biology.
Once a customer finds a model they want, they select the
appropriate infrastructure for their scaling needs and easily
deploy on AWS through fully managed endpoints. Customers can then
securely integrate the model with Amazon Bedrock’s unified
application programming interfaces (APIs), leverage tools like
Guardrails and Agents, and benefit from built-in security and
privacy features.
Zendesk is a global service software company with a diverse and
multicultural customer base of 100,000 brands around the world. The
company can use specialized models, like Widn.AI for translation,
in Amazon Bedrock to personalize and localize customer service
requests across email, chat, phone, and social media. This will
provide agents with the data they need, such as sentiment or intent
in the customer’s native language, to ultimately enhance the
customer service experience.
Prompt caching and Intelligent Prompt Routing help customers
tackle inference at scale
When selecting a model, developers need to balance multiple
considerations, like accuracy, cost, and latency. Optimizing for
any one of these factors can mean compromising on the others. To
balance these considerations when deploying an application into
production, customers employ a variety of techniques, like caching
frequently used prompts or routing simple questions to smaller
models. However, using these techniques is complex and
time-consuming, requiring specialized expertise to iteratively test
different approaches to ensure a good experience for end users.
That is why AWS is adding two new capabilities to help customers
more effectively manage prompts at scale.
- Lower response latency and costs by caching prompts:
Amazon Bedrock can now securely cache prompts to reduce repeated
processing, without compromising on accuracy. This can reduce costs
by up to 90% and latency by up to 85% for supported models. For
example, a law firm could create a generative AI chat application
that can answer lawyers’ questions about documents. When multiple
lawyers ask questions about the same part of a document in their
prompts, Amazon Bedrock could cache that section, so the section
only gets processed once and can be reused each time someone wants
to ask a question about it. This reduces the cost by shrinking the
amount of information the model needs to process each time. Adobe’s
Acrobat AI Assistant enhances user productivity by enabling quick
document summarization and question answering. With prompt caching
on Amazon Bedrock, Adobe observed a 72% reduction in response time,
based on preliminary testing.
- Intelligent Prompt Routing helps optimize response quality
and cost: With Intelligent Prompt Routing, customers can
configure Amazon Bedrock to automatically route prompts to
different foundation models within a model family, helping them
optimize for response quality and cost. Using advanced prompt
matching and model understanding techniques, Intelligent Prompt
Routing predicts the performance of each model for each request and
dynamically routes requests to the model most likely to give the
desired response at the lowest cost. Intelligent Prompt Routing can
reduce costs by up to 30% without compromising on accuracy. Argo
Labs, which delivers innovative voice agent solutions for
restaurants, handles diverse customer inquiries and reservations
with Intelligent Prompt Routing. As customers submit questions,
place orders, and book tables, Argo Labs’ voice chatbot dynamically
routes the query to the most suitable model, optimizing for both
cost and quality of responses. For example, a simple yes-no
question, like “Do you have an open table at this restaurant
tonight?” could be handled by a smaller model, while a larger model
could answer more complex questions such as, “What kind of vegan
options does this restaurant provide?” With Intelligent Prompt
Routing, Argo Labs can use their voice agents to seamlessly handle
customer interactions, while achieving the right balance of
accuracy and cost.
Two new capabilities for Amazon Bedrock Knowledge Bases help
customers maximize the value of their data
Customers want to leverage their data, no matter where, or in
what format, it resides to build unique generative AI-powered
experiences for end users. Knowledge Bases is a fully managed
capability that makes it easy for customers to customize foundation
model responses with contextual and relevant data using retrieval
augmented generation (RAG). While Knowledge Bases already makes it
easy to connect to data sources like Amazon OpenSearch Serverless
and Amazon Aurora, many customers have other data sources and data
types they would like to incorporate into their generative AI
applications. That is why AWS is adding two new capabilities to
Knowledge Bases.
- Support for structured data retrieval accelerates generative
AI app development: Knowledge Bases provides one of the first
managed, out-of-the-box RAG solutions that enables customers to
natively query their structured data where it resides for their
generative AI applications. This capability helps break data silos
across data sources, accelerating generative AI development from
over a month to just days. Customers can build applications that
use natural language queries to explore structured data stored in
sources like Amazon SageMaker Lakehouse, Amazon S3 data lakes, and
Amazon Redshift. With this new capability, prompts are translated
into SQL queries to retrieve data results. Knowledge Bases
automatically adjusts to a customer’s schema and data, learns from
query patterns, and provides a range of customization options to
further enhance the accuracy of their chosen use case. Octus, a
credit intelligence company, will use the new structured data
retrieval capability in Knowledge Bases to allow end users to query
structured data using natural language. By connecting Knowledge
Bases to Octus’ existing master data management system, end-user
prompts can be translated into SQL queries that Amazon Bedrock uses
to retrieve the relevant information and return it to the user as
part of the application’s response. This will help Octus’ chatbots
deliver precise, data-driven insights to its users and enhance the
users’ interactions with the company’s array of data products.
- Support for GraphRAG generates more relevant responses:
Knowledge graphs allow customers to model and store relationships
between data by mapping different pieces of relevant information
like a web. These knowledge graphs can be particularly useful when
incorporated into RAG, allowing a system to easily traverse and
retrieve relevant parts of information by following the graph. Now,
with support for GraphRAG, Knowledge Bases can enable customers to
automatically generate graphs using Amazon Neptune, AWS’s managed
graph database, and link relationships between entities across
data, without requiring any graph expertise. Knowledge Bases makes
it easier to generate more accurate and relevant responses,
identify related connections using the knowledge graph, and view
the source information to understand how a model arrived at a
specific response. BMW Group will implement GraphRAG for My AI
Assistant (MAIA), an AI-powered virtual assistant that helps users
find, understand, and integrate the company’s internal data assets
hosted on AWS. With GraphRAG’s automated graph modeling powered by
Amazon Neptune, BMW will be able to continuously update the
knowledge graph powering MAIA based on data usage to provide more
relevant and comprehensive insights from its data assets to
continue creating premium experiences for millions of drivers.
Amazon Bedrock Data Automation transforms unstructured
multimodal data into structured data for generative AI and
analytics
Today, most enterprise data is unstructured and is contained in
content like documents, videos, images, and audio files. Many
customers want to take advantage of this data to discover insights
or build new experiences for customers, but it is often a
difficult, manual process to convert it into a format that can be
easily used for analytics or RAG. For example, a bank may take in
multiple PDF documents for loan processing and need to extract
details from each one, normalize features like name or date of
birth for consistency, and transform the results into a text-based
format before entering them into a data warehouse to perform any
analyses. With Amazon Bedrock Data Automation, customers can
automatically extract, transform, and generate data from
unstructured content at scale using a single API.
Amazon Bedrock Data Automation can quickly and cost effectively
extract information from documents, images, audio, and videos and
transform it into structured formats for use cases like intelligent
document processing, video analysis, and RAG. Amazon Bedrock Data
Automation can generate content using predefined defaults, like
scene-by-scene descriptions of video stills or audio transcripts,
or customers can create an output based on their own data schema
that they can then easily load into an existing database or data
warehouse. Through an integration with Knowledge Bases, Amazon
Bedrock Data Automation can also be used to parse content for RAG
applications, improving the accuracy and relevancy of results by
including information embedded in both images and text. Amazon
Bedrock Data Automation provides customers with a confidence score
and grounds its responses in the original content, helping to
mitigate the risk of hallucinations and increasing
transparency.
Symbeo is a CorVel company that offers automated accounts
payable solutions. The company will use Amazon Bedrock Data
Automation to automate the extraction of data from complex
documents, such as insurance claims, medical bills, and more. This
will help Symbeo’s teams process claims faster and accelerate the
turnaround time to get back to their customers. Tenovos, a digital
asset management platform, is using Amazon Bedrock Data Automation
to enable semantic search at scale to increase content reuse by 50%
or more, saving millions in marketing expenses.
Amazon Bedrock Marketplace is available today. Inference
management capabilities, structured data retrieval and GraphRAG in
Amazon Bedrock Knowledge Bases, and Amazon Bedrock Data Automation
are all available in preview. Models from Luma AI, poolside, and
Stability AI are coming soon.
To learn more, visit:
- The AWS News Blog for details on today’s announcements: Amazon
Bedrock Marketplace, prompt caching and Intelligent Prompt Routing,
and data processing and retrieval capabilities.
- The Amazon Bedrock page to learn more about the
capabilities.
- The Amazon Bedrock customer page to learn how companies are
using Amazon Bedrock.
- The AWS re:Invent page for more details on everything happening
at AWS re:Invent.
About Amazon Web Services
Since 2006, Amazon Web Services has been the world’s most
comprehensive and broadly adopted cloud. AWS has been continually
expanding its services to support virtually any workload, and it
now has more than 240 fully featured services for compute, storage,
databases, networking, analytics, machine learning and artificial
intelligence (AI), Internet of Things (IoT), mobile, security,
hybrid, media, and application development, deployment, and
management from 108 Availability Zones within 34 geographic
regions, with announced plans for 18 more Availability Zones and
six more AWS Regions in Mexico, New Zealand, the Kingdom of Saudi
Arabia, Taiwan, Thailand, and the AWS European Sovereign Cloud.
Millions of customers—including the fastest-growing startups,
largest enterprises, and leading government agencies—trust AWS to
power their infrastructure, become more agile, and lower costs. To
learn more about AWS, visit aws.amazon.com.
About Amazon
Amazon is guided by four principles: customer obsession rather
than competitor focus, passion for invention, commitment to
operational excellence, and long-term thinking. Amazon strives to
be Earth’s Most Customer-Centric Company, Earth’s Best Employer,
and Earth’s Safest Place to Work. Customer reviews, 1-Click
shopping, personalized recommendations, Prime, Fulfillment by
Amazon, AWS, Kindle Direct Publishing, Kindle, Career Choice, Fire
tablets, Fire TV, Amazon Echo, Alexa, Just Walk Out technology,
Amazon Studios, and The Climate Pledge are some of the things
pioneered by Amazon. For more information, visit amazon.com/about
and follow @AmazonNews.
View source
version on businesswire.com: https://www.businesswire.com/news/home/20241204780931/en/
Amazon.com, Inc. Media Hotline Amazon-pr@amazon.com
www.amazon.com/pr
Amazon.com (NASDAQ:AMZN)
Graphique Historique de l'Action
De Déc 2024 à Jan 2025
Amazon.com (NASDAQ:AMZN)
Graphique Historique de l'Action
De Jan 2024 à Jan 2025