Skip to content

Shelly Palmer - Amazon announces Nova

Shelly Palmer has been named LinkedIn’s “Top Voice in Technology,” and writes a popular daily business blog.
amazon-1224
A new family of frontier models unveiled.

Amazon Web Services (AWS) introduced Nova, a new family of multimodal AI models, at its re:Invent conference. The Nova lineup includes four text-generating models—Micro, Lite, Pro, and Premier—and two media-focused tools: Nova Canvas for image generation and Nova Reel for video creation. Here's a brief overview.

Text Models: Micro, Lite, Pro, Premier

The Nova text models offer a range of capabilities tailored to different needs. Micro is the smallest, designed for rapid text-to-text generation with minimal latency. Lite expands functionality to include image and video inputs while maintaining speed. Pro provides a balance of accuracy, speed, and cost, suitable for complex workflows. Premier, available in early 2025, is intended for advanced tasks such as creating customized models, positioning it as a tool for developers and enterprises seeking scalability.

The context windows are well-sized. Micro supports up to 128,000 tokens (about 100,000 words), while Lite and Pro can handle 300,000 tokens (roughly 225,000 words, 15,000 lines of code, or about 30 minutes of video). Amazon says that Premier and future models will expand to over 2 million tokens next year.

Media Tools: Canvas and Reel

AWS also launched Nova Canvas and Nova Reel to enhance its generative media capabilities. Canvas focuses on image creation and editing, including tools for background removal and color customization. Reel generates six-second video clips with options for camera motion like pans, zooms, and 360-degree rotations. Longer video generation, up to two minutes, is expected soon.

Both tools integrate content moderation features, including watermarking, to promote responsible use.

Looking Ahead: Speech-to-Speech and Any-to-Any Models

AWS plans to roll out a speech-to-speech model in Q1 2025, capable of interpreting tone and cadence for natural-sounding transformations. Later in 2025, an any-to-any model will support inputs and outputs across text, speech, images, and video, enabling a broad range of AI applications.

Integration and Accessibility

The Nova models and tools are available on AWS Bedrock, where customers can fine-tune them for specific tasks. AWS emphasized their speed and cost-effectiveness, with CEO Andy Jassy highlighting Nova’s utility in orchestrating agent-based workflows through proprietary APIs.

AWS has not disclosed details about the data used to train the Nova models, citing competitive and legal considerations. However, customers are protected by an indemnification policy that addresses potential copyright issues stemming from the use of generative AI outputs.

All of the US-based hyperscalers are now in a foundational model arms race. This is my favorite kind of competition… everyone wins!

As always, your thoughts and feedback are welcome. Just reply to this email. -s

P.S. CES© is just around the corner (Las Vegas, January 7-10, 2025). Are you going? If you are, our executive briefings and floor tours are the best way to experience the show. Learn more.

About Shelly Palmer

ABOUT SHELLY PALMER

Shelly Palmer is the Professor of Advanced Media in Residence at Syracuse University’s S.I. Newhouse School of Public Communications and CEO of The Palmer Group, a consulting practice that helps Fortune 500 companies with technology, media and marketing. Named LinkedIn’s “Top Voice in Technology,” he covers tech and business for Good Day New York, is a regular commentator on CNN and writes a popular daily business blog. He's a bestselling author, and the creator of the popular, free online course, Generative AI for Execs. Follow @shellypalmer or visit shellypalmer.com

push icon
Be the first to read breaking stories. Enable push notifications on your device. Disable anytime.
No thanks