Amazon Web Services (AWS) introduced Nova, a new family of multimodal AI models, at its re:Invent conference. The Nova lineup includes four text-generating models—Micro, Lite, Pro, and Premier—and two media-focused tools: Nova Canvas for image generation and Nova Reel for video creation. Here's a brief overview.
Text Models: Micro, Lite, Pro, Premier
The Nova text models offer a range of capabilities tailored to different needs. Micro is the smallest, designed for rapid text-to-text generation with minimal latency. Lite expands functionality to include image and video inputs while maintaining speed. Pro provides a balance of accuracy, speed, and cost, suitable for complex workflows. Premier, available in early 2025, is intended for advanced tasks such as creating customized models, positioning it as a tool for developers and enterprises seeking scalability.
The context windows are well-sized. Micro supports up to 128,000 tokens (about 100,000 words), while Lite and Pro can handle 300,000 tokens (roughly 225,000 words, 15,000 lines of code, or about 30 minutes of video). Amazon says that Premier and future models will expand to over 2 million tokens next year.
Media Tools: Canvas and Reel
AWS also launched Nova Canvas and Nova Reel to enhance its generative media capabilities. Canvas focuses on image creation and editing, including tools for background removal and color customization. Reel generates six-second video clips with options for camera motion like pans, zooms, and 360-degree rotations. Longer video generation, up to two minutes, is expected soon.
Both tools integrate content moderation features, including watermarking, to promote responsible use.
Looking Ahead: Speech-to-Speech and Any-to-Any Models
AWS plans to roll out a speech-to-speech model in Q1 2025, capable of interpreting tone and cadence for natural-sounding transformations. Later in 2025, an any-to-any model will support inputs and outputs across text, speech, images, and video, enabling a broad range of AI applications.
Integration and Accessibility
The Nova models and tools are available on AWS Bedrock, where customers can fine-tune them for specific tasks. AWS emphasized their speed and cost-effectiveness, with CEO Andy Jassy highlighting Nova’s utility in orchestrating agent-based workflows through proprietary APIs.
AWS has not disclosed details about the data used to train the Nova models, citing competitive and legal considerations. However, customers are protected by an indemnification policy that addresses potential copyright issues stemming from the use of generative AI outputs.
All of the US-based hyperscalers are now in a foundational model arms race. This is my favorite kind of competition… everyone wins!
As always, your thoughts and feedback are welcome. Just reply to this email. -s
P.S. CES© is just around the corner (Las Vegas, January 7-10, 2025). Are you going? If you are, our executive briefings and floor tours are the best way to experience the show. Learn more.
About Shelly Palmer
ABOUT SHELLY PALMER
Shelly Palmer is the Professor of Advanced Media in Residence at Syracuse University’s S.I. Newhouse School of Public Communications and CEO of The Palmer Group, a consulting practice that helps Fortune 500 companies with technology, media and marketing. Named LinkedIn’s “Top Voice in Technology,” he covers tech and business for Good Day New York, is a regular commentator on CNN and writes a popular daily business blog. He's a bestselling author, and the creator of the popular, free online course, Generative AI for Execs. Follow @shellypalmer or visit shellypalmer.com.