According to a press release, this round brought total capital raised to over USD 100m and valued the company at USD 500m.
Now-CEO Gaurav Misra and COO Dwight Churchill started working on a social network in 2021, but decided to shift their focus to video creation and editing — and Captions was born.
Misra, an ex-Microsoft engineer, told Axios that Captions has only spent about USD 10m so far, thanks to revenue from its app, which has been downloaded more than 10 million times with more than 100,000 daily active users.
Misra said that the goal is to invest USD 100m into training Captions’ foundation model, a process that is expected to continue through 2025.
Text + Visuals
Alpaca, founded in 2022, markets its platform as being built specifically for “digital artists and creatives,” capable of “instantly” stylizing and rendering early concept sketches.
Users can choose from a tiered subscription plan, which includes a free level that provides up to 100 generations per day, with the caveat that user data may be used for internal improvements. Prior to the acquisition, Alpaca raised USD 4.2m in seed funding, also led by Index Ventures.
“The acquisition of AlpacaML will allow us to continue to push the boundaries of what’s possible with generative video,” Misra explained in a November 13, 2024 post on X.
Alpaca CEO William Buchwalter will join Captions as a research engineer; Captions also extended job offers to all six of Alpaca’s employees.
Slator Pro Guide: Audiovisual Translation
The Slator Pro Guide: Audiovisual Translation is a concise guide to audiovisual translation, including dubbing, subtitling, access services, AI dubbing, AI captions, and more.
Drew Jaegle, a former DeepMind research scientist, has joined Captions as Head of AI. The company also has nearly 30 openings on its website, half of which fall under the engineering department, including data scientists. (Interestingly, Churchill noted on X, “more than 10% of the Captions team are former founders.”)
Captions’ most recent launch is Lipdub Playground, released November 20, 2024, which adds voice to AI videos.
“Just type and watch your character speak the provided script aloud, complete with synced lip movement and body language,” read a blog post introducing the tool.
Axios coverage noted that Captions is considering more acquisitions, with a special interest in machine learning companies well-versed in model training, infrastructure, and inference.
In addition to the vote of confidence from investors in its Series C, Captions has garnered some notable good press, including a mention on Time Magazine’s list of “Best Inventions for 2024” and a New York Times article celebrating the platform’s role in a multilingual love story.