Twelve Labs
Twelve Labs sells multimodal video foundation models as an API, enabling enterprises and developers to perform semantic search, summarization, Q&A, and content analysis inside video archives without manual tagging.
Evidence notes
Who uses it: Enterprise buyers with large video libraries — primarily media & entertainment (professional sports leagues, studios, large content creators), advertising, government/security, and automotive — plus developers who build video-native applications on top of the...[8]+4Company-authored
Financing: Total raised is $107,120,000.[8]Not verified
Product: Videos are indexed via an API call; the models extract multimodal embeddings that capture visual action, speech, on-screen text, and their temporal relationships.[8]