Written by
Firassa AI Team
The future of video understanding isn't just about search, it's about having meaningful conversations with your content.
When Maroune Lamharzi Alaoui and Rachid Hakmi first looked at the video intelligence landscape, they noticed a fundamental problem: existing solutions treated video as a passive medium to be indexed rather than understood.
"Traditional video search has always been limited to tags, titles, and basic transcriptions," explains Lamharzi Alaoui. "But the richness of video, visual cues, emotional context, cultural references, remained locked away, inaccessible through conventional search methods."
This insight led to the creation of Firassa, a revolutionary platform that enables users to have natural conversations with their video content. Unlike systems that merely tag and catalog videos, Firassa understands videos holistically, comprehending the complex interplay between visuals, speech, and implied meaning.
Firassa's approach is fundamentally different from competitors in the video AI space. While companies like Twelve Labs focus on search capabilities, Firassa has pioneered true conversational interaction with video content.
"We're not just helping users find moments in videos, we're enabling them to interact with their entire video library as if each video were an intelligent entity capable of discussing its own content," says Hakmi. "You can ask a Firassa-powered video about visual elements that were never mentioned in speech, or inquire about cultural nuances that traditional systems would miss entirely."
This conversational paradigm transforms how organizations access their video knowledge. Rather than constructing complex search queries, users can simply ask questions in natural language: "Show me all instances where our packaging design appears but isn't explicitly discussed," or "Find moments across our customer interviews where people express confusion about our pricing model."
A particularly powerful aspect of Firassa's technology is its exceptional multilingual capability, with special strength in Arabic and its regional dialects.
"Video content doesn't exist in a single-language world," notes Lamharzi Alaoui. "Organizations operate globally, creating content in multiple languages. Firassa not only understands content in any language but can bridge language barriers by allowing users to query Arabic content in English, or discuss French videos in Arabic."
This multilingual excellence opens up vast libraries of previously siloed content, making knowledge accessible regardless of language barriers. A marketing team in London can seamlessly extract insights from customer testimonials recorded in Moroccan Arabic, while executives in Dubai can analyze product demonstrations created by their German engineering team.
Firassa's conversational approach to video intelligence is finding applications across industries:
"What's particularly exciting is seeing how conversational video intelligence transforms workflows that previously required intensive manual review," says Hakmi. "Tasks that once took days now happen in minutes, with greater depth of understanding."
As Firassa continues to develop its technology, the team is exploring new frontiers in conversational video intelligence.
"We're not just building better video search, we're creating a new paradigm where video content itself becomes an interactive, queryable knowledge base," explains Lamharzi Alaoui. "The potential applications extend beyond anything currently available in the market."
The bootstrapped startup is currently focused on refining its core technology and establishing strategic partnerships with key clients across industries. While operating lean, the company remains committed to expanding its engineering capabilities and research initiatives to meet the growing demand for conversational video intelligence.
"Video is humanity's richest medium for communication, yet until now, we've interacted with it in primitive ways," concludes Lamharzi Alaoui. "Firassa is changing that fundamentally, making video content truly conversational for the first time."