arrow_back
Back to Tickets
Edit Trouble Ticket
Update the ticket information below
Date Initiated
*
Status
*
Open
Pending
Completed
Terminated
Creator
*
Marcus
Priority
*
1
Urgent
Select urgency...
Urgent
Not Urgent
Important
Select importance...
Important
Not Important
Project Name
*
zfrika
Problem
*
build a semantic text matching engine. ignore the raw video and extract the transcript and look for deep semantic overlap. updagrade to 6GB . support multiple whisper request or create a way to sequence the work rather than in parallel. lets upgrade from whisper-tiny to whisper-base
Question
Root Cause
Notes
Strategy
2 step matching workflow ===================== 1a. audio extraction and transcription (the words). 1b. ASR (automatic speech recognition) model. like OpenAI Whisper. 1c. process the video's audio track and convert the spoken words into clean text string. 2a. text embedding (the meaning) . 2b. generated text string pass through a dedicated text embedding model to output a vector representation of the conversation. use these models: the xenon/all-MiniLM-L6-v2 or Xenova/bge-small-en-v1.5 these models specialize in analyzing sentence and paragraph structure to determine precise semantic alignment. CLIP (Contrastive Language-Image Pre-training)
Helpful People
Helpful Links
Diagram
Previous Steps
Next Steps
create a duplicate enviroemtn. (done). 1. move this project to an independent website.
Solution
Insight
Date Resolved
save
Update Ticket
Cancel