Problem Statement
Currently Gemini and other providers can't process audio, image and video
Proposed Solution
Check OpenAI adapter and replicate logic to transcribe audio and images to text
Alternatives Considered
No response
Use Case
Send audio, video and image data to cognee
Implementation Ideas
Copy implementation from OpenAI adapter and improve if possible
Additional Context
No response
Pre-submission Checklist
Problem Statement
Currently Gemini and other providers can't process audio, image and video
Proposed Solution
Check OpenAI adapter and replicate logic to transcribe audio and images to text
Alternatives Considered
No response
Use Case
Send audio, video and image data to cognee
Implementation Ideas
Copy implementation from OpenAI adapter and improve if possible
Additional Context
No response
Pre-submission Checklist