[Feature]: Extend the APIs of other model providers to support image and video

### Problem Statement

Currently Gemini and other providers can't process audio, image and video

### Proposed Solution

Check OpenAI adapter and replicate logic to transcribe audio and images to text

### Alternatives Considered

_No response_

### Use Case

Send audio, video and image data to cognee

### Implementation Ideas

Copy implementation from OpenAI adapter and improve if possible

### Additional Context

_No response_

### Pre-submission Checklist

- [x] I have searched existing issues to ensure this feature hasn't been requested already
- [x] I have provided a clear problem statement and proposed solution
- [x] I have described my specific use case

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Extend the APIs of other model providers to support image and video #1767

Problem Statement

Proposed Solution

Alternatives Considered

Use Case

Implementation Ideas

Additional Context

Pre-submission Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature]: Extend the APIs of other model providers to support image and video #1767

Description

Problem Statement

Proposed Solution

Alternatives Considered

Use Case

Implementation Ideas

Additional Context

Pre-submission Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions