Skip to content

Any explanation on how RAG works? #6

Description

@bilogic

Hi!

I'm trying to figure out this RAG and can't connect a few things together, let me explain

The 3 key things I know of are

  • embeddings provider, seems you can use embeddinggemma locally
  • vector database, I think it is LanceDB
  • LLM, seems like you can pick any from opencode
  1. It seems weird to me that an the 3 can work with embeddinggemma, I was on the impression that if an LLM doesn't have a matching embeddings provider, then it cannot be turned into a RAG, how does this actually work? Or did I misunderstand?
  2. How does the remote LLM actually search the local vector database?

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions