Skip to content

KnowerSmyf/rag-chunking-research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RAG Chunking Experiments

Exploration of structurally and semantically aware chunking strategies for Retrieval-Augmented Generation (RAG) systems.

Implemented Methods

  • Naive sentence chunking
  • Semantic valley chunking

Planned Methods

  • Spectral graph segmentation
  • Heat kernel segmentation
  • Information bottleneck chunking

Goals

Evaluate how chunking strategies affect retrieval quality in RAG pipelines.

Stack

  • Python
  • Sentence Transformers
  • NumPy
  • HuggingFace datasets

About

Experimental framework for evaluating semantic and structural chunking strategies in Retrieval-Augmented Generation (RAG) systems.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages