Prisma uses a 1-5 star rating system to ensure high-quality academic content and prioritize reliable academic databases.
The rating system evaluates sources based on:
- Content Quality: Academic rigor and peer review processes
- Metadata Richness: Availability of structured bibliographic data
- API Reliability: Consistent access and data quality
- Coverage Scope: Breadth and depth of academic content
- Academic Validation: Built-in quality controls and filtering
Semantic Scholar
- AI-powered search with 214M+ papers
- Advanced semantic understanding and paper relationships
- High-quality metadata and citation graphs
- Research influence metrics and author disambiguation
arXiv
- High-quality preprint server with full metadata
- Rigorous submission standards in STEM fields
- Complete PDF access and structured abstracts
- Immediate access to cutting-edge research
Open Library
- Internet Archive's millions of academic books
- Comprehensive book metadata and full-text access
- Historical academic publications and rare texts
- Structured bibliographic information
Google Books
- Comprehensive book catalog with rich metadata
- Academic publisher partnerships
- Preview access and citation information
- Cross-reference capabilities
PubMed
- Authoritative biomedical literature database
- MEDLINE indexing with controlled vocabularies
- High-quality abstracts and full bibliographic records
- Integration with clinical and research databases
Zotero
- User's personal research library
- Community-curated bibliographic data
- Flexible metadata schema
- Integration with academic workflows
CrossRef
- DOI resolution and metadata services
- Publisher-provided bibliographic information
- Citation linking and reference validation
- Academic content verification
Prisma automatically applies quality filters regardless of source rating:
- Authors: Must have identifiable authors or creators
- Publication Venue: Journal, conference, or institutional affiliation
- Academic Indicators: Proper citations, abstracts, or academic formatting
- Bibliographic Metadata: Title, date, and publication information
- Blog Posts: Personal or commercial blog content
- News Articles: Journalistic reporting without academic rigor
- Social Media: Twitter, LinkedIn, or social platform posts
- Marketing Content: Commercial or promotional materials
- Unverified Sources: Content without proper attribution
Prisma combines multiple sources to maximize content discovery:
- Start with 5-star sources for the highest quality baseline
- Supplement with 4-star sources for comprehensive coverage
- Include 3-star sources for specialized or personal collections
- Apply consistent filtering across all sources
Each discovered document receives a quality score based on:
- Source rating (1-5 stars)
- Academic validation (pass/fail filters)
- Metadata completeness (bibliographic richness)
- Content confidence (LLM assessment)
When multiple sources provide the same document:
- Higher-rated source takes precedence
- More complete metadata is preferred
- PDF availability increases priority
- Recent publication dates are favored
sources:
priority_order:
- semanticscholar # 5-star
- arxiv # 5-star
- pubmed # 4-star
- openlibrary # 4-star
- zotero # 3-star
quality_filters:
require_authors: true
require_venue: true
exclude_blogs: true
exclude_news: true
min_confidence: 0.7Each source integration implements quality-aware features:
- Metadata validation during content ingestion
- Quality scoring for search results
- Filtering pipelines to exclude low-quality content
- Confidence metrics for LLM assessment
The CLI reflects quality information:
📊 Search Results Quality Summary:
⭐⭐⭐⭐⭐ 15 papers from Semantic Scholar
⭐⭐⭐⭐⭐ 8 papers from arXiv
⭐⭐⭐⭐ 12 papers from PubMed
⭐⭐⭐ 5 papers from Zotero
🛡️ Quality Filters Applied:
✅ Academic validation: 40/45 papers passed
✅ Metadata completeness: 38/40 papers complete
✅ Deduplication: 35/38 unique papers- Performance monitoring of source reliability
- User feedback integration on content quality
- Automatic rating adjustments based on success metrics
- Research domain-specific rating adjustments
- Institution-specific source preferences
- User-customizable quality thresholds
- Source performance dashboards in reports
- Quality trend analysis over time
- Recommendation engines for source optimization
- Configuration Guide - Source configuration and settings
- Architecture Overview - System design and integration patterns
- Source Configuration - Detailed API setup and management