Tag: RAG

18Apr

Compression-Aware Prompting: Getting the Best from Small LLMs

Posted by JAMIUL ISLAM 0 Comments

Learn how compression-aware prompting helps small LLMs perform like giants by distilling prompts, reducing token costs, and improving RAG efficiency.

10Mar

Hybrid Search for RAG: Why Combining Keyword and Semantic Retrieval Boosts LLM Accuracy

Posted by JAMIUL ISLAM 5 Comments

Hybrid search for RAG combines semantic and keyword retrieval to fix the blind spots of each method alone. It boosts accuracy for technical, legal, and medical queries by ensuring exact terms aren’t missed - and is now the standard for enterprise LLM systems.