Tag: dense retrieval

Explore how Large Language Models transform traditional keyword search into semantic understanding using vector embeddings, dense retrieval, and re-ranking pipelines.

Recent-posts

Transformer Efficiency Tricks: KV Caching and Continuous Batching in LLM Serving

Transformer Efficiency Tricks: KV Caching and Continuous Batching in LLM Serving

Sep, 5 2025

Stopping AI Hallucinations: Practical Strategies for Reliable Generative AI

Stopping AI Hallucinations: Practical Strategies for Reliable Generative AI

Apr, 12 2026

How to Optimize Your Contact Center with Generative AI: Summaries, Sentiment, and Routing

How to Optimize Your Contact Center with Generative AI: Summaries, Sentiment, and Routing

Apr, 19 2026

Speculative Decoding and MoE: How These Techniques Slash LLM Serving Costs

Speculative Decoding and MoE: How These Techniques Slash LLM Serving Costs

Dec, 20 2025

How Training Duration and Token Counts Affect LLM Generalization

How Training Duration and Token Counts Affect LLM Generalization

Dec, 17 2025