Tag: small LLMs

Compression-Aware Prompting: Getting the Best from Small LLMs

by Phillip Ramos

Learn how compression-aware prompting optimizes small LLMs by distilling prompts. Explore techniques like TPC and LJMLingua to cut costs, boost speed, and improve RAG accuracy.

Recent-posts

Performance Budgets for Frontend Development: Set, Measure, Enforce

Jan, 4 2026

Education and Generative AI: Curriculum Design, Assessment, and Tutoring

May, 19 2026

Preventing Catastrophic Forgetting During LLM Fine-Tuning: Techniques That Work

Apr, 1 2026

Domain Adaptation in NLP: Fine-Tuning Large Language Models for Specialized Fields

Feb, 24 2026

When Vibe Coding Works Best: Project Types That Benefit from AI-Generated Code

Mar, 23 2026