writings

i write about what i learn

fixing rag retrieval with semantic double-pass chunking
intelligent context management: system design for ai agents
building a context relevance classifier with llama 3.2
training llama 3.2-3b to think better: grpo-lora-rl
colqwenrag: vision-based rag for legal-ai papers
fine-tuning embedding models for legal-rag
building custom reasoning models with grpo and sft
how to make models "think": the deepseek approach
fine-tuned a llm to draft indian legal contracts