Blog
Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.
Category: aiClear filters
••February 12, 2024
Fine-tuning Large Language Models: A Practical Guide
Learn how to fine-tune LLMs like Llama 2, Mistral, and GPT models for your specific use case. Includes LoRA, QLoRA, and full fine-tuning techniques.
KU
Kiril Urbonas••February 3, 2024
Building Production-Ready AI Applications with LangChain and Docker
Learn how to containerize and deploy LangChain applications in production. Best practices for scaling, monitoring, and maintaining AI-powered services.
KU
Kiril Urbonas••January 15, 2024
Orchestrating AI Agents on Kubernetes
A deep dive into managing stateful LLM workloads, scaling inference endpoints, and optimizing GPU utilization in a cloud-native environment.
KU
Kiril Urbonas••January 1, 2024
Fine-tuning Llama 3 on Consumer Hardware
Optimization techniques like LoRA and 4-bit quantization to run state-of-the-art models locally.
KU
Kiril Urbonas