Writing & Research

Articles & Engineering Notes

This is the hub for my long-form writing on data science, machine learning, and the engineering patterns that support them in production.

Each article aims to balance clarity with rigor: you will find walkthroughs of algorithms, postmortems from experiments, and the practical guardrails that emerge from shipping systems.

Latest articles

Use the filters to surface the topics, stacks, and case studies that match your current problem.

Mermaid diagram showing three pillars of LLM evaluation: What to Evaluate (Faithfulness vs Helpfulness), How to Evaluate (Methods and Metrics), and Making it Systematic (Process and Monitoring), connected in a circular feedback loop

Beyond the Vibe Check: A Systematic Approach to LLM Evaluation

Stop relying on gut feelings to evaluate LLM outputs. Learn systematic approaches to build trustworthy evaluation pipelines with measurable metrics, proven methods, and production-ready practices. A practical guide covering faithfulness vs helpfulness, LLM-as-judge techniques, bias mitigation, and continuous monitoring.

58 min read

Read article

Writing approach

The blog is a living lab notebook. Some essays are polished deep dives, others capture lessons while they are still fresh — both have a place in the learning loop.

Perfection slows the feedback cycle, so I share drafts, return with new data, and document the missteps alongside the breakthroughs.

If something sparks a question or disagreement, please reach out. Dialogue keeps the writing honest and ensures the next revision is better informed.