Case Study: Reference-free vs Reference-based evaluation of RAG pipeline