# Evaluating LLMs in Production
Moving LLMs from prototype to production requires a robust evaluation framework. Standard machine learning metrics often fail to capture the nuances of generative AI.
Sarah Kim
Principal AI Engineer
# Evaluating LLMs in Production
Moving LLMs from prototype to production requires a robust evaluation framework. Standard machine learning metrics often fail to capture the nuances of generative AI.