Murnitur AI provides robust tools for evaluating datasets through both function evaluations and custom AI evaluations. This documentation will guide you through setting up and running these evaluations.
Function evaluations allow you to evaluate your dataset without the need to query an LLM. These are predefined functions that can be used to validate specific aspects of your data.
For more advanced evaluations, you can use custom AI evaluations. These allow you to run evaluations based on your own templates using any LLM of your choice.