Evaluation Logs
The evaluation logs store the evaluation results from the EvaluationAgent
. The evaluation log contains the following information:
Field | Description | Type |
---|---|---|
Reason | The detailed reason for your judgment, by observing the screenshot differences and the |
String |
Sub-score | The sub-score of the evaluation in decomposing the evaluation into multiple sub-goals. | List of Dictionaries |
Complete | The completion status of the evaluation, can be yes , no , or unsure . |
String |
level | The level of the evaluation. | String |
request | The request sent to the EvaluationAgent . |
Dictionary |
id | The ID of the evaluation. | Integer |