Evaluation
The evaluation (eval) submodule includes functionalities for computing metrics for evaluating NLP model performance. There are general evaluation metrics like accuracy, precision, recall, and f1 scores for classification scenarios. In addition, we also include evaluation utilities for specialized tasks like question answering and sentence embedding.