reward-bench

RewardBench: the first evaluation tool for reward models.

最近更新: 11分钟前

nltk_hf

最近更新: 11分钟前

s1

s1: Simple test-time scaling

最近更新: 11分钟前

ChineseMenuCSI

最近更新: 11分钟前

trigrams

最近更新: 11分钟前

TranslationTemplateTransformer

A method which incorporates translation template into Transformer-based neural machine translation

最近更新: 11分钟前

VideoGameDialogueCorpusPublic

最近更新: 11分钟前

MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

最近更新: 11分钟前

compare-agent-frameworks

最近更新: 11分钟前

alignment-scripts

Scripts to preprocess training and test data and to run fast_align and giza

最近更新: 11分钟前

mtme-zh-mwe

The dataset of WMT22 Metrics Shared Task extended with annotations of Chinese Multiword Expressions and MQM-based translation errors associated wit...

最近更新: 11分钟前

evaluate

A library for easily evaluating machine learning models and datasets.

最近更新: 11分钟前

skformer

最近更新: 11分钟前

Pantone-color-libraries

Pantone color libraries as .acb files for Photoshop etc

最近更新: 11分钟前

colorless

Colorless green ideas sleep furiously

最近更新: 11分钟前

COMET

A Neural Framework for MT Evaluation

最近更新: 11分钟前

prism

MT Evaluation in Many Languages via Zero-Shot Paraphrasing

最近更新: 11分钟前

terminology_evaluation

最近更新: 11分钟前

indic_nlp_resources

Resources to go with the Indic NLP Library

最近更新: 11分钟前

obtuse

最近更新: 11分钟前
成就
2
Star
2
Fork
成员(1)
镜像

搜索帮助