MiniPile Challenge

Task Description

Use the MiniPile dataset to pre-train the best possible language model. The model will be tested on SuperGLUE.

Dataset

Dataset: AlgorithmicResearchGroup/minipile

Dataset structure:

DatasetDict({
    train: Dataset({
        features: ['text'],
        num_rows: 1000000
    })
    validation: Dataset({
        features: ['text'],
        num_rows: 500
    })
    test: Dataset({
        features: ['text'],
        num_rows: 10000
    })
})

Evaluation Process

Evaluation will be done on SuperGLUE.

You may run the following command to evaluate your model:

lm_eval --model hf \
        --model_args pretrained=<path_to_your_model> \
        --tasks super_glue \
        --device cuda:0 \
        --batch_size 8

Hardware Constraints

One A100 40GB GPU

Time Constraints

24 Hour Time Limit

Recommended Libraries

Huggingface Transformers