This workspace includes a tiny utility to validate a result-only submission JSONL and create a clean ZIP ready to upload to Codabench.
- Calculate the F1 for your submission locally
- Create the ZIP file for automatic scoring on the Codabench leaderboard
An example of the submission file participants must create is presented at your_submission_data/example_submission_italian.jsonl.
The submission file must have the following structure:
- One JSON object per line (JSONL)
- Each record has:
id: non-empty string (e.g., patient ID)predictions: non-empty list of objects, each with:item: non-empty stringprediction: string
Example (single line, truncated):
{"id":"1234","predictions":[{"item":"Exam: haemoglobin ","prediction":" 8g/dl. "}, ... ]}Use scoring.py to evaluate your submission locally.
- Input predictions:
your_submission_data/example_submission_italian.jsonl - Language:
enorit
python3 scoring.py \
--submission_path your_submission_data/example_submission_italian.jsonl \
--language it- Writes a JSON file with the metrics to
your_sumbmission_scores/scores.json(created automatically). - Prints a short summary to the terminal.
- Make sure your
example_submission_italian.jsonlfollows the expected submission schema for this task. - The reference file for development runs is fixed to
development_data/dev_gt.jsonlinside the script. - Use
--language itfor Italian submissions;enfor English.
check_submission_format.py validates the structure and writes a ZIP with the file at the archive root as mock_data_dev_codabench.jsonl.
python3 check_submission_format.py your_submission_data/dev_submission_all_correct.jsonl --out your_submission_data/submission_validated.zip- If validation passes, it prints the output path and exits with code 0.
- On failure, it prints a clear error and exits non‑zero.
- If your input filename differs, the script warns and still writes the ZIP with the required name.
The produced ZIP contains exactly one file at the root:
submission_validated.zip
└── mock_data_dev_codabench.jsonl
- Upload the resulting ZIP to Codabench