SoFair ModernBERT base filter

Fine-tuned ModernBERT to identify candidate documents for software mention extraction.

It was trained on SoFairOA/sofair_softcite_somesci (sofair_softcite_somesci_documents) to classify whether the given document contains at least one annotation.

Usage

We created https://github.com/SoFairOA/filter, a simple command-line tool to use this model for processing a collection of documents.

Evaluation

We evaluated this model on the test set of SoFairOA/sofair_softcite_somesci (sofair_softcite_somesci_documents) dataset:

precision 0.8625730994152047
recall 0.9104938271604939
f1 0.8858858858858859
accuracy 0.9268527430221367
Downloads last month
26
Safetensors
Model size
150M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for SoFairOA/sofair-modernBERT-base-filter

Finetuned
(646)
this model

Dataset used to train SoFairOA/sofair-modernBERT-base-filter