disi-unibo-nlp
/

zeroner-base

Token Classification

Model card Files Files and versions Community

alecocc commited on 8 days ago

Commit

036fc15

·

verified ·

1 Parent(s): 6fdb5de

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -71,7 +71,7 @@ displacy.serve(doc, style="ent")
 We have created a free [Google Colab notebook](https://colab.research.google.com/drive/1IVrTIqIlsARraI6pM-mVdYHIzNAo4Ap1?usp=sharing) to help you explore the library and customize it for your specific use case with ease.
-## 📥 Training Data (Unfiltered)
 The model is trained on synthetic annotations generated by LLaMA-3.1-8B-instruct over the [Pile Uncopyrighted](https://huggingface.co/datasets/monology/pile-uncopyrighted) dataset.
 The resulting automatically annotated dataset, [PileUncopyrighted-NER-BIO](https://huggingface.co/datasets/disi-unibo-nlp/PileUncopyrighted-NER-BIO), follows the BIO format and was used as the training source for this model.

 We have created a free [Google Colab notebook](https://colab.research.google.com/drive/1IVrTIqIlsARraI6pM-mVdYHIzNAo4Ap1?usp=sharing) to help you explore the library and customize it for your specific use case with ease.
+## 📥 Training Data
 The model is trained on synthetic annotations generated by LLaMA-3.1-8B-instruct over the [Pile Uncopyrighted](https://huggingface.co/datasets/monology/pile-uncopyrighted) dataset.
 The resulting automatically annotated dataset, [PileUncopyrighted-NER-BIO](https://huggingface.co/datasets/disi-unibo-nlp/PileUncopyrighted-NER-BIO), follows the BIO format and was used as the training source for this model.