YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Vocabulary Trimmed intfloat/multilingual-e5-large: nicolaebanari/me5-large-trimmed-nl-test
This model is a trimmed version of intfloat/multilingual-e5-large by vocabtrimmer
, a tool for trimming vocabulary of language models to compress the model size.
Following table shows a summary of the trimming process.
intfloat/multilingual-e5-large | nicolaebanari/me5-large-trimmed-nl-test | |
---|---|---|
parameter_size_full | 559,890,432 | 355,090,432 |
parameter_size_embedding | 256,002,048 | 51,202,048 |
vocab_size | 250,002 | 50,002 |
compression_rate_full | 100.0 | 63.42 |
compression_rate_embedding | 100.0 | 20.0 |
Following table shows the parameter used to trim vocabulary.
language | dataset | dataset_column | dataset_name | dataset_split | target_vocab_size | min_frequency |
---|---|---|---|---|---|---|
nl | allenai/c4 | text | nl | validation | 50000 | 2 |
- Downloads last month
- 16
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support