arctic-embed-l-tech_and_fiction

This is a finetuned version of: snowflake-arctic-embed-l

It is finetuned on a dataset of ~110K high quality synthetic examples, manually curated and edited.

The examples are primarily tech oriented, with some terms from fantasy and sci-fi fiction for good measure.

Training Code

Code for training this model is here: train.py

Note, this code is for Model A shown below.

For this model (Model B below) I trained at rank 128, alpha 128, with a learning rate of around 6e-6 I think, for around 10 epochs.

Benchmark Results

I've included the results for MTEB benchmark "MTEB(eng, v2)" in the results folder.

Here is a screenshot of the results summary (this is Model B):

benchmark

License

arctic-embed-l-tech_and_fiction is licensed under the Apache-2. The released models can be used for commercial purposes free of charge.

Acknowledgement

Thank you to the Snowflake team for making some excellent models!

Downloads last month
71
Safetensors
Model size
0.3B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support