What is the instruction format used for training?

#11
by ArneDeutsch - opened

If I like to add another fine tune on top of this, what instruction format am I supposed to use? I am slightly confused because I thought mistral 7B is using in general the one with [INST], but here (https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02/blob/main/added_tokens.json) I see added tokens <|im_start|> and <|im_end|>

Is it:

<s>[INST] %input% [/INST]
%output%</s>

Or:

<|im_start|>system
%system%
<|im_end|>
<|im_start|>user
%input%
<|im_end|>
<|im_start|>assistant
%output%
<|im_end|>

deleted
This comment has been hidden (marked as Resolved)

Sign up or log in to comment