Any plans to release the training recipe?
#21
by
nskwal
- opened
Are there any plans to release the training recipe and configuration used with Megatron-LM?
@okuchaiev Is there any detailed scripts regarding how to generate training data and how to perform phased pre-training & sft to reproduce the metrics and conclusions of the paper?