-
- Downloads
tmp save3
Showing
- model/base_model.py 2 additions, 1 deletionmodel/base_model.py
- model/cached_autoregressive_model.py 2 additions, 2 deletionsmodel/cached_autoregressive_model.py
- model/cuda2d_model.py 153 additions, 0 deletionsmodel/cuda2d_model.py
- model/mixins.py 2 additions, 2 deletionsmodel/mixins.py
- mpu/local_attention_function.py 0 additions, 79 deletionsmpu/local_attention_function.py
- mpu/transformer.py 3 additions, 30 deletionsmpu/transformer.py
- pretrain_gpt2.py 27 additions, 748 deletionspretrain_gpt2.py
- training/deepspeed_training.py 583 additions, 0 deletionstraining/deepspeed_training.py
- training/learning_rates.py 81 additions, 0 deletionstraining/learning_rates.py
- training/model_io.py 162 additions, 0 deletionstraining/model_io.py
Loading
Please register or sign in to comment