adding flash attention and xformer memory efficient through PT SDPA (44ef280d) · Commits · mirrored_repos / MachineLearning / meta-llama / Llama Recipes