Commits · e2342c21e5cf42e19ab19c6c2b6d66ccb17a7430 · mirrored_repos / MachineLearning / meta-llama / Llama Recipes

This project is mirrored from https://github.com/meta-llama/llama-recipes. Pull mirroring failed 2 months ago.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer or owner.
Last successful update 2 months ago.

Oct 24, 2024

Append epoch rather than best val. loss to val_loss (#744) · e2342c21
Sanyam Bhutani authored 5 months ago

e2342c21

Append epoch rather than best val. loss to val_loss · 2a94bfff

celestinoalan authored 5 months ago

**Problem**
Currently, we're val_loss.append(best_val_loss) in each epoch. This is misleading because we're appending the corresponding epoch (not best across epochs) quantities in train_loss, train_prep, and val_prep. This is also inconvenient, as one often would like to plot both train and validation losses as a function of the epochs to look for overfitting.

**Solution**
val_loss.append(eval_epoch_loss)

2a94bfff

Oct 21, 2024
- Save the `preprocessor_config.json` and `chat_template.json` for mllama model... · d8b0eba7
  Kai Wu authored 5 months ago
  
  Save the `preprocessor_config.json` and `chat_template.json` for mllama model after conversion (#741)
  d8b0eba7
- fixed wordlist · 28454212
  Kai Wu authored 5 months ago
  
  28454212
- fixed deadlinks · 945886b6
  Kai Wu authored 5 months ago
  
  945886b6
- fixed the missing processor after conversion · bd31680e
  Kai Wu authored 5 months ago
  
  bd31680e
Oct 18, 2024
- Support converting fine-tuned llama 3.2 vision model to HF format and then local inference (#737) · 799e90eb
  Sanyam Bhutani authored 5 months ago
  
  799e90eb
- Merge branch 'main' into fsdp_lmm · 6af36197
  Kai Wu authored 5 months ago
  
  6af36197
- fix typo · 8715e044
  Kai Wu authored 5 months ago
  
  8715e044
- add readme · 0ef6b8d0
  Kai Wu authored 5 months ago
  
  0ef6b8d0
- Update wordlist.txt (#736) · 82d40492
  Sanyam Bhutani authored 5 months ago
  
  82d40492
- Fix 3 · 916f309a
  Sanyam Bhutani authored 5 months ago
  
  916f309a
- Fix 2 · a6c7fe65
  Sanyam Bhutani authored 5 months ago
  
  a6c7fe65
- fix link 1 · ace827dd
  Sanyam Bhutani authored 5 months ago
  
  ace827dd
- convertion missing preprocessor_config.json. · 2ea7f579
  Kai Wu authored 5 months ago
  
  2ea7f579
- Update wordlist.txt · bf0fa150
  Sanyam Bhutani authored 5 months ago
  
  bf0fa150
Oct 17, 2024
- Tool Calling Tutorial and Example (#697) · dca7ecb3
  Sanyam Bhutani authored 5 months ago
  
  dca7ecb3
- Add readme · 2f7ef323
  Sanyam Bhutani authored 5 months ago
  
  2f7ef323
- Fix 201 nb · df4b0243
  Sanyam Bhutani authored 5 months ago
  
  df4b0243
- Fix notebooks 1 · ee08696f
  Sanyam Bhutani authored 5 months ago
  
  ee08696f
- API fix · 239b5658
  Sanyam Bhutani authored 5 months ago
  
  239b5658
- Fixed comments · 9dce2223
  Sanyam Bhutani authored 5 months ago
  
  9dce2223
- Fix/unit test 3.2 (#726) · b554b24b
  Sanyam Bhutani authored 5 months ago
  
  b554b24b
Oct 16, 2024
- Fix numpy seed in finetuning.py (#728) · a8e9f4ec
  Kai Wu authored 5 months ago
  
  a8e9f4ec
- Merge branch 'meta-llama:main' into set-numpy-seed-in-finetuning · b75a79e7
  Patrik Lambert authored 5 months ago
  
  b75a79e7
Oct 15, 2024
- Initial Crusoe examples to 3p_integrations recipes (#716) · 4e6e7e4f
  Hamid Shojanazeri authored 5 months ago
  
  4e6e7e4f
- Fix fixture in test_train_utils · d9ca0996
  Matthias Reso authored 5 months ago
  
  d9ca0996
- Merge remote-tracking branch 'origin/main' into fix/unit_test_3.2 · d58dea23
  Matthias Reso authored 5 months ago
  
  d58dea23
- Lets run the basic tests on all PRs · 8a06b1d3
  Matthias Reso authored 5 months ago
  
  8a06b1d3
- quick fix on readmes and deadlinks (#729) · 3128aee0
  Sanyam Bhutani authored 5 months ago
  
  3128aee0
- quick fix on readmes and deadlinks · 91bd59d4
  Kai Wu authored 5 months ago
  
  91bd59d4
- Updated spellcheck word list. · 268e473b
  Ethan authored 5 months ago
  
  268e473b
- Merge remote-tracking branch 'origin' into 3p-integrations-crusoe · f0850a3e
  Ethan authored 5 months ago
  
  f0850a3e
- Fix numpy seed in finetuning.py · f521c93d
  Patrik Lambert authored 5 months ago
  
  Set numpy seed in finetuning.py to fix it during finetuning (including in custom_dataset.py) and have it set in functions such as Dataset.train_test_split. This avoids having different train/test splits in different ranks, which may cause NCCL collective operation timeout errors.
  f521c93d
- Fix fine-tuning training loss accumulation (#725) · d6ae2031
  celestinoalan authored 5 months ago
  
  d6ae2031
Oct 14, 2024
- Correct gha runner workflow · 6748b6f2
  Matthias Reso authored 5 months ago
  
  6748b6f2
- mock samsum in test_batching · 1de6ac31
  Matthias Reso authored 5 months ago
  
  1de6ac31
- Remove old test · 26dff882
  Matthias Reso authored 5 months ago
  
  26dff882
- Remove debug print statements · 8075c480
  Matthias Reso authored 5 months ago
  
  8075c480
- Fix test on non cuda machine · 448af9d7
  Matthias Reso authored 5 months ago
  
  448af9d7