Commits · 4fcf3dceddd754b22f774beac4079b98a53f97d5 · mirrored_repos / MachineLearning / meta-llama / Llama Recipes

This project is mirrored from https://github.com/meta-llama/llama-recipes. Pull mirroring failed 2 months ago.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer or owner.
Last successful update 2 months ago.

Jan 15, 2025
- Update FAQ.md · 4fcf3dce
  Sanyam Bhutani authored 2 months ago
  
  4fcf3dce
- Update FAQ.md · e311eacd
  Sanyam Bhutani authored 2 months ago
  
  e311eacd
- fix tests · 7f5feae6
  Igor Kasianenko authored 2 months ago
  
  7f5feae6
- fix custom dataset test · ec214112
  Igor Kasianenko authored 2 months ago
  
  ec214112
Jan 10, 2025
- fixed few typos · 1d0dbb04
  Alexandre Bassel authored 2 months ago
  
  1d0dbb04
- refactor · 3328d727
  Sanyam Bhutani authored 2 months ago
  
  3328d727
- Fix A LOT of links · bd210b10
  Sanyam Bhutani authored 2 months ago
  
  bd210b10
- fix src links · 7e9cab0c
  Sanyam Bhutani authored 2 months ago
  
  7e9cab0c
- move req to home · 9703ce7d
  Sanyam Bhutani authored 2 months ago
  
  9703ce7d
Jan 09, 2025
- move readme · d2c51d80
  Sanyam Bhutani authored 2 months ago
  
  d2c51d80
- update readme · f6078ff0
  Sanyam Bhutani authored 2 months ago
  
  f6078ff0
- move and add Difflog · ae010af7
  Sanyam Bhutani authored 2 months ago
  
  ae010af7
Jan 02, 2025
- Fixed typo · 5241d3a0
  Alex Schrimpf authored 2 months ago
  
  5241d3a0
Nov 19, 2024
- fix typo in auto wrap policy · a62aff38
  Guanghui Qin authored 4 months ago
  
  Fix a typo. The FSDP wrapper should wrap the `MllamaCrossAttentionDecoderLayer`, which was missing.
  a62aff38
- Print model's frozen status after freezing · d31ee18e
  JimChienTW authored 4 months ago
  
  d31ee18e
- Fix model parameter mismatch by printing parameters before FSDP · d1195a6f
  JimChienTW authored 4 months ago
  
  d1195a6f
Nov 16, 2024
- add freeze_LLM_only option for mllama finetuning · 21e8368c
  JimChienTW authored 4 months ago
  
  21e8368c
Oct 24, 2024

Append epoch rather than best val. loss to val_loss · 2a94bfff

celestinoalan authored 5 months ago

**Problem**
Currently, we're val_loss.append(best_val_loss) in each epoch. This is misleading because we're appending the corresponding epoch (not best across epochs) quantities in train_loss, train_prep, and val_prep. This is also inconvenient, as one often would like to plot both train and validation losses as a function of the epochs to look for overfitting.

**Solution**
val_loss.append(eval_epoch_loss)

2a94bfff

Oct 21, 2024
- fixed the missing processor after conversion · bd31680e
  Kai Wu authored 5 months ago
  
  bd31680e
Oct 18, 2024
- convertion missing preprocessor_config.json. · 2ea7f579
  Kai Wu authored 5 months ago
  
  2ea7f579
Oct 15, 2024

Fix fixture in test_train_utils · d9ca0996
Matthias Reso authored 5 months ago

d9ca0996

Fix numpy seed in finetuning.py · f521c93d

Patrik Lambert authored 5 months ago

Set numpy seed in finetuning.py to fix it during finetuning (including in custom_dataset.py) and have it set in functions such as Dataset.train_test_split. This avoids having different train/test splits in different ranks, which may cause NCCL collective operation timeout errors.

f521c93d

Fix fine-tuning training loss accumulation (#725) · d6ae2031
celestinoalan authored 5 months ago

d6ae2031

Oct 14, 2024
- mock samsum in test_batching · 1de6ac31
  Matthias Reso authored 5 months ago
  
  1de6ac31
- Remove old test · 26dff882
  Matthias Reso authored 5 months ago
  
  26dff882
- Remove debug print statements · 8075c480
  Matthias Reso authored 5 months ago
  
  8075c480
- Fix test on non cuda machine · 448af9d7
  Matthias Reso authored 5 months ago
  
  448af9d7
- Skip test samsum if dataset is unavailable · 9fd81d75
  Matthias Reso authored 5 months ago
  
  9fd81d75
- Skip test_batching when tokenizer is missing · cb3658f2
  Matthias Reso authored 5 months ago
  
  cb3658f2
- Fix issues with fake_llama · c129020f
  Matthias Reso authored 5 months ago
  
  c129020f
- [WIP]add fake tokenizer · fce8c260
  Matthias Reso authored 5 months ago
  
  fce8c260
Oct 12, 2024
- Fix src/tests/test_train_utils.py · fac71cd1
  Matthias Reso authored 5 months ago
  
  fac71cd1
- Fix test_finetuning · 1090ccf4
  Matthias Reso authored 5 months ago
  
  1090ccf4
- Fix tests for custom dataset, grammar, batching, chat_completion · 9f520067
  Matthias Reso authored 5 months ago
  
  9f520067
Oct 11, 2024
- Remove trust_remote_code in favor of setting env variable · 8b01298a
  Matthias Reso authored 5 months ago
  
  8b01298a
- Fix test_grammar_dataset.py · 6a0f9568
  Matthias Reso authored 5 months ago
  
  6a0f9568
- Fix test_custom_dataset.py · dd8ca3c2
  Matthias Reso authored 5 months ago
  
  dd8ca3c2
Oct 08, 2024
- Fix the bug when continue the peft. (#717) · a6208317
  Huang Zhihong authored 5 months ago
  
  a6208317
Oct 02, 2024
- Improve model checkpoint saving logic (#691) · 27740658
  Lucas Ventura authored 5 months ago
  
  Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com>
  27740658
Sep 27, 2024
- chore: update train_utils.py · 139824aa
  Ikko Eltociear Ashimine authored 5 months ago
  
  availble -> available
  139824aa