Skip to content
Snippets Groups Projects
README.md 863 B
Newer Older
  • Learn to ignore specific revisions
  • Jerry Liu's avatar
    cr  
    Jerry Liu committed
    # Finetuning LLaMa + Text-to-SQL 
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    This walkthrough shows you how to fine-tune LLaMa-7B on a Text-to-SQL dataset, and then use it for inference against
    any database of structured data using LlamaIndex.
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    This code is taken and adapted from the Modal `doppel-bot` repo: https://github.com/modal-labs/doppel-bot.
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    ### Stack
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    - LlamaIndex
    - Modal
    - Hugging Face datasets
    - OpenLLaMa 
    - Peft
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    ### Steps for running
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    Please see the notebook (TODO) for full instructions.
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    In the meantime you can run each step individually as below:
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    Loading data:
    `modal run src.load_data_sql`
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    Finetuning:
    `modal run --detach src.finetune_sql`
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    Inference:
    `modal run src.inference_sql_llamaindex::main --query "Which city has the highest population?" --sqlite-file-path "nbs/cities.db"`
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    
    
    Jerry Liu's avatar
    cr  
    Jerry Liu committed
    (Optional) Downloading model weights:
    `modal run src.download_weights --output-dir out_model`