notebooks/finetune_chinese_alpaca_lora.ipynb
More info: https://github.com/ymcui/Chinese-LLaMA-Alpaca
!pip install transformers==4.28.1
!pip install git+https://github.com/huggingface/peft.git@13e53fc
!pip install datasets
!pip install sentencepiece
!pip install deepspeed
!git clone https://github.com/ymcui/Chinese-LLaMA-Alpaca.git
This follows the setting in https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/SFT-Script, except that to simplify the tutorial,
!mkdir Chinese-LLaMA-Alpaca/sft_data
!cp Chinese-LLaMA-Alpaca/data/alpaca_data_zh_51k.json Chinese-LLaMA-Alpaca/sft_data
!cd Chinese-LLaMA-Alpaca/scripts && torchrun --nnodes 1 --nproc_per_node 1 run_clm_sft_with_peft.py \
--deepspeed ds_zero2_no_offload.json \
--model_name_or_path decapoda-research/llama-7b-hf \
--tokenizer_name_or_path ziqingyang/chinese-alpaca-lora-7b \
--dataset_dir /content/Chinese-LLaMA-Alpaca/sft_data \
--validation_split_percentage 0.001 \
--per_device_train_batch_size 1 \
--do_train \
--fp16 \
--seed $RANDOM \
--max_steps 100 \
--lr_scheduler_type cosine \
--learning_rate 1e-4 \
--warmup_ratio 0.03 \
--weight_decay 0 \
--logging_strategy steps \
--logging_steps 10 \
--save_strategy steps \
--save_total_limit 3 \
--save_steps 50 \
--gradient_accumulation_steps 1 \
--preprocessing_num_workers 8 \
--max_seq_length 512 \
--output_dir /content/output_model \
--overwrite_output_dir \
--ddp_timeout 30000 \
--logging_first_step True \
--torch_dtype float16 \
--peft_path ziqingyang/chinese-alpaca-lora-7b \
--gradient_checkpointing \
--ddp_find_unused_parameters False
After training, rename saved pytorch_model.bin to adapter_model.bin
!mkdir output_model/peft_model
!mv output_model/pytorch_model.bin output_model/peft_model/adapter_model.bin
Lastly, you need to manually create an adapter_config.json under peft_model and fill in the hyperparamters such as lora_rank, lora_alpha etc., whose content and
format can be referenced from the corresponding file in Chinese-Alpaca-LoRA.