awesome-llama: https://github.com/Joyce94/LLM-RLHF-Tuning
fine-tuning language-model llama llm lora peft ppo reinforcement-learning rlhf
Score: 6.093569770045136
Last synced: about 1 hour ago
JSON representation
Repository metadata:
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
- Host: GitHub
- URL: https://github.com/Joyce94/LLM-RLHF-Tuning
- Owner: Joyce94
- Created: 2023-06-12T14:46:48.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-10-11T08:41:20.000Z (over 2 years ago)
- Last Synced: 2025-11-16T06:03:46.468Z (3 months ago)
- Topics: fine-tuning, language-model, llama, llm, lora, peft, ppo, reinforcement-learning, rlhf
- Language: Python
- Homepage:
- Size: 22.3 MB
- Stars: 440
- Watchers: 2
- Forks: 22
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Owner metadata:
- Name:
- Login: Joyce94
- Email:
- Kind: user
- Description:
- Website:
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/28557140?v=4
- Repositories: 1
- Last Synced at: 2023-06-12T17:53:20.329Z
- Profile URL: https://github.com/Joyce94
GitHub Events
Total
- Fork event: 7
- Watch event: 80
- Total: 87
Last Year
- Fork event: 7
- Watch event: 68
- Total: 75
Committers metadata
Last synced: 3 months ago
Total Commits: 27
Total Committers: 1
Avg Commits per committer: 27.0
Development Distribution Score (DDS): 0.0
Commits in past year: 0
Committers in past year: 0
Avg Commits per committer in past year: 0.0
Development Distribution Score (DDS) in past year: 0.0
| Name | Commits | |
|---|---|---|
| Joyce94 | 1****6@q****m | 27 |
Issue and Pull Request metadata
Last synced: 5 months ago
Total issues: 3
Total pull requests: 0
Average time to close issues: about 2 months
Average time to close pull requests: N/A
Total issue authors: 3
Total pull request authors: 0
Average comments per issue: 1.0
Average comments per pull request: 0
Merged pull request: 0
Bot issues: 0
Bot pull requests: 0
Past year issues: 0
Past year pull requests: 0
Past year average time to close issues: N/A
Past year average time to close pull requests: N/A
Past year issue authors: 0
Past year pull request authors: 0
Past year average comments per issue: 0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- couldn (1)
- AI-Study-Han (1)
- pengjiao123 (1)
Top Pull Request Authors
Top Issue Labels
Top Pull Request Labels
Dependencies
- accelerate ==0.21.0
- datasets ==2.13.1
- peft ==0.4.0
- scikit-learn ==1.3.0
- sentencepiece ==0.1.99
- torch ==2.0.1
- tqdm ==4.65.0
- transformers ==4.31.0
- trl ==0.5.0
- wandb =0.15.8