awesome-llama: https://github.com/magpie-align/magpie
alignment dataset gemma llama2 llama3 llm nlp paper phi3 qwen2 supervised-finetuning synthetic-data synthetic-dataset-generation
Score: 8.717845704894915
Last synced: about 3 hours ago
JSON representation
Repository metadata:
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
- Host: GitHub
- URL: https://github.com/magpie-align/magpie
- Owner: magpie-align
- License: mit
- Created: 2024-06-12T08:03:39.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2025-03-17T20:23:26.000Z (over 1 year ago)
- Last Synced: 2026-05-20T05:38:55.111Z (about 1 month ago)
- Topics: alignment, dataset, gemma, llama2, llama3, llm, nlp, paper, phi3, qwen2, supervised-finetuning, synthetic-data, synthetic-dataset-generation
- Language: Python
- Homepage: https://magpie-align.github.io/
- Size: 1.08 MB
- Stars: 861
- Watchers: 5
- Forks: 68
- Open Issues: 12
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Owner metadata:
- Name: magpie-align
- Login: magpie-align
- Email:
- Kind: organization
- Description:
- Website:
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/171787786?v=4
- Repositories: 1
- Last Synced at: 2024-06-05T09:53:53.890Z
- Profile URL: https://github.com/magpie-align
GitHub Events
Total
- Fork event: 23
- Issue comment event: 42
- Issues event: 18
- Pull request event: 4
- Push event: 15
- Watch event: 340
- Total: 442
Last Year
- Fork event: 6
- Issue comment event: 1
- Issues event: 2
- Watch event: 86
- Total: 95
Committers metadata
Last synced: about 1 month ago
Total Commits: 69
Total Committers: 7
Avg Commits per committer: 9.857
Development Distribution Score (DDS): 0.101
Commits in past year: 0
Committers in past year: 0
Avg Commits per committer in past year: 0.0
Development Distribution Score (DDS) in past year: 0.0
| Name | Commits | |
|---|---|---|
| fly_dust | f****8@g****m | 62 |
| Tendo33 | s****2@g****m | 2 |
| 胡亮 | 1****2@q****m | 1 |
| Mandlin Sarah | m****h@g****m | 1 |
| Iker García-Ferrero | i****o@g****m | 1 |
| F. J. | 4****g | 1 |
| Andres Uribe | a****7@g****m | 1 |
Issue and Pull Request metadata
Last synced: 3 months ago
Total issues: 40
Total pull requests: 9
Average time to close issues: 15 days
Average time to close pull requests: 4 days
Total issue authors: 37
Total pull request authors: 5
Average comments per issue: 1.65
Average comments per pull request: 0.44
Merged pull request: 9
Bot issues: 0
Bot pull requests: 0
Past year issues: 10
Past year pull requests: 0
Past year average time to close issues: about 2 months
Past year average time to close pull requests: N/A
Past year issue authors: 10
Past year pull request authors: 0
Past year average comments per issue: 1.0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- Tendo33 (2)
- FlyCarrot (2)
- blackblue9 (2)
- rogerslh (1)
- mrcabbage972 (1)
- lyravv (1)
- wwwadx (1)
- xxlxms (1)
- mkaratayev (1)
- qychen2001 (1)
- DylanDDeng (1)
- impact-rm (1)
- slark-prime (1)
- hyunwoongko (1)
- johnr14 (1)
Top Pull Request Authors
- andresuribe87 (2)
- mandlinsarah (2)
- huliang2016 (2)
- Tendo33 (2)
- ikergarcia1996 (1)
Top Issue Labels
Top Pull Request Labels
Dependencies
- accelerate *
- anthropic *
- autoawq *
- bitsandbytes *
- boto3 *
- datasets *
- faiss-gpu *
- fschat *
- google-generativeai *
- ipykernel *
- ipywidgets *
- matplotlib *
- ml_collections *
- openai *
- peft *
- ray *
- sentence-transformers *
- sentencepiece *
- tenacity *
- transformers *
- trl *
- vllm *
- wandb *