awesome-llama: https://github.com/PKU-Alignment/llms-resist-alignment
ai-safety alignment alpaca llama llama2 llama3 llm llms rlhf safe safe-rlhf vicuna
Score: 4.406719247264253
Last synced: about 7 hours ago
JSON representation
Repository metadata:
[ACL2025 Best Paper] Language Models Resist Alignment
- Host: GitHub
- URL: https://github.com/PKU-Alignment/llms-resist-alignment
- Owner: PKU-Alignment
- Created: 2024-06-09T09:51:58.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-06-11T15:46:36.000Z (8 months ago)
- Last Synced: 2026-01-29T07:16:34.441Z (9 days ago)
- Topics: ai-safety, alignment, alpaca, llama, llama2, llama3, llm, llms, rlhf, safe, safe-rlhf, vicuna
- Language: Python
- Homepage: https://pku-lm-resist-alignment.github.io/
- Size: 6.65 MB
- Stars: 41
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Owner metadata:
- Name: PKU-Alignment
- Login: PKU-Alignment
- Email: yaodong.yang@outlook.com
- Kind: organization
- Description: Loves Sharing and Open-Source, Making AI Safer.
- Website:
- Location: China
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/129283536?v=4
- Repositories: 3
- Last Synced at: 2023-05-15T17:22:58.200Z
- Profile URL: https://github.com/PKU-Alignment
GitHub Events
Total
- Fork event: 1
- Push event: 1
- Watch event: 18
- Total: 20
Last Year
- Fork event: 1
- Push event: 1
- Watch event: 15
- Total: 17
Committers metadata
Last synced: 2 days ago
Total Commits: 13
Total Committers: 2
Avg Commits per committer: 6.5
Development Distribution Score (DDS): 0.231
Commits in past year: 9
Committers in past year: 2
Avg Commits per committer in past year: 4.5
Development Distribution Score (DDS) in past year: 0.333
| Name | Commits | |
|---|---|---|
| 2200017816@stu.pku.edu.cn | 2****6@s****n | 10 |
| zmsn-2077 | j****i@g****m | 3 |
Issue and Pull Request metadata
Last synced: 5 months ago
Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull request: 0
Bot issues: 0
Bot pull requests: 0
Past year issues: 0
Past year pull requests: 0
Past year average time to close issues: N/A
Past year average time to close pull requests: N/A
Past year issue authors: 0
Past year pull request authors: 0
Past year average comments per issue: 0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0