awesome-llama: https://github.com/ScrapeGraphAI/Scrapegraph-ai
ai-crawler ai-scraping ai-search crawler data-extraction firecrawl-alternative large-language-model llm markdown rag scraping scraping-python web-crawler web-crawlers web-data web-data-extraction web-scraper web-scraping web-search webscraping
Score: -Infinity
Last synced: about 3 hours ago
JSON representation
Repository metadata:
Python scraper based on AI
- Host: GitHub
- URL: https://github.com/ScrapeGraphAI/Scrapegraph-ai
- Owner: ScrapeGraphAI
- License: mit
- Created: 2024-01-27T16:54:38.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2026-02-16T14:01:45.000Z (4 months ago)
- Last Synced: 2026-02-16T21:58:26.454Z (4 months ago)
- Topics: ai-crawler, ai-scraping, ai-search, crawler, data-extraction, firecrawl-alternative, large-language-model, llm, markdown, rag, scraping, scraping-python, web-crawler, web-crawlers, web-data, web-data-extraction, web-scraper, web-scraping, web-search, webscraping
- Language: Python
- Homepage: https://scrapegraphai.com
- Size: 16.1 MB
- Stars: 22,679
- Watchers: 140
- Forks: 1,976
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Citation: citation.cff
- Security: SECURITY.md
-
Funding:
- Github: ScrapeGraphAI
- Open collective: scrapegraphai
Owner metadata:
- Name: ScrapeGraphAI
- Login: ScrapeGraphAI
- Email:
- Kind: organization
- Description:
- Website: https://scrapegraphai.com/
- Location: United States of America
- Twitter: scrapegraphai
- Company:
- Icon url: https://avatars.githubusercontent.com/u/171017415?v=4
- Repositories: 1
- Last Synced at: 2025-12-01T02:36:00.103Z
- Profile URL: https://github.com/ScrapeGraphAI
GitHub Events
Total
- Commit comment event: 51
- Create event: 165
- Delete event: 56
- Fork event: 585
- Issue comment event: 826
- Issues event: 212
- Pull request event: 211
- Pull request review comment event: 20
- Pull request review event: 50
- Push event: 457
- Release event: 104
- Watch event: 5670
- Total: 8407
Last Year
- Commit comment event: 51
- Create event: 165
- Delete event: 56
- Fork event: 585
- Issue comment event: 826
- Issues event: 212
- Pull request event: 211
- Pull request review comment event: 20
- Pull request review event: 50
- Push event: 457
- Release event: 104
- Watch event: 5670
- Total: 8407
Committers metadata
Last synced: 6 months ago
Total Commits: 2,080
Total Committers: 111
Avg Commits per committer: 18.739
Development Distribution Score (DDS): 0.57
Commits in past year: 298
Committers in past year: 30
Avg Commits per committer in past year: 9.933
Development Distribution Score (DDS) in past year: 0.685
| Name | Commits | |
|---|---|---|
| Marco Vinciguerra | m****1@g****m | 894 |
| semantic-release-bot | s****t@m****t | 436 |
| Marco Perini | p****8@g****m | 262 |
| Federico Aguzzi | 6****i | 84 |
| Matteo Vedovati | m****7@g****m | 53 |
| Lorenzo Padoan | l****7@g****m | 43 |
| codebeaver-ai[bot] | 1****] | 17 |
| roryhaung | r****1@g****m | 16 |
| copilot-swe-agent[bot] | 1****t | 15 |
| Eric Page | e****0@g****m | 14 |
| Lorenzo Paleari | 1****i | 13 |
| Federico Minutoli | f****i@r****t | 11 |
| Tejas Amol Hande | 5****e | 10 |
| ekinsenler | e****r@g****m | 10 |
| Santabot123 | p****m@g****m | 10 |
| smith peng | p****h@g****m | 9 |
| JGalego | j****0@g****m | 9 |
| aziz-ullah-khan | a****4@g****m | 8 |
| mayurdb | m****1@g****m | 8 |
| SwapnilSonker | s****2@g****m | 6 |
| DPende | p****e@g****m | 5 |
| Shubham Kamboj | s****j@s****i | 5 |
| Stefan Krawczyk | s****n@d****o | 4 |
| Ikko Eltociear Ashimine | e****r@g****m | 4 |
| Tom Robinson | t****n@g****m | 4 |
| dependabot[bot] | 4****] | 4 |
| ftoppi | f****i@g****m | 4 |
| CodeBeaver | i****o@c****i | 4 |
| Lrd | l****s@g****m | 4 |
| Alok Saboo | a****o@g****m | 4 |
| and 81 more... | ||
Issue and Pull Request metadata
Last synced: 7 months ago
Total issues: 268
Total pull requests: 562
Average time to close issues: 23 days
Average time to close pull requests: 1 day
Total issue authors: 198
Total pull request authors: 75
Average comments per issue: 2.5
Average comments per pull request: 1.68
Merged pull request: 457
Bot issues: 0
Bot pull requests: 44
Past year issues: 110
Past year pull requests: 210
Past year average time to close issues: about 1 month
Past year average time to close pull requests: 1 day
Past year issue authors: 90
Past year pull request authors: 35
Past year average comments per issue: 2.07
Past year average comments per pull request: 1.75
Past year merged pull request: 162
Past year bot issues: 0
Past year bot pull requests: 42
Top Issue Authors
- VinciGit00 (16)
- anth0nyhak1m (6)
- nyck33 (6)
- f-aguzzi (5)
- aleenprd (4)
- LorenzoPaleari (4)
- Kilowhisky (4)
- silgon (4)
- angelotc (3)
- DenisMarasescu (3)
- matheus-rossi (3)
- rjbks (3)
- tm-robinson (3)
- salman-khandu (2)
- ekmekovski (2)
Top Pull Request Authors
- VinciGit00 (264)
- codebeaver-ai[bot] (42)
- f-aguzzi (35)
- PeriniM (25)
- LorenzoPaleari (18)
- vedovati-matteo (12)
- aziz-ullah-khan (12)
- ekinsenler (9)
- goasleep (7)
- shenghongtw (7)
- SwapnilSonker (6)
- tm-robinson (6)
- aflansburg (4)
- AmosDinh (4)
- tuhinmallick (4)
Top Issue Labels
- bug (36)
- question (13)
- released on @dev (11)
- feature request (9)
- enhancement (8)
- stale (8)
- documentation (4)
- refactor (1)
- size:XL (1)
- dependencies (1)
Top Pull Request Labels
- released on @stable (206)
- released on @dev (193)
- tests (49)
- size:M (29)
- size:L (28)
- bug (24)
- size:XS (18)
- lgtm (18)
- size:XL (16)
- enhancement (15)
- dependencies (15)
- size:S (11)
- size:XXL (10)
- documentation (10)
- refactor (7)
- typo (7)
Package metadata
- Total packages: 2
- Total downloads: unknown
- Total dependent packages: 0 (may contain duplicates)
- Total dependent repositories: 0 (may contain duplicates)
- Total versions: 882
proxy.golang.org: github.com/scrapegraphai/scrapegraph-ai
- Homepage:
- Documentation: https://pkg.go.dev/github.com/scrapegraphai/scrapegraph-ai#section-documentation
- Licenses: mit
- Latest release: v1.71.0 (published 5 months ago)
- Last Synced: 2026-01-05T15:15:35.880Z (5 months ago)
- Versions: 441
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.442%
- Average: 5.624%
- Dependent repos count: 5.807%
proxy.golang.org: github.com/ScrapeGraphAI/Scrapegraph-ai
- Homepage:
- Documentation: https://pkg.go.dev/github.com/ScrapeGraphAI/Scrapegraph-ai#section-documentation
- Licenses: mit
- Latest release: v1.71.0 (published 5 months ago)
- Last Synced: 2026-01-05T15:15:42.539Z (5 months ago)
- Versions: 441
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.442%
- Average: 5.624%
- Dependent repos count: 5.807%
Dependencies
- Requests ==2.31.0
- beautifulsoup4 ==4.12.3
- langchain ==0.1.4
- langchain_core ==0.1.16
- langchain_openai ==0.0.5
- python-dotenv ==1.0.1
- pytest ==8.0.0 development
- sphinx ==7.1.2 development
- sphinx-rtd-theme ==2.0.0 development
- twine ==4.0.2 development
- wheel ==0.42.0 development
- actions/checkout v4 composite
- github/codeql-action/init v3 composite
- actions/checkout v4 composite
- actions/dependency-review-action v4 composite
- actions/checkout v3 composite
- actions/setup-python v3 composite
- actions/checkout v3 composite
- actions/setup-python v5 composite
- 143 dependencies
- pytest 8.0.0 develop
- sphinx 7.1.2 docs
- sphinx-rtd-theme 2.0.0 docs
- beautifulsoup4 4.12.3
- faiss-cpu 1.7.4
- graphviz 0.20.1
- html2text 2020.1.16
- langchain 0.1.6
- langchain_community 0.0.19
- langchain_core 0.1.22
- langchain_openai 0.0.5
- pandas 2.0.3
- python >3.9,<3.9.7 || >3.9.7,<3.12
- python-dotenv 1.0.1
- tiktoken >=0.5.2,<0.6.0
- tqdm 4.66.1
- trulens_eval 0.23.0