An open API service for producing an overview of a list of open source projects.

awesome-llama: https://github.com/ScrapeGraphAI/Scrapegraph-ai

ai-crawler ai-scraping ai-search crawler data-extraction firecrawl-alternative large-language-model llm markdown rag scraping scraping-python web-crawler web-crawlers web-data web-data-extraction web-scraper web-scraping web-search webscraping

Score: -Infinity

Last synced: about 3 hours ago
JSON representation

Repository metadata:

Python scraper based on AI


Owner metadata:


GitHub Events

Total
Last Year

Committers metadata

Last synced: 6 months ago

Total Commits: 2,080
Total Committers: 111
Avg Commits per committer: 18.739
Development Distribution Score (DDS): 0.57

Commits in past year: 298
Committers in past year: 30
Avg Commits per committer in past year: 9.933
Development Distribution Score (DDS) in past year: 0.685

Name Email Commits
Marco Vinciguerra m****1@g****m 894
semantic-release-bot s****t@m****t 436
Marco Perini p****8@g****m 262
Federico Aguzzi 6****i 84
Matteo Vedovati m****7@g****m 53
Lorenzo Padoan l****7@g****m 43
codebeaver-ai[bot] 1****] 17
roryhaung r****1@g****m 16
copilot-swe-agent[bot] 1****t 15
Eric Page e****0@g****m 14
Lorenzo Paleari 1****i 13
Federico Minutoli f****i@r****t 11
Tejas Amol Hande 5****e 10
ekinsenler e****r@g****m 10
Santabot123 p****m@g****m 10
smith peng p****h@g****m 9
JGalego j****0@g****m 9
aziz-ullah-khan a****4@g****m 8
mayurdb m****1@g****m 8
SwapnilSonker s****2@g****m 6
DPende p****e@g****m 5
Shubham Kamboj s****j@s****i 5
Stefan Krawczyk s****n@d****o 4
Ikko Eltociear Ashimine e****r@g****m 4
Tom Robinson t****n@g****m 4
dependabot[bot] 4****] 4
ftoppi f****i@g****m 4
CodeBeaver i****o@c****i 4
Lrd l****s@g****m 4
Alok Saboo a****o@g****m 4
and 81 more...

Issue and Pull Request metadata

Last synced: 7 months ago

Total issues: 268
Total pull requests: 562
Average time to close issues: 23 days
Average time to close pull requests: 1 day
Total issue authors: 198
Total pull request authors: 75
Average comments per issue: 2.5
Average comments per pull request: 1.68
Merged pull request: 457
Bot issues: 0
Bot pull requests: 44

Past year issues: 110
Past year pull requests: 210
Past year average time to close issues: about 1 month
Past year average time to close pull requests: 1 day
Past year issue authors: 90
Past year pull request authors: 35
Past year average comments per issue: 2.07
Past year average comments per pull request: 1.75
Past year merged pull request: 162
Past year bot issues: 0
Past year bot pull requests: 42

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/ScrapeGraphAI/Scrapegraph-ai

Top Issue Authors

  • VinciGit00 (16)
  • anth0nyhak1m (6)
  • nyck33 (6)
  • f-aguzzi (5)
  • aleenprd (4)
  • LorenzoPaleari (4)
  • Kilowhisky (4)
  • silgon (4)
  • angelotc (3)
  • DenisMarasescu (3)
  • matheus-rossi (3)
  • rjbks (3)
  • tm-robinson (3)
  • salman-khandu (2)
  • ekmekovski (2)

Top Pull Request Authors

  • VinciGit00 (264)
  • codebeaver-ai[bot] (42)
  • f-aguzzi (35)
  • PeriniM (25)
  • LorenzoPaleari (18)
  • vedovati-matteo (12)
  • aziz-ullah-khan (12)
  • ekinsenler (9)
  • goasleep (7)
  • shenghongtw (7)
  • SwapnilSonker (6)
  • tm-robinson (6)
  • aflansburg (4)
  • AmosDinh (4)
  • tuhinmallick (4)

Top Issue Labels

  • bug (36)
  • question (13)
  • released on @dev (11)
  • feature request (9)
  • enhancement (8)
  • stale (8)
  • documentation (4)
  • refactor (1)
  • size:XL (1)
  • dependencies (1)

Top Pull Request Labels

  • released on @stable (206)
  • released on @dev (193)
  • tests (49)
  • size:M (29)
  • size:L (28)
  • bug (24)
  • size:XS (18)
  • lgtm (18)
  • size:XL (16)
  • enhancement (15)
  • dependencies (15)
  • size:S (11)
  • size:XXL (10)
  • documentation (10)
  • refactor (7)
  • typo (7)

Package metadata

proxy.golang.org: github.com/scrapegraphai/scrapegraph-ai

proxy.golang.org: github.com/ScrapeGraphAI/Scrapegraph-ai


Dependencies

requirements.txt pypi
  • Requests ==2.31.0
  • beautifulsoup4 ==4.12.3
  • langchain ==0.1.4
  • langchain_core ==0.1.16
  • langchain_openai ==0.0.5
  • python-dotenv ==1.0.1
requirements-dev.txt pypi
  • pytest ==8.0.0 development
  • sphinx ==7.1.2 development
  • sphinx-rtd-theme ==2.0.0 development
  • twine ==4.0.2 development
  • wheel ==0.42.0 development
.github/workflows/codeql.yml actions
  • actions/checkout v4 composite
  • github/codeql-action/init v3 composite
.github/workflows/dependency-review.yml actions
  • actions/checkout v4 composite
  • actions/dependency-review-action v4 composite
.github/workflows/pylint.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v3 composite
.github/workflows/python-publish.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v5 composite
poetry.lock pypi
  • 143 dependencies
pyproject.toml pypi
  • pytest 8.0.0 develop
  • sphinx 7.1.2 docs
  • sphinx-rtd-theme 2.0.0 docs
  • beautifulsoup4 4.12.3
  • faiss-cpu 1.7.4
  • graphviz 0.20.1
  • html2text 2020.1.16
  • langchain 0.1.6
  • langchain_community 0.0.19
  • langchain_core 0.1.22
  • langchain_openai 0.0.5
  • pandas 2.0.3
  • python >3.9,<3.9.7 || >3.9.7,<3.12
  • python-dotenv 1.0.1
  • tiktoken >=0.5.2,<0.6.0
  • tqdm 4.66.1
  • trulens_eval 0.23.0