Skip to content

Commit cca2d8c

Browse files
authored
Merge pull request #19 from ScrapeGraphAI/main
allignment
2 parents 8c841b3 + 0b052eb commit cca2d8c

24 files changed

+306
-212
lines changed

.github/workflows/python-publish.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,11 +6,15 @@ name: Upload Python Package
66
on:
77
release:
88
types: [published]
9+
paths:
10+
- 'scrapegraph-py/**'
911

1012
jobs:
1113
deploy:
1214

1315
runs-on: ubuntu-latest
16+
# Only run if scrapegraph-py has changes
17+
if: contains(github.event.release.body, 'scrapegraph-py/')
1418

1519
steps:
1620
- uses: actions/checkout@v4

.github/workflows/release.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,11 +4,15 @@ on:
44
branches:
55
- main
66
- pre/*
7+
paths:
8+
- 'scrapegraph-py/**'
79

810
jobs:
911
build:
1012
name: Build
1113
runs-on: ubuntu-latest
14+
# Only run if scrapegraph-py has changes
15+
if: contains(github.event.head_commit.modified, 'scrapegraph-py/') || contains(github.event.head_commit.added, 'scrapegraph-py/') || contains(github.event.head_commit.removed, 'scrapegraph-py/')
1216
steps:
1317
- name: Install git
1418
run: |

CHANGELOG.MD

Lines changed: 0 additions & 6 deletions
This file was deleted.

LICENSE

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
MIT License
2+
3+
Copyright (c) 2024 ScrapeGraphAI
4+
5+
Permission is hereby granted, free of charge, to any person obtaining a copy
6+
of this software and associated documentation files (the "Software"), to deal
7+
in the Software without restriction, including without limitation the rights
8+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9+
copies of the Software, and to permit persons to whom the Software is
10+
furnished to do so, subject to the following conditions:
11+
12+
The above copyright notice and this permission notice shall be included in all
13+
copies or substantial portions of the Software.
14+
15+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21+
SOFTWARE.

README.md

Lines changed: 86 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,86 @@
1+
# 🌐 ScrapeGraph AI SDKs
2+
3+
[![License](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)
4+
[![Python SDK](https://img.shields.io/badge/Python_SDK-Latest-blue)](https://github.com/ScrapeGraphAI/scrapegraph-sdk/tree/main/scrapegraph-py)
5+
[![JavaScript SDK](https://img.shields.io/badge/JavaScript_SDK-Latest-yellow)](https://github.com/ScrapeGraphAI/scrapegraph-sdk/tree/main/scrapegraph-js)
6+
[![Documentation](https://img.shields.io/badge/Documentation-Latest-green)](https://scrapegraphai.com/docs)
7+
8+
Official SDKs for the ScrapeGraph AI API - Intelligent web scraping powered by AI. Extract structured data from any webpage with natural language prompts.
9+
10+
The credits can be bougth [here](https://scrapegraphai.com)!
11+
12+
## 🚀 Quick Links
13+
14+
- [Python SDK Documentation](scrapegraph-py/README.md)
15+
- [JavaScript SDK Documentation](scrapegraph-js/README.md)
16+
- [API Documentation](https://scrapegraphai.com/docs)
17+
- [Website](https://scrapegraphai.com)
18+
19+
## 📦 Installation
20+
21+
### Python
22+
```bash
23+
pip install scrapegraph-py
24+
```
25+
26+
### JavaScript
27+
```bash
28+
npm install scrapegraph-js
29+
```
30+
31+
## 🎯 Core Features
32+
33+
- 🤖 **AI-Powered Extraction**: Use natural language to describe what data you want
34+
- 📊 **Structured Output**: Get clean, structured data with optional schema validation
35+
- 🔄 **Multiple Formats**: Extract data as JSON, Markdown, or custom schemas
36+
-**High Performance**: Concurrent processing and automatic retries
37+
- 🔒 **Enterprise Ready**: Production-grade security and rate limiting
38+
39+
## 🛠️ Available Endpoints
40+
41+
### 🔍 SmartScraper
42+
Extract structured data from any webpage using natural language prompts.
43+
44+
### 📝 Markdownify
45+
Convert any webpage into clean, formatted markdown.
46+
47+
### 💻 LocalScraper
48+
Extract information from a local HTML file using AI.
49+
50+
51+
## 🌟 Key Benefits
52+
53+
- 📝 **Natural Language Queries**: No complex selectors or XPath needed
54+
- 🎯 **Precise Extraction**: AI understands context and structure
55+
- 🔄 **Adaptive Scraping**: Works with dynamic and static content
56+
- 📊 **Schema Validation**: Ensure data consistency with Pydantic/TypeScript
57+
-**Async Support**: Handle multiple requests efficiently
58+
59+
## 💡 Use Cases
60+
61+
- 🏢 **Business Intelligence**: Extract company information and contacts
62+
- 📊 **Market Research**: Gather product data and pricing
63+
- 📰 **Content Aggregation**: Convert articles to structured formats
64+
- 🔍 **Data Mining**: Extract specific information from multiple sources
65+
- 📱 **App Integration**: Feed clean data into your applications
66+
67+
## 📖 Documentation
68+
69+
For detailed documentation and examples, visit:
70+
- [Python SDK Guide](scrapegraph-py/README.md)
71+
- [JavaScript SDK Guide](scrapegraph-js/README.md)
72+
- [API Documentation](https://scrapegraphai.com/docs)
73+
74+
## 💬 Support & Feedback
75+
76+
- 📧 Email: [email protected]
77+
- 💻 GitHub Issues: [Create an issue](https://github.com/ScrapeGraphAI/scrapegraph-sdk/issues)
78+
- 🌟 Feature Requests: [Request a feature](https://github.com/ScrapeGraphAI/scrapegraph-sdk/issues/new)
79+
80+
## 📄 License
81+
82+
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
83+
84+
---
85+
86+
Made with ❤️ by [ScrapeGraph AI](https://scrapegraphai.com)

readme.md

Lines changed: 0 additions & 137 deletions
This file was deleted.

scrapegraph-js/CODE_OF_CONDUCT.md

Lines changed: 11 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -17,23 +17,23 @@ diverse, inclusive, and healthy community.
1717
Examples of behavior that contributes to a positive environment for our
1818
community include:
1919

20-
* Demonstrating empathy and kindness toward other people
21-
* Being respectful of differing opinions, viewpoints, and experiences
22-
* Giving and gracefully accepting constructive feedback
23-
* Accepting responsibility and apologizing to those affected by our mistakes,
20+
- Demonstrating empathy and kindness toward other people
21+
- Being respectful of differing opinions, viewpoints, and experiences
22+
- Giving and gracefully accepting constructive feedback
23+
- Accepting responsibility and apologizing to those affected by our mistakes,
2424
and learning from the experience
25-
* Focusing on what is best not just for us as individuals, but for the
25+
- Focusing on what is best not just for us as individuals, but for the
2626
overall community
2727

2828
Examples of unacceptable behavior include:
2929

30-
* The use of sexualized language or imagery, and sexual attention or
30+
- The use of sexualized language or imagery, and sexual attention or
3131
advances of any kind
32-
* Trolling, insulting or derogatory comments, and personal or political attacks
33-
* Public or private harassment
34-
* Publishing others' private information, such as a physical or email
32+
- Trolling, insulting or derogatory comments, and personal or political attacks
33+
- Public or private harassment
34+
- Publishing others' private information, such as a physical or email
3535
address, without their explicit permission
36-
* Other conduct which could reasonably be considered inappropriate in a
36+
- Other conduct which could reasonably be considered inappropriate in a
3737
professional setting
3838

3939
## Enforcement Responsibilities
@@ -106,7 +106,7 @@ Violating these terms may lead to a permanent ban.
106106
### 4. Permanent Ban
107107

108108
**Community Impact**: Demonstrating a pattern of violation of community
109-
standards, including sustained inappropriate behavior, harassment of an
109+
standards, including sustained inappropriate behavior, harassment of an
110110
individual, or aggression toward or disparagement of classes of individuals.
111111

112112
**Consequence**: A permanent ban from any sort of public interaction within

scrapegraph-js/LICENSE

Lines changed: 0 additions & 7 deletions
This file was deleted.

0 commit comments

Comments
 (0)