You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ScrapeGraphAI is a _web scraping_ python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.).
15
+
ScrapeGraphAI is a *web scraping* python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.).
15
16
16
17
Just say which information you want to extract and the library will do it for you!
17
18
@@ -38,11 +39,9 @@ Additional dependecies can be added while installing the library:
38
39
- <b>More Language Models</b>: additional language models are installed, such as Fireworks, Groq, Anthropic, Hugging Face, and Nvidia AI Endpoints.
39
40
40
41
This group allows you to use additional language models like Fireworks, Groq, Anthropic, Together AI, Hugging Face, and Nvidia AI Endpoints.
41
-
42
42
```bash
43
43
pip install scrapegraphai[other-language-models]
44
44
```
45
-
46
45
- <b>Semantic Options</b>: this group includes tools for advanced semantic processing, such as Graphviz.
47
46
48
47
```bash
@@ -57,12 +56,13 @@ Additional dependecies can be added while installing the library:
57
56
58
57
</details>
59
58
60
-
## 💻 Usage
61
59
60
+
## 💻 Usage
62
61
There are multiple standard scraping pipelines that can be used to extract information from a website (or local file).
63
62
64
63
The most common one is the `SmartScraperGraph`, which extracts information from a single page given a user prompt and a source URL.
65
64
65
+
66
66
```python
67
67
import json
68
68
from scrapegraphai.graphs import SmartScraperGraph
@@ -98,17 +98,16 @@ The output will be a dictionary like the following:
We collect anonymous usage metrics to enhance our package's quality and user experience. The data helps us prioritize improvements and ensure compatibility. If you wish to opt-out, set the environment variable SCRAPEGRAPHAI_TELEMETRY_ENABLED=false. For more information, please refer to the documentation [here](https://scrapegraph-ai.readthedocs.io/en/latest/scrapers/telemetry.html).
| Marco Vinciguerra |[](https://www.linkedin.com/in/marco-vinciguerra-7ba365242/)|
193
-
| Marco Perini |[](https://www.linkedin.com/in/perinim/)|
194
-
| Lorenzo Padoan |[](https://www.linkedin.com/in/lorenzo-padoan-4521a2154/)|
184
+
|| Contact Info |
185
+
|--------------------|----------------------|
186
+
| Marco Vinciguerra |[](https://www.linkedin.com/in/marco-vinciguerra-7ba365242/)|
187
+
| Marco Perini |[](https://www.linkedin.com/in/perinim/)|
188
+
| Lorenzo Padoan |[](https://www.linkedin.com/in/lorenzo-padoan-4521a2154/)|
0 commit comments