Skip to content

Commit 104d869

Browse files
authored
Merge branch 'pre/beta' into support
2 parents 591cab0 + 8f9f96f commit 104d869

File tree

7 files changed

+88
-21
lines changed

7 files changed

+88
-21
lines changed

CHANGELOG.md

Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,36 @@
1+
## [1.8.1-beta.1](https://github.com/ScrapeGraphAI/Scrapegraph-ai/compare/v1.8.0...v1.8.1-beta.1) (2024-07-04)
2+
3+
4+
### Bug Fixes
5+
6+
* add test ([3a537ee](https://github.com/ScrapeGraphAI/Scrapegraph-ai/commit/3a537eec6fef1743924a9aa5cef0ba2f8d44bf11))
7+
8+
9+
### Docs
10+
11+
* **roadmap:** fix urls ([14faba4](https://github.com/ScrapeGraphAI/Scrapegraph-ai/commit/14faba4f00dd9f947f8dc5e0b51be49ea684179f))
12+
* **roadmap:** next steps ([3e644f4](https://github.com/ScrapeGraphAI/Scrapegraph-ai/commit/3e644f498f05eb505fbd4e94b144c81567569aaa))
13+
14+
## [1.8.0](https://github.com/VinciGit00/Scrapegraph-ai/compare/v1.7.5...v1.8.0) (2024-06-30)
15+
16+
17+
### Features
18+
19+
* add new search engine avaiability and new tests ([073d226](https://github.com/VinciGit00/Scrapegraph-ai/commit/073d226723f5f03b960865d07408905b7a506180))
20+
* add research with bing + test function ([aa2160c](https://github.com/VinciGit00/Scrapegraph-ai/commit/aa2160c108764745a696ffc16038f370e9702c14))
21+
22+
23+
### Bug Fixes
24+
25+
* updated for schema changes ([aedda44](https://github.com/VinciGit00/Scrapegraph-ai/commit/aedda448682ce5a921a62e661bffb02478bab75f))
26+
27+
28+
### CI
29+
30+
* **release:** 1.7.0-beta.13 [skip ci] ([ce0a47a](https://github.com/VinciGit00/Scrapegraph-ai/commit/ce0a47aee5edbb26fd82e41f6688a4bc48a10822))
31+
* **release:** 1.7.0-beta.14 [skip ci] ([ec77ff7](https://github.com/VinciGit00/Scrapegraph-ai/commit/ec77ff7ea4eb071469c2fb53e5959d4ea1f73ad6))
32+
* **release:** 1.8.0-beta.1 [skip ci] ([bbfbbd9](https://github.com/VinciGit00/Scrapegraph-ai/commit/bbfbbd93be3c87c5f25e3c75ec7d677832d37467))
33+
134
## [1.8.0-beta.1](https://github.com/VinciGit00/Scrapegraph-ai/compare/v1.7.4...v1.8.0-beta.1) (2024-06-25)
235

336

@@ -7,6 +40,7 @@
740
* add research with bing + test function ([aa2160c](https://github.com/VinciGit00/Scrapegraph-ai/commit/aa2160c108764745a696ffc16038f370e9702c14))
841

942

43+
1044
### Bug Fixes
1145

1246
* updated for schema changes ([aedda44](https://github.com/VinciGit00/Scrapegraph-ai/commit/aedda448682ce5a921a62e661bffb02478bab75f))
@@ -17,6 +51,7 @@
1751
* **release:** 1.7.0-beta.13 [skip ci] ([ce0a47a](https://github.com/VinciGit00/Scrapegraph-ai/commit/ce0a47aee5edbb26fd82e41f6688a4bc48a10822))
1852
* **release:** 1.7.0-beta.14 [skip ci] ([ec77ff7](https://github.com/VinciGit00/Scrapegraph-ai/commit/ec77ff7ea4eb071469c2fb53e5959d4ea1f73ad6))
1953

54+
2055
## [1.7.4](https://github.com/VinciGit00/Scrapegraph-ai/compare/v1.7.3...v1.7.4) (2024-06-21)
2156

2257

README.md

Lines changed: 25 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -191,9 +191,32 @@ Please see the [contributing guidelines](https://github.com/VinciGit00/Scrapegra
191191
[![My Skills](https://skillicons.dev/icons?i=twitter)](https://twitter.com/scrapegraphai)
192192

193193
## 📈 Roadmap
194-
Check out the project roadmap [here](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/README.md)! 🚀
195194

196-
Wanna visualize the roadmap in a more interactive way? Check out the [markmap](https://markmap.js.org/repl) visualization by copy pasting the markdown content in the editor!
195+
We are working on the following features! If you are interested in collaborating right-click on the feature and open in a new tab to file a PR. If you have doubts and wanna discuss them with us, just contact us on [discord](https://discord.gg/uJN7TYcpNa) or open a [Discussion](https://github.com/VinciGit00/Scrapegraph-ai/discussions) here on Github!
196+
197+
```mermaid
198+
%%{init: {'theme': 'base', 'themeVariables': { 'primaryColor': '#5C4B9B', 'edgeLabelBackground':'#ffffff', 'tertiaryColor': '#ffffff', 'primaryBorderColor': '#5C4B9B', 'fontFamily': 'Arial', 'fontSize': '16px', 'textColor': '#5C4B9B' }}}%%
199+
graph LR
200+
A[DeepSearch Graph] --> F[Use Existing Chromium Instances]
201+
F --> B[Page Caching]
202+
B --> C[Screenshot Scraping]
203+
C --> D[Handle Dynamic Content]
204+
D --> E[New Webdrivers]
205+
206+
style A fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
207+
style F fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
208+
style B fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
209+
style C fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
210+
style D fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
211+
style E fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
212+
213+
click A href "https://github.com/VinciGit00/Scrapegraph-ai/issues/260" "Open DeepSearch Graph Issue"
214+
click F href "https://github.com/VinciGit00/Scrapegraph-ai/issues/329" "Open Chromium Instances Issue"
215+
click B href "https://github.com/VinciGit00/Scrapegraph-ai/issues/197" "Open Page Caching Issue"
216+
click C href "https://github.com/VinciGit00/Scrapegraph-ai/issues/197" "Open Screenshot Scraping Issue"
217+
click D href "https://github.com/VinciGit00/Scrapegraph-ai/issues/279" "Open Handle Dynamic Content Issue"
218+
click E href "https://github.com/VinciGit00/Scrapegraph-ai/issues/171" "Open New Webdrivers Issue"
219+
```
197220

198221
## ❤️ Contributors
199222
[![Contributors](https://contrib.rocks/image?repo=VinciGit00/Scrapegraph-ai)](https://github.com/VinciGit00/Scrapegraph-ai/graphs/contributors)

pyproject.toml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,7 @@
22
name = "scrapegraphai"
33

44

5-
version = "1.8.0b1"
6-
5+
version = "1.8.1b1"
76

87

98
description = "A web scraping library based on LangChain which uses LLM and direct graph logic to create scraping pipelines."

scrapegraphai/helpers/models_tokens.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -78,7 +78,11 @@
7878
"claude2": 9000,
7979
"claude2.1": 200000,
8080
"claude3": 200000,
81-
"claude3.5": 200000
81+
"claude3.5": 200000,
82+
"claude-3-opus-20240229": 200000,
83+
"claude-3-sonnet-20240229": 200000,
84+
"claude-3-haiku-20240307": 200000,
85+
"claude-3-5-sonnet-20240620": 200000
8286
},
8387
"vertexai": {
8488
"gemini-1.5-flash": 128000,

scrapegraphai/nodes/search_internet_node.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -84,7 +84,7 @@ def execute(self, state: dict) -> dict:
8484
You should return only the query string without any additional sentences. \n
8585
For example, if the user prompt is "What is the capital of France?",
8686
you should return "capital of France". \n
87-
If yuo return something else, you will get a really bad grade. \n
87+
If you return something else, you will get a really bad grade. \n
8888
USER PROMPT: {user_prompt}"""
8989

9090
search_prompt = PromptTemplate(

scrapegraphai/utils/parse_state_keys.py

Lines changed: 0 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -101,18 +101,3 @@ def evaluate_expression(expression):
101101
final_result.append(key)
102102

103103
return final_result
104-
105-
106-
EXPRESSION = "user_input & (relevant_chunks | parsed_document | document)"
107-
state = {
108-
"user_input": None,
109-
"document": None,
110-
"parsed_document": None,
111-
"relevant_chunks": None,
112-
}
113-
114-
try:
115-
result = parse_expression(EXPRESSION, state)
116-
print("Matched keys:", result)
117-
except ValueError as e:
118-
print("Error:", e)

tests/utils/parse_state_keys_test.py

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
"""
2+
Parse_state_key test module
3+
"""
4+
import pytest
5+
from scrapegraphai.utils.parse_state_keys import parse_expression
6+
7+
8+
def test_parse_expression():
9+
"""Test parse_expression function."""
10+
EXPRESSION = "user_input & (relevant_chunks | parsed_document | document)"
11+
state = {
12+
"user_input": None,
13+
"document": None,
14+
"parsed_document": None,
15+
"relevant_chunks": None,
16+
}
17+
try:
18+
result = parse_expression(EXPRESSION, state)
19+
assert result != []
20+
except ValueError as e:
21+
assert "Error" in str(e)

0 commit comments

Comments
 (0)