You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[🤖 Compatibility with MeiliSearch](#-compatibility-with-meilisearch)
@@ -459,6 +462,37 @@ If used, `min_indexed_level` is ignored.
459
462
}
460
463
```
461
464
465
+
#### `js_render` (optional)
466
+
467
+
When `js_render` is set to `true`, The scraper will use ChromeDriver. This is needed for pages that are rendered with JavaScript, for example pages generated with React or Vue. The default value is false.
468
+
469
+
```json
470
+
{
471
+
"js_render": true
472
+
}
473
+
```
474
+
475
+
#### `js_wait` (optional)
476
+
477
+
This setting can be used when `js_render` is set to `true` and the pages need time to fully load. `js_wait` takes an integer is specifies the number of seconds the scraper should wait for the page to load.
478
+
479
+
```json
480
+
{
481
+
"js_render": true,
482
+
"js_wait": 1
483
+
}
484
+
```
485
+
486
+
#### `allowed_domains` (optional)
487
+
488
+
This setting specifies the domains that the scraper is allowed to access. In most cases the `allowed_domains` will be automatically set using the `start_urls` and `stop_urls`. When scraping a domain that contains a port, for example `http://localhost:8080`, the domain needs to be manually added to the configuration.
489
+
490
+
```json
491
+
{
492
+
"allowed_domains": ["localhost"]
493
+
}
494
+
```
495
+
462
496
### Authentication
463
497
464
498
__WARNING:__ Please be aware that the scraper will send authentication headers to every scraped site, so use `allowed_domains` to adjust the scope accordingly!
0 commit comments