You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -19,6 +21,8 @@ This project supports Python 3.6+.
19
21
20
22
### From Source Code
21
23
24
+
The [`pipenv` command](https://pipenv.readthedocs.io/en/latest/install/#installing-pipenv) must be installed.
25
+
22
26
Set both environment variables `MEILISEARCH_HOST_URL` and `MEILISEARCH_API_KEY`.
23
27
24
28
Then, run:
@@ -111,10 +115,37 @@ After having crawled your documentation, you might need a search bar to improve
111
115
112
116
For the front part, check out the [docs-searchbar.js repository](https://github.com/meilisearch/docs-searchbar.js), wich provides a front-end search bar adapted for documentation.
113
117
118
+
## Authentication
119
+
120
+
__WARNING:__ Please be aware that the scraper will send authentication headers to every scraped site, so use `allowed_domains` to adjust the scope accordingly!
121
+
122
+
### Basic HTTP <!-- omit in TOC -->
123
+
124
+
Basic HTTP authentication is supported by setting these environment variables:
125
+
-`DOCS_SCRAPER_BASICAUTH_USERNAME`
126
+
-`DOCS_SCRAPER_BASICAUTH_PASSWORD`
127
+
128
+
### Cloudflare Access: Identity and Access Management <!-- omit in TOC -->
129
+
130
+
If it happens to you to scrape sites protected by Cloudflare Access, you have to set appropriate HTTP headers.
131
+
132
+
Values for these headers are taken from env variables `CF_ACCESS_CLIENT_ID` and `CF_ACCESS_CLIENT_SECRET`.
133
+
134
+
In case of Google Cloud Identity-Aware Proxy, please specify these env variables:
135
+
-`IAP_AUTH_CLIENT_ID` - # pick [client ID of the application](https://console.cloud.google.com/apis/credentials) you are connecting to
0 commit comments