Skip to content

Commit 11d8531

Browse files
committed
Deploy latest docs: 527f137
1 parent 363d13d commit 11d8531

File tree

13 files changed

+62
-31
lines changed

13 files changed

+62
-31
lines changed

docs/cuda-bindings/jupyter_execute/overview.ipynb

Lines changed: 25 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
"cells": [
33
{
44
"cell_type": "markdown",
5-
"id": "2c2ee760",
5+
"id": "89cc298e",
66
"metadata": {},
77
"source": [
88
"# Overview\n",
@@ -50,7 +50,7 @@
5050
{
5151
"cell_type": "code",
5252
"execution_count": 1,
53-
"id": "eec9439d",
53+
"id": "fbbf48f8",
5454
"metadata": {},
5555
"outputs": [],
5656
"source": [
@@ -60,7 +60,7 @@
6060
},
6161
{
6262
"cell_type": "markdown",
63-
"id": "a02f0fed",
63+
"id": "e7856b1c",
6464
"metadata": {},
6565
"source": [
6666
"Error checking is a fundamental best practice in code development and a code\n",
@@ -72,7 +72,7 @@
7272
{
7373
"cell_type": "code",
7474
"execution_count": 2,
75-
"id": "aba234e7",
75+
"id": "a15ca753",
7676
"metadata": {},
7777
"outputs": [],
7878
"source": [
@@ -98,7 +98,7 @@
9898
},
9999
{
100100
"cell_type": "markdown",
101-
"id": "aedde00f",
101+
"id": "2f6edb25",
102102
"metadata": {},
103103
"source": [
104104
"It’s common practice to write CUDA kernels near the top of a translation unit,\n",
@@ -112,7 +112,7 @@
112112
{
113113
"cell_type": "code",
114114
"execution_count": 3,
115-
"id": "9501b26e",
115+
"id": "ad3b35ea",
116116
"metadata": {},
117117
"outputs": [],
118118
"source": [
@@ -130,7 +130,7 @@
130130
},
131131
{
132132
"cell_type": "markdown",
133-
"id": "ba8aa6dd",
133+
"id": "3b497b44",
134134
"metadata": {},
135135
"source": [
136136
"Go ahead and compile the kernel into PTX. Remember that this is executed at runtime using NVRTC. There are three basic steps to NVRTC:\n",
@@ -147,7 +147,7 @@
147147
{
148148
"cell_type": "code",
149149
"execution_count": 4,
150-
"id": "3f34e779",
150+
"id": "183f49bc",
151151
"metadata": {},
152152
"outputs": [],
153153
"source": [
@@ -177,7 +177,7 @@
177177
},
178178
{
179179
"cell_type": "markdown",
180-
"id": "fb09c972",
180+
"id": "0981c1a8",
181181
"metadata": {},
182182
"source": [
183183
"Before you can use the PTX or do any work on the GPU, you must create a CUDA\n",
@@ -189,7 +189,7 @@
189189
{
190190
"cell_type": "code",
191191
"execution_count": 5,
192-
"id": "ccb32289",
192+
"id": "0fb562ab",
193193
"metadata": {},
194194
"outputs": [],
195195
"source": [
@@ -199,7 +199,7 @@
199199
},
200200
{
201201
"cell_type": "markdown",
202-
"id": "a2f56747",
202+
"id": "d8331acd",
203203
"metadata": {},
204204
"source": [
205205
"With a CUDA context created on device 0, load the PTX generated earlier into a\n",
@@ -211,7 +211,7 @@
211211
{
212212
"cell_type": "code",
213213
"execution_count": 6,
214-
"id": "d4fbd234",
214+
"id": "fb3af604",
215215
"metadata": {},
216216
"outputs": [],
217217
"source": [
@@ -224,7 +224,7 @@
224224
},
225225
{
226226
"cell_type": "markdown",
227-
"id": "84b6af96",
227+
"id": "cfda3062",
228228
"metadata": {},
229229
"source": [
230230
"Next, get all your data prepared and transferred to the GPU. For increased\n",
@@ -236,7 +236,7 @@
236236
{
237237
"cell_type": "code",
238238
"execution_count": 7,
239-
"id": "857cb9dc",
239+
"id": "a7678a2f",
240240
"metadata": {},
241241
"outputs": [],
242242
"source": [
@@ -254,7 +254,7 @@
254254
},
255255
{
256256
"cell_type": "markdown",
257-
"id": "299122b9",
257+
"id": "0a7b45b0",
258258
"metadata": {},
259259
"source": [
260260
"With the input data `a`, `x`, and `y` created for the SAXPY transform device,\n",
@@ -271,7 +271,7 @@
271271
{
272272
"cell_type": "code",
273273
"execution_count": 8,
274-
"id": "19f9e83f",
274+
"id": "d459bd0b",
275275
"metadata": {},
276276
"outputs": [],
277277
"source": [
@@ -291,7 +291,7 @@
291291
},
292292
{
293293
"cell_type": "markdown",
294-
"id": "6e703eba",
294+
"id": "39ccaa76",
295295
"metadata": {},
296296
"source": [
297297
"With data prep and resources allocation finished, the kernel is ready to be\n",
@@ -308,7 +308,7 @@
308308
{
309309
"cell_type": "code",
310310
"execution_count": 9,
311-
"id": "5a917142",
311+
"id": "d50cc757",
312312
"metadata": {},
313313
"outputs": [],
314314
"source": [
@@ -324,7 +324,7 @@
324324
},
325325
{
326326
"cell_type": "markdown",
327-
"id": "65d9e077",
327+
"id": "9137714f",
328328
"metadata": {},
329329
"source": [
330330
"Now the kernel can be launched:"
@@ -333,7 +333,7 @@
333333
{
334334
"cell_type": "code",
335335
"execution_count": 10,
336-
"id": "310fdfe7",
336+
"id": "2963c1cd",
337337
"metadata": {},
338338
"outputs": [],
339339
"source": [
@@ -359,7 +359,7 @@
359359
},
360360
{
361361
"cell_type": "markdown",
362-
"id": "c8c842c7",
362+
"id": "4bc876e6",
363363
"metadata": {},
364364
"source": [
365365
"The `cuLaunchKernel` function takes the compiled module kernel and execution\n",
@@ -374,7 +374,7 @@
374374
{
375375
"cell_type": "code",
376376
"execution_count": 11,
377-
"id": "3adea158",
377+
"id": "ac71e923",
378378
"metadata": {},
379379
"outputs": [],
380380
"source": [
@@ -386,7 +386,7 @@
386386
},
387387
{
388388
"cell_type": "markdown",
389-
"id": "96dd7306",
389+
"id": "ea813e42",
390390
"metadata": {},
391391
"source": [
392392
"Perform verification of the data to ensure correctness and finish the code with\n",
@@ -396,7 +396,7 @@
396396
{
397397
"cell_type": "code",
398398
"execution_count": 12,
399-
"id": "cd0c402e",
399+
"id": "c29ce423",
400400
"metadata": {},
401401
"outputs": [],
402402
"source": [
@@ -410,7 +410,7 @@
410410
},
411411
{
412412
"cell_type": "markdown",
413-
"id": "d08fe6ce",
413+
"id": "a4a86e92",
414414
"metadata": {},
415415
"source": [
416416
"## Performance\n",
Binary file not shown.
411 Bytes
Binary file not shown.
2.35 KB
Binary file not shown.
Binary file not shown.

docs/cuda-core/latest/_sources/install.md.txt

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,21 @@ and likewise use `[cu11]` for CUDA 11.
2525
Note that using `cuda.core` with NVRTC or nvJitLink installed from PyPI via `pip install` is currently
2626
not supported. This will be fixed in a future release.
2727

28+
## Installing from Conda (conda-forge)
29+
30+
Same as above, `cuda.core` can be installed in a CUDA 11 or 12 environment. For example with CUDA 12:
31+
```console
32+
$ conda install -c conda-forge cuda-core cuda-version=12
33+
```
34+
and likewise use `cuda-version=11` for CUDA 11.
35+
36+
Note that to use `cuda.core` with nvJitLink installed from conda-forge currently requires it to
37+
be separately installed:
38+
```console
39+
$ conda install -c conda-forge libnvjitlink
40+
```
41+
(can be combined with the command above). This extra step will be removed in a future release.
42+
2843
## Installing from Source
2944

3045
```console

docs/cuda-core/latest/_sources/release/0.1.1-notes.md.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ Released on Dec 20, 2024
1616
- Add a `cuda.core.experimental.system` module for querying system- or process- wide information.
1717
- Add `LaunchConfig.cluster` to support thread block clusters on Hopper GPUs.
1818

19-
## Enchancements
19+
## Enhancements
2020

2121
- The internal handle held by `ObjectCode` is now lazily initialized upon first touch.
2222
- Support TCC devices with a default synchronous memory resource to avoid the use of memory pools.

docs/cuda-core/latest/index.html

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -304,6 +304,7 @@ <h1><code class="docutils literal notranslate"><span class="pre">cuda.core</span
304304
<li class="toctree-l1"><a class="reference internal" href="install.html">Installation</a><ul>
305305
<li class="toctree-l2"><a class="reference internal" href="install.html#runtime-requirements">Runtime Requirements</a></li>
306306
<li class="toctree-l2"><a class="reference internal" href="install.html#installing-from-pypi">Installing from PyPI</a></li>
307+
<li class="toctree-l2"><a class="reference internal" href="install.html#installing-from-conda-conda-forge">Installing from Conda (conda-forge)</a></li>
307308
<li class="toctree-l2"><a class="reference internal" href="install.html#installing-from-source">Installing from Source</a></li>
308309
</ul>
309310
</li>

docs/cuda-core/latest/install.html

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -327,6 +327,20 @@ <h2>Installing from PyPI<a class="headerlink" href="#installing-from-pypi" title
327327
<p>Note that using <code class="docutils literal notranslate"><span class="pre">cuda.core</span></code> with NVRTC or nvJitLink installed from PyPI via <code class="docutils literal notranslate"><span class="pre">pip</span> <span class="pre">install</span></code> is currently
328328
not supported. This will be fixed in a future release.</p>
329329
</section>
330+
<section id="installing-from-conda-conda-forge">
331+
<h2>Installing from Conda (conda-forge)<a class="headerlink" href="#installing-from-conda-conda-forge" title="Link to this heading"></a></h2>
332+
<p>Same as above, <code class="docutils literal notranslate"><span class="pre">cuda.core</span></code> can be installed in a CUDA 11 or 12 environment. For example with CUDA 12:</p>
333+
<div class="highlight-console notranslate"><div class="highlight"><pre><span></span><span class="gp">$ </span>conda<span class="w"> </span>install<span class="w"> </span>-c<span class="w"> </span>conda-forge<span class="w"> </span>cuda-core<span class="w"> </span>cuda-version<span class="o">=</span><span class="m">12</span>
334+
</pre></div>
335+
</div>
336+
<p>and likewise use <code class="docutils literal notranslate"><span class="pre">cuda-version=11</span></code> for CUDA 11.</p>
337+
<p>Note that to use <code class="docutils literal notranslate"><span class="pre">cuda.core</span></code> with nvJitLink installed from conda-forge currently requires it to
338+
be separately installed:</p>
339+
<div class="highlight-console notranslate"><div class="highlight"><pre><span></span><span class="gp">$ </span>conda<span class="w"> </span>install<span class="w"> </span>-c<span class="w"> </span>conda-forge<span class="w"> </span>libnvjitlink
340+
</pre></div>
341+
</div>
342+
<p>(can be combined with the command above). This extra step will be removed in a future release.</p>
343+
</section>
330344
<section id="installing-from-source">
331345
<h2>Installing from Source<a class="headerlink" href="#installing-from-source" title="Link to this heading"></a></h2>
332346
<div class="highlight-console notranslate"><div class="highlight"><pre><span></span><span class="gp">$ </span>git<span class="w"> </span>clone<span class="w"> </span>https://github.com/NVIDIA/cuda-python
@@ -403,6 +417,7 @@ <h2>Installing from Source<a class="headerlink" href="#installing-from-source" t
403417
<li><a class="reference internal" href="#">Installation</a><ul>
404418
<li><a class="reference internal" href="#runtime-requirements">Runtime Requirements</a></li>
405419
<li><a class="reference internal" href="#installing-from-pypi">Installing from PyPI</a></li>
420+
<li><a class="reference internal" href="#installing-from-conda-conda-forge">Installing from Conda (conda-forge)</a></li>
406421
<li><a class="reference internal" href="#installing-from-source">Installing from Source</a></li>
407422
</ul>
408423
</li>

docs/cuda-core/latest/release.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -296,7 +296,7 @@ <h1>Release Notes<a class="headerlink" href="#release-notes" title="Link to this
296296
<li class="toctree-l1"><a class="reference internal" href="release/0.1.1-notes.html"> 0.1.1</a><ul>
297297
<li class="toctree-l2"><a class="reference internal" href="release/0.1.1-notes.html#hightlights">Hightlights</a></li>
298298
<li class="toctree-l2"><a class="reference internal" href="release/0.1.1-notes.html#new-features">New features</a></li>
299-
<li class="toctree-l2"><a class="reference internal" href="release/0.1.1-notes.html#enchancements">Enchancements</a></li>
299+
<li class="toctree-l2"><a class="reference internal" href="release/0.1.1-notes.html#enhancements">Enhancements</a></li>
300300
<li class="toctree-l2"><a class="reference internal" href="release/0.1.1-notes.html#bug-fixes">Bug fixes</a></li>
301301
<li class="toctree-l2"><a class="reference internal" href="release/0.1.1-notes.html#limitations">Limitations</a></li>
302302
</ul>

docs/cuda-core/latest/release/0.1.1-notes.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -310,8 +310,8 @@ <h2>New features<a class="headerlink" href="#new-features" title="Link to this h
310310
<li><p>Add <code class="docutils literal notranslate"><span class="pre">LaunchConfig.cluster</span></code> to support thread block clusters on Hopper GPUs.</p></li>
311311
</ul>
312312
</section>
313-
<section id="enchancements">
314-
<h2>Enchancements<a class="headerlink" href="#enchancements" title="Link to this heading"></a></h2>
313+
<section id="enhancements">
314+
<h2>Enhancements<a class="headerlink" href="#enhancements" title="Link to this heading"></a></h2>
315315
<ul class="simple">
316316
<li><p>The internal handle held by <code class="docutils literal notranslate"><span class="pre">ObjectCode</span></code> is now lazily initialized upon first touch.</p></li>
317317
<li><p>Support TCC devices with a default synchronous memory resource to avoid the use of memory pools.</p></li>
@@ -402,7 +402,7 @@ <h2>Limitations<a class="headerlink" href="#limitations" title="Link to this hea
402402
<li><a class="reference internal" href="#"><code class="docutils literal notranslate"><span class="pre">cuda.core</span></code> v0.1.1 Release notes</a><ul>
403403
<li><a class="reference internal" href="#hightlights">Hightlights</a></li>
404404
<li><a class="reference internal" href="#new-features">New features</a></li>
405-
<li><a class="reference internal" href="#enchancements">Enchancements</a></li>
405+
<li><a class="reference internal" href="#enhancements">Enhancements</a></li>
406406
<li><a class="reference internal" href="#bug-fixes">Bug fixes</a></li>
407407
<li><a class="reference internal" href="#limitations">Limitations</a></li>
408408
</ul>

docs/cuda-core/latest/searchindex.js

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.
0 Bytes
Binary file not shown.

0 commit comments

Comments
 (0)