Default to CUDA if available #431

malfet · 2024-04-23T22:56:48Z

Unless user specifies device, default it to CUDA if it's available on the platform.

As for all models, other than TinyLlama inference on GPU will be faster than on CPU

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

malfet · 2024-04-23T22:57:02Z

Not sure if this will work though, need to run some local tests...

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

Default to CUDA if available

41deecf

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 23, 2024

mikekgfb self-requested a review April 25, 2024 02:58

mikekgfb approved these changes Apr 25, 2024

View reviewed changes

mikekgfb merged commit 7c011e4 into main Apr 25, 2024

mikekgfb deleted the malfet-patch-2 branch April 25, 2024 07:32

malfet added a commit that referenced this pull request Jul 17, 2024

Default to CUDA if available (#431)

47d6f26

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

malfet added a commit that referenced this pull request Jul 17, 2024

Default to CUDA if available (#431)

615954a

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

malfet added a commit that referenced this pull request Jul 17, 2024

Default to CUDA if available (#431)

56e2840

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

malfet added a commit that referenced this pull request Jul 17, 2024

Default to CUDA if available (#431)

db1e4c6

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

malfet added a commit that referenced this pull request Jul 17, 2024

Default to CUDA if available (#431)

85a7810

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

malfet added a commit that referenced this pull request Jul 17, 2024

Default to CUDA if available (#431)

c5eb5b0

Unless user specifies device, default it to CUDA if it's available on the platform. As for all models, other than TinyLlama inference on GPU will be faster than on CPU

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Default to CUDA if available #431

Default to CUDA if available #431

Uh oh!

malfet commented Apr 23, 2024

Uh oh!

malfet commented Apr 23, 2024

Uh oh!

Uh oh!

Default to CUDA if available #431

Default to CUDA if available #431

Uh oh!

Conversation

malfet commented Apr 23, 2024

Uh oh!

malfet commented Apr 23, 2024

Uh oh!

Uh oh!