feat(llama.cpp): enable ROCm/HIPBLAS support #1100

65a · 2023-09-24T19:09:50Z

Description

This PR fixes lack of HIPBLAS support in LocalAI.

Notes for Reviewers
This PR builds on go-skynet/go-llama.cpp#235 to enable ROCm/HIPBLAS support for gguf models running under llama.cpp backend (not the stable ggml one). It can be enabled by using BUILD_TYPE=hipblas. This was tested on a gfx1100 card, but should work for gfx900,gfx1030 and other cards. Card support can be set with AMDGPU_TARGETS environment variable.

Signed commits

Yes, I signed my commits.

Need to dance around the fact llama-stable doesn't support this (I think?) by using a plain CPU build type there. I guess clblas would be ideal, but it requires additional parameters. Signed-off-by: 65a <[email protected]>

Signed-off-by: 65a <[email protected]>

65a · 2023-09-24T19:28:42Z

Additional testing:

✅ Clean checkout of PR builds correctly
✅ Inference works on gfx900 and gfx1100

mudler

Looking good here - not merging it yet as I'd like to cut a release of the current code first as it is well tested up to now, and I prefer to keep bumps separated.

ETA for merging this is 28-29 September.

mudler · 2023-09-28T19:42:13Z

Thanks for the contribution @65a , merging it

* wip: new sections Signed-off-by: mudler <[email protected]> * document hipblas mudler/LocalAI#1100 * add vllm, vall-e-x, minor updates Signed-off-by: mudler <[email protected]> * Add development docs: wip Signed-off-by: mudler <[email protected]> --------- Signed-off-by: mudler <[email protected]>

65a added 3 commits September 24, 2023 12:01

Add HIPBLAS/ROCm support to llama backend

d3148b5

Need to dance around the fact llama-stable doesn't support this (I think?) by using a plain CPU build type there. I guess clblas would be ideal, but it requires additional parameters. Signed-off-by: 65a <[email protected]>

Use recent go-llama.cpp for ROCm support

d707340

Signed-off-by: 65a <[email protected]>

New go-llama.cpp in go.sum for ROCm support

7c32c32

Signed-off-by: 65a <[email protected]>

65a marked this pull request as ready for review September 24, 2023 19:28

lunamidori5 requested a review from mudler September 25, 2023 11:00

lunamidori5 added the enhancement New feature or request label Sep 25, 2023

mudler approved these changes Sep 25, 2023

View reviewed changes

mudler changed the title ~~Enable ROCm/HIPBLAS support for LocalAI~~ feat(llama.cpp): enable ROCm/HIPBLAS support Sep 27, 2023

mudler merged commit 55e38fe into mudler:master Sep 28, 2023

mudler added a commit to go-skynet/localai-website that referenced this pull request Sep 30, 2023

document hipblas mudler/LocalAI#1100

7a7e153

mudler mentioned this pull request Jan 31, 2024

feat(llama.cpp): Vulkan, Kompute, SYCL #1647

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(llama.cpp): enable ROCm/HIPBLAS support #1100

feat(llama.cpp): enable ROCm/HIPBLAS support #1100

65a commented Sep 24, 2023

65a commented Sep 24, 2023

mudler left a comment

mudler commented Sep 28, 2023

feat(llama.cpp): enable ROCm/HIPBLAS support #1100

feat(llama.cpp): enable ROCm/HIPBLAS support #1100

Conversation

65a commented Sep 24, 2023

65a commented Sep 24, 2023

mudler left a comment

Choose a reason for hiding this comment

mudler commented Sep 28, 2023