Connection to locally hosted LLM on Apple Silicon

ron_dk · ‎01-30-2024

Is there any way to setup a connection that connects to a locally running LLM? On Apple Silicon it is possible to run huggingface models locally with some different open source solutions, which make use of Metal for the GPU-compute. The solution I have running can be said to function as a server accessible as localhost. Would be great to be able to setup a connection to run AI recipes on that.

Operating system used: MacOS

Turribeach · ‎01-30-2024

https://developer.apple.com/metal/tensorflow-plugin/

ron_dk · ‎01-31-2024

Thanks, Turri. But I think I was unclear. I do have metal running on MacOS, and I do have a local LLM running (via LM Studio). What I am looking for is a no-code way to setup an LLM connection (using the tools available under the "connections" tab of the admin section) to that local LLM. Have tried different workarounds (using the Azure connection, installing the deprecated GPT plugin) but none of them will let me direct the AI recipe to an LLM running locally (but outside of Dataiku).

Of course, this aveniue of inquiry begins with the fact that the Huggingface connection (also under the admin) won't run om my Mac, since the setup requires Nvidia GPUs to run. If there is a solution to get around that limitation, that would also be great. Is that maybe what you are suggesting could be achieved with the Apple PluggableDevice?

Turribeach · ‎01-31-2024

OK I got what you want to do now. I am guessing Dataiku hasn't opened the local avenue yet. I haven't played much with the LLM Mesh but my simple understanding is that it lets you connect to different LLMs providers. I wonder if will be possible to duplicate one of the LLM provider APIs locally and fake the DNS using your local hosts file so that it connects to a local webserver that execute your local LLM.

Marlan · ‎01-31-2024

Hi @ron_dk,

I understand that a LLM Mesh connection plugin is in the works. This would enable creating custom connections for specific use cases (like yours). You may want to contact Dataiku directly to get more information.

We will be using this when it is available to connect to an API that we are putting in front of an LLM.

Marlan

ron_dk · ‎02-01-2024

Thanks, Marlan. Would be a bit of revolution for us non-coders. Fingers crossed!

ron_dk · ‎03-05-2024

Bump for this thread.

Has anyone managed to wrangle any of the connections in the LLM Mesh (e.g., the OpenAI connection or the HuggingFace connection) to connect to a compatible local LLM host running on MacOS/Apple Silicon?

I am thinking of LM Studio specifically, which seems to be the perfect partner for Dataiku. When running in server mode, LM Studio gives you a localhost adress to send prompts to. All you need to do is to ensure that your prompts are formated as OpenAI prompts. The program runs on Metal to offload inferencing to the GPUs. I have a Python script to run classification that way, but it would be way easier if I could just use Prompt Studio.

Ollama is another alternative with a similar setup, except using the Llama.cpp prompt format.

Any wrangling suggestions would be much appreciated.

Connection to locally hosted LLM on Apple Silicon

Connection to locally hosted LLM on Apple Silicon

Labels

Connections

Sign up to take part

Connection to locally hosted LLM on Apple Silicon

Connection to locally hosted LLM on Apple Silicon

Labels

Connections