Comment by thot_experiment

Comment by thot_experiment a day ago

8 replies

I'd love to have something like this but calling a cloud is a no-go for me. I have a half baked tool that a friend of mine and I applied to the Mozilla Builders Grant with (didn't get in), it's janky and I don't have time to work on it right now but it does the thing. I also find myself using OpenWebUI's context RAG stuff sometimes but I'd really like to have a way to dump all of my private documents into a DB and have search/RAG work against them locally, preferably in a way that's agnostic of the LLM backend.

Does such a project exist?

Adityav369 a day ago

You can run this fully locally using Ollama for inference, although you'll need larger models and a beefy machine for great results. On my end llama 3.2 8B does a good job on technical docs, but bigger the better lol.

  • thot_experiment 21 hours ago

    Ahh, I didn't see that, I just saw them talking about a free tier or whatever and my eyes glazed over. I'll try it out with Mistral-small 3.1 at some point tonight, I've been having really great results with it's multimodal understanding.

  • mrtimo 18 hours ago

    how would you use this within open-web-ui locally?

osigurdson 20 hours ago

Just curious, are you fine with running things in your own AWS / Azure / GCP account or do you really mean that the solution has to be fully on-premise?

  • thot_experiment 16 hours ago

    Airgapped. It really makes threat modelling so so soooo much easier. It's temporal so if I were being attacked by a state level actor exfiltration is possible but this specific application I either have the data live and no internet, or internet and no data. I also have some lesser stuff that I allow on-prem w/ internet and just trust the firewall, but absolutely no way am I doing any sensitive data storage or inference in the cloud.

    Since people will be curious, one lesser thing I used this for is a diary/assistant and it's nice to have the peace of mind that I can dump my inner most thoughts without any concern for oversharing.

    • ArnavAgrawal03 3 hours ago

      totally agree that air-gapped provides unparalleled peace of mind. That's a major reason why we have strong support for local deployment. Nice to know that our hypothesis is somewhat accurate :)

    • rank0 16 hours ago

      What kind of hardware do you need for this setup?

      • thot_experiment 15 hours ago

        A computer with a couple gaming GPUs, a lan cable you can unplug and an encrypted external hard drive to offline your sensitive data.