If you don’t find the idea of writing scripts for that sort of thing literally fun…
I absolutely do. What I find as a potential showstopper for me right now, is that I don’t have a nonintegrated GPU, which makes complex LLMs hard to run. Basically, if I can’t push the processing to CPU, I’m looking at around 2-5 seconds per token; it’s rough. But I like your workflow a lot, and I’m going to try to get something similar going with my incredibly old hardware, and see if CPU-only processing of this would be something feasible (though, I’m not super hopeful there).
And, yes, I, too, am aware of the hallucinations and such that come from the technology. But, honestly, for this non-critical use case, I don’t really care.
I only just recently discovered that my installation of Whisper was completely unaware that I had a GPU, and was running entirely on my CPU. So even if you can’t get a good LLM running locally you might still be able to get everything turned into text transcripts for eventual future processing. :)
That’s awesome! Thank you!
I absolutely do. What I find as a potential showstopper for me right now, is that I don’t have a nonintegrated GPU, which makes complex LLMs hard to run. Basically, if I can’t push the processing to CPU, I’m looking at around 2-5 seconds per token; it’s rough. But I like your workflow a lot, and I’m going to try to get something similar going with my incredibly old hardware, and see if CPU-only processing of this would be something feasible (though, I’m not super hopeful there).
And, yes, I, too, am aware of the hallucinations and such that come from the technology. But, honestly, for this non-critical use case, I don’t really care.
I only just recently discovered that my installation of Whisper was completely unaware that I had a GPU, and was running entirely on my CPU. So even if you can’t get a good LLM running locally you might still be able to get everything turned into text transcripts for eventual future processing. :)
Nicceeeee! Thank you!